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SBYPGSYLTK NDNWLVRHPD CINVPDWRGA ICSGCYAQMY IQAYKTSNLR MKIIKNDFPS 660 

HPLYLEGALT RSTHYQQYGP WTLQKGYTI HWDQTAPAEL AIWLINFNKG DWIRVGLCYP 720 

RGTTFSILSD VHNRLLKQTS KTGVFVRTLQ MDKVEQSYPG RSHYYWDEDS GLLFLKLKAQ 780 

NEREKFAPCS MKGCERIKIK ALIPKNAGVS DCTATAYPKF TERAWDVPM PKKLFGSQLK 840 

TKDHFLEVKM ESSKQHFFHL WNDFAYIEVD GKKYPSSEDG IQWVIDGNQ GRWSHTSFR 900 

NSILQGIPWQ LFNYVATIPD KSIVLMASKG RYVSRGPWTR VLEKLGADRG LKLKEQMAFV 960 
GPKGSFRPIW VTLDTEDHKA KIFQWPIPV VKKKKL 



Seq ID NO: 141 DNA sequence 

Nucleic Acid Accession # : Eos sequence 

Coding sequence: 2 61.. 2 8 61 



1 11 21 31 41 SI 

I I I I I I 

GAGCTAGCGC TCAAGCAGAG CCCAGCGCGG TGCTATCGGA CAGAGCCTGG CGAGCGCAAG 60 

CGGCGCGGGG AGCCAGCGGG GCTGAGCGCG GCCAGGGTCT GAACCCAGAT TTCCCAGACT 120 

AGCTACCACT CCGCTTGCCC ACGCCCCGGG AGCTCGCGGC GCCTGGCGGT CAGCGACCAG 180 

ACGTCCGGGG CCGCTGCGCT CCTGGCCCGC GAGGCGTGAC ACTGTCTCGG CTACAGACCC 240 

AGAGGGAGCA CACTGCCAGG ATGGGAGCTG CTGGGAGGCA GGACTTCCTC TTCAAGGCCA 300 

TGCTGACCAT CAGCTGGCTC ACTCTGACCT GCTTCCCTGG GGCCACATCC ACAGTGGCTG 360 

CTGGGTGCCC TGACCAGAGC CCTGAGTTGC AACCCTGGAA CCCTGGCCAT GACCAAGACC 420 

ACCATGTGCA TATCGGCCAG GGCAAGACAC TGCTGCTCAC CTCTTCTGCC ACGGTCTATT 480 

CCATCCACAT CTCAGAGGGA GGCAAGCTGG TCATTAAAGA CCACGACGAG CCGATTGTTT 540 

TGCGAACCCG GCACATCCTG ATTGACAACG GAGGAGAGCT GCATGCTGGG AGTGCCCTCT 600 

GCCCTTTCCA GGGCAATTTC ACCATCATTT TGTATGGAAG GGCTGATGAA GGTATTCAGC 660 

CGGATCCTTA CTATGGTCTG AAGTACATTG GGGTTGGTAA AGGAGGCGCT CTTGAGTTGC 720 

ATGGACAGAA AAAGCTCTCC TGGACATTTC TGAACAAGAC CCTTCACCCA GGTGGCATGG 780 

CAGAAGGAGG CTATTTTTTT GAAAGGAGCT GGGGCCACCG TGGAGTTATT GTTCATGTCA 840 

TCGACCCCAA ATCAGGCACA GTCATCCATT CTGACCGGTT TGACACCTAT AGATCCAAGA 900 

AAGAGAGTGA ACGTCTGGTC CAGTATTTGA ACGCGGTGCC CGATGGCAGG ATCCTTTCTG 960 

TTGCAGTGAA TGATGAAGGT TCTCGAAATC TGGATGACAT GGCCAGGAAG GCGATGACCA 1020 

AATTGGGAAG CAAACACTTC CTGCACCTTG GATTTAGACA CCCTTGGAGT TTTCTAACTG 1080 

TGAAAGGAAA TCCATCATCT TCAGTGGAAG ACCATATTGA ATATCATGGA CATCGAGGCT 1140 

CTGCTGCTGC CCGGGTATTC AAATTGTTCC AGACAGAGCA TGGCGAATAT TTCAATGTTT 1200 

CTTTGTCCAG TGAGTGGGTT CAAGACGTGG AGTGGACGGA GTGGTTCGAT CATGATAAAG 1260 

TATCTCAGAC TAAAGGTGGG GAGAAAATTT CAGACCTCTG GAAAGCTCAC CCAGGAAAAA 1320 

TATGCAATCG TCCCATTGAT ATACAGGCCA CTACAATGGA TGGAGTTAAC CTCAGCACCG 1380 

AGGTTGTCTA CAAAAAAGGC CAGGATTATA GGTTTGCTTG CTACGACOGG GGCAGAGCCT 1440 

GCCGGAGCTA CCGTGTACGG TTCCTCTGTG GGAAGCCTGT GAGGCCCAAA CTCACAGTCA 1500 

CCATTGACAC CAATGTGAAC AGCACCATTC TGAACTTGGA GGATAATGTA CAGTCATGGA 1560 

AACCTGGAGA TACCCTGGTC ATTGCCAGTA CTGATTACTC CATGTACCAG GCAGAAGAGT 1620 

TCCAGGTGCT TCCCTGCAGA TCCTGCGCCC CCAACCAGGT CAAAGTGGCA GGGAAACCAA 1680 

TGTACCTGCA CATCGGGGAG GAGATAGACG GCGTGGACAT GCGGGCGGAG GTTGGGCTTC 1740 

TGAGCCGGAA CAT CATA GTG ATGGGGGAGA TGGAGGACAA ATGCTACCCC TACAGAAACC 1800 

ACATCTGCAA TTTCTTTGAC TTOGATACCT TTGGGGGCCA CATCAAGTTT GCTCTGGGAT 1860 

TTAAGGCAGC ACACTTGGAG GGCACGGAGC TGAAGCATAT GGGACAGCAG CTGGTGGGTC 1920 

AGTACCCGAT TCACTTCCAC CTGGCCGGTG ATGTAGACGA AAGGGGAGGT TATGACCCAC 1980 

CCACATACAT CAGGGACCTC TCCATCCATC ATACATTCTC TCGCTGCGTC ACAGTCCATG 2040 

GCTCCAATGG CTTGTTGATC AAGGACGTTG TGGGCTATAA CTCTTTGGGC CACTGCTTCT 2100 

TCACGGAAGA TGGGCCGGAG GAACGCAACA CTTTTGACCA CTGTCTTGGC CTCCTTGTCA 2160 

AGTCTGGAAC CCTCCTCCCC TCGGACCGTG ACAGCAAGAT GTGCAAGATG ATCACAGAGG 2220 

ACTCCTACCC AGGGTACATC CCCAAGCCCA GGCAAGACTG CAATGCTGTG TCCACCTTCT 2280 

GGATGGCCAA TCCCAACAAC AACCTCATCA ACTGTGCCGC TGCAGGATCT GAGGAAACTG 2340 

GATTTTGGTT TATTTTTCAC CACGTACCAA CGGGCCCCTC CGTGGGAATG TACTCCCCAG 2400 

GTTATTCAGA GCACATTCCA CTGGGAAAAT TCTATAACAA CCGAGCACAT TCCAACTACC 2460 

GGGCTGGCAT GATCATAGAC AACGGAGTCA AAACCACCGA GGCCTCTGCC AAGGACAAGC 2520 

GGCCGTTCCT CTCAATCATC TCTGCCAGAT ACAGCCCTCA CCAGGACGCC GACCCGCTGA 2580 

AGCCCCGGGA GCCGGCCATC ATCAGACACT T CATTGCCT A CAAGAACCAG GACCACGGGG 2640 

CCTGGCTGCG CGGCGGGGAT GTGTGGCTGG ACAGCTGCCA TTTCAGAGGG GAGGCTCAGG 2700 

AAGGCTTCTT GCTTACAGGA ATGAAGGCTG GGGGCATTTT GCTGGGGGGA GATGAGGCAG 2760 

CCTCTGGAAT GGCTCAGGGA TTCAGCCCTC CCTGCCGCTG CCTGCTGAAG CTGGTGACTA 2820 

CGGGGTCGCC CT TTGCT CAC GTCTCTCTGG CCCACTCATG ATGGAGAAGT GTGGTCAGAG 2880 

GGGAGCAATG GGCTTTGCTG CTTATGAGCA CAGAGGAATT CAGTCCCCAG GCAGCCCTGC 2940 

CTCTGACTCC AAGAGGGTGA AGTCCACAGA AGTGAGCTCC TGCCTTAGGG CCTCATTTGC 3000 

TCTTCATCCA GGGAACTGAG CACAGGGGGC CTCCAGGAGA CCCTAGATGT GCTCGTACTC 3060 

CCTCGGCCTG GGATTTCAGA GCTGGAAATA TAGAAAATAT CTAGCCCAAA GCCTTCATTT 3120 

TAACAGATGG GGAAAGTGAG CCCCCAAGAT GGGAAAGAAC CACACAGCTA AGGGAGGGCC 3180 

TGGGGAGCCC CACCCTAGCC CTTGCTGCCA CACCACATTG CCTCAACAAC CGGCCCCAGA 3240 

GTGCCCAGGC ACTCCTGAGG TAGCTTCTGG AAATGGGGAC AAGTCCCCTC GAAGGAAAGG 3300 

AAATGACTAG AGTAGAATGA CAGCTAGCAG ATCTCTTCCC TCCTGCTCCC AGCGCACACA 3360 

AACCCGCCCT CCCCTTGGTG TTGGCGGTCC CTGTGGCCTT CACTTTGTTC ACTACCTGTC 3420 

AGCCCAGCCT GGGTGCACAG TAGCTGCAAC TCCCCATTGG TGCTACCTGG CTCTCCTGTC 3480 

TCTGCAGCTC TACAGGTGAG GCCCAGCAGA GGGAGTAGGG CTCGCCATGT TTCTGGTGAG 3540 

CCAATTTGGC TGATCTTGGG TGTCTGAACA GCTATTGGGT CCACCCCAGT CCCTTTCAGC 3600 

TGCTGCTTAA TGCCCTGCTC TCTCCCTGGC CCACCTTATA GAGAGCCCAA AGAGCTCCTG 3660 

TAAGAGGGAG AACTCTATCT GTGGTTTATA ATCTTGCACG AGGCACCAGA GTCTCCCTGG 3720 

GTCTTGTGAT GAACTACATT TATCCCCTTT CCTGCCCCAA CCACAAACTC TTTCCTTCAA 3780 

AGAGGGCCTG CCTGGCTCCC TCCACCCAAC TGCACCCATG AGACTCGGTC CAAGAGTCCA 3840 

TTCCCCAGGT GGGAGCCAAC TGTCAGGGAG GTCTTTCCCA CCAAACATCT TTCAGCTGCT 3900 

GGGAGGTGAC CATAGGGCTC TGCTTTTAAA GATATGGCTG CTTCAAAGGC CAGAGTCACA 3960 

GGAAGGACTT CTTCCAGGGA GATTAGTGGT GATGGAGAGG AGAGTTAAAA TGACCTCATG 4020 

TCCTTCTTGT CCACGGTTTT GTTGAGTTTT CACTCTTCTA ATGCAAGGGT CTCACACTGT 4080 

GAACCACTTA GGATGTGATC ACTTTCAGGT GGCCAGGAAT GTTGAATGTC TTTGGCTCAG 4140 

TTCATTTAAA AAAGATATCT ATTTGAAAGT TCTCAGAGTT GTACATATGT TTCACAGTAC 4200 
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AGGATCTGTA CATAAAAGTT TCTTTCCTAA ACCATTCACC AAGAGCCAAT ATCTAGGCAT 4260 

TTTCTTGGTA GCACAAATTT TCTTATTGCT TAGAAAATTG TCCTCCTTGT TATTTCTGTT 4320 

TGTAAGACTT AAGTGAGTTA GGTCTTTAAG GAAAGCAAOG CTCCTCTGAA ATGCTTGTCT 4380 

TTTTTCTGTT GCCGAAATAG CTGGTCCTTT TTCGGGAGTT AGATGTATAG AGTGTTTGTA 4440 

TGTAAACATT TCTTGTAGGC ATCACCATGA ACAAAGATAT ATTTTCTATT TATTTATTAT 4500 

ATGTGCACTT CAAGAAGTCA CTGTCAGAGA AATAAAGAAT TGTCTTAAAT GTCATGATTG 4S60 

GAGATGTCCT TTGCATTGCT TGGAAGGGGT GTACCTAGAG CCAAGGAAAT TGGCTCTGGT 4620 

TTGGAAAAAT TTTGCTGTTA TTATAGTAAA CATACAAAGG ATGTCAAAAA AAAAAAAAAA 4680 
AAAAAAAAAA AAAAAAAAAA AA 



Seq ID NO: 142 Protein sequence 
Protein Accession H : Eos sequence 



Seq ID NO: 143 NM DNA sequence 
Nucleic Acid Accession #: NM_006017 
Coding sequence: 3 8.. 2 635 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



1 11 

I I 
MGAAGRQDFL FKAMLTISWL 
GKTLLLTSSA TVYSIHISEG 
TIILYGRADE GIQPDPYYGL 
ERSWGHRGVI VHVIDPKSGT 
SRNLDDMARK AMTKLGSKHP 
KLFQTEHGEY FNVSLSSEWV 
IQATTMDGVN LSTEWYKKG 
STILNLEDNV QSWKPGDTLV 
EIDGVDMRAE VGLLSRNIIV 
GTELKHMGQQ LVGQYPIHFH 
KDWGYNSLG HCFFTBDGPE 
PKPRQDCNAV STFWMANPNN 
LGKFYNNRAH SNYRAGMI ID 
IRHFIAYKNQ DHGAWLRGGD 
FSPPCRCLLK LVTTGSPFAH 



21 31 

I I 

TLTCFPGATS TVAAGCPDQS 
GKLVIKDHDE PIVLRTRHIL 
KYIGVGKGGA LELHGQKKLS 
VIHSDRFDTY RSKKESERLV 
LHLGFRHPWS FLTVKGNPSS 
QDVEWTEWFD HDKVSQTKGG 
QDYRFACYDR GRACRSYRVR 
IASTDYSMYQ AEEFQVLPCR 
MGEMEDKCYP YRNHICNFFD 
LAGDVDERGG YDPPTYIRDL 
ERNTFDHCLG LLVKSGTLLP 
NLINCAAAGS EETGFWFIFH 
NGVKTTEASA KDKRPFLSII 
VWLDSCHFRG EAQEGFLIiTG 
VSLAHS 



41 51 
I I 
PELQPWNPGH DQDHHVHIGQ 
IDNGGELHAG SALCPFQGNF 
WTFLNKTLHP GGMAEGGYFF 
QYLNAVPDGR ILSVAVNDEG 
SVEDHIEYHG HRGSAAARVF 
EKISDLWKAH PGKICNRPID 
FLCGKPVRPK LTVTIDTNVN 
SCAPNQVKVA GKPMYLHIGE 
FDTFGGHIKF ALGFKAAHLE 
SIHHTFSRCV TVHGSNGLliI 
SDRDSKMCKM ITEDSYPGYI 
HVPTGPSVGM YSPGYSEHIP 
SARYSPHQDA DPLKPREPAI 
MKAGGILLGG DEAASGHAQG 



1 11 21 31 41 51 

I 1 I 1 I | . 

CCAAGTTCTA CCTCATGTTT GGAGGATCTT GCTAGCTATG GCCCTCGTAC TCGGCTCCCT 60 

GTTGCTGCTG GGGCTGTGCG GGAACTCCTT TTCAGGAGGG CAGCCTTCAT CCACAGATGC 120 

TCCTAAGGCT TGGAATTATG AATTGCCPGC AACAAATTAT GAGACCCAAG ACTCCCATAA 180 

AGCTGGACCC ATTGGCATTC TCTTTGAACT AGTGCATATC TTTCTCTATG TGGTACAGCC 240 

GCGTGATTTC CCAGAAGATA CTTTGAGAAA ATTCTTACAG AAGGCATATG AATCCAAAAT 300 

TGATTATGAC AAGCCAGAAA CTGTAATCTT AGGTCTAAAG ATTGTCTACT ATGAAGCAGG 360 

GATTATTCTA TGCTGTGTCC TGGGGCTGCT GTTTATTATT CTGATGCCTC TGGTGGGGTA 420 

TTTCTTTTGT ATGTGTCGTT GCTGTAACAA ATGTGGTGGA GAAATGCACC AGCGACAGAA 480 

GGAAAATGGG CCCTTCCTGA GGAAATGCTT TGCAATCTCC CTGTTGGTGA TTTGTATAAT 540 

AATAAGCATT GGCATCTTCT ATGGTTTTGT GGCAAATCAC CAGGTAAGAA CCCGGATCAA 600 

AAGGAGTCGG AAACTGGCAG ATAGCAATTT CAAGGACTTG CGAACTCTCT TGAATGAAAC 660 

TCCAGAGCAA ATCAAATATA TATTGGCCCA GTACAACACT ACCAAGGACA AGGCGTTCAC 720 

AGATCTGAAC AGTATCAATT CAGTGCTAGG AGGOGGAATT CTTGACCGAC TGAGACCCAA 780 

CATCATCCCT GTTCTTGATG AGATTAAGTC CATGGCAACA GCGATCAAGG AGACCAAAGA 840 

GGCGTTGGAG AACATGAACA GCACCTTGAA GAGCTTGCAC CAACAAAGTA CACAGCTTAG 900 

CAGCAGTCTG ACCAGCGTGA AAACTAGCCT GOGGTCATCT CTCAATGACC CTCTGTGCTT 960 

GGTGCATCCA TCAAGTGAAA CCTGCAACAG CATCAGATTG TCTCTAAGCC AGCTGAATAG 1020 

CAACCCTGAA CTGAGGCAGC TrCCACCCGT GGATGCAGAA CTTGACAACG TTAATAACGT 1080 

TCTTAGGACA GATTTGGATG GCCTGGTCCA ACAGGGCTAT CAATCCCTTA ATGATATACC 1140 

TGACAGAGTA CAACGCCAAA CCACGACTGT CGTAGCAGGT ATCAAAAGGG TCTTGAATTC 1200 

CATTGGTTCA GATATCGACA ATGTAACTCA GCGTCTTCCT ATTCAGGATA TACTCTCAGC 1260 

ATTCTCTGTT TATGTTAATA ACACTGAAAG TTACATCCAC AGAAATTTAC CTACATTGGA 1320 

AGAGTATGAT TCATACTGGT GGCTGGGTGG CCTGGTCATC TGCTCTCTGC TGACCCTCAT 1380 

CGTGATTTTT TACTACCTGG GCTTACTGTG TGGCGTGTGC GGCTATGACA GGCATGCCAC 1440 

CCCGACCACC CGAGGCTGTG TCTCCAACAC CGGAGGCGTC TTCCTCATGG TTGGAGTTGG 1500 

ATTAAGTTTC CTCTTTTGCT GGATATTGAT GATCATTGTG GTTCTTACCT TTGTCTTTGG 1560 

TGCAAATGTG GAAAAACTGA TCTGTGAACC TTACACGAGC AAGGAATTAT TCCGGGTTTT 1620 

GGATACACCC TACTTACTAA ATGAAGACTG GGAATACTAT CTCTCTGGGA AGCTATTTAA 1680 

TAAATCAAAA ATGAAGCTCA CTTTTGAACA AGTTTACAGT GACTGCAAAA AAAATAGAGG 1740 

CACTTACGGC ACTCTTCACC TGCAGAACAG CTTCAATATC AGTGAACATC TCAACATTAA 1800 

TGAGCATACT GGAAGCATAA GCAGTGAATT GGAAAGTCTG AAGGTAAATC TTAATATCTT I860 

TCTGTTGGGT GCAGCAGGAA GAAAAAACCT TCAGGATTTT GCTGCTTGTG GAATAGACAG 1920 

AATGAATTAT GACAGCTACT TGGCTCAGAC TGGTAAATCC CCCGCAGGAG TGAATCTTTT 1980 

ATCATTTGCA TATGATCTAG AAGCAAAAGC AAACAGTTTG OCCCCAGGAA ATTTGAGGAA 2040 

CTCCCTGAAA AGAGATGCAC AAACTATTAA AACAATTCAC CAGCAACGAG TCCTTCCTAT 2100 

AGAACAATCA CTGAGCACTC TATACCAAAG CGTCAAGATA CTTCAACGCA CAGGGAATGG 2160 

ATTGTTGGAG AGAGTAACTA GGATTCTAGC TTCTCTGGAT TTTGCTCAGA ACTTCATCAC 2220 

AAACAATACT TCCTCTGTTA TTATTGAGGA AACTAAGAAG TATGGGAGAA CAATAATAGG 2280 

ATATTTTGAA CATTATCTGC AGTGGATCGA GTTCTCTATC AGTGAGAAAG TGGCATCGTG 2340 

CAAACCTGTG GCCACCGCTC TAGATACTGC TGTTGATGTC TTTCTGTGTA GCTACATTAT 2400 

CGACCCCTTG AATTTGTTTT GGTTTGGCAT AGGAAAAGCT ACTGTATTTT TACTTCCGGC 2460 

TCTAATTTTT GGGGTAAAAC TGGCTAAGTA CTATCGTCGA ATGGATTCGG AGGACGTGTA 2520 

CGATGATGTT GAAACTATAC CCATGAAAAA TATGGAAAAT GGTAATAATG GTTATCATAA 2580 

AGATCATGTA TATGGTATTC ACAATCCTGT TATGACAAGC CCATCACAAC ATTGATAGCT 2640 

GATGTTGAAA CTGCTTGAGC ATCAGGATAC TCAAAGTGGA AAGGATCACA GATTTTTGGT 2700 

AGTTTCTGGG TCTACAAGGA CTTTCCAAAT CCAGGAGCAA CGCCAGTGGC AACGTAGTGA 2760 

CTCAGGCGGG CACCAAGGCA ACGGCACCAT TGGTCTCTGG GTAGTGCTTT AAGAATGAAC 2820 
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ACAATCACGT TATAGTCCAT 
TATTTTTGTT TTTTACTTTT 
GTTTGTTCCC ATTGGATGCA 
ACATATTGCC ATGTGTGGAG 
GTGTACAGTA AACGGTGTAT 
TTATAGGACT TTCTTCTAAA 
AAATAATCCA TTTTCACTAA 
TTCATCTATT ATACTTTATC 
CCATCATAGA GAAACCTGCG 
GAGAGAGAAA TGCTGTTCGT 
ACAGTTTAAC AGAGTTTTCT 
GAGTATCATC AGACTAGTAT 
TTCTTAGATC TGGTGTCCAG 
AAAGCACAAG GTAAATATTC 
AAGTGATCAG CAATGAAGAA 
GATGTAGGCA TTCCCCCCAG 
AAAAAGGAAC TTGG 

Seq ID NO: 144 NP Protein sequence 
Protein Accession #: NP_006008.1 

1 11 21 31 41 51 

I I I 1 I I 

MALVLGSLLL LGLCGNSFSG GQPSSTDAPK AWNYELPATN YETQDSHKAG PIGILFELVH 60 

I FLYWQPRD FPEDTLRKFL QKAYESKIDY DKPETVILGL KIVYYEAGI I LCCVLGLLFI 120 

ILMPLVGYFF CMCRCCNKCG GEMHQRQKEN GPFLRKCFAI SLLVICIIIS IGIFYGFVAN 180 

HQVRTRIKRS RKLADSNFKD LRTLLNETPB QIKYILAQYN TTKDKAFTDL NSINSVLGGG 240 

ILDRLiRPNII PVLDEIKSMA TAIKETKEAL ENMNSTLKSL HQQSTQLSSS LTSVKTSLRS 300 

SLNDPL&jVH PSSETCNSIR LSLSQLNSNP ELRQLPPVDA ELDNVNNVLR TDLDGLVQQG 360 

YQSLNDIPDR VQRQTTTWA GIKRVLNSIG SDIDNVTQRL PIQDILSAFS VYVNNTESYI 420 

HRNLPTLEEY DSYWWLGGIiV ICSLLTLIVI FYYLGLLCGV CGYDRHATPT TRGCVSNTGG 480 

VFLMVGVGLS FLFCWILMII WLTFVFGAN VEKLICEPYT SKELFRVLDT PYLLNEDWBY 540 

YLSGKLFNKS KMKLTFEQVY SDCKKKRGTY GTLHLQNSFN ISEHLNINEH TGSISSELES 600 

LKVNLNIFLL GAAGRKNLQD FAACGIDRMN YDSYLAQTGK SPAGVNLLSF AYDLEAKANS 660 

LPPGNLRNSL KRDAQTIKTI HQQRVLPIEQ SLSTLYQSVK ILQRTGNGLL ERVTR I IiASL 720 

DFAQNFITNN TSSVIIEETK KYGRTIIGYF EHYLQWIEFS ISEKVASCKP VATALDTAVD 780 

VFLCSYIIDP LNLFWFGIGK ATVFL.LPALI FAVKLAKYYR RMDSEDVYDD VETIFMKNME 840 
NGNNGYHKDH VYGIHNPVMT SPSQH 

Seq 10 NO: 145 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequences 38.. 2605 

1 11 21 31 41 51 

I I I I I I 

CCAAGTTCTA CCTCATGTTT GGAGGATCTT GCTAGCTATG GCCCTCGTAC TCGGCTCCCT 60 

GTTGCTGCTG GGGCTGTGCG GGAACTCCTT TTCAGGAGGG CAGCCTTCAT CCACAGATGC 120 

TCCTAAGGCT TGGAATTATG AATTGCCTGC AACAAATTAT GAGACCCAAG ACTCCCATAA 180 

AGCTGGACCC ATTGGCATTC TCTTTGAACT AGTGCATATC TTTCTCTATG TGGTACAGCC 240 

GCGTGATTTC CCAGAAGATA CTTTGAGAAA ATTCTTACAG AAGGCATATG AATCCAAAAT 300 

TGATTATGAC ATTGTCTACT ATGAAGCAGG GATTATTCTA TGCTGTGTCC TGGGGCTGCT 360 

GTTTATTATT CTGATGCCTC TGGTGGGGTA TTTCTTTTGT ATGTGTCGTT GCTGTAACAA 420 

ATGTGGTGGA GAAATGCACC AGCGACAGAA GGAAAATGGG CCCTTCCTGA GGAAATGCTT 480 

TGCAATCTCC CTGTTGGTGA TTTGTATAAT AATAAGCATT GGCATCTTCT ATGGTTTTGT 540 

GGCAAATCAC CAGGTAAGAA CCCGGATCAA AAGGAGTCGG AAACTGGCAG ATAGCAATTT 600 

CAAGGACTTG CGAACTCTCT TGAATGAAAC TCCAGAGCAA ATCAAATATA TATTGGCCCA 660 

GTACAACACT ACCAAGGACA AGGCGTTCAC AGATCTGAAC AGTATCAATT CAGTGCTAGG 720 

AGGCGGAATT CTTGACCGAC TGAGACCCAA CATCATCCCT GTTCTTGATG AGATTAAGTC 780 

CATGGCAACA GCGATCAAGG AGACCAAAGA GGCGTTGGAG AACATGAACA GCACCTTGAA 840 

GAGCTTGCAC CAACAAAGTA CACAGCTTAG CAGCAGTCTG ACCAGCGTGA AAACTAGCCT 900 

GCGGTCATCT CTCAATGACC CTCTGTGCTT GGTGCATCCA TCAAGTGAAA CCTGCAACAG 960 

CATCAGATTG TCTCTAAGCC AGCTGAATAG CAACCCTGAA CTGAGGCAGC TTCCACCCGT 1020 

GGATGCAGAA CTTGACAACG TTAATAACGT TCTTAGGACA GATTTGGATG GCCTGGTCCA 1080 

ACAGGGCTAT CAATCCCTTA ATGATATACC TGACAGAGTA CAACGCCAAA CCACGACTGT 1140 

CGTAGCAGGT ATCAAAAGGG TCTTGAATTC CATTGGTTCA GATATCGACA ATGTAACTCA 1200 

GCGTCTTCCT ATTCAGGATA TACTCTCAGC ATTCTCTGTT TATGTTAATA ACACTGAAAG 1260 

TTACATCCAC AGAAATTTAC CTACATTGGA AGAGTATGAT TCATACTGGT GGCTGGGTGG 1320 

CCTGGTCATC TGCTCTCTGC TGACCCTCAT CGTGATTTTT TACTACCTGG GCTTACTGTG 1380 

TGGCGTGTGC GGCTATGACA GGCATGCCAC CCCGACCACC CGAGGCTGTG TCTCCAACAC 1440 

CGGAGGCGTC TTCCTCATGG TTGGAGTTGG ATTAAGTTTC CTCTTTTGCT GGATATTGAT 1500 

GATCATTGTG GTTCTTACCT TTGTCTTTGG TGCAAATGTG GAAAAACTGA TCTGTGAACC 1560 

TTACAOGAGC AAGGAATTAT TCCGGGTTTT GGATACACCC TACTTACTAA ATGAAGACTG 1620 

GGAATACTAT CTCTCTGGGA AGCTATTTAA TAAATCAAAA ATGAAGCTCA CTTTTGAACA 1680 

AGTTTACAGT GACTGCAAAA AAAATAGAGG CACTTACGGC ACTCTTCACC TGCAGAACAG 1740 

CTTCAATATC AGTGAACATC TCAACATTAA TGAGCATACT GGAAGCATAA GCAGTGAATT 1800 

GGAAAGTCTG AAGGTAAATC TTAATATCTT TCTGTTGGGT GCAGCAGGAA GAAAAAACCT 1860 

TCAGGATTTT GCTGCTTGTG GAATAGACAG AATGAATTAT GACAGCTACT TGGCTCAGAC 1920 

TGGTAAATCC CCCGCAGGAG TGAATCTTTT ATCATTTGCA TATGATCTAG AAGCAAAAGC 1980 

AAACAGTTTG CCCCCAGGAA ATTTGAGGAA CTCCCTGAAA AGAGATGCAC AAACTATTAA 2040 

AACAATTCAC CAGCAACGAG TCCTTCCTAT AGAACAATCA CTGAGCACTC TATACCAAAG 210 0 

CGTCAAGATA CTTCAACGCA CAGGGAATGG ATTGTTGGAG AGAGTAACTA GGATTCTAGC 2 ISO 

TTCTCTGGAT TTTGCTCAGA ACTTCATCAC AAACAATACT TCCTCTGTTA TTATTGAGGA 2220 

AACTAAGAAG TATGGGAGAA CAATAATAGG ATATTTTGAA CATTATCTGC AGTGGATCGA 2280 

GTT CTCTATC AGTGAGAAAG TGGCATCGTG CAAACCTGTG GCCACCGCTC TAGATACTGC 2340 



GGTCCATCAC TATTCAAGGA TGACTCCCTC CCTTCCTGTC 2880 

TTACACTGAG TTTCTATTTA GACACTACAA CATATGGGGT 2940 

TTTCTATCAA AACTCTATCA AATGTGATGG CTAGATTCTA 3000 

TGTGCTGAAC ACACACCAGT TTACAGGAAA GATGCATTTT 3060 

ATACCTTTTG TTACCACAGA GTTTTTTAAA CAAATGAGTA 3120 

TGAGCTAAAT AAGTCACCAT TGACTTCTTG GTGCTGTTGA 3180 

AAGTGTGTGA AACCTACAGC ATATTCTTCA CGCAGAGATT 3240 

AAAGATTGGC CATGTTCCAC TTGGAAATGG CATGCAAAAG 3300 

TAACTCCATC TGACAAATTC AAAAGAGAGA GAGAGATCTT 3360 

TCAAAAGTGG AGTTGTTTTA ACAGATGCCA ATTACGGTGT 3420 

GTTGCATTAG GATAAACATT AATTGGAGTG CAGCTAACAT 3480 

CAAGTGTTCT AAAATGAAAT ATGAGAAGAT CCTGTCACAA 3540 

CATGGATGAA ACCTTTGAGT TTGGTCCCTA AATTTGCATG 3600 

ATTTGCTTCA GGAGTTTCAT GTTGGATCTG TCATTATCAA 3660 

CTGGTCGGAC AAAATTTAAC GTTGATGTAA TGGAATTCCA 3720 

GTCTTTTCAT GTGCAGATTG CAGTTCTGAT TCATTTGAAT 3780 
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TGTTGATGTC TTTCTGTGTA GCTACATTAT CGACCCCTTG A ATTTGTTTT GGTTTGGCAT 2400 

AGGAAAAGCT ACTGTATTTT TACTTCCGGC TCTAATTTTT GCGGTAAAAC TGGCTAAGTA 2460 

CTATCGTCGA ATGGATTCGG AGGACGTGTA CGATGATGTT GAAACTATAC CCATGAAAAA 2520 

TATGGAAAAT GGTAATAATG GTTATCATAA AGATCATGTA TATGGTATTC ACAATCCTGT 2580 

J TATGACAAGC CCATCACAAC ATTGATAGCT GATGTTGAAA CTGCTTGAGC ATCAGGATAC 2640 

TCAAAGTGGA AAGGATCACA GATTTTTGGT AGTTTCTGGG TCTACAAGGA CTTTCCAAAT 2700 

CCAGGAGCAA CGCCAGTGGC AACGTAGTGA CTCAGGCGGG CACCAAGGCA ACGGCACCAT 2760 

TGGTCTCTGG GTAGTGCTTT AAGAATGAAC ACAATCACGT TATAGTCCAT GGTCCATCAC 2820 

TATTCAAGGA TGACTCCCTC CCTTCCTGTC TATTTTTGTT TTTTACTTTT TTACACTGAG 2880 

iU TTTCTATTTA GACACTACAA CATATGGGGT GTTTGTTCCC ATTGGATGCA TTTCTATCAA 2940 

AACTCTATCA AATGTGATGG CTAGATTCTA ACATATTGCC ATGTGTGGAG TGTGCTGAAC 3000 

ACACACCAGT TTACAGGAAA GATGCATTTT GTGTACAGTA AACGGTGTAT ATACCTTTTG 3060 

TTACCACAGA GTTTTTTAAA CAAATGAGTA TTATAGGACT TTCTTCTAAA TGAGCTAAAT 3120 

AAGTCACCAT TGACTTCTTG GTGCTGTTGA AAATAATCCA TTTTCACTAA AAGTGTGTGA 3180 

O AACCTACAGC ATATTCTTCA OGCAGAGATT TTCATCTATT ATACTTTATC AAAGATTGGC 3240 

CATGTTCCAC TTGGAAATGG CATGCAAAAG CCATCATAGA GAAACCTGCG TAACTCCATC 3300 

TGACA AATT C AAAAGAGAGA GAGAGATCTT GAGAGAGAAA TGCTGTTCGT TCAAAAGTGG 3360 

AGTTGTTTTA ACAGATGCCA ATTACGGTGT ACAGTTTAAC AGAGTTTTCT GTTGCATTAG 3420 

GATAAACATT AATTGGAGTG CAGCTAACAT GAGTATCATC AGACTAGTAT CAAGTGTTCT 3480 

AAAATGAAAT ATGAGAAGAT CCTGTCACAA TTCTTAGATC TGGTGTCCAG CATGGATGAA 3540 

ACCTTTGAGT TTGGTCCCTA AATTTGCATG AAAGCACAAG GTAAATATTC ATTTGCTTCA 3600 

GGAGTTTCAT GTTGGATCTG TCATTATCAA AAGTGATCAG CAATGAAGAA CTGGTCGGAC 3660 

AAAATTTAAC GTTGATGTAA TGGAATTCCA GATGTAGGCA TTCCCCCCAG GTCTTTTCAT 3720 
GTGCAGATTG CAGTTCTGAT TCATTTQAAT AAAAAGGAAC TTGG 
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Seq ID NO: 146 Protein sequence 
Protein Accession 8: 



on 1 11 21 31 41 51 

30 | | | | | | 

MALVLGSLLL LGLCGNSFSG GQPSSTDAPK AWNYELPATN YETQDSHKAG PIGILFELVH 60 

IFLYWQPRD FPEDTLRKFL QKAYESKIDY DIVYYEAGII LCCVLGLLFI ILMPLVGYFP 120 

CMCRCCNKCG GEMHQRQKEN GPPLRKCFAI SLLVICIIIS IGIFYGFVAN HQVRTRIKRS 180 

RKLADSNFKD LRTLLNETPE QIKYILAQYN TTKDKAFTDL NSINSVLGGG ILDRLRPNII 240 

PVLDEIKSMA TAIKETKEAL ENMNSTLKSL HQQSTQLSSS LTSVKTSLRS SLNDPLCLVH 300 

PSSETCNSIR LSLSQLNSNP ELRQLPPVDA ELDNVNNVLR TDLDGLVQQG YQSLNDIPDR 360 

VQRQTTTWA GIKRVLNSXG SDIDNVTQRL PIQDILSAFS VYVNNTESYI HRNLPTLEBY 420 

DSYWWLGGLV ICSLLTLIVI PYYLGLLCGV CGYDRHATPT TRGCVSNTGG VFLMVGVGLS 480 

FLFCWILMII WLTFVFGAN VEKLICEPYT SKELFRVLDT PYLLNEDWEY YLSGKLFNKS 540 

KMKLTFEQVY SDCKKKRGTY GTLHLQN9FN ISEHLNINEH TGSISSELES LKVNLNIFLL 600 

GAAGRKNLQD FAACGIDRMN YDSYIAQTGK SPAGVNLLSF AYDLEAKANS LPPGNLRNSL 660 

KRDAQTIKTI HQQRVLPIEQ SLSTLYQSVK ILQRTGNGLL ERVTRILASL DFAQNFITNN 720 

TSSVIIEETK KYGRTI IGYF EHYLQWIEPS ISEKVASCKP VATALDTAVD VFLCSYIIDP 780 

LNLFWFGIGK ATVFLLPAItl FAVKLAKYYR RMDSEDVYDD VETIPMKNME NGNNGYHKDH 840 
40 VYGIHNPVMT SPSQH 



Seq ID NO: 147 DNA sequence 

Nucleic Acid Accession #: BCO12089.1 

Coding sequence: 236.. 2806 



1 11 21 31 41 51 

I I 1 I i I 

ATGCTCTCAG CTCTCCCGCC GCGGGATGGT GCCTTGAGTG AATGACCCCC TTGGAGAACA 60 

TTCTTCCGCA TCCCTCGCCT CAAGCCAGCC TCAGACAGAA AACTGAAGAT TCAGCAGATC 120 

CAGTGCTTCC TGCTCCTCTT CTGCCCAGGA ACACGCTTGC CTTCCCCAAG GCTTCCAGAA 180 

GCTCTGAGGC AGGAGGCACC AAGTTCTACC TCATGTTTGG AGGATCTTGC TAGCTATGGC 240 

CCTOGTACTC GGCTCCCTGT TGCTGCTGGG GCTGTGCGGG AACTCCTTTT CAGGAGGGCA 300 

GCCTTCATCC ACAGATGCTC CTAAGGCTTG GAATTATGAA TTGCCTGCAA CAAATTATGA 360 

„ GACCCAAGAC TCCCATAAAG CTGGACCCAT TGGCATTCTC TTTGAACTAG TGCATATCTT 420 

OU TCTCTATGTG GTACAGCCGC GTGATTTCCC AGAAGATACT TTGAGAAAAT TCTTACAGAA 480 

GGCATATGAA .TCCAAAATTG ATTATGACAA GATTGTCTAC TATGAAGCAG GGATTATTCT 540 

ATGCTGTGTC CTGGGGCTGC TGTTTATTAT TCTGATGCCT CTGGTGGGGT ATTTCTTTTG 600 

TATGTGTCGT TGCTGTAACA AATGTGGTGG AGAAATGCAC CAGCGACAGA AGGAAAATGG 660 

- GCCCTTCCTG AGGAAATGCT TTGCAATCTC CCTGTTGGTG ATTTGTATAA TAATAAGCAT 720 

03 TGGCATCTTC TATGGTTTTG TGGCAAATCA CCAGGTAAGA ACCCGGATCA AAAGGAGTCG 780 

GAAACTGGCA GATAGCAATT TCAAGGACTT GCGAACTCTC TTGAATGAAA CTCCAGAGCA 640 

AATCAAATAT ATATTGGCCC AGTACAACAC TACCAAGGAC AAGGCGTTCA CAGATCTGAA 900 

CAGTATCAAT TCAGTGCTAG GAGGCGGAAT TCTTGACCGA CTGAGACCCA ACATCATCCC 960 

TGTTCTTGAT GAGATTAAGT CCATGGCAAC AGCGATCAAG GAGACCAAAG AGGCGTTGGA 1020 

/U GAACATGAAC AGCACCTTGA AGAGCTTGCA CCAACAAAGT ACACAGCTTA GCAGCAGTCT 1080 

GACCAGCGTG AAAACTAGCC TGCGGTCATC TCTCAATGAC CCTCTGTGCT TGGTGCATCC 1140 

ATCAAGTGAA ACCTGCAACA GCATCAGATT GTCTCTAAGC CAGCTGAATA GCAACCCTGA 1200 

ACTGAGGCAG CTTCCACCCG TGGATGCAGA ACTTGACAAC GTTAATAACG TTCTTAGGAC 1260 

„ AGATTTGGAT GGCCTGGTCC AACAGGGCTA TCAATCCCTT AATGATATAC CTGACAGAGT 1320 

IJ ACAACGCCAA ACCACGACTG TCGTAGCAGG TATCAAAAGG GTCTTGAATT CCATTGGTTC 1380 

AGATATCGAC AATGTAACTC AGCGTCTTCC TATTCAGGAT ATACTCTCAG CATTCTCTGT 1440 

TTATGTTAAT AACACTGAAA GTTACATCCA CAGAAATTTA CCTACATTGG AAGAGTATGA 1500 

TTCATACTGG TGGCTGGGTG GCCTGGTCAT CTGCTCTCTG CTGACCCTCA TCGTGATTTT 1560 

_ n TTACTACCTG GGCTTACTGT GTGGCGTGTG CGGCTATGAC AGGCATGCCA CCCCGACCAC 1620 

OU CCG AGGCT GT GTCTCCAACA CCGGAGGCGT CTTCCTCATG GTTGGAGTTG GATTAAGTTT 1680 

CCTCTTTTGC TGGATATTGA TGATCATTGT GGTTCTTACC TTTGTCTTTG GTGCAAATGT 1740 

GGAAAAACTG ATCTGTGAAC CTTACACGAG CAAGGAATTA TTCCGGGTTT TGGATACACC 1800 

CTACTTACTA AATGAAGACT GGGAATACTA TCTCTCTGGG AAGCTATTTA ATAAATCAAA 1860 

AATGAAGCTC ACTTTTGAAC AAGTTTACAG TGACTGCAAA AAAAATAGAG GCACTTACGG 1920 
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CACTCTTCAC CTGCAGAACA GCTTCAATAT CAGTGAACAT CTCAACATTA ATGAGCATAC 1980 

TGGAAGCATA AGCAGTGAAT TGGAAAGTCT GAAGGTAAAT CTTAATATCT TTCTGTTGGG 2040 

TGCAGCAGGA AGAAAAAACC TTCAGGATTT TGCTGCTTGT GGAATAGACA GAATGAATTA 2100 

TGACAGCTAC TTGGCTCAGA CTGGTAAATC CCCCGCAGGA GTGAATCTTT TATCATTTGC 2160 

ATATGATCTA GAAGCAAAAG CAAACAGTTT GCCCCCAGGA AATTTGAGGA ACTCCCTGAA 2220 

AAGAGATGCA CAAACTATTA AAACAATTCA CCAGCAACGA GTCCTTCCTA TAGAACAATC 2280 

ACTGAGCACT CTATACCAAA GCGTCAAGAT ACTTCAACGC ACAGGGAATG GATTGTTGGA 2340 

GAGAGTAACT AGGATTCTAG CTTCTCTGGA TTTTGCTCAG AACTTCATCA CAAACAATAC 2400 

TTCCTCTGTT ATTATTGAGG AAACTAAGAA GTATGGGAGA ACAATAATAG GATATTTTGA 24S0 

ACATTATCTG CAGTGGATCG AGTTCTCTAT CAGTGAGAAA GTGGCATCGT GCAAACCTGT 2520 

GGCCACCGCT CTAGATACTG CTGTTGATGT CTTTCTGTGT AGCTACATTA TCGACCCCTT 2580 

GAATTTGTTT TGGTTTGGCA TAGGAAAAGC TACTGTATTT TTACTTCCGG CTCTAATTTT 2640 

TGCGGTAAAA CTGGCTAAGT ACTATCGTCG AATGGATTCG GAGGACGTGT ACGATGATGT 2700 

TGAAACTATA CCCATGAAAA ATATGGAAAA TGGTAATAAT GGTTATCATA AAGATCATGT 2760 

ATATGGTATT CACAATCCTG TTATGACAAG CCCATCACAA CATTGATAGC TGATGTTGAA 2820 

ACTGCTTGAG CATCAGGATA CTCAAAGTGG AAAGGATCAC AGATTTTTGG TAGTTTCTGG 2880 

GTCTACAAGG ACTTTCCAAA TCCAGGAGCA ACGCCAGTGG CAACGTAGTG ACTCAGGCGG 2940 

GCACCAAGGC AACGGCACCA TTGGTCTCTG GGTAGTGCTT TAAGAATGAA CACAATCACG 3000 

TTATAGTCCA TGGTCCATCA CTATTCAAGG ATGACTCCCT CCCTTCCTGT CTATTTTTGT 3060 

TTTTTACTTT TTTACACTGA GTTTCTATTT AGACACTACA ACATATGGGG TGTTTGTTCC 3120 

CATTGGATGC ATTTCTATCA AAACTCTATC AAATGTGATG GCTAGATTCT AACATATTGC 3180 

CATGTGTGGA GTGTGCTGAA CACACACCAG TTTACAGGAA AGATGCATTT TGTGTACAGT 3240 

AAACGGTGTA TATACCTTTT GTTACCACAG AGTTTTTTAA ACAAATGAGT ATTATAGGAC 3300 

TTTCTTCTAA ATGAGCTAAA TAAGTCACCA TTGACTTCTT GGTGCTGTTG AAAATAATCC 3360 

ATTTTCACTA AAAGTGTGTG AAACCTACAG CATATTCTTC ACGCAGAGAT TTTCATCTAT 3420 

TATACTTTAT CAAAGATTGG CCATGTTCCA CTTGGAAATG GCATGCAAAA GCCATCATAG 3480 

AGAAACCTGC GTAACTCCAT CTGACAAATT CAAAAGAGAG AGAGAGATCT TGAGAGAGAA 3540 

ATGCTGTTCG TTCAAAAGTG GAGTTGTTTT AACAGATGCC AATTACGGTG TACAGTTTAA 3600 

CAGAGTTTTC TGTTGCATTA GGATAAACAT TAATTGGAGT GCAGCTAACA TGAGTATCAT 3660 

CAGACTAGTA TCAAGTGTTC TAAAATGAAA TATGAGAAGA TCCTGTCACA ATTCTTAGAT 3720 

CTGGTGTCCA GCATGGATGA AACCTTTGAG TTTGGTCCCT AAATTTGCAT GAAAGCACAA 3780 

GGTAAATATT CATTTGCTTC AGGAGTTTCA TGTTGGATCT GTCATTATCA AAAGTGATCA 3840 

GCAATGAAGA ACTGGTCGGA CAAAATTTAA CGTTGATGTA ATGGAATTCC AGATGTAGGC 3900 

ATTCCCCCCA GGTCTTTTCA TGTGCAGATT GCAGTTCTGA TTCATTTGAA TAAAAAGGAA 3960 
CTTGGAAAAA AAAAAAAAAA AAA 

Seq ID NO: 148 Protein sequence 
Protein Accession Jh AAH12 089.1 

1 11 21 31 41 51 

I I I If I 

MALVLGSLIiL LGLCGNSFSG GQPSSTDAPK AWNYELPATN YETQDSHXAG PIGILFELVH 60 

IFLYWQPRD FPEDTLRKPL QKAYESKIDY DKIVYYEAGI ILCCVLGLLF 1 1 LMPLVGYF 120 

FCMCRCCNKC GGEMHQRQKE NGPFLRKCFA ISLLVICIII SIGIFYGFVA NHQVRTRIKR 180 

SRKLADSNFK DLRTLLNETP EQIKYILAQY NTTKDKAFTD LNSINSVLGG GILDRLRPNI 240 

IPVLDEIKSM ATAIKETKEA LENMNSTLKS LHQQSTQLSS SLTSVKTSLR SSLNDPLCLV 300 

HPSSETCNSI RLSLSQLNSN PELRQLPPVD AELDNVNNVL RTDLDGLVQQ GYQSLNDIPD 360 

RVQRQTTTW AGIKRVLNSI GSDIDNVTQR LPIQDILSAF SVYVNNTESY IHRNLPTLEE 420 

YDSYWWLGGL VICSLLTLIV IFYYLGLLCG VCGTDRHATP TTRGCVSNTG GVFLMVGVGL 480 

SFLFCWILMI IWLTFVFGA NVEKLICEPY TSKELFRVLD TPYLLNEDWE YYLSGKLFNK 540 

SKMKLTFEQV YSDCKKNROT YGTLHLQNSF NISEHLNINE HTGSISSELE SLKVNLNIFL 600 

LGAAGRKNLQ DFAACGIDRM NYDSYLAQTG KSPAGVNLLS FAYDLEAKAN SLPPGNLRNS 660 

LKRDAOTIKT IHQQRVLPIE QSLSTLYQSV KILQRTGNGL LERVTRILAS LDFAQNFITN 720 

NTSSVIIBET KKYGRTIIGY FEHYLQMIEF SISEKVASCK PVATALDTAV DVFLCSYIID 780 

PLNLFWFGIG KATVFLLPAL IFAVKLAKYY RRMDSEDVYD DVETIPMKNM ENGNNGYHKD 840 
HVYGIHNPVM TSPSQH 

Seq ID NO i 149 DNA sequence 

Nucleic Acid Accession Si NM_033049.l 

Coding sequence: 28.. 1566 

1 11 21 31 41 51 

I I I I I I 

CCACGCGTCC GAGCAAGAAC AGCTAAAATG AAAGCCATCA TTCATCTTAC TCTTCTTGCT 60 

CTCCTTTCTG TAAACACAGC CACCAACCAA GGCAACTCAG CTGATGCTGT AACAACCACA 120 

GAAACTGCGA CTAGTGGTCC TACAGTAGCT GCAGCTGATA CCACTGAAAC TAATTTCCCT 180 

GAAACTGCTA GCACCACAGC AAATACACCT TCTTTCCCAA CAGCTACTTC ACCTGCTCCC 240 

CCCATAATTA GTACACATAG TTCCTCCACA ATTCCTACAC CTGCTCCCCC CATAATTAGT 300 

ACACATAGTT CCTCCACAAT TCCTATACCT ACTGCTGCAG ACAGTGAGTC AACCACAAAT 360 

GTAAATTCAT TAGCTACCTC TGACATAATC ACCGCTTCAT CTCCAAATGA TGGATTAATC 420 

ACAATGGTTC CTTCTGAAAC ACAAAGTAAC AATGAAATGT CCCCCACCAC AGAAGACAAT 480 

CAATCATCAG GGCCTCCCAC TGGCACCGCT TTATTGGAGA CCAGCACCCT AAACAGCACA 540 

GGTCCCAGCA ATCCTTGCCA AGATGATCCC TGTGCAGATA ATTCGTTATG TGTTAAGCTG 600 

CATAATACAA GTTTTTGCCT GTGTTTAGAA GGGTATTACT ACAACTCTTC TACATGTAAG 660 

AAAGGAAAGG TATTCCCTGG GAAGATTTCA GTGACAGTAT CAGAAACATT TGACCCAGAA 720 

GAGAAACATT CCATGGCCTA TCAAGACTTG CATAGTGAAA TTACTAGCTT GTTTAAAGAT 780 

GTATTTGGCA CATCTGTTTA TGGACAGACT GTAATTCTTA CTGTAAGCAC ATCTCTGTCA 840 

CCAAGATCTG AAATGCGTGC TGATGACAAG TTTGTTAATG TAACAATAGT AACAATTTTG 900 

GCAGAAACCA CAAGTGACAA TGAGAAGACT GTGACTGAGA AAATTAATAA AGCAATTAGA 960 

AGTAGCTCAA GCAACTTTCT AAACTATGAT TTGACCCTTC GGTGTGATTA TTATGGCTGT 1020 

AACCAGACTG CGGATGACTG CCTCAATGGT TTAGCATGCG ATTGCAAATC TGACCTGCAA 1080 

AGGCCTAACC CACAGAGCCC TTTCTGCGTT GCTTCCAGTC TCAAGTGTCC TGATGCCTGC 1140 

AACGCACAGC ACAAGCAATG CTTAATAAAG AAGAGTGGTG GGGCCCCTGA GTGTGCGTGC 1200 

GTGCCCGGCT ACCAGGAAGA TGCTAATGGG. AACTGCCAAA AGTGTGCATT TGGCTACAGT 1260 
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GGACTCGACT GTAAGGACAA ATTTCAGCTG ATCCTCACTA TTGTGGGCAC CATCGCTGQC 1320 

ATTGTCATTC TCAGCATGAT AATTGCATTG ATTGTCACAG CAAGATCAAA TAACAAAACO 1380 

AAGCATATTG AAGAAGAGAA CTTGATTGAC GAAGACTTTC AAAATCTAAA ACTGCGGTCG 1440 

ACAGGCTTCA CCAATCTTGG AGCAGAAGGG AGCGTCTTTC CTAAGGTCAG GATAACGGCC 1500 

TCCAGAGACA GCCAGATGCA AAATCCCTAT TCAAGACACA GCAGCATGCC CCGCCCTGAC 1560 

TATTAGAATC ATAAGAATGT GGAACCCGCC ATGGCCCCCA ACCAATGTAC AAGCTATTAT 1620 

TTAGAGTGTT TAGAAAGACT GATGGAGAAG TGAGCACCAG TAAAGATCTG GCCTCCGGGG 1680 

TTTTTCTTCC ATCTGACATC TGCCAGCCTC TCTGAATGGA AGTTGTGAAT GTTTGCAACG 1740 

AATCCAGCTC ACTTGCTAAA TAAGAATCTA TGACATTAAA TGTAGTAGAT GCTATTAGCG 1800 

CTTGTCAGAG AGGTGGTTTT CTTCAATCAG TACAAAGTAC TGAGACAATG GTTAGGGTTG 1860 

TTTTCTTAAT TCTTTTCCTG GTAGGGCAAC AAGAACCATT TCCAATCTAG AGGAAAGCTC 1920 

CCCAGCATTG CTTGCTCCTG GGCAAACATT GCTCTTGAGT TAAGTGACCT AATTCCCCTG 1980 

GGAGACATAC GCATCAACTG TGGAGGTCCG AGGGGATGAG AAGGGATACC CACCACCTTT 2040 

CAAGGGTCAC AAGCTCACTC TCTGACAAGT CAGAATAGGG ACACTGCTTC TATCCCTCCA 2100 

ATGGAGAGAT TCTGGCAACC TTTGAACAGC CCAGAGCTTG CAACCTAGCC TCACCCAAGA 2160 

AGACTGGAAA GAGACATATC TCTCAGCTTT TTCAGGAGGC GTGCCTGGGA ATCCAGGAAC 2220 

TTTTTGATGC TAATTAGAAG GCCTGGACTA AAAATGTCCA CTATGGGGTG CACTCTACAG 2280 

TTTTTGAAAT GCTAGGAGGC AGAAGGGGCA GAGAGTAAAA AACATGACCT GGTAGAAGGA 2340 

AGAGAGGCAA AGGAAACTGG GTGGGGAGGA TCAATTAGAG AGGAGGCACC TGGGATCCAC 2400 

CTTCTTCCTT AGGTCCCCTC CTCCATCAGC AAAGGAGCAC TTCTCTAATC ATGCCCTCCC 2460 

GAAGACTGGC TGGGAGAAGG TTTAAAAACA AAAAATCCAG GAGTAAGAGC CTTAGGTCAG 2520 

TTTGAAATTG GAGACAAACT GTCTGGCAAA GGGTGCGAGA GGGAGCTTGT GCTCAGGAGT 2580 

CCAGCCGTCC AGCCTCGGGG TGTAGGTTTC TGAGGTGTGC CATTGGGGCC TCAGCCTTCT 2640 

CTGGTGACAG AGGCTCAGCT GTGGCCACCA ACACACAACC ACACACACAC AACCACACAC 2700 

ACAAATGGGG GCAACCACAT CCAGTACAAG CTTTTACAAA TGTTATTAGT GTCCTTTTTT 2760 

ATTTCTAATG CCTTGTCCTC TTAAAAGTTA TTTTATTTGT TATTATTATT TGTTCTTGAC 2820 

TGTTAATTGT GAATGGTAAT GCAATAAAGT GCCTTTGTTA GATGGTGAAA AAAAAAAAAA 2880 
AAAAAAA 

Seq ID NO: 150 Protein sequence 
Protein Accession #: NP_149038.1 

1 11 21 31 41 51 

t I I I 1.1 

MKAIXHLTLIi ALLSVNTATN QGNSADAVTT TETATSGPTV AAADTTETNF PETASTTANT 60 

PSFPTATSPA PPIISTHSSS TIPTPAPPII STHSSSTIPI PTAADSESTT NVNSLATSDI 120 

ITASSPNDGL ITMVPSETQS NNEMSPTTED NQSSGPPTGT ALLETSTLNS TGPSNPCQDD 180 

PCADNSLCVK LHNTSPCLCL BGYYYNSSTC KKGKVFPGKI SVTVSETFDP EEKHSMAYQD 240 

LHSEITSLFK DVPGTSVYGQ TVILTVSTSL SPRSEMRADD KFVNVTIVTI LAETTSDNEK 300 

TVTEKINKAI RSSSSNFLNY DLTLRCDYYG CNQTADDCLN GLACDCKSDL QRPNPQSPFC 360 

VASSLKCPDA CNAQHKQCLI KKSGGAPECA CVPGYQEDAN GNCQKCAFGY SGLDCKDKFQ 420 

LILTXVGTIA GIVILSMIIA LIVTARSNNK TKHIEEENLI DEDFQNLKLR STGFTNLGAE 480 
GSVFPKVRIT ASRDSQMQNP YSRHSSMPRP DY 

Seq ID NO: 151 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence : 87 . . 914, 

1 11 21 31 41 51 

1 I till 

AGTACGCGGG GCCGCGTTTT GCCTCCGCAG CAGCTCTGGG CTCTTCTCAG CTGCGCGAGC 60 

AGCTGCTCCA ATGCCCCGGA GTGGCCATGG GCGCCCCGCA CTGGTGGGAC CAGCTGCAGG 120 

CTGGTAGCTC GGAGGTGGAC TGGTGCGAGG ACAACTACAC CATCGTGCCT GCTATCGCCG 180 

AGTTCTACAA CACGATCAGC AATGTCTTAT TTTTCATTTT ACCGCCCATC TGCATGTGCT 240 

TGTTTCGTCA GTATGCAACA TGCTTCAACA GTGGCATCTA CTTAATCTGG ACTCTTTTGG 300 

TTGTAGTGGG AATTGGATCC GTCTACTTCC ATGCAACCCT TAGTTTCTTG GGTCAGATGC 360 

TTGATGAACT TGCAGTCCTT TGGGTTCTGA TGTGTGCTTT GGCCATGTGG TTCCCCAGAA 420 

GGTATCTACC AAAGATCTTT CGGAATGACC GGGGTAGGTT CAAGGTGGTG GTCAGTGTCC 480 

TGTCTGCGGT TACGACGTGC CTGGCATTTG TCAAGCCTGC CATCAACAAC ATCTCTCTGA 540 

TGACCCTGGG AGTTCCTTGC ACTGCACTGC TCATCGCAGA GCTAAAGAGG TGTGACAACA 600 

TGCGTGTGTT TAAGCTGGGC CTCTTCTCX3G GCCTCTGGTG GACCCTGGCC CTGTTCTGCT 660 

GGATCAGTGA CCGAGCTTTC TGCGAGCTGC TGTCATCCTT CAACTTCCCC TACCTGCACT 720 

GCATGTGGCA CATCCTCATC TGCCTTGCTG CCTACCTGGG CTGTGTATGC TTTGCCTACT 780 

TTGATGCTGC CTCAGAGATT CCTGAGCAAG GCCCTGTCAT CAAGTTCTGG CCCAATGAGA 840 

AATGGGCCTT CATTGGTGTC CCCTATGTGT CCCTCCTGTG TGCCAACAAG AAATCATCAG 900 

TCAAGATCAC GTGATGGCAA GATGGTGGCT GGCTTCTCTG CTTATCGCCC CTCATGCAGT 960 

GGGCTTCCTT TGCTAGGAAG ACAGCCAAGG GAGTTCGAAT AGTTGGGGTG TGGGCTATCT 1020 

TTTCAAAAAT CTATTTGCTG GGGCTCTTAA TTTCTTTAGT GTTCTTTQTA TGTAGGGATT 1080 

TAAACTTTGT CATATGGTAC AAATATTCCC TGCCCCCCTG CAGTTTCCCA TTTGTCTTTC 1140 

AGTATGTTAA TATTTTTGTG CCATACTGGT TTTAAACTTT CATGTTGTCA CATCTGTTAA 1200 

TCTTTTCTTT AGGATTTCTG GATTTTGTGT AATTTTTAAA AAGGTCCCCT CCTCCTCCCT 1260 

AATGTGTCTG TGGACCACCT GGATTCCACT GTACAAGGGG AAAAGTGTCT ATTCCTTTCC 1320 

CAAAGATGGA AAATGGAGGG CTTAGGGACA CTAGATGCAT CTTTCTCAGC ATCACTTCCA 1380 

GATGCAGTGA CTTGTTGGGC TGCGTCCTTA ATGGCCATGG CAGAGCAGTC CCTTGGGGGA 1440 

TCCAGCCCTG TACAATGCAT CTCTTCCTGG AGAAAGCTGG CCTGCTCCAG ACCCCACCAT 1500 

TCCCAGGCGC CCTTGGAGTG GACTCTACTG ATGACAGACA GACCCTCTGA GAGACAAGAC 1560 

CCTCTGACTC TGTGATGGAA GATGCCAGAG ATTTTCCTTT GGGGTAATTG TCCTTAAACA 1620 

AAACCAAACA GATGAAACAC ACACAGGACT TGTGGCTAAA AAGGCTAGTT TTTCACTTGC 1680 

ATTTCTCAAC TAACCCAGGT TTTACATGCA TCTGTGAATC CTTTTACTAC TACCTCTGTG 1740 

GAGAGATGGA GAGACTTCAG ATAAACGTGA AGCTAATGAG TAAAACCCTC TCTGCCAAAA 1800 

CCTACACTCC ACTTTAGGCC CTTCTTGAAG ATGAGCACAA TTTTTAAATA CTGAGCACAA 1860 

TTTTTAAATA CTGACATCAC TTCCTCTTCC CCCTCCCACC CCAGCTCAGC AGCCTCAAAT 1920 

CTACAGAGAA GAAGAATTAT GGCATGAACA TTCCCACAGA CCCACCATCT TTAAGACTTG 1980 

ACCTCTGTAA GTTTACCAAA GGGCTCCTCA CAATTGTGGT GGGGGTTCTG GTTCAAAATT 2040 
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TGGAGCAAAC ATGAAGTTTT TGGAAACGTT TTCTCATTTG AAGCCTCCAG TATGCTGTAC 2100 
TATTCTGGAA ATTACCTTCA AGAGTCTCAC TTCTTGTTTC TGTTGTGTTT TCTGTGGGCA 2160 
TCATGTTCTT CACGCTTGCA GTAGAAGGTG CTTTCTCGGT TTCCCAGAGT ATCCAACGGC 2220 
TCACCTTTCT CAAGTGCTGG CAGTAGCTAT GCACTCACGG GCTGGTTTGG GTOGCTGGTG 2280 
CAGCAGCGCA AATCTGTTGC CTTCTGAATT TTTCTCACCT AATGTGACAC TGGCTACAAT 2340 
GAATCTTCTC TTCATCGGGC TGAATGAAAG ATTCAAGAAC CATCTTCAAG GTGCATGGTG 2400 
GGAATTATCA ACCTCAGGGA TACTCATTTT AACTCAGGCG TGTCCTGCTT TGTAACATTC 2460 
CATTGTTGGG AGAGGGCAGG ACAGGTGTGT TCTTCTGTGG GCAGGAGTCA TGTCACTGTC 2520 
CTACATATGT AAGAGTTGGG AAGGTGACGA TTTTTGACAC ATCCAGGAAC TCTTACTCTA 2580 
GTTAGAATTT GTACCAGATC CAAGGTGAAA ACCCCAATAA GCAACTGAAT TTAGAGTTTA 2640 
AAAATGAATG ACTTTATGCT ACATCTGTGG TTATCAAATT ATATAGGTTG TTGAGAAGCA 2700 
GAACGCTGTT TGTAGTAAGA AATCTTTGTG GAACCCCAGT GTGTGAAGTA AATTGTATGT 2760 
TATTAAATTT ATTTAAGGTT AAATTTATGG CATTTACTTA ATAATATATG AGGTGGTGAA 2820 
AATGCAAATT AACAAATTGG TAATTTCCAA GGTAGAAAAA TTAGGTGTTG AATGAATGTA 2880 
TATGTTGGTT TATTATAGTT TTATATATAT ATAGAGAGAG TGTTTTGTTT TTGAGTCAGG 2940 
GTCTTGCTCT ATTACCCAGG CTGGAGTGCA GTGGTGCAGT CATGACTCAC TGTAGCCTCT 3000 
GTCTCCCAGG CTCAAGTGAT CCTCTCACCT CAGCCTCCCC AGGAGCTGGG AGTACTGGTG 3060 
OSTGCCTCCA CTCCCAGCTA ATTTTTGTAT TTTTTCATAG AGATGGGGTT TCACCATTTT 3120 
GCCCAGGCTG GTCTCAAACT TCTGGGCTCA AGAGATTCGC CCGCCTCGGC CTCCCAAAGT 3180 
GCTGGGAATA GGCATGAGCC GTCACGCCTG GCCTAAAAAA TATTTTTAAA ATGATCTTTG 3240 
AATTAAATAT TCQTAGAATT TCTAATGTAT CTCTTTGAGA CCTAGGAGGT TGATGGAACA 3300 
GAACTGCTGT TAAGTCCTTT GGGTTTCAAG TCTAGAATTT TTTAAAGGCA AATATCAGCT 3360 
CATTCTTATT TTAGATTGAC CTTATCAGGC ATGGATTCTG GTCT CATCTA CTTTATGGTA 3420 
TAAATGCTCC AAGGTAGGGG GTTTGGTATA TATTTTAAGC CCGGCTTTTT TTTTTTTTTT 34 80 
TTTTTTTTTT TTTTTAATGT GAGAAGCAGA ATGTGCTTCT AGAAACTGGT TTTAAAGAGA 3540 
TGAGCTGAGA AAGAAATGTG GAATGGAGTA TATTTGAGGA GGACAAAACA TAACTTCACT 3600 
TTTGAACAGA AATCACTCTA GCTTGCCAGC ATGGGATGTA AACCAAGAGA GTAGAAATAT 3660 
ACCCATCTTA TTTTAAGTTG GGTTTATGGC ATCGCTCATA TATGTAAAAG CACTACAAAC 3720 
TCTTTAAAGA AAATTGGGAA ACTACAGAGA AGTCAAAGAA AAAAAAAAGT AACCCATATT 3780 
TCTATTGCCC AGGTATAATC CTTGTTAATA TTTTGGTTTG GTCTCCTCTT TTTTTCCCCC 3840 
AATATAGTTG TAAATAAATG ATGTCTTTCA GAGTTGACAT TTATCCTGTA ' GCTTGAATGG 3900 
CATGTAAATG CCAGTTGTAT ATTTTTTCAT GAAGTGTAGG TTTGGAATAC ACTAGAGTTA 3960 
GCTATATGCT TGAATGCTGA TCACTGGATT CTGAGACTGA CTACTGAGTC TACC TTTTTA 4020 
ATCAAGCCTA ACATGAATGG GCTCCAAAAA GTAATGAATG TAATTGTACT TTTTGATGTG 4080 
CCTCTGCACT TGGCTTGGTG AGTCATCATA AATAGCTGTT AAATATGTGA CTTTACAGAT 4140 
TTTGATATGT TCAGATTGTA AAAAATGAAT AGTTTATTTC ATTAATTGAT GGGCAGTCAA 4200 
GAATCTCCCT CC 

Seq ID NO: 152 Protein sequence 
Protein Accession ft: Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MGAPHWWDQL QAGSSEVDWC EDNYTIVPAI AEFYNTISNV LFFILPPICM CLFRQYATCF 60 
NSGIYLIWTL LWVGIGSVY FHATLSFLGQ MLDELAVLWV LMCALAMWFP RRYLPKIFRN 120 
DRGRPKWVS VLSAVTTCLA FVKPAINNIS LMTLGVPCTA LLIAELKRCD NMRVFKLGLF 180 
SGLWWTLALF CWISDRAFCE LLSSFNFPYL HCMWHILICL AAYLGCVCFA YFDAASEIPE 240 
QGPVIKFWPN EKWAFIGVPY VSLLCANKKS SVKIT 

Seq ID NO: 153 DNA sequence 

Nucleic Acid Accession ft: NM_001432.1 

Coding sequence: 167.. 67 6 

1 11 21 31 41 51 

! I I 1 1 I 

TCACTTGCCT GATATTTCCA GTGTCAGAGG GACACAGCCA ACGTGGGGTC CCTTCTAGGC 60 

TGACAGCCGC TCTCCAGCCA CTGCCGCGAG CCCGTCTGCT CCCGCCCTGC CCGTGCACTC 120 

TCCGCAGCCG CCCTCCGCCA AGCCCCAGCG CCCGCTCCCA TCGCCGATGA CCGCGGGGAG 180 

GAGGATGGAG ATGCTCTGTG CCGGCAGGGT CCCTGCGCTG CTGCTCTGCC TGGGTTTCCA 240 

TCTTCTACAG GCAGTCCTCA GTACAACTGT GATTCCATCA TGTATCCCAG GAGAGTCCAG 300 

TGATAACTGC ACAGCTTTAG TTCAGACAGA AGACAATCCA CGTGTGGCTC AAGTGTCAAT 360 

AACAAAGTGT AGCTCTGACA TGAATGGCTA TTGTTTGCAT GGACAGTGCA TCTATCTGGT 420 

GGACATGAGT CAAAACTACT GCAGGTGTGA AGTGGGTTAT ACTGGTGTCC GATGTGAACA 480 

CTTCTTTTTA ACCGTCCACC AACCTTTAAG CAAAGAGTAT GTGGCTTTGA COGTGATTCT 540 

TATTATTTTG TTTCTTATCA CAGTCGTCGG TTCCACATAT TATTTCTGCA GATGGTACAG 600 

AAATCGAAAA AGTAAAGAAC CAAAGAAGGA ATATGAGAGA GTTACCTCAG GGGATCCAGA 660 

GTTGCCGCAA GTCTGAATGG CGCCATCAAA CTTATGGGCA GGGATAACAG TGTGCCTGGT 720 

TAATATTAAT ATTCCATTTT ATTAATAATA TTTATGTTGG GTCAAGTGTT AGGTCAATAA 780 

CACTGTATTT TAATGTACTT GAAAAATGTT TTTATTTTTG TTTTATTTTT GACAGACTAT 840 

TTGCTAATGT ATAATGTGCA GAAAATATTT AATATCAAAA GAAAATTGAT ATTTTTATAC 900 

AAGTAATTTC CTGAGCTAAA TGCTTCATTG AAAGCTTCAA AGTTTATATG CCTGGTGCAC 960 

AGTGCTTAGA AGTAAGCAAT TCCCAGGTCA TAGCTCAAGA ATTGTTAGCA AATGACAGAT 1020 

TTCTGTAAGC CTATATATAT AGTCAAATCG ATTTAGTAAG TATGTTTTTT ATGTTCCTCA 1080 

AATCAGTGAT AATTGGTTTG ACTGTACCAT GGTTTGATAT GTAGTTGGCA CCATGGTATC 1140 

ATATATTAAA ACAATAATGC AATTAGAATT TGGGAGAAGC AAATATAGGT CCTGTGTTAA 1200 

ACACTACACA TTTGAAACAA GCTAACCCTG GGGAGTCTAT GGTCTCTTCA CTCAGGTCTC 1260 

AGCTATAATT CTGTTATATG AGGGGCAGTG GACAGTTCCC TATGCCAACT CACGACTCCT 1320 

ACAGGTACTA GTCACTCATC TACCAGATTC TGCCTATGTA AAATGAATTG A AAAACAA TT 1380 

TTCTGTAATC TTTTATTTAA GTAGTGGGCA TTTCATAGCT TCACAATGTT CCTTTTTTGT 1440 

ATATTACAAC ATTTATGTGA GGTAATTATT GCTCAACAGA CAATTAGAAA AAAGTCCACA 1500 

CTTGAAGCCT AA A TTTGT G C TTTTTAAGAA TATTTTTAGA CTATTTCTTT TTATAGGGGC 1560 

TTTGCTGAAT TCTAACATTA AATCACAGCC CAAAATTTGA TGGACTAATT ATTATTTTAA 1620 

AATATATGAA GACAATAATT CTACATGTTG TCTTAAGATG GAAATACAGT TATTTCATCT 1680 

TTTATTCAAG GAAGTTTTAA CTTTAATACA GCTCAGTAAA TGGCTTCTTC TAGAATGTAA 1740 
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AGTTATGTAT TTAAAGTTGT ATCTTGACAC AGGAAATGGG AAAAAACTTA AAAATTAATA 1800 

TGGTGTATTT TTCCAAATGA AAAATCTCAA TTGAAAGCTT TTAAAATGTA GAAACTTAAA I860 

CACACCTTCC TGTGGAGGCT GAGATGAAAA CTAGGGCTCA TTTTCCTGAC ATTTGTTTAT 1920 

TTTTTGGAAG AGACAAAGAT TTCTTCTGCA CTCTGAGCCC ATAGGTCTCA GAGAGTTAAT 1980 

AGGAGTATTT TTGGGCTATT GCATAAGGAG CCACTGCTGC CACCACTTTT GGATTTTATG 2040 

GGAGGCTCCT TCATCGAATG CTAAACCTTT GAGTAGAGTC TCCCTGGATC ACATACCAGG 2100 

TCAGGGAGGA TCTGTTCTTC CTCTACGTTT ATCCTGGCAT GTGCTAGGGT AAACGAAGGC 2160 

ATAATAAGCC ATGGCTGACC TCTGGAGCAC CAGGTGCCAG GACTTGTCTC CATGTGTATC 2220 

CATGCATTAT ATACCCTGGT GCAATCACAC GACTGTCATC TAAAGTCCTG GCCCTGGCCC 2280 

TTACTATTAG GAAAATAAAC AGACAAAAAC AAGTAAATAT ATATGGTCCT ATACATATTG 2340 

TATATATATT CATATACAAA CATGTATGTA TACATGACCT TAATGGATCA TAGAATTGCA 2400 

GTCATTTGGT GCTCTGCTAA CCATTTATAT AAAACTTAAA AACAAGAGAA AAGAAAAATC 2460 

AATTAGATCT AAACAGTTAT TTCTGTTTCC TATTTAATAT AGCTGAAGTC AAAATATGTA 2520 

AGAACACATT TTAAATACTC TACTTACAGT TGGCCCTCTG TGGTTAGTTC CACATCTGTG 2580 

GATTCAACCA ACCAAGGACG GAAAATGCTT AAAAAATAAT ACAACAACAA CAAAAAATAC 2640 

ATTATAACAA CTATTTACTT TTTTTTTTTT CTTTTTGAGA TGGAGTCTCG CTCTGTTGCC 2700 

CAGGTTGGAG TGCAGTGGCA CGATCTOGGC TCACTGCAAC CTCACCTCCC GGGTTCAAGA 2760 

GATCCTCCTG CCTCAGCCTC CTGAGCAGCT GGGACTACAG GCGCATGCCA CCATGCCCAG 2820 

CTAATTTTTG TATTTTTAGT AGAGGCGGGG TTTCACCATG TTGGCCAGGA TGGTCTCAAT 2880 

CTCCTAACCT TGAGATCCAC CCTCCACAGC CTCCCAAACT GCTGGGATTA CAGGCGTGAG 2940 

CCACCGCACG TAGCATTTAC ATTAGGTATT ACAAGTAATG TAAAGATGAT TTAAGTATAC 3000 

AGGAGGATGT GAATAGGTTA TATGCAAGCA CTATGCCCTT TTATATAAGT GACTTGAACA 3060 

TCTGTGCCCG ATTTTAGTAT GTGCAGGGGG GCGATCTGGG AATCAGTCCC CTGTGGATAC 3120 

CAAGGTACAA CTGTATTTAT TAAOGCTTAC TAGATGTGAG GAGAGTCTGA ATATTTTCAG 3180 

TGATCTTGGC TGTTTCAAAA AAATCTATTG ACTTTTCAAT AAATCAGCTG CAATCCATTT 3240 

ATTTCATTTA CAAAAGATTT ATTGTAAGCC TCTCAATCTT GGTTTTTCAG TTGATCTTAA 3300 

GCATGTCAAT TCATAAAAAC AAGTCATTTT TGTATTTTTC ATCTTTAAGA ATGCTTAAAA 3360 

AAGCTAATCC CTAAAATAGT TAGATCTTTG TAAATGCATA TTAAATAATA AAGTATGACC 3420 

CACATTACTT TTTATGGGTG AAAATAAGAC AAAAATAATA GTTTTAGTGA GGATGGTGCT 3480 

GAGTAAACAT AAAAACTGAT TTGCTCTCAG CTGATGTGTC CTGTACACAG TGGGAAGATT 3540 

TTAGTTCACA CTTAGTCTAA CTCCCCCATT TTACAGATTT CTCACTATAT ATATTTCTAG 3600 

AAGGGGCTAT GCATATTCAA TGTATTGAGA ACCAAAGCAA CCACAAATGC ATAAATGCAT 3660 

AATTTATGGT CTTCAACCAA GGCCACATAA TAACCCAGTT AACTTACTCT TTAACCAGGA 3720 

ATATTAAGTT CTATAACTAG TACTCAAGGT TTAACCTTAA AATTAAGATT TCCTTAACCT 3780 

TAACCTTAAA ATTGATATTA TATTAAACAT ACATAATACA ATGTAACTCC ACTGTTCTCC 3840 

TGAATATTTT TTGCTCTAAT CTCTCTGCGG AAAGTCAAAG TGATGGGAGA ATTGGTATAC 3900 

TGGTATGACT AOGTCTTAAG TCAGATTTTT ATTTATGAGT CTTTGAGACT AAATTCAATC 3960 

ACGACCAGGT ATCAAATCAA CTTTTATGCA GCAAATATAT GATTCTAGTG TCTGACTTTT 4020 

GTTAAATTCA GTAATGCAGT TTTTAAAAAC CTGTATCTGA CCCACTTTGT AATTTTTGCT 4080 

CCAATATCCA TTCTGTAGAC TTTTGAAAAA AAAGTTTTTA ATTTGATGCC CAATATATTC 4140 

TGACCGTTAA AAAATTCTTG TTCATATGGG AGAAGGGGGA GTAATGACTT GTACAAACAG 4200 

TATTTCTGGT GTATATTTTA ATGTTTTTAA AAAGAGTAAT TTCATTTAAA TATCTGTTAT 4260 

TCAAATTTGA TGATGTTAAA TGTAATATAA TGTATTTTCT TTTTATTTTG CACTCTGTAA 4320 

TTGCACTTTT TAAGTTTGAA GAGCCATTTT GGTAAACGGT TTTTATTAAA GATGCTATGG 4380 

AACATAAAGT TGTATTGCAT GCAATTTAAA GTAACTTATT TGACTATGAA TATTATCGGA 4440 

TTACTGAATT GTATCAATTT GTTTGTGTTC AATATCAGCT TTGATAATTG TGTACCTTAA 4500 

GATATTGAAG GAGAAAATAG ATAATTTACA AGATATTATT AATTTTTATT TATTTTTCTT 4560 

GGGAATTGAA AAAAAXTGAA ATAAATAAAA ATGCATTGAA CATCTTGCAT TCAAAATCTT 4620 
CACTGAC 

Seq ID NO: 154 Protein sequence 
Protein Accession UP 001423.1 



1 11 21 31 41 51 

55 | | | | | | 

MTAGRRMEML CAGRVPALLL CLGFHLLQAV LSTTVIPSCI PGESSDNCTA LVQTEDNPRV 60 

AQVSITKCSS DMNGYCLHGQ CTYLVDMSQN YCRCEVGYTG VRCEHFFLTV HQPLSKEYVA 120 
LTVILIILPL ITWGSTYYP CRWYRNRKSK EPKKEYERVT SGDPELPQV 

60 Seq ID NO: 155 DNA sequence 

Nucleic Acid Accession ft: NM_013282.2 
Coding sequence: 85. .2466 

« ' 1 11 21 31 41 51 

65 I | | i | | 

CGACTCCTTA GAGCATGGCA TGGCTCAGAG GTGCTGGTAA AACTGATGGG GGTTTTTGCT 60 

GTCCCTCCCC TCAGCGCCGA CACCATGTGG ATCCAGGTTC GGACCATGGA CGGGAGGCAG 120 

ACCCACACGG TGGACTCGCT GTCCAGGCTG ACCAAGGTGG AGGAGCTGAG GCGGAAGATC 180 

CAGGAGCTGT TCCACGTGGA GCCAGGCCTG CAGAGGCTGT TCTACAGGGG CAAACAGATG 240 

GAGGACGGCC ATACCCTCTT OGACTACGAG GTCCGCCTGA ATGACACCAT CCAGCTCCTG 300 

GTCCGCCAGA GCCTCGTGCT CCCCCACAGC ACCAAGGAGC GGGACTCCGA GCTCTCCGAC 360 

ACCGACTCCG GCTGCTGCCT GGGCCAGAGT GAGTCAGACA AGTCCTCCAC CCACGGCGAG 420 

GCGGCCGCCG AGACTGACAG CAGGCCAGCC GATGAGGACA TGTGGGATGA GACGGAATTG 480 

GGGCTGTACA AGGTCAATGA GTAOGTCGAT GCTCGGGACA OGAACATGGG GGCGTGGTTT 540 

GAGGCGCAGG TGGTCAGGGT GACGCGGAAG GCCCCCTCCC GGGAOGAGCC CTGCAGCTCC 600 

ACGTCCAGGC CGGOGCTGGA GGAGGACGTC ATTTACCACG TGAAATACGA OGACTACCCG 660 

GAGAACGGOG TGGTCCAGAT GAACTCCAGG GACGTCOGAG CGCGOGCCCG CACCATCATC 720 

AAGTGGCAGG ACCTGGAGGT GGGCCAGGTG GTCATGCTCA ACTACAACCC CGACAACCCC 780 

AAGGAGCGGG GCTTCTGGTA CGACGCGGAG ATCTCCAGGA AGCGOGAGAC CAGGACGGCG 640 

CGGGAACTCT ACGCCAACGT GGTGCTGGGG GATGATTCTC TGAACGACTG TCGGATCATC 900 

TTCGTGGACG AAGTCTTCAA GATTGAGCGG COGGGTGAAG GGAGCCCCAT GGTTGACAAC 960 

CCCATGAGAC GGAAGAGCGG GCCGTCCTGC AAGCACTGCA AGGACGACGT GAACAGACTC 1020 

TGCCGGGTCT GCGCCTGCCA CCTGTGCGGG GGCCGGCAGG ACCCCGACAA GCAGCTCATG 1080 

TGOGATGAGT GCGACATGGC CTTCCACATC TACTGCCTGG ACCCGCCCCT CAGCAGTGTT 1140 
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CCCAGCGAGG ACGAGTGGTA CTGCCCTGAG TGCCGGAATG ATGCCAGCGA GGTGGTACTG 1200 

GCGGGAGAGC GGCTGAGAGA GAGCAAGAAG AAGGCGAAGA TGGCCTCGGC CACATCGTCC 1260 

TCACAGCGGG ACTGGGGCAA GGGCATGGCC TGTGTGGGCC GCACCAAGGA ATGTACCATC 1320 

GTCCCGTCCA ACCACTACGG ACCCATCCCG GGGATCCCCG TGGGCACCAT GTGGCGGTTC 1380 

CGAGTCCAGG TCAGCGAGTC GGGTGTCCAT CGGCCCCACG TGGCTGGCAT ACACGGCCGG 1440 

AGCAACGACG GAGCGTACTC CCTAGTCCTG GCGGGGGGCT ATGAGGATGA CGTGGACCAT 1500 

GGGAATTTTT TCACATACAC GGGTAGTGGT GGTCGAGATC TTTCCGGCAA CAAGAGGACC 1560 

GCGGAACAGT CTTGTGATCA GAAACTCACC AACACCAACA GGGCGCTGGC TCTCAACTGC 1620 

TTTGCTCCCA TCAATGACCA AGAAGGGGCC GAGGCCAAGG ACTGGCGGTC GGGGAAGCCG 1680 

GTCAGGGTGG TGCGCAATGT CAAGGGTGGC AAGAATAGCA AGTACGCCCC CGCTGAGGGC 1740 

AACCGCTACG ATGGCATCTA CAAGGTTGTG AAATACTGGC CCGAGAAGGG GAAGTCCGGG 1800 

TTTCTCGTGT GGCGCTACCT TCTGCGGAGG GACGATGATG AGCCTGGCCC TTGGACGAAG 1860 

GAGGGGAAGG ACCGGATCAA GAAGCTGGGG CTGACCATGC AGTATCCAGA AGGCTACCTG 1920 

GAAGCCCTGG CCAACCGAGA GCGAGAGAAG GAGAACAGCA AGAGGGAGGA GGAGGAGCAG 1980 

CAGGAGGGGG GCTTCGCGTC CCCCAGGACG GGCAAGGGCA AGTGGAAGCG GAAGTCGGCA 2040 

GGAGGTGGCC CGAGCAGGGC CGGGTCCCCG CGCCGGACAT CCAAGAAAAC CAAGGTGGAG 2100 

CCCTACAGTC TCACGGCCCA GCAGAGCAGC CTCATCAGAG AGGACAAGAG CAACGCCAAG 2160 

CTGTGGAATG AGGTCCTGGC GTCACTCAAG GACCGGCCGG CGAGCGGCAG CCCGTTCCAG 2220 

TTGTTCCTGA GTAAAGTGGA GGAGACGTTC CAGTGTATCT GCTGTCAGGA GCTGGTGTTC 2280 

CGGCCCATCA CGACCGTGTG CCAGCACAAC GTGTGCAAGG ACTGCCTGGA CAGATCCTTT 2340 

CGGGCACAGG TGTTCAGCTG CCCTGCCTGC CGCTACGACC TGGGCCGCAG CTATGCCATG 2400 

CAGGTGAACC AGCCTCTGCA GACCGTCCTC AACCAGCTCT TCCCCGGCTA CGGCAATGGC 2460 

OGGTGATCTC CAAGCACTTC TCGACAGGCG TTTTGCTGAA AACGTGTCGG AGGGCTCGTT 2520 

CATCGGCACT GATTTTGTTC TTAGTGGGCT TAACTTAAAC AGGTAGTGTT TCCTCCGTTC 2580 

CCTAAAAAGG TTTGTCTTCC TTTTTTTTTA TTTTTATTTT TCAAATCTAT ACATTTTCAG 2640 

GAATTTATGT ATTCTGGCTA AAAGTTGGAC TTCTCAGTAT TGTGTTTAGT TCTTTGAAAA 2700 

CATAAAAGCC TGCAATTTCT CGACAAAACA ACACAAGATT TTTTAAAGAT GGAATCAGAA 2760 

ACTACGTGGT GTGGAGGCTG TTGATGTTTC TGGTGTCAAG TTCTCAGAAG TTGCTGCCAC 2820 

CAACTCTTTA AGAAGGCGAC AGGATCAGTC CTTCTCTAGG GTTCTGGCCC CCAAGGTCAG 2880 

AGCAAGCATC TTCCTGACAG CATTTTGTCA TCTAAAGTCC AGTGACATGG TTCCCCGTGG 2940 

TGGCCCGTGG CAGCCCGTGG CATOGCGTGG CTCAGCTGTC TGTTGAAGTT GTTGCAAGGA 3000 

AAAGAGGAAA CATCTCGGGC CTAGTTCAAA CCTTTGCCTC AAAGCCATCC CCCACCAGAC 3060 

TGCTTAGCGT CTGAGATCCG CGTGAAAAGT CCTCTGCCCA CGAGAGCAGG GAGTTGGGGC 3120 

CACGCAGAAA TGGCCTCAAG GGGACTCTGC TCCACGTGGG GCCAGGCGTG TGACTGACGC 3180 

TGTCCGACGA AGGCGGCCAC GGACGGACGC CAGCACACGA AGTCACGTGC AAGTGCCTTT 3240 

GATTCGTTCC TTCTTTCTAA AGACGACAGT CTTTGTTGTT AGCACTGAAT TATTGAAAAT 3300 

GTCAACCAGA TTCTAGAAAC TGCGGTCATC CAGTTCTTCC TGACACCGGA TGGGTGCTTG 3360 

GGAACCGTTT GAGCCTTATA GATCATTTAC ATTCAATTTT TTTAACTCAG CAAGTGAGAA 3420 

CTTACAAGAG GGTTTTTTTT TAATTTTTTT TTCTCTTAAT GAACACATTT TCTAAATGAA 3480 

TTTTTTTTGT AGTTACTGTA TATGTACCAA GAAAGATATA ACGTTAGGGT TTGGTTGTTT 3540 

TTGTTTTTGT ATTTTTTTTC TTTTGAAAGG GTTTGTTAAT TTTTCTAATT TTACCAAAGT 3600 

TTGCAGCCTA TACCTCAATA AAACAGGGAT ATTTTAAATC ACATACCTGC AGACAAACTG 3660 

GAGCAATGTT ATTTTTAAAG GGTTTTTTTC ACCTCCTTAT TCTTAGATTA TTAATGTATT 3720 

AGGGAAGAAT GAGACAATTT TGTGTAGGCT TTTTCTAAAG TCCAGTACTT TGTCCAGATT 3780 
TTAGATTCTC AGAATAAATG TTTTTCACAG ATTGAAAAAA AAAAAAAA 

Seq ID NO: 156 Protein sequence 
Protein Accession #: NP 037414.2 



1 
I 

MWIQVRTMDG 
YEVRLNDTIQ 
PADEDMWDET 
DVIYHVKYDD 
AEISRKRETR 
SCKHCKDDVN 
PECRNDASEV 
IPGIPVGTMW 



GGKNSKYAPA 
LGLTMQYPEG 
SPRRTSKKTK 
TFQCICCQEL 



11 
1 

RQTHTVDSLS 
LLVRQSLVLP 
ELGLYKVNEY 
YPENGWQMN 
TARELYANW 
RLCRVCACHL 
VLAGERLRES 
RFRVQVSESG 
RTAEQSCDQK 
EGNRYDGIYK 
YLEALANRER 
VEPYSLTAQQ 
VFRPITTVCQ 



21 
I 

RLTKVEELRR 
HSTKERDSEL 
VDARDTNMGA 
SRDVRARART 
LGDDSLNDCR 
CGGRQDFDKQ 
KKKAKMASAT 
VHRPHVAGIH 
LTNTNRALAL 
WKYWPEKGK 



SSLIREDKSN 
HNVCKDCLDR 



31 
I 

KIQELFHVEP 
SDTDSGCCLG 
WFEAQWRVT 
IIKWQDLEVG 
IIFVDEVFKI 
LMCDECDMAF 
SSSQRDWGK3 
GRSNDGAYSL 
NCFAPINDQE 
SGFLVWRYLL 
EQQEGGFASP 
AKLWNEVLAS 
SFRAQVFSCP 



41 
I 

GLQRLFYRGK 
QSESDKSSTH 
RKAPSRDEPC 
QWMLNYNPD 
ERPGEGSPMV 
HIYCLDPPLS 
KACVGRTKEC 
VLAGGYEDDV 
GAEAKDWRSG 
RRDDDEPGPW 
RTGKGKWKRK 
LKDRPASGSP 
ACRYDLGRSY 



51 
I 

QMEDGHTLFD 
GEAAAETDSR 
SSTSRPALEE 
NPKERGFWYD 
DNPMRRKSGP 
SVPSEDEWYC 
TIVPSNHYGP 
DHGNFFTYTG 
KPVRWRNVK 



Seq ID NO: 157 DNA sequence 

Nucleic Acid Accession #: NM_0007S6.1 

Coding sequence : 186 . . 776 



AGAAACTCAG 
GAGAGTGGGA 
TCTCTGCAGA 
CTAACATGCG 
GCCCGCCATG 
AGCACCCTCA 
CGCAGGCTCG 
TCAACAAGAG 
GCAGCCGCCC 
TGCTGCTGCC 
ATGCCCTCGG 
TCTCCCTGGA 
AGTTAGCACA 



11 
I 

AGACCAAGTC 
ACAGTAAAGA 
GAGGCGGCAG 
GCTGCCGCTG 
CAGGGCGCTC 
GCCCTTGGAT 
GCCGGTCCTG 
CCCGGCCGCT 
TTCGCCGGAA 
TCGGCGCTCG 
CGGCCACCAG 
TCTCACCTTC 
GCAAGCTCAC 



21 
I 

CATTGAGAGA 
GAAAGGAAGA 
CACCCGGCTC 
CTTGTGTCCG 
CTGAGCCGCG 
TTCTTCCAGC 
CTCCGCATGG 
CCCCTTTCGC 
CAGGCGACCG 
CTCGACAGCC 
GAGGCACCGG 
CACCTCCTCC 
AGCAACAGGA 



31 
I 

CTGAGGGGAA 
CAACCTCCAG 
ACCTGCGAAG 
OGGGAGTCCT 
GGCCGGTCCC 
CGCCGCCGCA 
GAGAGGAGTA 

CCAACTTTTT 

AGAGAGAAAG 
GGGAAGTCTT 
AACTCATQGA 



41 
I 

AGAGAGGAGA 
AGAAAGCCCC 
GGCCTGGGAA 
GCTGGTGGCT 
GGGAGCTCGG 
GTCCGAGCAG 
CTTCCTCCGC 
GCTCCTCGCC 
CCGCGTGTTG 
CGCGGAGCGC 
GCGGTCCGAG 
GGAAATGGCC 
GATTATTGGG 



SAGGGPSRAG 
FQLFIiSKVEE 
AMQVNQPLQT 



51 
I 

GAAAGAAAAA 
CGGAGACGTC 
GCGAGTGCCC 
CTCCTGCCCT 
CAGGCGCCGC 
CCCCAGCAGC 
CTGGGGAACC 
GGAGGCAGCG 
CTGCAGCAGC 
GGCGCTAGGA 
GAGCCTCCCA 
AGGGCCGAGC 
AAATAAAACG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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GTGCGTTTGG CCAAAAAGAA TCTGCATTTA GCACAAAAAA AATTTAAAAA AATACAGTAT 840 

TCTGTACCAT AGCGCTGCTC TTATGCCATT TGTTTATTTT TATATAGCTT GAAACATAGA 900 

GGGAGAGAGG GAGAGAGOCT ATACCCCTTA CTTAGCATGC ACAAAGTGTA TTCACGTGCA 960 

GCAGCAACAC AATGTTATTC GTTTTGTCTA CGTTTAGTTT CCGTTTCCAG GTGTTTATAG 1020 

TGGTGTTTTA AAGAGAATGT AGACCTGTGA GAAAACGTTT TGTTTGAAAA AGCAGACAGA 1080 

AGTCACTCAA TTGTTTTTGT TGTGGTCTGA GCCAAAGAGA ATGCCATTCT CTTGGGTGGG 1140 

TAAGACTAAA TCTGTAAGCT CTTTGAAACA ACTTTCTCTT GTAAACGTTT CAGTAATAAA 1200 

ACATCTTTCC AGTCCTTGGT CAGTTTGGTT GTGTAAGAGA ATGTTGAATA CTTATATTTT 1260 
TAATAAAAGT TGCAAAGGT 

Seq ID NO: 15B Protein sequence 
Protein Accession 8: NP 000747.1 



ic } " 21 31 41 51 

MRLPLLVSAG VLLVALLPCP PCRALLSRGP VPGARQAPQH PQPLDFFQPP PQSEQPQQPQ 60 

ARPVLLRMGB EYFLRLGNLN KSPAAPLSPA SSLLAGGSGS RPSPEQATAN FFRVLLQQLL 120 

LPRRSL.DSPA ALAERGARNA LGGHQEAPER ERRSEEPPIS LDLTFHLLRE VLEMARAEQL 180 
AQQAHSNRKL MEIIGK 



Seq ID NOs 159 dna aeguencg 

Nucleic Acid Accession #: NM 001200.1 

Coding sequence: 325.. 1514 ~~ 



25 1 11 21 31 41 51 

I I I I I ! 

GGGGACTTCT TGAACTTGCA GGGAGAATAA CTTGCGCACC CCACTTTGCG CCGGTGCCTT 60 

TGCCCCAGCG GAGCCTGCTT CGCCATCTCC GAGCCCCACC GCCCCTCCAC TCCTCGGCCT 120 

TGCCCGACAC TGAGACGCTG TTCCCAGCGT GAAAAGAGAG ACTGCGCGGC CGGCACCCGG 180 

JV GAGAAGGAGG AGGCAAAGAA AAGGAACGGA CATTCGGTCC TTGCGCCAGG TCCTTTGACC 240 

AGAGTTTTTC CATGTGGACG CTCTTTCAAT GGACGTGTCC CCGCGTGCTT CTTAGACGGA 300 

CTGCGGTCTC CTAAAGGTCG ACCATGGTGG CCGGGACCCG CTGTCTTCTA GCGTTGCTGC 360 

TTCCCCAGGT CCTCCTGGGC GGCGCGGCTG GCCTCGTTCC GGAGCTGGGC CGCAGGAAGT 420 

oc TCGCGGCGGC GTCGTCGGGC CGCCCCTCAT CCCAGCCCTC TGACGAGGTC CTGAGCGAGT 480 

J J TCGAGTTGCG GCTGCTCAGC ATGTTCGGCC TGAAACAGAG ACCCACCCCC AGCAGGGACG 540 

CCGTGGTGCC CCCCTACATG CTAGACCTGT ATCGCAGGCA CTCAGGTCAG CCGGGCTCAC 600 

CCGCCCCAGA CCACCGGTTG GAGAGGGCAG CCAGCCGAGC CAACACTGTG CGCAGCTTCC 660 

ACCAT GAAGA ATCTTTGGAA GAACTACCAG AAACGAGTGG GAAAACAACC CGGAGATTCT 720 

. TCTTTAATTT AAGTTCTATC CCCACGGAGG AGTTTATCAC CTCAGCAGAG CTTCAGGTTT 780 

H\J TCCGAGAACA GATGCAAGAT GCTTTAGGAA ACAATAGCAG TTTCCATCAC CGAATTAATA 840 

TTTATGAAAT CATAAAACCT GCAACAGCCA ACTCGAAATT CCCCGTGACC AGACTTTTGG 900 

ACACCAGGTT GGTGAATCAG AATGCAAGCA GGTGGGAAAG TTTTGATGTC ACCCCCGCTG 960 

TGATGCGGTG GACTGCACAG GGACACGCCA ACCATGGATT CGTGGTGGAA GTGGCCCACT 1020 

TGGAGGAGAA ACAAGGTGTC TCCAAGAGAC ATGTTAGGAT AAGCAGGTCT TTGCACCAAG 1080 

ATGAACACAG CTGGTCACAG ATAAGGCCAT TGCTAGTAAC TTTTGGCCAT GATGGAAAAG 1140 

GGCATCCTCT CCACAAAAGA GAAAAACGTC AAGCCAAACA CAAACAGCGG AAACGCCTTA 1200 

AGTCCAGCTG TAAGAGACAC CCTTTGTACG TGGACTTCAG TGACGTGGGG TGGAATGACT 1260 

GGATTGTGGC TCCCCCGGGG TATCACGCCT TTTACTGCCA CGGAGAATGC CCTTTTCCTC 1320 

TGGCTGATCA TCTGAACTCC ACTAATCATG CCATTGTTCA GACGTTGGTC AACTCTGTTA 1380 

DV ACTCTAAGAT TCCTAAGGCA TGCTGTGTCC CGACAGAACT CAGTGCTATC TCGATGCTGT 1440 

ACCTTGACGA GAATGAAAAG GTTGTATTAA AGAACTATCA GGACATGGTT GTGGAGGGTT 1500 
GTGGGTGTCG CTAGTACAGC AAAATTAAAT ACATAAATAT ATATATA 

rc Seq ID N0 * 1S0 Protein sequence 
OD Protein Accession #: NP_001191.l 

1 11 21 31 41 SI 

I I I : I ! | 

MVAGTRCLLA LLLPQVLIX3G AAGLVPELGR RKFAAASSGR PSSQPSDEVL SEFELRLLSM 60 

OU FGLKQRPTPS RDAWPPYML DLYRRHSGQP GSPAPDHRLE RAASRANTVR SFKHEESLEE 120 

LPETSGKTTR RFFFNLSSIP TEEFITSAEL QVFREQMQDA LGNNSSFHHR INIYEIIKPA 180 
TANSKFPVTR LLDT 



Seq ID NO: 161 DNA sequence 

Nucleic Acid Accession ft: NMJJ01999-2 

Coding sequence: 1..8736 



1 11 21 31 41 51 

70 1 1 1 I i I 

f\J ATGGGGAGAA GACGGAGGCT GTGTCTCCAG CTCTACTTCC TGTGGCTGGG CTGTGTGGTG 60 

CTCTGGGCGC AGGGCACGGC CGGCCAGCCT CAGCCTCCTC CGCCCAAGCC GCCCCGGCCC 120 

CAGCCGCCGC CGCAACAGGT TCGGTCCGCT ACAGCAGGCT CTGAAGGCGG GTTTCTAGCG 180 

CCCGAGTATC GCGAGGAGGG TGCCGCAGTG GCCAGCCGCG TCCGCCGGCG AGGACAGCAG 240 

GACGTGCTCC GAGGGCCCAA CGTGTGCGGC TCCAGATTCC ACTCCTACTG CTGCCCTGGA 300 

/D TGGAAGACGC TCCCTGGAGG AAACCAGTGC ATTGTCCCGA TTTGTAGAAA TAGTTGTGGA 360 

GATGGATTTT GTTCCCGTCC TAACATGTGT ACTTGTTCCA GTGGGCAAAT ATCATCAACC 420 

TGTGGATCAA AATCAATTCA GCAGTGCAGT GTGAGATGCA TGAATGGTGG GACCTGTGCA 480 

GATGACCACT GCCAGTGCCA GAAAGGATAT ATTGGAACTT ATTGTGGACA ACCTGTCTGT 540 

cn GAAAATGGAT GTCAGAATGG TGGACGTTGC ATCGCCCAAC CGTGTGCTTG TGTTTATGGG 600 

OU TTCACTGGTC CACAGTGTGA AAGAGATTAC AGGACAGGCC CGTGTTTCAC TCAGGTCAAC 660 

AACCAGATGT GCCAAGGGCA GCTGACAGGC ATTGTCTGCA CGAAGACTCT GTGCTGTGCC 720 

ACCACTGGAC GGGCGTGGGG CCATCCCTGT GAGATGTGTC CAGCCCAGCC TCAGCCCTGC 780 

CGACGGGGTT TCATCCCCAA CATCCGCACT GGAGCTTGCC AAGATGTTGA TGAATGCCAG 840 

GCTATCCCAG GGATATGCCA AGGAGGAAAC TGTATCAATA CAGTGGGCTC TTTTGAATGC 900 
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AGATGCCCTG CTGGTCACAA ACAGAGTGAA ACTACTCAGA AATGTGAAGA CATTGATGAG 960 
TGCAGCATCA TTCCTGGGAT ATGTGAAACT GGTGAATGTT CCAACACCGT GGGAAGCTAT 1020 
TTTTGTGTTT GTCCACGTGG ATATGTAACC TCAACAGATG GCTCTCGATG CATCGATCAG 1080 
AGAACAGGCA TGTGTTTCTC GGGCCTGGTG AATGGCCGCT GTGCACAAGA GCTCCCGGGG 1140 
5 AGAATGACGA AAATGCAGTG CTGCTGTGAG CCTGGCCGCT GCTGGGGCAT CGGAACCATT 1200 
CCTGAAGCCT GTCCTGTCAG AGGTTCTGAG GAATATCGCA GACTTTGCAT GGATGGACTT 1260 
CCAATGGGAG GAATTCCAGG GAGTGCTGGT TCCAGACCTG GAGGCACTGG GGGAAATGGC 1320 
TTTGCCCCAA GTGGCAATGG CAATGGCTAT GGCCCAGGAG GGACAGGCTT CATCCCCATC 1380 
CCTGGAGGCA ATGGCTTTTC TCCTGGCGTT GGGGGAGCCG GTGTGGGGGC CGGGGGACAG 1440 
10 GGACCTATCA TCACTGGACT AACAATTCTG AACCAGACAA TAGATATCTG TAAGCATCAT 1500 
GCTAACCTTT GTTTAAATGG ACGCTGTATA CCAACTGTCT CAAGCTACCG ATGTGAATGC 1560 
AACATGGGTT ATAAGCAGGA TGCAAATGGA GATTGTATAG ATGTTGATGA ATGCACATCA 1620 
AATCCCTGCA CTAATGGAGA TTGTGTTAAC ACACCTGGTT CCTATTATTG TAAATGTCAT 1680 
GCTGGATTCC AGAGGACTCC TACCAAGCAA GCATGCATTG ATATTGATGA GTGCATCCAG 1740 
15 AATGGGGTTC TTTGTAAAAA CGGTCGATGC GTGAACTCAG ATGGAAGTTT CCAGTGCATT 1800 
TGCAATGCCG GCTTTGAATT AACTACAGAT GGAAAAAACT GTGTTGATCA TGATGAATGT 1860 
ACAACTACCA ACATGTGTTT GAATGGAATG TGCATCAATG AAGATGGCAG CTTCAAGTGC 1920 
ATCTGCAAAC CAGGATTTGT CTTGGCTCCA AATGGGCGTT ACTGTACTGA TGTTGATGAA 1980 
TGCCAGACCC CAGGAATCTG CATGAATGGG CACTGCATCA ACAGTGAAGG GTCCTTCCGC 2040 
20 TGTGACTGTC CCCCAGGCCT GGCTGTGGGC ATGGATGGAC GTGTGTGTGT TGATACTCAC 2100 
ATGCGCAGTA CCTGCTATGG AGGAATCAAG AAAGGAGTGT GTGTGCGTCC TTTCCCCGGT 2160 
GCAGTGACCA AGTCCGAATG CTGCTGTGCC AATCCAGACT ATGGTTTTGG AGAACCCTGC 2220 
CAGCCATGCC CTGCAAAAAA TTCAGCTGAA TTCCACGGCC TTTGTAGTAG TGGAGTAGGT 2280 
ATCACTGTGG ATGGAAGAGA TATCAATGAA TGTGCTTTGG ATCCTGATAT ATGTGCCAAT 2340 
25 GGGATTTGTG AAAACTTACG TGGTAGTTAC CGTTGTAATT GCAACAGTGG CTATGAACCA 2400 
GATGCCTCTG GAAGAAACTG TATTGACATT GATGAATGTT TAGTAAACAG ACTGCTTTGT 2460 
GATAACGGAT TGTGCCGAAA CACGCCAGGA AGTTACAGCT GTACGTGCCC ACCAGGGTAT 2520 
GTGTTCAGGA CTGAGACAGA GACCTGTGAA GATATAAATG AATGTGAAAG CAACCCATGT 2580 
GTCAATGGGG CCTGCAGAAA CAACCTTGGA TCTTTCAATT GTGAATGTTC GCCCGGCAGC 2640 
30 AAACTCAGCT CCACAGGATT GATCTGTATT GACAGCCTGA AGGGGACCTG TTGGCTCAAC 2700 
ATCCAGGACA GCCGCTGTGA GGTGAATATT AATGGAGCCA CTCTGAAATC TGAATGCTGT 2760 
GCCACCCTCG GAGCCGCCTG GGGGAGCCCC TGTGAGCGGT GTGAACTAGA TACAGCTTGC 2820 
CCAAGAGGGC TTGCCAGGAT TAAAGGTGTT ACGTGTGAAG ATGTTAATGA GTGTGAGGTG 2B80 
TTCCCTGGCG TTTGTCCAAA TGGACGCTGT GTCAACAGTA AGGGATCTTT TCATTGCGAG 2940 
35 TGCCCTGAAG GCCTTACGTT GGATGGGACT GGCCGTGTAT GTTTGGATAT TCGCATGGAG 3000 
CAGTGTTACT TGAAGTGGGA TGAAGATGAA TGCATCCACC CCGTTCCTGG AAAGTTCCGC 3060 
ATGGATGCCT GCTGCTGTGC TGTCGGGGCG GCTTGGGGCA CCGAGTGTGA GGAGTGCCCC 3120 
AAACCTGGCA CCAAGGAATA CGAGACACTG TGCCCCCGCG GGGCTGGCTT TGCTAACCGA 3180 
GGGGATGTTC TTACTGGGCG GCCATTTTAC AAAGACATCA ATGAATGCAA AGCATTTCCT 3240 
40 GGGATGTGCA CTTATGGGAA GTGCAGAAAT ACAATCGGAA GCTTCAAATG CCGTTGCAAT 3300 
AGTGGCTTTG CTCTAGACAT GGAGGAAAGA AACTGCACGG ACATCGACGA GTGCAGGATT 3360 
TCTCCTGACC TCTGTGGCAG TGGAATCTGC GTCAATACAC CGGGCAGCTT TGAGTGCGAG 3420 
TGCTTCGAAG GCTATGAAAG TGGCTTCATG ATGATGAAGA ACTGCATGGA CATTGACGGA 3480 
TGTGAACGTA ACCCTCTCCT TTGTAGGGGT GGCACCTGTG TGAACACTGA GGGCAGCTTT 3540 
45 CAGTGTGACT GCCCACTGGG ACACGAGCTG TCACCATCCC GTGAGGACTG TGTGGATATT 3600 
AATGAATGCT CCCTGAGTGA CAATCTCTGC AGAAATGGAA AATGTGTGAA CATGATTGGA 3660 
ACCTATCAGT GCTCTTGCAA TCCTGGATAT CAGGCTACGC CAGACCGCCA GGGCTGTACA 3720 
GATATTGATG AATGTATGAT AATGAACGGA GGCTGTGACA CCCAGTGCAC AAATTCAGAG 3780 
GGAAGCTACG AATGCAGCTG CAGTGAGGGT TATGCCCTGA TGCCAGATGG GAGATCGTGT 3840 
50 GCAGACATTG ATGAATGTGA AAACAATCCT GATATCTGTG ATGGCGGCCA GTGTACCAAC 3900 
ATTCCTGGAG AGTATCGCTG CCTCTGCTAT GATGGCTTCA TGGCTTCCAT GGACATGAAA 3960 
ACATGCATTG ATGTCAATGA ATGTGACCTA AATTCAAATA TCTGCATGTT TGGGGAATGT 4020 
GAGAACACAA AGGGATCCTT CATTTGCCAC TGTCAGCTGG GTTACTCAGT GAAGAAGGGG 4080 
ACCACAGGAT GTACAGATGT GGATGAGTGT GAAATTGGTG CTCATAACTG CGACATGCAT 4140 
55 GCCTCATGTC TGAATATCCC AGGAAGCTTC AAGTGTAGCT GCAGAGAAGG CTGGATTGGA 4200 
AACGGCATCA AGTGTATTGA TCTGGACGAA TGTTCTAATG GAACCCACCA GTGTAGCATC 4260 
AATGCTCAGT GTGTAAATAC CCCGGGCTCA TACCGCTGTG CCTGCTCCGA AGGTTTCACT 4320 
GGTGATGGCT TTACCTGCTC AGATGTTGAT GAGTGTGCAG AAAACATAAA CCTCTGTGAG 4380 
AACGGACAGT GCCTTAATGT CCCGGGTGCA TATCGCTGOG AGTGTGAGAT GGGCTTCACT 4440 
60 CCAGCCTCAG ACAGCAGATC CTGCCAAGAT ATTGATGAAT GCTCCTTCCA AAACATTTGT 4S00 
GTCTCTGGAA CATGTAATAA CCTGCCTGGA ATGTTTCATT GCATCTGCGA TGATGGTTAT 4560 
GAATTGGACA GAACAGGAGG GAACTGTACA GATATTGATG AGTGTGCAGA TCCTATAAAC 4620 
TGTGTCAATG GCCTATGTGT CAACACGCCT GGTCGCTATG AGTGTAACTG CCCACCCGAT 4680 
TTTCAGTTGA ACCCAACTGG TGTGGGTTGT GTTGACAACC GTGTGGGCAA CTGCTACCTG 4740 
65 AAGTTTGGAC CTGGAGGAGA TGGGAGTCTG TCTTGCAACA CCGAGATCGG GGTGGGCGTC 4800 
AGTCGCTCTT CATGCTGCTG CTCTCTGGGA AAGGCCTGGG GAAACCCCTG TGAGACATGC 4860 
CCCCCTGTCA ATAGCACTGA ATATTACACC CTGTGTCCCG GAGGTGAAGG CTTCAGACCT 4920 
AACCCCATCA CAATCATTTT AGAAGACATT GACGAATGCC AGGAGTTACC AGGTCTCTGC 4980 
CAGGGTGGAA ACTGCATCAA CACTTTTGGG AGCTTCCAGT GTGAGTGCCC ACAAGGCTAC 5040 
70 TACCTCAGCG AGGATACCCG CATCTGTGAG GATATTGATG AGTGTTTTGC ACATCCTGGT 5100 
GTGTGTGGGC CTGGGACCTG CTATAACACC CTGGGAAATT ACACCTGCAT TTGCCCACCT 5160 
GAGTACATGC AGGTCAATGG AGGCCACAAC TGCATGGACA TGAGAAAAAG CTTTTGCTAC 5220 
CGAAGCTATA ATGGAACCAC TTGTGAGAAT GAGTTGCCTT TCAATGTGAC AAAAAGGATG 52 BO 
TGCTGCTGCA CATATAATGT GGGCAAAGCT GGGAACAAAC CTTGTGAACC ATGCCCAACT 5340 
75 CCAGGAACAG CTGACTTTAA AACCATATGT GGAAATATTC CTGGATTCAC CTTTGACATT 5400 
CACACAGGAA AAGCTGTTGA CATTGATGAA TGTAAAGAGA TTCCAGGCAT TTGTGCAAAT 5460 
GGTGTGTGCA TTAACCAGAT TGGCAGTTTC CGCTGTGAAT GCCCTACAGG ATTCAGTTAC 5520 
AATGACCTGC TGTTGGTTTG TGAAGATATA GATGAGTGCA GCAATGGTGA TAATCTCTGC 5580 
CAGCGGAATG CAGACTGCAT CAATAGTCCT GGTAGTTACC GCTGTGAATG TGCCGCGGGT 5640 
80 TTCAAACTTT CACCCAATGG GGCCTGTGTA GATCGCAATG AATGTTTAGA AATTCCTAAC 5700 
GTTTGCAGTC ATGGCTTGTG TGTTGATCTG CAAGGAAGTT ACCAGTGCAT CTGCCACAAT 5760 
GGCTTTAAGG CTTCTCAGGA CCAGACCATG TGCATGGATG TTGATGAGTG CGAGCGGCAC 5820 
CCATGTGGAA ATGGAACTTG TAAAAACACC GTTGGATCCT ATAACTGTCT GTG CTACCCA 5880 
GGGTTTGAAC TCACTCATAA TAATGATTGC CTGGACATAG ATGAGTGCAG TTCCTTTTTT 5940 
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GGTCAGGTGT GCAGAAATGG AOGTTGTTTT AATGAAATTG GTTCTTTCAA GTGTCTATGT 6000 
AACGAAGGTT ATGAACTTAC CCCAGATGGC AAAAACTGTA TAGACACTAA TGAGTGTGTC 6060 
GCCCTTCCCG GCTCTTGCTC TCCTGGTACC TGTCAGAATT TGGAGGGATC CTTCAGATGC 6120 
ATCTGTCCCC CAGGGTATGA AGTAAAAAGC GAGAACTGCA TTGATATAAA TGAATGTGAT 6180. 
5 GAAGATCCCA ACATTTGTCT TTTTGGTTCC TGTACTAATA CTCCAGGGGG CTTCCAGTGC 6240 
CTCTGCCCCC CTGGCTTTGT ACTATCTGAT AATGGAOGGA GATGCTTTGA TACTCGCCAG 6300 
AGCTTCTGCT TCACAAATTT TGAAAATGGA AAGTGTTCTG TACCCAAAGC TTTCAACACC 6360 
ACAAAAGCAA AATGCTGCTG TAGTAAGATG CCAGGAGAGG GCTGGGGGGA CCCCTGTGAG 6420 
CTGTGCCCCA AAGACGATGA AGTTGCATTT CAGGATTTGT GTCCATATGG CCATGGAACT 64 80 
10 GTCCCTAGTC TTCATGATAC ACGTGAAGAT GTCAATGAGT GTCTTGAGAG CCCAGGCATT 6540 
TGTTCAAATG GTCAATGTAT CAACACCGAC GGATCTTTTC GCTGTGAATX3 TCCAATGGGC 6600 
TACAACCTTG ACTACACTGG AGTACGCTGT GTGGATACTG ATGAGTGTTC AATCGGCAAT 6660 
CCGTGTGGAA ATGGTACATG CACCAATGTT ATTGGGAGTT TTGAATGCAA TTGCAATGAA 6720 
GGCTTTGAGC CAGGGCCCAT GATGAATTGT GAAGATATCA ACGAATGTGC CCAGAACCCA 6780 
15 CTGCTGTGTG CTTTACGCTG CATGAACACT TTTGGGTCCT ATGAATGCAC GTGCCCGATT 6840 
GGCTATGCCC TCAGGGAAGA TCAAAAGATG TGCAAAGATC TGGATGAATG TGCTGAAGGG 6900 
TTACACGACT GTGAATCTAG GGGCATGATG TGTAAGAATC TAATCGGCAC CTTCATGTGC 6960 
ATCTGCCCTC CTGGAATGGC CCGAAGGCCC GATGGAGAAG GCTGTGTAGA TGAAAATGAA 7020 
TGCAGGACCA AGCCAGGAAT CTGTGAAAAT GGACGTTGTG TTAACATTAT TGGAAGCTAT 7080 
20 AGATGTGAGT GTAATGAAGG ATTCCAGTCA AGTTCTTCAG GCACTGAATG CCTTGACAAT 7140 
CGACAGGGTC TCTGCTTTGC AGAGGTACTG CAGACAATAT GTCAAATGGC ATCCAGTAGT 7200 
CGCAATCTCG TCACTAAGTC AGAATGCTGC TGTGATGGTG GGCGAGGCTG GGGCCACCAG 72 SO 
TGCGAGCTTT GCCCACTTCC TGGAACTGCC CAGTACAAAA AGATATGTCC TCATGGCCCA 7320 
GGATATACAA CTGATGGAAG AGATATTGAT GAATGTAAGG TAATGCCAAA CCTCTGCACC 7380 
25 AATGGTCAGT GCATCAATAC CATGGGCTCA TTCCGATGCT TCTGCAAGGT TGGCTACACC 7440 
ACAGACATCA GTGGAACCTC TTGTATAGAC CTTGATGAAT GCTCCCAGTC CCCGAAACCA 7500 
TGCAACTACA TCTGCAAGAA CACTGAGGGG AGTTATCAGT GTTCATGTCC GAGGGGGTAT 7560 
GTCCTGCAAG AGGATGGAAA GACATGCAAA GACCTTGATG AATGTCAAAC AAAGCAGCAT 7620 
AACTGCCAGT TCCTCTGTGT CAACACCCTG GGGGGGTTTA CCTGTAAATG TCCACCTGGT 76 BO 
30 TTCACACAGC ATCACACTGC TTGTATCGAC AACAACGAAT GTGGGTCTCA ACCTTTGCTT 7740 
TGTGGAGGAA AGGGAATCTG TCAAAACACT CCAGGCAGTT TCAGCTGTGA ATGCCAAAGA 7 BOO 
GGGTTCTCTC TTGATGCCAC CGGACTGAAC TGTGAAGATG TTGATGAATG TGATGGGAAC 7860 
CACAGGTGCC AACACGGCTG CCAGAACATC CTGGGTGGCT ACAGATGTGG CTGCCCCCAA 7920 
GGCTACATCC AGCACTACCA GTGGAATCAG TGTGTCGATG AGAATGAATG CTCCAATCCC 7980 
35 AATGCCTGTG GCTCTGCTTC CTGCTACAAC ACCCTGGGGA GTTACAAGTG CGCCTGCCCC 8040 
TCGGGGTTCT CCTTCGACCA GTTCTCCAGT GCCTGCCACG ACGTGAATGA GTGCTCGTCC 8100 
TCCAAGAACC CCTGCAATTA CGGCTGCTCT AACACGGAGG GGGGCTACCT CTGTGGCTGC 8160 
CCCCCTGGGT ATTACAGAGT GGGACAAGGC CACTGTGTCT CAGGAATGGG ATTTAACAAG 8220 
GGGCAGTACC TGTCACTGGA TACAGAGGTC GATGAGGAAA ATGCTCTGTC CCCAGAAGCA 8280 
40 TGCTACGAGT GCAAAATCAA CGGCTATCCT AAGAAAGACA GCAGGCAGAA GAGAAGTATT 8340 
CATGAACCTG ATCCCACTGC TGTTGAACAG ATCAGCCTAG AGAGTGTCGA CATGGACAGC 8400 
CGCGTCAACA TGAAGTTCAA CCTCTCCCAC CTCGGCTCTA AGGAGCACAT CCTGGAACTA 8460 
AGGCCCGCCA TCCAGCCCCT CAACAACCAC ATCCGTTATG TCATCTCTCA AGGGAACGAT 8520 
GACAGCGTCT TCCGCATCCA CCAAAGGAAT GGGCTCAGCT ACTTGCACAC GGCCAAGAAG 8580 
45 AAGCTCATGC CCGGCACATA CACACTGGAA ATCACTAGCA TCCCTCTCTA CAAGAAGAAG 8640 
GAGCTTAAGA AACTGGAAGA GAGCAATGAG GATGACTACC TCCTAGGGGA GCTTGGGGAG 8700 
GCTCTCAGAA TGAGGCTGCA GATTCAGCTC TATTAACCGT TCACAGACTT GGGCCCAGGC B760 
TCAAATCCTA GCACAGCCAG TCTGCAGAAG CATTTGAAAA GTCAAGGACT AATTTTAAAG B820 
AGGAAAAATA ATAATAACTC TTGTTTCTTT CCTCCCTGTC TTAGACTTTG AATGTTGACC 8880 
50 CTCACAGGGA GGGATAATTT AGACTCTGGT ATGGCCAAAG ATTTGAGCTC AAAGGCAACC 8940 
GTGGTTACTG TATTTTTTAT ATAACTTCAT TTTAAAATAT ATTAAAAGAA ACCTAAATGT 9000 
TCAAGATATC AGCATATGGC ACTAAATGCA CAAAAATAAT GTGAGCTTTT TTTTTTTTTT 9060 
CCTGTTAGCA GTCTGTAACA CTTTGGGTAT TTTGCTATAG TTGCTAATTA AAAAAATATA 9120 
GATGTTTATT TATTTTTAAT GCAGTAATAT ATGGAGAAAT GAACAAACTA TGTAAACAAA 9180 
55 AAGGGAAACT CACTT G TT T T TCTTTAGATT TATAAATTTG AGCTATTTTT TTTAGAGGTG 9240 
CTTTTTAAAA ATCCAATAGA TACAAGAGAT GTTTCCTTTG GTTTTCTGCC AGTCATCCAG 9300 
CTGATACACA CCTGATCGAT TTTAAAGAAA GCCACACAGA GCTGAATCGG GCAGTGCTAA 9360 
TCAATAATTT AAAAGACATG AATGTCATTA GATCCTTTAT AACGTAGATC GAAGCCAAAG 9420 
- CAGCTCATTT GTGACAACAT TTCATATCAC CAGACACACC AGGCAACAGA AGTTGAAGCA 9480 

60 CAACCACTGT AGCAAAATAC CTTGACTGCT TGTGAGACCA TTAGCATTGC AGGCCAAACC 9540 
GTACTGTATT TCCTTCTCAT AACCTCAAGG AACCATATGT GCTACCCACA ACACCTCATT 9600 
CTTACCCAGG GTGCGCTGCG TCCTCATGGT ACTGTAGGCA GCTGAAGAAC GGCOGTTCCC 9660 
TTGAAAGGGA ACACCTGGCA TTCTGTGGTG TTTCGTGCTG TCTTAAATAA TGGTGCATTT 9720 
_ c ATTATGTTCA AGTTATTTCA GGATTGCCAT ATGTGCAAAC AAATCATGCA ATGCAGCCAA 9780 
05 GGAATATATG TTGTTGTTGT TGTTTTAAAC CCATTTTTTT TTTAGAATTT TCATTAATAC 9B40 
TGTAGTTATA CACGATATGC CTCATTTTAT CATAGCCTAT TGTGTATGAA AGATGTTTGT 9900 
ACAATGAATT GATGTTTAGT TTGCTTTAGT CATTTAAAAA GATATTGTAC CAGGATGTGC 9960 
TATTAAGAGC ACGTATCCAT TATTCTTCTC AACCCAAGAA CCTGTTTCCT GGACCAGTGA 10020 
CCAAACCTCA TATGTGAAAT GGCCAAAGCA CATGCAGGCT CCTGGTTGTT CCTCTCAAAC 10080 
70 CTGTGCTGAC CAAAGATTAG TAACCAGTTA TACCCAGTAT TTTGAGGTTT TATTGTTTTT 10140 
TTAATAACTA AAAAAAAACT CGTGCC 

Seq ID NO i 162 Protein sequence 
^ Protein Accession #: NPJJ01990.1 

1 11 21 31 41 SI 

1 I II I I 

MGRRRRLCLQ LYFLWLGCW LWAQGTAGQP QPPPPKPPRP QPPPQQVRSA TAGSEGGFLA 60 

PEYREEGAAV ASRVRRRGQQ DVLRGPNVCG SRFHSYCCPG WKTLPGGNQC IVPICRNSCG 120 

80 DGFCSRPNMC TCSSGQISST CGSKSIQQCS VRCMNGGTCA DDHCQCQKGY IGTYCGQPVC 180 

ENGCQNGGRC IAQPCACVYG FTGPQCERDY RTGPCFTQVN NQMOQGQLTG IVCTKTLCCA 240 

TTGRAWGHPC EMCPAQPQPC RRGFIPNIRT GAOQDVDECQ AIPGICQGGN CINTVGSFEC 300 

RCPAGHKQSE TTQKCEDIDE CSIIPGICET GECSNTVGSY FCVCPRGYVT STDGSRCIDQ 360 

RTGMCPSGLV NGRCAQELPG RMTKMQCCCE PGRCWGIGTI PEACPVRGSE EYRRLCMDGL 420 
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PMGGIPGSAG SRPGGTGGNG FAPSGNGNGY GPGGTGFIPI PGGNGFSPGV GGAGVGAGGQ 480 

GPIITGLiTIL NQTIDICKHH ANLCLNGRCI PTVSSYRCEC NMGYKQDANG DCIDVDECTS 540 

NPCTNGDCVN TPGSYYCKCH AGFQRTPTKQ ACIDIDECIQ NGVLCKNGRC VNSDGSFQCI 600 

CNAGFELTTD GKNCVDHDEC TTTNMCLNGM CINEDGSFKC ICKPGFVLAP NGRYCTDVDB 660 

CQTPGICMNG HCINSEGSFR CDCPPGLAVG MDGRVCVDTH MRSTCYGGIK KGVCVRPFPG 720 

AVTKSECCCA NPDYGFGEPC QPCPAKNSAE FHGLCSSGVG ITVDGRDINE CALDPDICAN 780 

GICENLRGSY RCNCNSGYEP DASGRNCIDI DECLVNRLLC DNGLCRNTPG SYSCTCPPGY 840 

VFRTETETCE DINECESNPC VNGACRNNLG SFNCECSPGS KLSSTGLICI DSLKGTCWLN 900 

IQDSRCEVNI NGATLKSECC ATLGAAWGSP CERCELDTAC PRGLARIKGV TCEDVNECEV 960 

FPGVCPNGRC VNSKGSFHCE CPEGLTLDGT GRVCLDIRME QCYLKWDEDE CIHPVPGKFR 1020 

MDACCCAVGA AWGTECEECP KPGTKEYETL CPRGAGFANR GDVLTGRPFY KDINECKAFP 10 80 

GMCTYGKCRN TIGSFKCRCN SGFALDMBER NCTDIDECRI SPDLGGSGIC VNTPGSFECE 1140 

CFEGYESGFM MMKNCMDIDG CERNPLLCRG GTCVNTEGSP QCDCPLGHEL SPSREDCVDI 1200 

NECSLSDNLC RNGKCVNMIG TYQCSCNPGY QATPDRQGCT DIDECMIMNG GCDTQCTNSE 1260 

GSYBCSCSEG YALMPDGRSC ADIDECENNP DICDGGQCTN IPGEYRCLCY DGFMASMDMK 1320 

TCIDVNECDL NSNICMFGEC ENTKGSFICH CQLGYSVKKG TTGCTDVDEC EIGAHNCDMH 1380 

ASCLNIPGSF KCSCREGWIG NGIKCIDLDE CSNGTHQCSI NAQCVNTPGS YRCACSEGFT 1440 

GDGFTCSDVD ECAENINLCE NGQCLNVPGA YRCECEMGFT PASDSRSCQD IDECSFQNIC 1500 

VSGTCNNLPG MFHCICDDGY ELDRTGGNCT DIDECADPIN CVNGLCVNTP GRYECNCPPD 1560 

FQLNPTGVGC VDNRVGNCYL KFGPRGDGSL SCNTEIGVGV SRSSCCCSLG KAWGNPCETC 1620 

PPVNSTEYYT LCPGGEGFRP NPITIILEDI DECQELPGLC QGGNCINTFG SFQCECPQGY 1680 

YLSEDTRICE DIDECFAHPG VCGPGTCYNT LGNYTCICPP EYMQVNGGHN CMDMRKSFCY 1740 

RSYNGTTCEN ELPFNVTKRM CCCTYNVGKA GNKPCEPCPT PGTADFKTIC GNIPGFTFDI 1800 

HTGKAVDIDE CKEIPGICAN GVCINQIGSF RCECPTGFSY NDLLLVCEDI DECSNGDNLC 1860 

QRNADCINSP GSYRCECAAG FKLSPNGACV DRNECLEIPN VCSHGLCVDL QGSYQCICHN 1920 

GFKASQDQTM CMDVDECERH PCGNGTCKNT VGSYNCLCYP GFELTHNNDC LDIDECSSFF 1980 

GQVCRNGRCF NEIGSFKCLC NEGYELTPDG KNCIDTNECV ALPGSCSPGT CQNLEGSFRC 2040 

ICPPGYEVKS ENCIDINECD EDPNICLFGS CTNTPGGFQC LCPPGFVLSD NGRRCFDTRQ 2100 

SFCFTNFENG KCSVPKAFNT TKAKCCCSKM PGEGWGDPCE LCPKDDEVAF QDLCPYGHGT 2160 

VPSLHDTRED VNECLESPGI CSNGQCINTD GSFRCECPMG YNLDYTGVRC VDTDECSIGN 2220 

PCGNGTCTNV IGSFECNCNE GFEPGPMMNC EDINECAQNP LLCALRCMNT FGSYECTCPI 2280 

GYALREDQKM CKDLDECAEG LHDCESRGMM CKNLIGTFMC ICPPGMARRP DGEGCVDENE 2340 

CRTKPGICEN GRCVNIIGSY RCECNEGFQS SSSGTECLDN RQGLCFAEVL QTICQMASSS 2400 

RNLVTKSECC CDGGRGWGHQ CELCPLPGTA QYKKICPHGP GYTTDGRDID ECKVMPKLCT 2460 

NGQCINTMGS FRCPCKVGYT TDISGTSCID LDECSQSPKP CNYICKNTEG SYQCSCPRGY 2520 

VLQEDGKTCK DLDECQTKQH NCX}FLCVNTL GGFTCKCPPG FTQHHTACID NNECGSQPLL 2580 

CGGKGICQNT PGSFSCECQR GFSLDATGLM CEDVDECDGN HRCQHGCQNI LGGYRCGCPQ 2640 

GYIQHYQWNQ CVDENECSNP NACGSASCYN TLGSYKCACP SGFSFDQFSS ACHDVNECSS 2700 

SKNPCNYGCS NTEGGYLOGC PPGYYRVGQG HCVSGMGFNK GQYLSLDTEV DEENALSPEA 2760 

CYECKINGYP KKDSRQKRSI HEPDPTAVEQ ISLBSVDMDS PVNMKFNLSH LGSKEHILBL 2820 

RPAIQPLNNH IRYVISQGND DSVFRIHQRN GLSYLHTAKK KLMPGTYTLE ITSIPLYKKK 2880 
ELKKLEESNE DDYLLGELGE ALRMRLQIQL Y 

Seq ID NO: 163 DNA sequence 

Nucleic Acid Accession #: NM_013372.1 

Coding sequence: 63.. 617 

1 11 21 31 41 • 51 

I I I I I I 

GCGGCCGCAC TCAGCGCCAC GCGTCGAAAG CGCAGGCCCC GAGGACCCGC CGCACTGACA 60 

GTATGAGCCG CACAGCCTAC ACGGTGGGAG CCCTGCTTCT CCTCTTGGGG ACCCTGCTGC 120 

CGGCTGCTGA AGGGAAAAAG AAAGGGTCCC AAGGTGCCAT CCCCCOGCCA GACAAGGCCC 180 

AGCACAATGA CTCAGAGCAG ACTCAGTCGC CCCAGCAGCC TGGCTCCAGG AACCGGGGGC 240 

GGGGCCAAGG GCGGGGCACT GCCATGCCCG GGGAGGAGGT GCTGGAGTCC AGCCAAGAGG 300 

CCCTGCATGT GACGGAGCGC AAATACCTGA AGCGAGACTG GTGCAAAACC CAGCCGCTTA 360 

AGCAGACCAT CCACGAGGAA GGCTGCAACA GTCGCACCAT CATCAACCGC TTCTGTTACG 420 

GCCAGTGCAA CTCTTTCTAC ATCCCCAGGC ACATCCGGAA GGAGGAAGGT TCCTTTCAGT 480 

CCTGCTCCTT CTGCAAGCCC AAGAAATTCA CTACCATGAT GGTC3VCACTC AACTGCCCTG 540 

AACTACAGCC ACCTACCAAG AAGAAGAGAG TCACACGTGT GAAGCAGTGT CGTTGCATAT 600 

CCATCGATTT GGATTAAGCC AAATCCAGGT GCACCCAGCA TGTCCTAGGA ATGCAGCCCC 660 

AGGAAGTCCC AGACCTAAAA CAACCAGATT CTTACTTGGC TTAAACCTAG AGGCCAGAAG 720 

AACCCCCAGC TGCCTCCTGG CAGGAGCCTG CTTGTGCGTA GTTCGTGTGC ATGAGTGTGG 780 

ATGGGTGCCT GTGGGTGTTT TTAGACACCA GAGAAAACAC AGTCTCTGCT AGAGAGCACT 840 

CCCTATTTTG TAAACATATC TGCTTTAATG GGGATGTACC AGAAACCCAC CTCACCCCGG 900 

CTCACATCTA AAGGGGCGGG GCCGTGGTCT GGTTCTGACT TTGTGTTTTT GTGCCCTCCT 960 

GGGGAGCAGA ATCTCCTTTC GGAATGAATG TTCATGGAAG AGGCTCCTCT GAGGGCAAGA 1020 

GACCTGTTTT AGTGCTGCAT TCGACATGGA AAAGTCCTTT TAACCTGTGC TTGCATCCTC 1080 

CTTTCCTCCT CCTCCTCACA ATCCATCTCT TCTTAAGTTG ATAGTGACTA TGTCAGTCTA 1140 

ATCTCTTGTT TGCCAAGGTT CCTAAATTAA TTCACTTAAC CATGATGCAA ATGTTTTTCA 1200 

TTTTGTGAAG ACCCTCCAGA CTCTGGGAGA GGCTGGTGTG GGCAAGGACA AGCAGGATAG 1260 

TGGAGTGAGA AAGGGAGGGT GGAGGGTGAG GCCAAATCAG GTCCAGCAAA AGTCAGTAGG 1320 

GACATTGCAG AAGCTTGAAA GGCCAATACC AGAACACAGG CTGATGCTTC TGAGAAAGTC 1380 

TTTTCCTAGT ATTTAACAGA ACCCAAGTGA ACAGAGGAGA AATGAGATTG CCAGAAAGTG 1440 

ATTAACTTTG GCCGTTGCAA TCTGCTCAAA CCTAACACCA AACTGAAAAC ATAAATACTG 1500 

ACCACTCCTA TGTTCGGACC CAAGCAAGTT AGCTAAACCA AACCAACTCC TCTGCTTTGT 1560 

CCCTCAGGTG GAAAAGAGAG GTAGTTTAGA ACTCTCTGCA TAGGGGTGGG AATTAATCAA 1620 

AAACCKCAGA GGCTGAAATT CCTAATACCT TTCCTTTATC GTGGTTATAG TCAGCTCATT 1680 

TCCATTCCAC TATTTCCCAT AATGCTTCTG AGAGCCACTA ACTTGATTGA TAAAGATCCT 1740 

GCCTCTGCTG AGTGTACCPG ACAGTAAGTC TAAAGATGAR AGAGTTTAGG GACTACTCTG 1800 

TTTTAGCAAG ARATATTKTG GGGGTCTTTT TGTTTTAACT ATTGTCAGGA GATTGGGCTA 1860 

RAGAGAAGAC GACGAGAGTA AGGAAATAAA GGGRATTGCC TCTGGCTAGA GAGTAAGTTA 1920 

GGTGTTAATA CCTGGTAGAA ATGTAAGGGA TATGACCTCC CTTTCTTTAT GTGCTCACTG 1980 

AGGATCTGAG GGGACCCTGT TAGGAGAGCA TAGCATCATG ATGTATTAGC TGTTCATCTG 2040 

CTACTGGTTG GATGGACATA ACTATTGTAA CTATTCAGTA TTTACTGGTA GGCACTGTCC 2100 
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TCTGATTAAA CTTGGCCTAC TGGCAATGGC TACTTAGGAT TGATCTAAGG GCCAAAGTGC 2160 

AGGGTGGGTG AACTTTATTG TACTTTGGAT TTGGTTAACC TGTTTTCTTC AAGCCTGAGG 2220 

TTTTATATAC AAACTCCCTG AATACTCTTT TTGCCTTGTA TCTTCTCAGC CTCCTAGCCA 2280 

AGTCCTATGT AATATGGAAA ACAAACACTG CAGACTTGAG ATTCAGTTGC CQATCAAGGC 2340 

D TCTGGCATTC AGAGAACCCT TGCAACTCGA GAAGCTGTTT TTATTTCGTT TTTGTTTTGA 2400 

TCCAGTGCTC TCCCATCTAA CAACTAAACA GGAGCCATTT CAAGGCGGGA GATATTTTAA 2460 

ACACCCAAAA TGTTGGGTCT GATTTTCAAA CTTTTAAACT CACTACTGAT GATTCTCACG 2520 

CTAGGC GAAT TTGTCCAAAC ACATAGTGTG TGTGTTTTGT ATACACTGTA TGACCCCACC 2S80 

CCAAATCTTT GTATTGTCCA CATTCTCCAA CAATAAAGCA CAGAGTGGAT TTAATTAAGC 2640 

ACACAAATGC TAAGGCAQAA TTTTGAGGGT GGGAGAGAAG AAAAGGGAAA GAAGCTGAAA 2700 

ATGTAAAACC ACACCAGGOA GGAAAAATGA CATTCAGAAC CAGCAAACAC TGAATTTCTC 2760 

TTGTTGTTTT AACTCTGCCA CAAGAATGCA ATTTCGTTAA TGGAGATGAC TTAAGTTGGC 2820- 

AGCAGTAATC TTCTTTTAGG AGCTTGTACC ACAGTCTTGC ACATAAGTGC AGATTTGGCT 2880 

CAAGTAAAGA GAATTTCCTC AACACTAACT TCACTGGGAT AATCAGCAGC GTAACTACCC 2940 

TAAAAGCATA TCACTAGCCA AAGAGGGAAA TATCTGTTCT TCTTACTGTG CCTATATTAA 3000 

GACTAGTACA AATGTGGTGT GTCTTCCAAC TTTCATTGAA AATGCCATAT CTATACCATA 3060 

TTTTATTCGA GTCACTGATG ATGTAATGAT ATATTTTTTC ATTATTATAG TAGAATATTT 3120 

TTATGGCAAG ATATTTGTGG TCTTGATCAT ACCTATTAAA ATAATGCCAA ACACCAAATA 3180 

TGAATTTTAT GATGTACACT TTGTGCTTGG CATTAAAAGA AAAAAACACA CATCCTGGAA 3240 

Z\J GTCTGTAAGT TGTTTTTTGT TACTGTAGGT CTTCAAAGTT AAGAGTGTAA GTGAAAAATC 3300 

TGGAGGAGAG GATAATTTCC ACTGTGTGGA ATGTGAATAG TTAAATGAAA AGTTATGGTT 3360 

ATTTAATGTA ATTATTACTT CAAATCCTTT GGTCACTGTG ATTTCAAGCA TGTTTTCTTT 3420 

TTCTCCTTTA TATGACTTTC TCTGAGTTGG GCAAAGAAGA AGCTGACACA CCGTATGTTG 3480 

TTAGAGTCTT TTATCTGGTC AGGGGAAACA AAATCTTGAC CCAGCTGAAC ATGTCTTCCT 3540 

GAGTCAGTGC CTGAATCTTT ATTTTTTAAA TTGAATGTTC CTTAAAGGTT AACATTTCTA 3600 

AAGCAATATT AAGAAAGACT TTAAATGTTA TTTTGGAAGA CTTACGATGC ATGTATACAA 3660 

ACGAATAGCA GATAATGATG ACTAGTTCAC ACATAAAGTC CTTTTAAGGA GAAAATCTAA 3720 

AATGAAAAGT GGATAAACAG AACATTTATA AGTGATCAGT TAATGCCTAA GAGTGAAAGT 3780 

AGTTCTATTG ACATTCCTCA AGATATTTAA TATCAACTGC ATTATGTATT ATGTCTGCTT 3840 

AAATCATTTA AAAACGGCAA AGAATTATAT AGACTATGAG GTACCTTGCT GTGTAGGAGG 3900 

ATGAAAGGGG AGTTGATAGT CTCATAAAAC TAATTTGGCT TCAAGTTTCA TGAATCTGTA 3960 

ACTAGAATTT AATTTTCACC CCAATAATGT TCTATATAGC CTTTGCTAAA GAGCAACTAA 4020 
TAAATTAAAC CTATTCTTTC AAAAAAAAA 
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Seq ID NO: 164 Protein sequence 
Protein Accession ft: NP_037504.1 

1 11 21 31 41 51 

1 I I I I I 

MSRTAYTVGA LLLLLGTLLP AAEGKKKGSQ GAIPPPDKAQ HNDSEQTQSP QQPGSRNRGR 60 
GQGRGTAMPG EEVLESSQEA LHVTERKYLK RDWCKTQPLK QTIHEEGCNS RTI INRFCYG 120 
QCNSPYIPRH IRKEEGSFQS CSFCKPKKPT TMMVTLNCPE LQPPTKKKRV TRVKQCRCIS 180 
IDLD 

Seq ID NO: 165 DNA sequence 

Nucleic Acid Accession ft: CAT cluster 

1 11 21 31 41 51 

I I I I I I 

GAATTGCATC GGACAGAGCT TCGCCATGGC CGAGTTGCGC GTGGTTGTGG CACTAACACT 60 
GCTACGTTTC CGCCTGAGCG TGGACCGAAC GCGCAAGGTG CGGCGGAAGC OGGAGCTCAT 120 
ACTGCGCACG GAGAACGGGC TCTGGCTCAA G 



Seq ID NO: 166 DNA sequence 

Nucleic Acid Accession #t FGENESH predicted 

Coding sequence: 1..1650 



/TA 1 11 21 31 41 51 

60 { | | | | | 

ATGCCACCTC TGCCATCCAG AATGTCTCGG GGGCCCTCCT TGGTGACAGG CAGGATGCTG 60 

CCCATCACAG ACCGCCTGCT GCACCTCCTG GGGCTGGAGA AGACGGCGTT CCGCATATAC 120 

GCGGTGTCCA CCCTTCTCCT CTTCCTGCTC TTCTTCCTGT TCCGCCTGCT GCTGCGGTTC 180 

^ CTGAGGCTCT GCAGGAGCTT CTACATCACC TGCCGCCGGC TGCGCTGCTT CCCCCAGCCT 240 

UJ CCCCGGCGCA ACTGGCTGCT GGGCCACCTG GGCATGTACC TTCCAAATGA GGCGGGCCTT 300 

CAAGATGAGA AGAAGGTACT GGACAACATG CACCATGTAC TCTTGGTATG GATGGGACCT 360 

GTCCTGCCGC TGTTGGTTCT GGTGCACCCT GATTACATCA AACCCCTTTT GGGAGCCTCA 420 

GCTGCCATCG CCCCCAAGGA TGACCTCTTC TATGGCTTCC TAAAACCTTG GCTAGGGGAT 480 

GGGCT GCTGC TCAGCAAAGG TGACAAGTGG AGCCGGCACC GTCGCCTGCT GACACCCGCC 540 

/U TTCCACTTTG ACATCCTGAA GCCTTACATG AAGATCTTCA ACCAGAGCGC TGACATTATG 600 

CATGCTAAAT GGCGGCATCT GGCAGAGGGC TCAGCGGTCT CCCTTGATAT GTTTGAGCAT 660 

ATCAGCCTCA TGACCCTGGA CAGTCTTCAG AAATGTGTCT TCAGCTACAA CAGCAACTGC 720 

CAAGAGAAGA TGAGTGATTA TATCTCCGCT ATCATTGAAC TGAGCGCTCT GTCTGTCCGG 780 

CGCCAGTATC GCTTGCACCA CTACCTCGAC TTCATTTACT ACCGCTCGGC GGATGGGCGG 840 

/»> AGGTTCCGGC AGGCCTGTGA CATGGTGCAC CACTTCACCA CTGAAGTCAT CCAGGAACGG 900 

CGGCGGGCAC TGCGTCAGCA GGGGGCCGAG GCCTGGCTTA AGGCCAAGCA GGGGAAGACC 960 

TTGGACTTTA TTGATGTGCT GCTCCTGGCC AGGGATGAAG ATGGAAAGGA ACTGTCAGAC 1020 

GAGGATATCC GAGCCGAAGC AGACACCTTC ATGTTTGAGG GTCACGACAC AACATCCAGT 1080 

qsn GGGATCTCTT GGATGCTGTT CAATTTGGCA AAGTATCCGG AATACCAGGA GAAATGCCGA 1140 

OU GAAGAGATTC AGGAAGTCAT GAAAGGCCGG GAGCTGGAGG AGCTGGAGTG GGACGATCTG 1200 

ACTCAGCTGC CCTTTACAAC TATGTGCATT AAGGAGAGCC TGCGCCAGTA CCCACCTGTC 1260 

ACTCTTGTCT CTCGCCAATG CACGGAGGAC ATCAAGCTCC CAGATGGGCG CATCATCCCC 1320 

AAAGGAATCA TCTGCTTGGT CAGCATCTAT GGAACCCACC ACAACCCCAC AGTGTGGCCT 1380 

GACTCCAAGG TGTACAACCC CTACCGCTTT GACCCGGACA ACCCACAGCA GCGCTCTCCA 1440 
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CTGGCCTATQ TGCCCTTCTC TGCA6GACCC AGGAATTGCA TCGGACAGAG CTTCGCCATG 1500 

GCCGAGTTGC GQGTGGTTGT GGCACTAACA CTGCTACGTT TCCGCCTGAG CGTGGACCGA 1560 

ACGOGCAAGG TGCGGCGGAA GCCGGAGCTC ATACTGCGCA CGGAGAACGG GCTCTGGCTC 1620 
AAGGTGGAGC CGCTGCCTCC GCGGGCCTGA 

Seq ID NO: 167 Protean sequence 

Protein Accession #: FGENBSH predicted 

1 11 21 31 41 51 

i i i i i i 

MPPLPSRMSR GPSLVTGRML PITDRLLHLL GLEKTAFRIY AVSTLLLFLL FFLFRLLLRF 60 

LRLCRSFYIT CRRLRCFPQP PRRNWLLGHL GMYLPNEAGL QDEKKVLDNM HHVLLVWMGP 120 

VLPLLVLVHP DYIKPLLGAS AAIAPKDDLF YGFLKPWLGD GLLLSKGDKW SRHRRLLTPA 180 

FHFDILKPYM K1FNQSADIM HAKWRHLAEG SAVSLDMFEH ISLMTLDSLQ KCVFSYNSNC 240 

QEKMSDYISA IIELSALSVR RQYRLHHYLD FIYYRSADGR RFRQACDMVH HFTTEVIQER 300 

RRALRQQGAE AWLKAKQGKT LDFIDVLLLA RDEDGKELSD EDIRAEADTF MFEGHDTTSS 360 

GISWMLFNLA KYPBYQEKCR EEIQEVMKGR ELEELEWDDL TQLPFTTMCI KESLRQYPPV 420 

TLVSRQCTED IKLPDGRIIP KGIICLVSIY GTHHNPTVWP DSKVYNPYRF DPDNPQQRSP 480 

LAYVPFSAGP RNCIGQSFAM AELRVWALT LLRFRLSVDR TRKVRRKPEL ILRTENGLWL 540 
KVEPLPPRA 

Seq ID NO i 168 DNA sequence 

Nucleic Acid Accession #: AK058088.1 

Coding sequence: 252.. 1772 

1 11 21 31 41 51 

1 I I I I I 

AGGAAACCAA GGCAAGCTCC CCCTGTCAAA GCACCTTGGC CCATAAGAAG AAAAGGGGGA 60 

GCCCCAGATG TGATGAGCGC TTCCAGGCTT CAGGCTCAGA AGGCGCCCCC AGCTCTCCTG 120 

TAACTCAGAG GCCAGTGTGA TGGGAGTTCC TCCACTCAGC ACACTTCCCC TGTAAACACG 180 

CC TGTGG TGG GCAAAAGGGC TTTGGAACGG TTGCTTGTCT TTTCTCTCCT GCGTAATTTC 240 

CACTTTCATT CATGATAATG TCGAACACGC ACAAAGCTCG GCTGGAACGC CGGGTCACTG 300 

GCTCAACCAA CCGGTGGCQT TTGCCCAAAC AGCCTTTCTC TGGGGACCTG CTCTCACTTT 360 

CCCAGATGTG CAAGGCTCTG AGCATAGACT TTGAGGAAGC TTTGAGGAAC CCAGACAGGT 420 

TATGCATTTC ACAAATCCAG AAGTTTTTCT TTGAGAATTT CAAGAACAAG GACATCCAAA 480 

GT GGGGAA GC AGATGTGATT CTCGAGTGCC TGGGCTTCAA ATGGGAGCTC CATCAGCCCC 540 

GGCTTTTTCA GTCTGAGACC TTGGCCAAGC TCTACCTGAA AGCCCTGGCG CAGGGCACCA 600 

CACACCCCCT GAGGGAGCTG GAGGAGCTTC TGCGAGCTCA ATCACCTAAG AAGACCAAAG 660 

AAAAATCCCC TGCAAAGAGG ATCATCATTT CCTTGAAGAT CAATGACCCA CTGGTCACTA 720 

AAGTCGCCTT CGCCACGGCC CTGAAGAACC TCTACATGAG TGAGGTGGAG ATTAACTTGG 780 

AAGACCTACT GGGAGTGCTG GCTTCCGCCC ACATCCTCCA GTTCAGTGGC CTGTTTCAAA 840 

GGTGCGTGGA TGTGATGATA GCCAGACTCA AGCCAAGCAC CATCAAGAAA TTCTACGAGG 900 

CCGGCTGCAA GTACAAGGAA GAGCAGCTCA CCACCGGCTG CGAGAAGTGG CTGGAAATGA 960 

ACTTGGTTCC TCTAGGGGGG ACGCAGATCC ACCTCCACAA AATCCCACAG GACCTGCTCC 1020 

ACAAAGTGCT GAAGTCCCCC AGGTTATTTA CCTTTAGTGA ATTCCATCTT CTGAAAACAA 1080 

TGCTTTTGTG GGTCTTCTTG CAACTGAACT ACAAGATTCA GGCAATTCCG ACTTATGAAA 1140 

CCGTGATGAC ATTTTTTAAG AGCTTTCCTG AGAACTGTTG CTTTCTGGAC CGGGACATAG 1200 

GACGGAGCTT GAGGCCGCTC TTCCTCTGCT TGCGTCTGCA CGGCATCACC AAAGGCAAGG 1260 

ATCTGGAGGT GCTGCGGCAC CTTAACTTCT TCCCAGAGTC ATGGCTCGAC CAGGTTACAG 1320 

TCAACCATTA CCACGCACTG GAGAATGGGG GCGACATGGT CCACCTGAAA GATCTTAACA 1380 

CCCAGGCTGT GAGATTTGGG CTGCTCTTTA ACCAGGAGAA TACAACTTAT TCGAAAACGA 1440 

T TGCTCT ATA TGGATTCTTC TTTAAGATAA AGGGACTCAA ACATGATACT ACCTCTTAIA 1500 

GTTTTTACAT GCAGAGAATA AAGCACACAG ACCTGGAATC TCCCTCTGCG GTCTACGAGC 1560 

ACAACCACGT CAGCCTGCGA GCGGCACGCC TGGTGAAGTA TGAGATCAGA GCAGAGGCCC 1620 

TGGTTGACGG CAAGTGGCAG GAGTTCAGGA CAAACCAGAT CAAGCAGAAG TTTGGGTTGA 1680 

CC ACGTCA TC CTGCAAAAGC CATACCTTGA AAATCCAAAC TGTGGGCATC CCAATCTATG 1740 

TAAGTTTTGC ATTCATCTTC CCAGCATCTT GACAGTTTCC AGAAGAATCT ATGGGATTTT 1800 
CCCCCCACTG GTCTGCATAA AAGAAAATAA AATGACATAA AAGGGAGC 

Seq ID NO: 169 Protein sequence 
Protein Accession 8: BAB71658.1 

1 11 21 31 41 51 

I I I I I I 

MIMSNTHKAR LERRVTGSTN RWRLPKQPFS GDLLSLSQMC KALSIDFEEA LRNPDRLCIS 60 

QIQKFFFENF KNKDIQSGEA DVILECLGFK WELHQPRLFQ SETLAKLYLK ALAQGTTHPL 120 

RELEELLRAQ SPKKTKEKSP AKRIIISLKI NDPLVTKVAF ATALKNLYMS EVEINLEDLL 180 

GVLASAHILQ FSGLFQRCVD VMIARLKPST IKKFYEAGCK YKEEQLTTGC EKWLEMNLVP 240 

LGGTQIHLHK IPQDLLHKVL KSPRLFTFSE FHLLKTMLLW VFLQLNYKIQ AIPTYETVKT 300 

FFKSFPENCC FLDRDIGRSL RPLFLCLRLH GITKGKDLEV LRHLNFFPES WLDQVTVNHY 360 

HALENGGDMV HLKDLNTQAV RFGLLFNQEN TTYSKTIALY GFFFKIKGLK HDTTSYSFYM 420 

QRIKHTDLES PSAVYEHNHV SLRAARLVKY EIRAEALVDG KHQEFRTNQI KQKFGLTTSS 480 
CKSHTLKIQT VGIPIYVSFA FIFPAS 

Seq ID NO: 170 DNA sequence 

Nucleic Acid Accession #: NM__007000.1 

Coding sequence: 1...777 

1 11 21 31 41 51 

I I I I I I 

ATGGCGTCTG OGGCAGCAGC GGAGGCCGAG AAGGGATCTC CAGTTGTGGT GGGCCTGCTA 60 

GTTGTGGGCA ATATCATTAT TCTGCTGTCA GGCCTGTCCC T GTT T G CTGA GACCATATGG 120 

GTGACAGCCG ACCAGTACCG TGTATACCCA CTGATGGGAG TCTCAGGCAA GGATGACGTC 180 

TTCGCTGGTG CCTGGATTGC CATCTTCTGC GGCTTCTCCT TCTTCATGGT AGCCAGTTTT 240 
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GGTGTGGGTG CCGCACTCTG CCGCCGCCGG TCCATGGTCC TCACGTACCT GGTGCTCATG 300 

CTCATCGTCT ACATCTTCGA GTGCGCCTCC TGCATCACGT CCTACACCCA CCGTGACTAC 360 

ATGGTGTCCA ACCCATCCCT GATCACCAAG CAGATGCTGA CCTTCTACAG CGCGGACACC 420 

GACCAGGGCC AGGAGCTGAC CCGCCTCTGG GACCGCGTCA TGATTGAGCA AGAATGCTGT 480 

GGCACATCTG GTCCCATGGA CTGGGTGAAC TTCACGTCAG CCTTCCGGGC GGCCACTCCG 540 

GAGGTGGTGT TCCCCTGGCC CCCACTGTGC TGTCGCCGGA CGGGAAACTT CATCCCCCTC 600 

AACGAGGAGG GCTGCCGCCT GGGGCACATG GACTACCTGT TCACCAAGGG CTGCTTCGAA 660 

CACATCGGCC ACGCCATCGA CAGCTACACG TGGGGTATCT CGTGGTTTGG GTTTGCCATC 720 
CTGATGTGGA CGCTCCCGGT CATGCTGATA GCCATGTATT TCTACACCAT GCTCTGA 

Seq ID NO: 171 Protein sequence 
Protein Accession «: NP_008931.1 

ic } 11 21 31 41 51 

MASAAAAEAE KGSPWVGLL WGNIIILLS GLSLFAETIW VTADQYRVYP LMGVSGKDDV 60 

FAGAWIAIPC GPSFFMVASP GVGAALCRRR SMVLTYLVLM LIVYIFECAS CITSYTHRDY 120 

MVSNPSLITK QMLTFYSADT DQGQELTRLW DRVMIEQECC GTSGPMDWVN FTSAFRAATP 180 

EWFPWPPLC CRRTGNFIPL NEEGCRLGHM DYLFTKGCFE HIGHAIDSYT WGISWFGFAI 240 
ZU LMWTLPVMLI AMYFYTML 
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Seq ID NO: 172 DMA sequence 

Nucleic Acid Accession #: NM_006760. 

Coding sequence.- 3 9.. 5 93 



1 11 21 31 41 51 

I ) I I I | 

GAAAGCCTGC CAGCACCTAT TCCACCTCCC AGCCCAGCAT GGCACCCCTG CTGCCCATCC 60 

-ia GGACCTTGCC CTTGATCCTG ATTCTGCTGG CTCTGCTGTC CCCAGGGGCT GCAGACTTCA 120 

jyJ ACATCTCAAG CCTCTCTGGT CTGCTGTCCC CGGCGCTAAC GGAGAGCCTG CTGGTTGCCT 180 

TGCCCCCCTG TCACCTCACA GGAGGCAATG CCACACTGAT GGTCCGGAGA GCCAATGACA 240 

GCAAAGTGGT GACGTCCAGC TTTGTGGTGC CTCCGTGCCG TGGGCGCAGG GAACTGGTGA 300 

GTGTGGTGGA CAGTGGTGCT GGCTTCACAG TCACTCGGCT CAGTGCATAC CAGGTGACAA 360 

ACCTCGTGCC AGGAACCAAA TTCTACATTT CCTACCTAGT GAAGAAGGGG ACAGCCACTG 420 

AGTCCAGCAG AGAGATCCCA ATGTCCACAC TCCCTCGAAG GAACATGGAA TCCATTGGGC 480 

TGGGTATGGC CCGCACAGGG GGCATGGTGG TCATCACGGT GCTGCTCTCT GTCGCCATGT S40 

TCCTGCTGGT GCTGGGCTTC ATCATTGCCC TGGCACTGGG CTCCCGCAAG TAAGGAGGTC 600 

TGCCGGGAGC AGCAGCTTCT CCAGGAAGCC CAGGGCACCA TCCAGCTCCC CAGCCCACCT 660 

Aft GCTCCCAGGC CCCAGGCCTG TGGCTCCCTT GGTGCCCTCG CCTCCTCCTC CTGCCCTCCT 720 

H-U CTCCCCTAGA GCCCTCTCCT CCCTCTGTCC CTCTCCTTGC CCCCAGTGCC TCACCTTCCA 780 

A CACT CCATT ATTCCTCTCA CCCCACTCCT GTCAGAGTTG ACTTTCCTCC CATTTTACCA 840 

CTTTAAACAC CCCCATAACA ATTCCCCCAT CCTTCAGTGA ACTAAGTCCC TATAATAAAG 900 
GCTGAGGCTG CATCTGCCAA AAAAAAAAAA AA 



Seq ID NO: 173 Protein sequence 
Protein Accession #j NP 006751.1 



* 11 21 31 41 51 

™ 1 1 I I 1 I 

JU MAPLLPIRTL PLILILLALL SPGAADFNIS SLSGLLSPAL TESLLVALPP CHLTGGNATL 60 
MVRRANDSKV VTSSFWPPC RGRRELVSW DSGAGFTVTR LSAYOVTNLV PGTKFYISYL 120 
VKKGTATESS REIPM5TLPR RNMESIGLGM ARTGGMWIT VLLSVAMFLL VLGFIIALAL 180 



60 



55 Seq ID NO: 174 DNA sequence 

Nucleic Acid Accession #: Eob sequence 
Coding sequence: 1..2733 

} «■ 21 31 41 si 

ATGAAAGTTG GAGTGCTGTG GCTCATTTCT TTCTTCACCT TCACTGACGG CCACGGTGGC 

TTCCTGGGGA AAAATGATGG CATCAAAACA AAAAAAGAAC TCATTGTGAA TAAGAAAAAA 120 

CATCTAGGCC CAGTCGAAGA ATATCAGCTG CTGCTTCAGG TGACCTATAG AGATTCCAAG 180 

GAGAAAAGAG ATTTGAGAAA TTTTCTGAAG CTCTTGAAGC CTCCATTATT ATGGTCACAT 240 

GGGCTAATTA GAATTATCAG AGCAAAGGCT ACCACAGACT GCAACAGCCT GAATGGAGTC 300 

CTGCAGTGTA CCTGTGAAGA CAGCTACACC TGGTTTCCTC CCTCATGCCT TGATCCCCAG 360 

AACTGCTACC TTCACACGGC TGGAGCACTC CCAAGCTGTG AATGTCATCT CAACAACCTC 420 

AGCCAGAGTG TCAATTTCTG TGAGAGAACA AAGATTTGGG GCACTTTCAA AATTAATGAA 480 

„ AGGTTTACAA ATGACCTTTT GAATTCATCT TCTGCTATAT ACTCCAAATA TGCAAATGGA 540 

/U ATTGAAATTC AACTTAAAAA AGCATATGAA AGAATTCAAG GTTTTGAGTC GGTTCAGGTC 600 

ACCCAATTTC GAAATGGAAG CATCGTTGCT GGGTATGAAG TTGTTGGCTC CAGCAGTGCA 660 

TCTGAACTGC TGTCAGCCAT TQAACATGTT GCCGAGAAGG CTAAGACAGC CCTTCACAAG 720 

CTGTTTCCAT TAGAAGACGG CTCTTTCAGA GTGTTCGGAA AAGCCCAGTG TAATGACATT 780 

nc GTCTTTGGAT TTGGGTCCAA GGATGATGAA TATACCCTGC CCTGCAGCAG TGGCTACAGG 840 

GGAAACATCA CAGCCAAGTG TGAGTCCTCT GGGTGGCAGG TCATCAGGGA GACTTGTGTG 900 

CTCTCTCTGC TTGAAGAACT GAACAAGAAT TTCAGTATGA TTGTAGGCAA TGCCACTGAG 960 

GCAGCTGTGT CATCCTTCGT GCAAAATCTT TCTGTCATCA TTCGGCAAAA CCCATCAACC 1020 

ACAGTGGGGA ATCTGGCTTC GGTGGTGTCG ATTCTGAGCA ATATTTCATC TCTGTCACTG 1080 

GCCAGCCATT TCAGGGTGTC CAATTCAACA ATGGAGGATG TCATCAGTAT AGCTGACAAT 1140 

ATCCTTAATT CAGCCTCAGT AACCAACTGG ACAGTCTTAC TGCGGGAAGA AAAGTATGCC 1200 

AGCTCACGGT TACTAGAGAC ATTAGAAAAC ATCAGCACTC TGGTGCCTCC GACAGCTCTT 1260 

CCTCTGAATT TTTCTCGGAA ATTCATTGAC TGGAAAGGGA TTCCAGTGAA CAAAAGCCAA 1320 

CTCAAAAGGG GTTACAGCTA TCAGATTAAA ATGTGTCCCC AAAATACATC TATTCCCATC 1380 

AGAGGCCGTG TGTTAATTGG GTCAGACCAA TTCCAGAGAT CCCTTCCAGA AACTATTATC 1440 
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AGCATGGCCT CGTTGACTCT GGGGAACATT CTACCCGTTT CCAAAAATGG AAATGCTCAG 1500 

GTCAATGGAC CTGTQATATC CACGGTTATT CAAAACTATT CCATAAATGA AGTTTTCCTA 1560 

TTTTTTTCCA AGATAGAGTC AAACCTGAGC CAGCCTCATT GTGTGTTTTG GGATTTCAGT 1620 

CATTTGCAGT GGAACGATGC AGGCTGCCAC CTAGTGAATG AAACTCAAGA CATCGTGACG 1680 

TGCCAATGTA CTCACTTGAC CTCCTTCTCC ATATTGATGT CACCTTTTGT CCCCTCTACA 1740 

ATCTTCCCCG TTGTAAAATG GATCACCTAT GTGGGACTGG GTATCTCCAT TGGAAGTCTC 1800 

ATTTTATGCC TGATCATCGA GGCTTTGTTT TGGAAGCAGA TTAAAAAAAG CCAAACCTCT 1B60 

CACACACGTC GTATTTGCAT GGTGAACATA GCCCTGTGCC TCTTGATTGC TGATGTCTGG 1920 

TTTATTGTTG GTGCCACAGT GGACACCACG GTGAACCCTT CTGGAGTCTG CACAGCTGCT 1980 

GTGTTCTTTA CACACTTCTT CTACCTCTCT TTGTTCTTCT GGATGCTCAT GCTTGGCATC 2040 

CTGCTG GCTT ACCGGATCAT CCTCGTGTTC CATCACATGG CCCAGCATTT GATGATGGCT 2100 

GTTGGATTTT GCCTGGGTTA TGGGTGCCCT CTCATTATAT CTGTCATTAC CATTGCTGTC 2160 

ACGCAACCTA GCAATA CCTA CAAAAGGAAA GATGTGTGTT GGCTTAACTG GTCCAATGGA 2220 

AGCAAACCAC TCCTGGCTTT TGTTGTCCCT GCACTGGCTA TTGTGGCTGT GAACTTCGTT 2280 

GTGGTGCTGC TAGTTCTCAC AAAGCTCTGG AGGCCGACTG TTGGGGAAAG ACTGAGTCGG 2340 

GATGACAAGG CCACCATCAT CCGCGTGGGG AAGAGCCTCC TCATTCTGAC CCCTCTGCTA 2400 

GGGCTCACCT GGGG CTTTGG AATAGGAACA ATAGTGGACA GCCAGAATCT GGCTTGGCAT 2460 

GTTATTTTTG CTTTACTCAA TGCATTCCAG GGATTTTTTA TCTTATGCTT TGGAATACTC 2520 

TTGGACAGTA AGCTGCGACA ACTTCTGTTC AACAAGTTGT CTGCCTTAAG TTCTTGGAAG 2580 

CAAACAGAAA AGCAAAACTC ATCAGATTTA TCTGCCAAAC CCAAATTCTC AAAGCCTTTC 2640 

AACCCACTGC AAAACAAAGG CCATTATGCA TTTTCTCATA CTGGAGATTC CTCCGACAAC 2700 
ATCATGCTAA CTCAGTTTGT CTCAAATGAA TAA 



Seq ID NO: 175 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MKVGVLVfLIS FPTFTDGHGG FLGKNDGIKT KKELIVNKKK HLGPVEEYQL LLQVTYRDSK 60 

EKRDLRNFLK LLKPPLLWSH GLIRIIRAKA TTDCNSLNGV LQCTCEDSYT WFPPSCLDPQ 120 

NCYLHTAGAL PSCECHLNNL SQSVNFCERT KIWGTFKINE RFTNDLLNSS SAIYSKYANG 180 

IEIQLKKAYE RIQGFESVQV TQFRNGSIVA GYEWGSSSA SELLS AIEHV AEKAKTALHK 240 

LFPLEDGSFR VFGKAQCNDI VFGFGSKDDE YTLPCSSGYR GNITAKCESS GWQVIRETCV 300 

LSLLEELNKN FSMIVGNATE AAVSSFVQNL SVI1RQNPST TVGNLASWS ILSNISSLSL 360 

ASHFRVSNST MEDVISIADN ILNSASVTNW TVLLREEKYA SSRLLETLEN ISTLVPPTAL 420 

PLNFSRKFID WKGIPVNKSQ LKRGYSYQIK MCPQNTSIPI RGRVLIGSDQ FQRSLPETII 480 

SMASLTLGNI LPVSKNGNAQ VNGPVISTVI QNYSINEVFL FFSKIESNLS QPHCVFWDFS 540 

HLQWNDAGCH LVNETQDIVT CQCTHLTSFS ILMSPFVPST IFPWKWITY VGLGISIGSL 600 

ILC *|* IEALF WKQIKKSQTS HTRR1CMVNI ALSLLIADVW FIVGATVDTT VNPSGVCTAA 660 

VFFTHFFYLS LFFWMLMLGI LLAYRIILVF HHMAQHLMMA VGFCLGYGCP LIISVITIAV 720 

TQPSNTYKRK DVCWLNWSNG SKPLLAFWP ALAIVAVNFV WLLVLTKLW RPTVGERLSR 780 

DDKATIIRVG KSLLILTPLL GLTWGFGIGT IVDSQNLAWH VIFALLNAFQ GFFILCFGIL 840 

LDSKLRQLLF NKLSALSSWK QTEKQNSSDL SAKPKFSKPF NPLQNKGHYA FSHTGDSSDN 900 
IMLTQFVSNE 
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Nucleic Acid Accession #: AB035089.1 

Coding sequence: 9845.. 10219 



] 11 21 31 41 51 

I I I I | | 

GGGCATGCAG CCATCGGGGA AAATCCATAG TGCAGATAAA GCAAGGAGGA AGAAGAAGGA 60 

CAGTTCTAGT AAAAGGGAGA ACATCAATAT AGGATGTTTC TTAGCAATAG AAAAAGAAGG 120 

CCAAGAGGAA TTAGGGAGAG AGTTATAAGA GATCAGCAAG GGGACAGGGT TAGATTTGGT 180 

TTGGTTTGAA AGCATACAGT AAATATGATG TCTGTCCCTG GCAGTGTTGG CAGAGTAGGA 240 

AGGAGGAAGG GAGGCAAGAG ATAATATCAT TTTCTCTGTG CTCCAACTGT ACTTACATAT 300 

GAGACTATTT CCCTCTCTGC TTTTCAAACC TTACTGGAGT TGTTTTCCCT CATGAAAACC 360 

AAGAAAGGAA AGCTAGTTAG TCTTGTTCTG AGGTTGTTCA ATGTATACAT ATCTATATCT 420 

GTAGACAGAA TCCTTGGGAA TACAGTAATT GACATATATT CTGTTATTTG ATGCTTGAAA 480 

AATCTCCTCC ACTAACCAGT TTCCCTATAG ATTGCCACAA GCACATAATA AGAAACAATA 540 

AATAAAATGT TCTCTTGACT TTGTTACTTA ACAATGCTGA GAAAACTTTA CAGCCTTCAT 600 

AAGGAAGTGA GGTCCAGGAA AATCTAGGAG ATATTTCTTA ACCAATCTAT AAAGGCATTA 660 

GTAATGACAG GATATTTCCT GAAAGTGTAA TTTCCCATTG AGGATTTGTT TTTAATTTCT 720 

GGATTCCTGG AGCCAATGAA GTTGGTGTAT GTTTATGAAA TATCAAGAGA CATAAGTTGG 780 

CAAGTGTTCA TATGCAAAAA CTTCTTGGAA TTTCTGAGTT CTCTGTGGCA ATATATGACA 840 

TCAGGATATG TCCAGTCTCA CACACCAGGA TATGTCCTTT CTAGCCTGTC TATCACATGC 900 

TAGGAGAACT ATTTAGGAAC AGAAAAAAAT GCCTGAAATG ATTTCTCATT TGAACTCATC 960 

CAAGCTTTCT CTAAATTTAA GCAAACTCCT GGTCATTTTC AGTTAGTACC TTTCCTTAAG 1020 

TTCAACCTTC AGGGCAAACC TCCGTGCCTC AGACGTTTAG CCATAGTCTG AAATTCTCTT 1080 

CCATAGATTG GTCCCCTGTA ACCCOGGTTT GTCTCAGCTT GTTATCCTGT TTTTTTCTT C 1140 

CCTCCATTCC CAGGATGAGC TTGTTGCTTC TGTCCTATGA GACATTAGAT TO CTTTT CTT 1200 

TGGTACCCGA GTAAATCCAT CCTACTCCAA TAGAGGAAGG TCCATTTTTG TCTTATAGCG 1260 

CTGGATGCAG ACT CAGCTGA GAAGACCATT ATTCATTTTT GGAATTCTTT ATCTCAGATA 1320 

TTTCCTCTTC TTTCTTTTTC TTCTATCTTT GGATTTTTAG TCCATCAACG CCCCATTAGT 1380 

CTATTCCCCG ACTTCAATCA GGGAACTTAT ACCTCTTAAA CTCATTCAGA GACTCAAAAC 1440 

ATATATATTG ATACAGGAGA CCTAAGAAGA GCATGTCTTG GGGGTTGAGG AAACAGGCAG 1500 

GTGAGAAATT TCCAGATTGG AAACACAGCT TCCTTTCTCC CATCCAGCCC CTACTTTCAG 1560 

CCTATGTGTT TCTGGCACCT TGTTGTAGAT AAATCTCCCT TGACTTTGTG ATGTGCTGAG 1620 

AAAACAAACT CACGGCTGGT GTTAAAAAGG GCCCATGACA ATACCAAGTG TTGGGGAGAA 1680 

TGTGGAGAAA TCAGAACTCT ATTCACGGTC GGTTGGAATG CACACTTGTG CAGAATTCTA 1740 

TGGAGAAGAG TCTGGCATTT CCTCAAAATG TTAACCTGGA TTTACCATAT GACCCAGCGA 1800 

TTTCATTCAT AGGTTTATAC TCAAAAGAAA TGAAGAAATA TGCCATGCAA AAAAATGTAC I860 

ATGAAAGGTC ACAACATCAT TATTCATAAT AGTAAAAGGA TGGAAACAAC ACAAATGTCC 1920 

ATCAACTTAT GATTAAAGAA AATCTGGTCT ATTCATAGAA TGGAATATTA TTCGACCACA 1980 
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AAAAGGAATG ATGTACTGAT CCATGCAATG ATGTGGACAA ACCATGAAAA TAACACTAGA 
TTAAA6AAGC CAGTCACAAA AGGACTTACT GTATGATTCC ATTTACCTGA AATGTTTGGA 
ATAGGCAAAT CCATAGAAAC AGGAGGTAGA TTCCTGGTTT CCAGGGTCTC CAGGAAGGGA 
AGAATGAAGT ACAAGATTTC TTTTGGAGGT AGTGAAATTG TTGTGGAATG AGATCATGAT 
5 GATGATAGCA CAACTTTGTG AATATAATAA AATCATTGAA TTGTACAGTT GAATTTATGG 
TATATAAATT ATATGTTAAT AAAAAGGGGG TCCACAAAAC AAACAGCCCC CCACTCTGGT 
TGTCAGGGAG ATATTGGATT AAATGGCCTT GGACAACAAC CCCTCTCCCT GGCCACAGAC 
ATTCTTCAGA TTACAAGATA TTCCAGGGGA AACACTGGAA TGAGTCTGAA GCCAGGTGCT 
AAACAGAAGG ACCATTGAGA AATGTTGTGA TCCTGACAGG TCAAGCAATT TATTTTTCGG 
10 CTTCATTTTT AAATGTAAAA TTAGAAAGCT GCCATTTAAA ATGGCCCGTC TGTTTCAATT 
GCTCTTCTCA GTGTCAGCCT GTTAACTCAA TGTGTTAGTC TGTTTTCATG CTGCTGATAA 
AAACATACCT GAGACTGGCA AGAAAAAGAG GTTTAATTGG GCTTAGAGTT CCACGTGATT 
GGGGAGGCCT CAGAATCACA GTAGGAGGCA AAAGTTATTC TTACATGGTG GCTGCAAGAG 
AAGATGAGGA AGAAGCAAAA GAAGAAACCC CTGATAAACC CATCGGATCT CCTGAGGCTT 
15 ATTAACTATC ATGAGAATAG CACAAGAAAG ACCGGCCCCC ATGATTCAAT TACCTCTACC 
TGGGTCCCTC CAATAACATG TGGAAATTCT GGTAGATACA ATTCAAGTTG AGATTTGGGT 
GGGAACACAG CCAAACCATA TCACTCAGCA AGGCAGATAA CTTTCTCACT GAGCCTATGC 
AACAGAAAAC CATCTGGGAT GGTTGTAAGG GGCACAGGAA GTGACTGGTA GGATCACTGC 
CAAAGCTGAG CACTCAGGAG AAGGCAATAG AATCCTATTC TCCATAGTAT GCTATAAGAT 
20 ACTGAAGTAC ACTTCTTCAC TATCTCTTTG GACTTAGAAT TAGCACTACA TTCCTTGTTA 
TACAGAAAAA TTACTAAGGA AATTCATAGG ATGACAAAAA CTTTCAGAAC TGAAAAACAG 
GAAATGTAAG CTTTTTAGTT CTTTGGTATT CGAAGTATGC CTAAAAGACA ATGCAAAATC 
CAAGAAAAGA ATGGTGGGGT TTTTGTTTGT TTGGTTTTGT TTTTGTTTTA CAGCTGGAGT 
AGAATACAAA GGGATGGAGT TGAAACAAAT GAGAGGAAAT TGGAATTCTA AACTTATTCT 
25 CATTGGCATT AGAAAGGCAC CTACATGTAT TTCACATGAG CCGGTGACTG CTGACTTGCA 
TTCTTATTTT TTCCCTATAG ATTAAAAAGG AGGTACAATG QTAGAACTGT AATCCTGTCC 
TTTGTCATAA ATTTTCATAT TCATAAAGGT GAGTGTTAGC CCGCTTGTGA AATCTGAAGT 
TGAGTAACTT CAAATACTAA CCACAGAGGG AAAGGCAGCA AGAGGAGAGG CATAAATTTA 
GGATCTCACC CTTCATTCCA CAGACACACA CAGCCTCTCT GCCCACCTCT G CTTCCTCTA 
30 GGAACACAGG TAAGAGCTTC AAGCCTCTCC AGCTTAATAA CATGAATTAT TTTTGAGAAT 
AATAATGATA CTGTGTTCTA TATCATGCAT CTCCTGCATT CTGTCTGATT ATATTTTACT 
TATTCTGCCA GAGCAAAATT AAAATACCTA TTTCATCTGA TTTGTCCTTT ATCTAAATTG 
CTTAGTTCCA AGTAAACCAA GGCACTTTTA GGAACACAGA GGGAGAGTGC CTTGCAGCCA 
GAGAGTCTTG AAGGAGATGT CAGGGACGCA TCTTAACAGC TGGTTGGATG TGAT CCACAG 
35 AGGTCTCCTG TTAGCATTCA TTGTAAAGCC ATCCTACCTA G CTCT AGTGT AACC AGCA AT 
GAAAGAAAGA TAAAGAGGGT CGATTACTTA TTTACAATAG TCTTTAAAAA CGTAGTTTTG 
TAAGCCTTCT AATTAGGACA TTAATATATT TAATATATGC ACATTGTAGA AAGATTGAAG 
CGTTAAAAAT AAGAGAAAAA CTTTAAATGT CAAAATCTCA CAACCCAGAT ATATCATTTC 
TTTAAGAAAA TTGTACTACA AAATACCATT CCATTTATTA AAGTCATTCT GACAGGAATC 
40 TGATGCTTTT CCAGGAGTTC CAGATCACAT CGAGTTCACC ATGAATTCAC TCAGTGAAGC 
CAACACCAAG TTCATGTTCG ATCTGTTCCA ACAGTTCAGA AAATCAAAAG AGAACAACAT 
CTTCTATTCC CCTATCAGCA TCACATCAGC ATTAGGGATG GTCCTCTTAG GAGCCAAAGA 
CAACACTGCA CAACAAATTA GCAAGGTAGC TATCAGCATC ATTACGTTGT CCTGTTGCAG 
TTTTTCTCTG GTTCCGTCGG CTAGCACGCA GATGGTAATA GATGTGGTGG TCTGATGGGT 
45 AGCACAGGGG GCTGTGCAGG AATTCCCATA ACTGTGAGAC CACTGACTTA AACAGATCTT 
TTGAGTAAAG TTTTCTTGTC CCGCTTCATG TCTCTTCCAG GTTCTTCACT TTGATCAAGT 
CACAGAGAAC ACCACAGAAA AAGCTGCAAC ATATCATGTG AGTCACAGAG CACTCTGATT 
CAGCTTTAGA TCCCTGAACA GGTCATAGTT TAAACCTGGA ACTTCACAAA AACTAAGAAA 
AGGCCAGTTT TAGGGAAAAT CTTGGACACA AAOATTGAGA CATACAGAGT GGGTTGGCAT 
50 TTCATGGCAC ATAATTATTA TTCCTCATTT CTGCGTTACT AAAAGACAGT CAGCACTGTA 
CCTCAGAGCA TAGGTCTGGA TCAGGATAGG CTGGGTTCAG ACTCCAGCTT TGCTCTTCAC 
AAATGATGAA TAAGAGCAGG ACACAACTGC TCGGAGTCCC AGTGACCTCA TCCCAGAAAA 
CTAAGGGTAA GAAAAAATCT GACTCAATAC ATG CAAAT AC ATGCAAATGT TTACAACAGT 
GCCTTGCCCA T AAAAGT CAT AATAAATGTT ATTATTATTA TAAAGTAGCT ATAATTATAC 
55 TAATCATAAT AATGTGAAAA TAATTTAATT TTCATTGAGT CATTAATGAG ATTCAGAGGA 

ATAAGCACAA GTCCAAGTAT ATTTTGGAAA ATGATTGCTA TGGAATATAT TGGTTTAGAG 
CCTTAATAGT GCAAAATGCT TTGCTGGAAG GTAGAAAGTT CT AGATTTAA ACAGGCTTAG 
GTTCAAAACT TGGCACTTCT AATTTATGTC TCTATAAACA GGGTTTTTTT CCCCATTCTC 
TGAGCTTTCT TGTGTTCATC TGAATTGAAC TAAAGACTTA GAGTTACCCA TGTAAAGTCC 
60 TTAGCCATGG ACCTGGCATA CACTCTTCTT ACGTGCAGAG AATGACCATC ATGAGGAAAG 
AGCCACAGAT CAGTCAATGT GTCCTACAAG ATAATAGCAC CAACAGGTAT AACAGGGCTT 
CCTGGCATAA TCTATTTAAA ATATCCAACC TTCAACATAC TCGTATCCTT GATGACTGTT 
AGAAGTGAAA TATGGTCCTT GCCCATAAGG AGCTGAGAGT TTAACTGGGA AGCTAAACCT 
AACCCTTTAA ACCAACAAGG AGAAAATCTA CTGGTAGACA GCGCTGCATC TTTAGTTCAG 
65 AAGAGAAAAG ATTGCAGTAC GTTAGAGCAA GAAGAATTTT CTGGAAGAAG TCAAATATAA 
GGTGGATTTT GAAGGGTATT TGAGGTGAAA TACACCAATT ATCAGGGAAT AACATCAAAG 
GTCCTCAATG AGACTACCAG CATTTAGGGA CTGATCTAAC AGACTTAGCA TGGGTTTAGT 
ATTTACATTG ATACAGCAAT TGAATGATCT CCTTTTTTGA TGTTTGAAGG TTGATAGGTC 
AGGAAATGTT CATCACCAGT TTCAAAAGCT TCTGACTGAA TTCAACAAAT CCACTGATGC 
70 ATATGAGCTG AAGATCGCCA ACAAGCTCTT CGGAGAAAAG ACGTATCAAT TTTTACAGGT 
AATTTCACCT GGCCTACCCA CATTTCATTT GCATCCTGAT GTCTGTGTCT CTGAGTGGCC 
AAATGGAAGA AAGCAAGGCA GATGAGCCTG GCCGACCCAG GTGGAGAGCA TTTACTCAGA 
GTGCATTAGC TCCATTTCCA CAACTCTCCC CCACTGGAGT GTCCCAGACC CCAACGATAC 
ATCACTGAAG TGTGGATTTA GGGATAATCT TGTGATAAAA GAGGAGGTTG TGTAATAGAG 
75 TGAGTAAGAG TAATAAGTAA TAAGATACCA TCGATAAACT GGCACTGACT CAGTCACATA 
CGATACATCT TGGTGGGAAA TGTATGACTA ATGGGATATT ATTGGAATGG GCAGGCTTGG 
GTGAGTTCCT GAGAATAGTT GAGGAAGTAC CAGGAAATAT TGAATGCACA GGATGAAAGA 
CAAAAACAAA GATCAGAAAC ATCATGGTTA AAATTACTGG AGAGAAGTCT GAGAAGCAAT 
GAATCTCCTT CAGGGAAGCC TGCTCTGCAG TTTGCAAACC ACAGCCTCTT CTGCTTCTGC 
80 CTTTTGCCAA GATGATATTG ACCTTCAGTG ACCTCTTTCT TGTGCCAGCC CACATTCCCC 
TTTTGCATTG CCTACATGAC ACCTGTATAA AAATATCCAT GGACAGGAGA TACTGCATCT 
ATTCAGGGTC TGGATTCAGC TTACTGTTGT TACAAATAAG TAAGTTTGGT AATATATAGT 
TACATAAATT ACTCCTAATT CCTACTTCTT CCTTCATATC TCAAAGGAAT ATTTAGATGC 
CATCAAGAAA TTTTACCAGA CCAGTGTGGA ATGTACTGAT TTTGCAAATG CTCCAGAAGA 
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AAGTCGAAAG AAGATTAACT CCTGGGTGGA AAGTCAAACG AATGGTAGGA GAGCCACCCA 7080 
TTATAGAAAC ACCTTTGAGA AACCTATGCC AGTGAGCCTT GTGCTTGACA CTGCATGGGG 7140 
GAACAGGTGT GGGGATTGAG ATGGGTTTGC AGGGAGGGCT GAAGAGGGCA CTCCAGATGA 7200 
AGGATTTGTC CAAATGAATA TGAAGAGAGC CTAGGGGAGC CAAGGAGGAA ATCACAGGAA 7260 
5 GCCAATTAGA TGGAAACACA TCTGGAGAAT TATTTGCTTA TGGCCCTGCA TGACAATAGC 7320 
TTTGTGGATC CCCTGTCTCC GCTCAGACCT ATTTTGAGAT CATATCCTTT ACTTTAAATC 7380 
AGACTCAAAT TTTTATGATG AATATTTAAT AGAAAACATT AGAAAGCGTC TCTCGTCTCC 7440 
TTTACTAATT GGGAAACAAG CAGCTCTCTG GTAAATCACC CTTTTGTCTC TGAGCTGGAG 7500 
CTGCCTGGAT CACATCTGTA GCCAATGTGT TCTGCAGGGA TTATCACAGC TCTCTTCCCC 7560 
10 ATCAAGGGCA AAGAGCTTGA CAAAGTCTCC ATTCTACAGA CATCTTTCTT ACCTCCCACC 7620 
TCTCATTACA GGCCAAACTT ACAGCAACTC AACATGAGAG TGAATAGGAA GATACCCCCG 7680 
GAAGTAGTGT CTGACAGCAC AGGACATGCG TTTCATATTA CAGAGCTCAA GTCACTCATC 7740 
CTAAAATGCA ATCAGGGCCT CCTTCCTCTG AATGGGGACC CCGTAGTTAA AAAAAAATAA 7800 
AAGTAGGAAG AGGAGGGAGG GAGAAAGGAA AGACACATGT TGGAAGAGTA GACAAAATCA 7860 
15 GTTTATCAGT ATTCCAAATC AGATGATTGG AGACATTCAT ACACAGAGAA CGTGAACTCC 7920 
TTCTCTATCA CAAGAAGTGA TGTCTCCATC AAGGGTAACT TTATACGACT GGAGCCTTGA 7980 
AGAAAGCTGC ATCTGGTGAA CCACTGGTCA GTGAGTCTAA CAATTCAAAG ATCAAAGTCA 8040 
GTGAGTCTCA AGCAGGGATT TGGGTCAATA ATTAACGATC AGTCACGAAC ATTTGCAAAG 8100 
CATCTTCCAG ACAAGCCATT TGTAGCTTGT GTAAAAGACT CTTTTATTCT TTCCCTTGCA 8160 
20 GAAAAAATTA AAAACCTATT TCCTGATGGG ACTATTGGCA ATGATACGAC ACTGGTTCTT 8220 
GTGAAOGCAA TCTATTTCAA AGGGCAGTGG GAGAATAAAT TTAAAAAAGA AAACACTAAA 8280 
GAGGAAAAAT TTTGGCCAAA CAAGGTATTG TCTATATTTT ATTTATATAG TGTAATATGT 8340 
TAATACATGG AATGTTAAAC ATTTCTGATG GAATGTAACA TGATAAGTAA AAAATAAAAA 8400 
TTGTTCATGT CTGTTATTTT GTTGTTTTAC TCTTATAACT TTATTTAGTT AGGAATACCT 8460 
25 GAAAAACTAT TGTTTCTAAC TCATGGAATT CCTGGGTTAT TTCTTAGAAG AAGAAGGATG 8520 
TGTTGCTATC TCAATAATAT TATCTTTTTT GTCTTGTGTT TCACGTGTTA TTTGTTGGAC 8580 
ACATTGATTT ATTGCAGAAT ACATACAAAT CTGTACAGAT GATGAGGCAA TACAATTCCT 8640 
TTAATTTTGC CTTGCTGGAG GATGTACAGG CCAAGGTCCT GGAAATACCA TACAAAGGCA 8700 
AAGATCTAAG CATGATTGTG CTGCTGCCAA ATGAAATCGA TGGTCTGCAG AAGGTAAGAA 8760 
30 CTTGCATCTA CAACTCTTCC TTCTACTGCC GGACATTTTT CCAAAGATAC CAAGTTTAAA 8820 
CAAGGTAAAA GCTTATGACC GAGTTGCCTC AAAATGATGA AAAATTCTAA ATGAGGAATG 8880 
ATGACTCACC TT CAT ATT AC AAATATTTGA GCATAGGGCC TGACACAAAC TGAAAGCTTA 8940 
GTTTTTGTTT GTTTGTTTGT TTTTATTATT ATTATTATAA TACTTTAAGC TTTAGGGTAC 9000 
ATGTGCACAA TGTGCAGGTT AGTTACATAT GTATACATGT GCCATGCTGG TGTGCTGCAC 9060 
35 CCATTAACTC ATCATTTAGC GTTAGGTATA TCTCCTAATG CTATCCCTCC CCCCTCCCCC 9120 
CACCCCACAA CAGTCCTCAG AGTGTGATGT TACCTTCCTG TGTCCAAGTG TTCTCATTGT 9180 
TCAATTCCCA TCTATGATTT AATTCCATCT ATGGCTTAGT TAATGATTAA TTTATTAGAG 9240 
TTACATGCAT TGGATATCAA TTTGATGATA TTATTATGCA GCAATTTAAA CTTGACTGGG 9300 
AGAAATATAT ACCAATGTGA GGAAAGTTTA CAAATAGGCC GAGTAGAAAA GGGAATACAA 9360 
40 ATTTAGGAAT TTAGGGAATT ACAATTTAAT AATTGCAATG TGTACTAAAT AATGTATACA 9420 
GAAAAATATG ATGAGCCTAT TAAAAATTGA CACATGTAGT AGGCTGTTGG CACAAGAAAT 9480 
AGTGATACAT ACAGTTCATT GTGTACAAAA TAATGTAATC ATATTTTACA TGTGTATCAT 9540 
ACAGTTGTAT ACATACATAT GTACACATAT ACATATACGT AAAAACATGA TTCTGTTTTT 9600 
ACATACATGT ATATACATAT ACACATATAA CCCAATGTAT TTATATATTC AGGACTCATA 9660 
45 . TTTTACCTAT TAGAATAATA ATGTCTATTA AAGTGAACCT TCTGTATTTC ACATTTATTG 9720 
CCAAAATAAC GAATCTCCAC ATAGTCAATT CATTGTTAAG GTGTATTAGA GATCGACAGT 9780 
TAGTCATATC AGTTTCTTTT TTCCATTTGT ATAGCTTGAA GAGAAACTCA CTGCTGAGAA 9840 
ATTGATGGAA TGGACAAGTT TGCAGAATAT GAGAGAGACA TGTGTCGATT TACACTTACC 9900 
TCGGTTCAAA ATGGAAGAGA GCTATGACCT CAAGGACACG TTGAGAACCA TGGGAATGGT 9960 
50 GAATATCTTC AATGGGGATG CAGACCTCTC AGGCATGACC TGGAGCCAOG GTCTCTCAGT 10020 
ATCTAAAGTC CTACACAAGG CCTTTGTGGA GGTCACTGAG GAGGGAGTGG AAGCTGCAGC 10080 
TGCCACCGCT GTAGTAGTAG TCGAATTATC ATCTCCTTCA ACTAATGAAG AGTTCTGTTG 10140 
TAATCACCCT TTCCTATTCT TCATAAGGCA AAATAAGACC AACAGCATCC TCTTCTATGG 10200 
CAGATTCTCA TCCCCATAGA TGCAATTAGT CTGTCACTCC ATTTAGAAAA TGTTCACCTA 10260 
55 GAGGTGTTCT GGTAAACTGA TTGCTGGCAA CAACAGATTC TCTTGGCTCA TATTTCTTTT 10320 
CTATCTCATC TTGATGATGA TAGTCATCAT CAAGAATTTA ATGATTAAAA TAGCATGCCT 10380 
TTCTCTCTTT CTCTTAATAA GCCCACATAT AAATGTACTT TTCCTTCCAG AAAAATTTCC 10440 
CTTGAGGAAA AATGTCCAAG ATAAGATGAA TCATTTAATA CCGTGTCTTC TAAATTTGAA 10500 
„ ATATAATTCT GTTTCTGACC TGTTTTAAAT GAACCAAACC AAATCATACT TTCTCTTCAA 10560 
60 ATTTAGCAAC CTAGAAACAC ACATTTCTTT GAATTTAGGT GATACCTAAA TCCTTCTTAT 10620 
GTTTCTAAAT TTTGTGATTC TATAAAACAC ATCATCAATA AAATAATGAC ATAAAATCAT 10680 
TTTTGCTTTA CCTGTTTTCT CTCTGGAAAG GGCAAGTGTC CAGTTACACA TAGGAAAGAT 10740 
AATTTAGAGA TATATTAATC ATATATAAAG GAAAATTAAA AACAGAGTAG TTCATGATGA 10800 
GCCTGGAGTA GAAGGCATAT CCCAGAACAG GAGGAGCCTT GTAAACCACA TAGGAACTTC 10860 
65 CTATTTTATG CTAAAGGGAT AAGAAACTCA TTACAGGCTT TGATGGTTGT TTGTCAAAGA 10920 
GGGGCATAAA ATTATCATAT CCACATCTAG AAAATACATC TCTGGCTACG CTGATATCAA 10980 
TGGATGCGAG GAAAGAACAG TGTGGTTACC ATATATAAAT TAGGAAATCA TTAGAGTATT 11040 
GGGAGTGGAA ATGGAGAGAA AGAAAGAGCC TGGGGGAATT ATTTAGGAAA TAATAGTTAC 11100 
AGAAAGACAT CTAAGTTGCT GACCTATCTG ACTGGATGGA TGGAAGAATA TCTTGTTTCT 11160 
70 GAGAGAAAAA AAGACTTTGG GTTTAAATTT GTACTTGATG AATTAAGGTA CTTTTAATAT 11220 
TCAAATGGAT TTGCCTGGCA GGCACTTGAA GATATTAGTC TAAATCTCAG AAACAGAATA 11280 
TGATCTGAAG CTCTAAATTT GTGATATTCA ATATAAATAC TTTAGAGTCA TTGGGATAAA 11340 
TATGGTAGTT GTAGCTAAAA GCAAAAATAA GATACTAGGG AGAAAGGATA AAGTTAGAAG 11400 
AAAGAAGAAT CTAGAATTGA CCTTGAAGTA TATCAGCATG TGTAAAGATC AGGAATTGAT 11460 
75 CATTTTTATT TTCCAGAAAG TAGCTTTTCT TAGGGTTCCA TATTTACTCC CATAGATTCT 11520 
TCCC 

Seq ID NO: 177 Protein sequence 
Protein Accession BAB21525.1 

80 

1 11 21 31 41 51 

I I I I I I 

MNSLSBANTK FMFDLFQQFR KSKENNIPYS PISITSALGM VLLGAKDNTA QQISKVLHPD 60 
QVTENTTEKA ATYHVDRSGN VHHQFQKLLT EFNKSTDAYE LKIANKLFGE KTYQFLQEYL 120 
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DAIKKFYQTS VESTDFANAP EESRKKINSW VESQTNEKIK NLFPDGTIGN DTTLVLVNAI 180 

YFKGQWENKP KKENTKEEKF WPNKNTYKSV QMMRQYNSFN FALLEDVQAK VLEIPYKGKD 240 

LSMIVLLPNE IDGLQKLEEK LTAEXLMEWT SLQNMRETCV DLHLPRPKME ESYDLKDTLR 300 

TMGMVNIFNG DADLSGMTWS HGLSVSKVLH KAFVEVTEEG VEAAAATAW WELSSPSTN 360 
EEFCCNHPFL FFIRQNKTNS ILFYGRFSSP 

Seq ID NO: 178 DMA sequence 

Nucleic Acid Accession ft: NM_O01910.1 

Coding sequence: 50 . .1240 

1 11 21 31 41 51 

I i I I I I 

GGAGAGAAGA AAGGAGGGGG CAAGGGAGAA GCTGCTGGTC GGACTCACAA TGAAAACGCT 60 

CCTTCTTTTG CTGCTGGTGC TCCTGGAGCT GGGAGAGGCC CAAGGATCCC TTCACAGGGT 120 

GCCCCTCAGG AGGCATCCGT CCCTCAAGAA GAAGCTGCGG GCACGGAGCC AGCTCTCTGA 180 

GTTCTGGAAA TCCCATAATT TGGACATGAT CCAGTTCACC GAGTCCTGCT CAATGGACCA 240 ' 

GAGTGCCAAG GAACCCCTCA TCAACTACTT GGATATGGAA TACTTCGGCA CTATCTCCAT 300 

TGGCTCCCCA CCACAGAACT TCACTGTCAT CTTCGACACT GGCTCCTCCA ACCTCTGGGT 360 

CCCCTCTGTG TACTGCACTA GCCCAGCCTG CAAGACGCAC AGCAGGTTCC AGCCTTCCCA 420 

GTCCAGCACA TACAGCCAGC CAGGTCAATC TTTCTCCATT CAGTATGGAA CCGGGAGCTT 480 

GTCCGGGATC ATTGGAGCOG ACCAAGTCTC TGTGGAAGGA CTAACCGTGG TTGGCCAGCA 540 

GTTTGGAGAA AGTGTCACAG AGCCAGGCCA GACCTTTGTG GATGCAGAGT TTGATGGAAT 600 

TCTGGGCCTG GGATACCCCT CCTTGGCTGT GGGAGGAGTG ACTCCAGTAT TTGACAACAT 660 

GATGGCTCAG AACCTGGTGG ACTTGCCGAT GTTTTCTGTC TACATGAGCA GTAACOCAGA 720 

AGGTGGTGCG GGGAGCGAGC TGATTTTTGG AGGCTACGAC CACTCCCATT TCTCTGGGAG 780 

CCTGAATTGG GTCCCAGTCA CCAAGCAAGC TTACTQGCAG ATTGCACTGG ATAACATCCA 840 

GGTGGGAGGC ACTGTTATGT TCTGCTCCGA GGGCTGCCAG GCCATTGTGG ACACAGGGAC 900 

TTCCCTCATC ACTGGCCCTT CCGACAAGAT TAAGCAGCTG CAAAACGCCA TTGGGGCAGC 960 

CCCCGTGGAT GGAGAATATG CTGTGGAGTG TGCCAACCTT AACGTCATGC CGGATGTCAC 1020 

CTTCACCATT AACGGAGTCC CCTATACCCT CAGCCCAACT GCCTACACCC TACTGGACTT 1080 

CGTGGATGGA ATGCAGTTCT GCAGCAGTGG CTTTCAAGGA CTTGACATCC ACCCTCCAGC 1140 

TGGGCCCCTC TGGATCCTGG GGGATGTCTT CATTGGACAG TTTTACTCAG TCTTTGACCG 1200 

TGGGAATAAC CGTGTGGGAC TGGCCCCAGC AGTCCCCTAA GGAGGGGCCT TGTGTCTGTG 1260 

CCTGCCTGTC TGACAGACCT TGAATATGTT AGGCTGGGGC ATTCTTTACA CCTACAAAAA 1320 

GTTATTTTCC AGAGAATGTA GCTGTTTCCA GGGTTGCAAC TTGAATTAAG ACCAAACAGA 1380 

ACATGAGAAT ACACACACAC ACACACATAT ACACACACAC ACACTTCACA CATACACACC 1440 

ACTCCCACCA CCGTCATGAT GGAGGAATTA CGTTATACAT TCATATTTTG TATTGATTTT 1500 

TGATTATGAA AATCAAAAAT TTTCACATTT GATTATGAAA ATCTCCAAAC ATATGCACAA 1560 

GCAGAGATCA TGGTATAATA AATCCCTTTG CAACTCCACT CAGCCCTGAC AACCCATCCA 1620 

CACACGGCCA GGCCTGTTTA TCTACACTGC TGCCCACTCC TCTCTCCAGC TCCACATGCT 1680 

GTACCTGGAT CATTCTGAAG CAAATTCCGA GCATTACATC ATTTTGTCCA TAAATATTTC 1740 

TAACATCCTT AAATATACAA TCGGAATTCA AGCATCTCCC ATTGTCCCAC AAATGTTTGG 1800 

CTGTTTTTGT AGTTGGATTG TTTGTATTAG GATTCAAGCA AGGCCCATAT ATTGCATTTA 1860 

TTTGAAATGT CTGTAAGTCT CTTTCCATCT ACAGAGTTTA GCACATTTGA ACGTTGCTGG 1920 

TTGAAATCCC GAGGTGTCAT TTGACATGGT TCTCTGAACT TATCTTTCCT ATAAAATGGT 1980 

AGTTAGATCT GOAGGTCTGA TTTTGTGGCA AAAATACTTC CTAGGTGGTG CTGGGTACTT 2040 

CTTGTTGCAT CCTGTCAGGA GGCAGATAAT GCTGGTGCCT CTCTATTGGT AATGTTAAGA 2100 
CTGCTGGGTG GGTTTGGAGT TCTTGGCTTT AATCATTCAT TACAAAGTTC AGCATTTT 

Seq ID NO: 179 Protein sequence 
Protein Accession ft: NP_00190l.l 

1 11 21 31 41 51 

I I I I I I 

MKTLLLLLLV LLELGEAQGS LHRVPLRRHP SLKKKLRARS QLSEFWKSHN LDMIQFTESC 60 

SMDQSAKEPL INYLDMEYFG TISIGSPPQN FTVI FDTGSS NLWVPSVYCT SPACKTHSRP 120 

QPSQSSTYSQ PGQSFSIQYG TGSLSGIIGA DQVSVEGLTV VGQQFGESVT EPGQTFVDAE 180 

FDGILGLGYP SLAVGGVTPV FDNMMAQNLV DLPMFSVYKS SNPEGGAGSE LIFGGYDHSH 240 

FSGSLNWVPV TKQAYWQIAL DNIQVGGTVM FCSBGOQAIV DTGTSLITGP SDKIKQLQNA 300 

IGAAPVDGEY AVECANI2JVM PDVTFTINGV PYTLSPTAYT LLDFVDGMQF CSSGFQGLDI 360 
HPPAGPLWIL GD VP IRQ FYS VFDRGNNRVG LAPAVP 

Seq ID NO: 180 DNA sequence 

Nucleic Acid Accession ft: NM_018058.1 

Coding sequence: 319.. 1575 

1 11 21 31 41 51 

I I I I I I 

TACGCGCTGC GGGACCGGCA GGGGAACGCC ATCGGGGTCA CAGCCTGCGA CATCGACGGG 60 

GACGGCCGGG AGGAGATCTA CTTCCTCAAC ACCAATAATG CCTTCTCGGG GGTGGCCACG 120 

TACACCGACA AGTTGTTCAA GTTCCGCAAT AACCGGTGGG AAGACATCCT GAGCGATGAG 180 

GTCAACGTGG CCCGTGGTGT GGCCAGCCTC TTTGCCGGAC GCTCTGTGGC CTGTGTGGAC 240 

AGAAAGGGCT CTGGACGCTA CTCTATCTAC ATTGCCAATT ACGCCTACGG TAATGTGGGC 300 

CCTGATGCCC TCATTGAAAT GGACCCTGAG GCCAGTGACC TCTCCCGGGG CATTCTGGCG 3 60 

CTCAGAGATG TGGCTGCTGA GGCTGGGGTC AGCAAATATA CAGGGGGCCG AGGCGTCAGC 420 

GTGGGCCCCA TCCTCAGCAG CAGTGCCTCG GATATCTTCT GCGACAATGA GAATGGGCCT 480 

AACTTCCTTT TCCACAACCG GGGCGATGGC ACCTTTGTGG ACGCTGCGGC CAGTGCTGGT 540 

GTGGACGACC CCCACCAGCA TGGGCGAGGT GTCGCCCTGG CTGACTTCAA CCGTGATGGC 600 

AAAGTGGACA TCGTCTATGG CAACTGGAAT GGCCCCCACC GCCTCTATCT GCAAATGAGC 660 

ACCCATGGGA AGGTCCGCTT CCGGGACATC GCCTCACCCA AGTTCTCCAT GCCCTCCCCT 720 

GTCCGCACGG TCATCACOGC CGACTTTGAC AATGACCAGG AGCTGGAGAT CTTCTTCAAC 780 

AACATTGCCT ACCGCAGCTC CTCAGCCAAC CGCCTCTTCC GOGTCATCCG TAGAGAGCAC 840 

GGAGACCCCC TCATCGAGGA GCTCAATCCC GGCGAOGCCT TGGAGCCTGA GGGCOGGGGC 900 

ACAGGGGGTG TGGTGACCGA CTTCGACGGA GACGGGATGC TGGACCTCAT CTTGTCCCAT 960 
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GGAGAGTCCA TGGCTCAGCC GCTGTCCGTC TTCCGGGGCA ATCAGGGCTT CAACAACAAC 1020 

TGGCTGCGAG TGGTGCCACG CACCCGGGTT GGGGCCTTTG CCAGGGGAGC TAAGGTCGTG 1O80 

CTCTACACCA AGAAGAGTGG GGCCCACCTG AGGATCATCG ACGGGGGCTC AGGCTACCTG 1140 

TGTGAGATGG AGCCCGTGGC ACACTTTGGC CTGGGGAAGG ATGAAGCCAG CAGTGTGGAG 1200 

GTGACGTGGC CAGATGGCAA GATGGTGAGC CGGAACGTGG CCAGCGGGGA GATGAACTCA 1260 

GTGCTGGAGA TCCTCTACCC CCGGGATGAG GACACACTTC AGGACCCAGC CCCACTGGAG 1320 

ACACCAATGA ATGCATCCAG TTCCCATTCG TGTGCCCTCG AGACAAGCCC GTATGTGTCA 1380 

ACACCTATGG AAGCTACAGG TGCCGGACCA ACAAGAAGTG CAGTCGGGGC TACGAGCCCA 1440 

ACGAGGATGG CACAGCCTGC GTGGGGACTC TCGGCCAGTC ACCGGGCCCC CGCCCCACCA 1500 

CCCCCACCGC TGCTGCTGCC ACTGCCGCTG CTGCTGCCGC TGCTGGAGCT GCCACTGCTG 1560 

CACCGGTCCT CGTAGATGGA GATCTCAATC TGGGGTCGGT GGTTAAGGAG AGCTGCGAGC 1620 

CCAGCTGCTG AGCAGGGGTG GGACATGAAC CAGCGGATGG AGTCCAGCAG GGGAGTGGGA 1680 

AAGTGGGCTT GTGCTGCTGC CTAGACAGTA GGGATGTAAA GGCCTGGGAG CTAGACCCTC 1740 

CCCAAGCCCA TCCATGCACA TTACTTAGCT AACAATTAGG GAGACTCGTA AGGCCAGGCC 1800 

CTGTGCTGGG CACATAGCTG TGATCACAGC AGACAGGGTC GCTGCCCTGA TGGCGCTTAC 1860 

ATTCCAGTGG GTCTAATGAC CATATCTTAG GACACAGATG TGCCCAGGGA GGTGGTGTCA 1920 

CTGCACAGGA AGTATGAGGA CTTTAGTGTC CTGAGTTCAA ATCCTGATTC AGGAACTCAC 1980 

AAAGCTATGT GACCTTACAC CAGTCACTTA ACTTGTTAGC CATCCATTAT CGCATCTGCA 2040 

AAATGGGGAT TAAGAATAGA ATCTTGGGGT TAGTGTGGAG ATTAGATTAA ATGTATGTAA 2100 

GACACTTGGC ACAAAACCTG GCACATAGTA AAGGCTCAAT AAAAACAAGT GCCTCTCACT 2160 
GGGCTTTGTC AACACGTG 



Seq ID NO: 181 Protein sequence 
Protein Accession tf: NP_060S2S.l 

1 11 21 31 41 51 

I II I I ! 

MDPEASDLSR GILALRDVAA EAGVSKYTGG RGVSVGPILS SSASDIFCDN ENGPNFLFHN 60 

RGDGTFVDAA ASAGVDDPHQ HGRGVALADF NRDGKVDIVY GNWNGPHRLY LQMSTHGKVR 120 

FRDIASPKPS MPSPVRTVIT ADFDNDQELE IPFNNIAYRS SSANRLPRVI RREHGDPLIE 180 

ELNPGDALEP EGRGTGGWT DFDGDGKLDL ILSHGESMAQ PLSVFRGNQG FNNNWLRWP 240 

RTRVGAFARG AKWLYTKKS GAHLRIIDGG SGYLCEMEPV AHFGLGKDEA SSVEVTWPDG 300 

KMVSRNVASG EMNSVLEILY PRDEDTLQDP APLETPMNAS SSHSCALJBTS PYVSTPMEAT 360 
GAGPTRSAVG ATSPTRMAQP AWGLSASHRA PAPPPPPLLL PLPLLLPLLE LPLLHRSS 

Seq ID NO: 182 DNA sequence 
Nucleic Acid Accession ft: AJ279016 
Coding sequence: 1..1962 

1 11 21 31 41 51 

I 1 I I I I 

ATGTCCAGGA TGTTACCGTT CCTGCTGCTG CTCTGGTTTC TGCCCATCAC TGAGGGGTCC 60 

CAGCGGGCTG AACCCATGTT CACTGCAGTC ACCAACTCAG TTCTGCCTCC TGACTATGAC 120 

AGTAATCCCA CCCAGCTCAA CTATGGTGTG GCAGTTACTG ATGTGGACCA TGATGGGGAC 180 

TTTGAGATCG TCGTGGCGGG GTACAATGGA CCCAACCTGG TTCTGAAGTA TGACCGGGCC 240 

CAGAAGCGGC TGGTGAACAT CGCGGTCGAT GAGCGCAGCT CACCCTACTA CGCGCTGCGG 300 

GACCGGCAGG GGAACGCCAT CGGGGTCACA GCCTGCGACA TCGACGGGGA CGGCCGGGAG 360 

GAGATCTACT TCCTCAACAC CAATAATGCC TTCTCGGGGG TGGCCACGTA CACCGACAAG 420 

TTGTTCAAGT TCCGCAATAA CCGGTGGGAA GACATCCTGA GCGATGAGGT CAACGTGGCC 480 

CGTGGTGTGG CCAGCCTCTT TGCCGGACGC TCTGTGGCCT GTGTGGACAG AAAGGGCTCT 540 

GGACGCTACT CTATCTACAT TGCCAATTAC GCCTACGGTA ATGTGGGCCC TGATGCCCTC 600 

ATTGAAATGG ACCCTGAGGC CAGTGACCTC TCCCGGGGCA TTCTGGCGCT CAGAGATGTG 660 

GCTGCTGAGG CTGGGGTCAG CAAATATACA GGGGGCCGAG GCGTCAGCGT GGGCCCCATC 720 

CTCAGCAGCA GTGCCTCGGA TATCTTCTGC GACAATGAGA ATGGGCCTAA CTTCCTTTTC 780 

CACAACCGGG GCGATGGCAC CTTTGTGGAC GCTGCGGCCA GTGCTGGTGT GGACGACCCC 840 

CACCAGCATG GGCGAGGTGT CGCCCTGGCT GACTTCAACC GTGATGGCAA AGTGGACATC 900 

GTCTATGGCA ACTGGAATGG CCCCCACCGC CTCTATCTGC AAATGAGCAC CCATGGGAAG 960 

GTCCGCTTCC GGGACATCGC CTCACCCAAG TTCTCCATGC CCTCCCCTGT CCGCACGGTC 1020 

ATCACCGCCG ACTTTGACAA TGACCAGGAG CTGGAGATCT TCTTCAACAA CATTGCCTAC 1080 

CGCAGCTCCT CAGCCAACCG CCTCTTCCGC GTCATCCGTA GAGAGCACGG AGACCCCCTC 1140 

ATCGAGGAGC TCAATCCCGG CGAOGCCTTQ GAGCCTGAGG GCCGGGGCAC AGGGGGTGTG 1200 

GTGACCGACT TCGACGGAGA CGGGATGCTG GACCTCATCT TGTCCCATGG AGAGTCCATG 1260 

GCTCAGCOGC TGTCCGTCTT CCGGGGCAAT CAGGGCTTCA ACAACAACTG GCTGCGAGTG 1320 

GTGCCACGCA CCCGGTTTGG GGCCTTTGCC AGGGGAGCTA AGGTCGTGCT CTACACCAAG 1380 

AAGAGTGGGG CCCACCTGAG GATCATCGAC GGGGGCTCAG GCTACCTGTG TGAGATGGAG 1440 

CCCGTGGCAC ACTTTGGCCT GGGGAAGGAT GAAGCCAGCA GTGTGGAGGT GACGTGGCCA 1500 

GATGGCAAGA TGGTGAGCCG CAACGTGGCC AGCGGGGAGA TGAACTCAGT GCTGGAGATC 1S60 

CTCTACCCCC GGGATGAGGA CACACTTCAG GACCCAGCCC CACTGGAGTG TGGCCAAGGA 1620 

TTCTCCCAGC AGGAAAATGG CCATTGCATG GACACCAATG AATGCATCCA GTTCCCATTC 1680 

GTGTGCCCTC GAGACAAGCC CGTATGTGTC AACACCTATG GAAGCTACAG GTGCCGGACC 1740 

AACAAGAAGT GCAGTCGGGG CTACGAGCCC AACGAGGATG GCACAGCCTG CGTGGGGACT 1800 

CTCGGCCAGT CACCGGGCCC CCGCCCCACC ACCCCCACCG CTGCTGCTGC CACTGCCGCT I860 

GCTGCTGCCG CTGCTGGAGC TGCCACTGCT GCACCGGTCC TCGTAGATGG AGATCTCAAT 1920 

CTGGGGTCGG TGGTTAAGGA GAGCTGCGAG CCCAGCTGCT GAGCAGGGGT GGGACATGAA 1980 

CCAGCGGATG GAGTCCAGCA GGGGAGTGGG AAAGTGGGCT TGTGCTGCTG CCTAGACAGT 2040 

AGGGATGTAA AGGCCTGGGA GCTAGACCCT CCCCAAGCCC ATCCATGCAC ATTACTTAGC 2100 

TAACAATTAG GGAGACTCGT AAGGCCAGGC CCTGTGCTGG GCACATAGCT GTGATCACAG 2160 

CAGACAGGGT CGCTGCCCTG ATGGCGCTTA CATTCCAGTG GGTCTAATGA CCATATCTTA 2220 

GGACACAGAT GTGCCCAGGG AGGTGGTGTC ACTGCACAGG AAGTATGAGG ACTTTAGTGT 2280 

CCTGAGTTCA AATCCTGATT CAGGAACTCA CAAAGCTATG TGACCTTACA CCAGTCACTT 2340 

AACTTGTTAG CCATCCATTA TCGCATCTGC AAAATGGGGA TTAAGAATAG AATCTTGGGG 2400 

TTAGTGTGGA GATTAGATTA AATGTATGTA AGACACTTGG CACAAAACCT GGCACATAGT 2460 
AAAGGCTCAA TAAAAACAAG TGCCTCTCAC TGGGCTTTGT CAACACG 
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PCTAJS02/36810 



Seq ID NO: 183 Protein sequence 
Protein Accession fh CAC08451 
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MSRMLPFLLL 
FEIWAGYNG 
EIYFLNTNNA 
GRYSIYIANY 
LSSSASDIFC 
VYGNWNGPHR 
RSSSANRLFR 
AQPLSVPRGN 
PVAHFGLGKD 
FSQQENGHCM 
LGQSPGPRPT 



11 
I 

LWFLPITEGS 
PNLVLXYDRA 
FSGVATYTDK 
AYGNVGPDAL 
DNENGPNFLF 
LYLQMSTHGK 
VIRREHGDPL 
QGFNNNWLRV 
EASSVEVTWP 
DTNECIQFPF 
TPTAAAATAA 



21 
I 

QRAEPMFTAV 
QKRIiVNIAVD 
LFKFRNNRWE 
IEMDPEASDL 
HNRGDGTPVD 
VRFRDIASPK 
IEELNPGDAL 
VPRTRFGAFA 
DGKMVSRNVA 
VCPRDKPVCV 
AAAAAGAATA 



31 
I 

TNSVLPPDYD 
ERSSPYYALR 
DILSDEVNVA 
SRGIliALRDV 
AAASAGVDDP 
FSMPSPVRTV 
EPEGRGTGGV 
RGAKWLYTK 
SGEMNSVLEI 
NTYGSYRCRT 
APVLVDGDLN 



41 
I 

SNPTQLNYGV 
DRQGNA1GVT 
RGVASLFAGR 
AAEAGVSKYT 
HQHGRGVALA 
ITADFDNDQE 
VTDFDGDGML 
KSGARLRIID 
LYPRDEDTLQ 
NKKCSRGYEP 
LGSWKESCE 



51 
I 

AVTDVDHDGD 
ACDIDGDGRB 
SVACVDRKGS 
GGRGVSVGPI 
DFNRDGKVDI 
LEIFFNNIAY 
DLILSHGESM 
GGSGYLCEME 
DPAPLECGQG 
NEDGTACVGT 
PSC 



Seq ID NO* 1B4 DNA sequence 
Nucleic Acid Accession «: FGENESHH 
Coding sequence: 1..4794 

31 



SI 
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480 
540 
600 



1 11 21 31 41 

I I I I I ! 

ATGGCGTGTC CGGGAGGACT CCCAGCCCGT TGCTCTGGTT GGATGGGACT GGGTGGGCCC 
AGCGGCTCCT CCCCAGCATC CCCTCCCCAT TCCTCCTCCA GGTACAATGG ACCCAACCTG 
GTTCTGAAGT ATGACCGGGC CCAGAAGCGG CTGGTGAACA TCGCGGTCGA TGAGCGCAGC 
TCACCCTACT AOGCGCTGCG GGACCGGCAG GGGAACGCCA TCGGGGTCAC AGCCTGCGAC 
ATCGACGGGG ACGGCCGGGA GGAGATCTAC TTCCTCAACA CCAATAATGC CTTCTCGGGC 
CACAGCAGCT CAGCGCAGGT CCCTTCTGGG CTCCACAGAA ACAGGCCTGT GCTGAAGCCT 
CCACCTACAA CCCCTGCAGG CCTCCTGGGT CTGCCTCCAC TCAGCGGAAG GGACTTTTCC 
TCCTCCCTGG GTCAGGCTTC TCCGGACAGC AGGCAGGGAG AGAGGGTGCC GGTTCCCTGC 
TGTCGGGGTG GACTGAGACC TACCCATGAA OCAGAACCAT TTCTTCTGAG ACCCAAATCA 
GGGGTGGCCA CGTACACCGA CAAGTTGTTC AAGTTCCGCA ATAACCGGTG GGAAGACATC 
CTGAGCGATG AGGTCAACGT GGCCCGTGGT GTGGCCAGCC TCTTTGCCGG ACGCTCTGTG 
GCCTGTGTGG ACAGAAAGGG CTCTGGACGC TACTCTATCT ACATTGCCAA TTAOGCCTAC 
GGTAATGTGG GCCCTGATGC CCTCATTGAA ATGGACCCTG AGGCCAGTGA CCTCTCCCGG 
GGCATTCTGG CGCTCAGAGA TGTGGCTGCT GAGGCTGGGG TCAGCAAATA TACAGAAGGC 
TTCTCCCACA CTGCCTCTCC AAGCATTGGT GAGATATCTG GCAGAACCGA GGAGCGGGAA 
GGAGGAGACC CAGAGGAGGC AGATGAGGAG CACAGTGGGG ATGGAAGCAC CAGCCAACTG 
TGCOGGCTGG GCTGGAAGGA GGGGCAGTTC AAGGAAGAAG CAGCAGCTTT GGTGGAGGAA 
CAGAGGGAGG CTGGGGCAGC TGGCGTGCCC AGAGGACGTG TTCGAACAGC TCTGCAGACT 
TCCAAAAGCC ATTTGGCTGA CAAGAACCTA TTTGGCCCAC CATGTTACTA TTCTGTCTGC 
GCGCCTTCTC CAGCCCACCC TTTCCCTGCC CGCCAAGCCC CCCAACACTA CCCTGTAGCC 
CCCCTTGTCA CTCAGCTAAT GACACATGGA CGTCTGGCTG GAAAACTAGC CCGGAGTGTC 
CCCCACCCCC GAGCCCCAGG AATGGACCCC AAATGTAAGG GCCGCCATGC TGAGCCCGGC 
CTGATGGCTG AGGCTTTGGG OGCGTGGCCA GOGCTCAGCA CCACTGTGGT GCCAGGGGGC 
CTGAGAAGCT GGGAGGAAAG CAGGCAGAAG GGGCAGGCCA TGTCGAGATG TGCACTCAGG 
GAGCTGGGAG GTCCCTGGAG CCAAGCCACA CAGCACCTGC CTGCTAGAGA GCTGTATGAC 
CTGGGAGAAC CTCCCATTTT ACAAAGAACA GACGGAGATC CAGGGAGGAG AAGGGACTCG 
CCCAAGGTCA CACAGGAGTG CCATCTAGTG GCCACCATGC CAGCTCTCGG GGGACTCGAG 
GGCCCCGGGA GGGTGGCCAA GCGAGAGATT GGGAGAGAGA CTGGGGCAGT AGGAAGACCA 
CTCTCCCATC CCCTGGTCCC CAACTTCCCC AGCTGCTTGA GGCCTCTTGA AGCCGGGACA 
GTGCCGGGAG CTGCCCTGCC TGGGAATCCT GGGAACTGGG TTCTGGACAT GGCCAAGGCC 
CTGGCGTGGA ACCAGATGGA AAAAGAGGAG GGGAAGATTC ATGGAGACCA TGAGCCCAGA 
TTTAGGCTCA GGAAAGCACG GGAAGCAGAA TTCCCCCCAG GCTCCTCTGA GGAGCCTCTG 
CTGCAGTTCC CCTCAGGCCT CAGAGGCAGC CCTGTCCTCC AGGTGGGCCT GGGGCTTGCT 
TCTGCCACTC ACTGTGGGTC GATGTCTTTT CTAGGGGGCC GAGGCGTCAG CGTGGGCCCC 
ATCCTCAGCA GCAGTGCCTC GGATATCTTC TGCGACAATG AGAATGGGCC TAACTTCCTT 
TTCCACAACC GGGGCGATGG CACCTTTGTG GACGCTGCGG CCAGTGCTGA ACGTCGTTTA 
GCCTTCATCG TTCACCTCAA ATATCACCTC TGCAGAGATT TTCCTCACTC CCTGTGCCAC 
CTAGCAGAAA CTGGTCCTTC CTCCTCCTGC TGCCCGTGGC ATGCAOGTCT TCTTCAGGCT 
CCACATTGCC ATCATGGTTT GTCTATGAGC TTTACAAGGA CCGGGTCACG GTTCTATTCA 
TTCTTGACGC AAGGCTTGGC CTCCAGTGCC CACCGGAGGA CACTCAGCCT CCAGGGTTCT 
CAGGGGGCCC CACCCTGCCT TCTGGCAAGA GCTCCCTGTG TCCTGGGGTC TCTGATCCOC 
ACTGCCTATT ACATTGTCCT GTGGTCTGCC ATCCCAGAGA GCCTGATGAC CCACAGCTAT 
TTGTCCTCTG AAAGAGTCAA CGTGGGTGTG GACGACCCCC ACCAGCATGG GCGAGGTGTC 
GCCCTGGCTG ACTTCAACCG TGATGGCAAA GTGGACATCG TCTATGGCAA CTGGAATGGC 
CCCCACCGCC TCTATCTGCA AATGAGCACC CATGGGAAGG TCCGCTTCCG GGACATCGCC 
TCACCCAAGT TCTCCATGCC CTCCCCTGTC CGCACGGTCA TCACCGCCGA CTTTGACAAT 
GACCAGGAGC TGGAGATCTT CTTCAACAAC ATTGCCTACC GCAGCTCCTC AGCCAACCGC 
CTCTTCCGAT GCTCCATCCT GGCTCGTGGC TCTTCATCCT TGACAGCTGG TGGGAGGAAC 
GGTCAGGGAG AAGGTTTAAG AATCAGAAGG GGAGGGTTCG CAGGGCCAGG GGGTCAGGCC 
AAGGTCAACA CAGGTCCCCT GATGAAGAAA CAGAAAGGAA GGAAGGACGA GGACTGGGCA 
AGAGGCTGTG GGAATGCAGG GCAAAGCCTG GCCAAGGAGC CGGCCTCTGC TATTGCAGGG 
AAAGGGAAGG GAAATGTGGC CCAAAGTGTG CCCAGAACCC AAGCGCCACA AGATACAAAG 
CCACACTACC ACAAAAAGGG GCTACAGGGT CCAATCACTA CCAGGAAAAG GGGCTACGGG 
GTCCAATCAC TACCAGGAAA AGGGGCTACG GGGTCCAATC ACTACCAGGA AAAGGGGCTA 
CGGGGTCCAA TCACTACCAG GAAAAGGGGC TACGGGGTCC AATCACTACC AGGAAAAGGG 
GCTACGGGCT CCAATCACTA CCAGGAAAAG GGGCTACAGG GTCCAATCAC TACCAGGAAA 
AGGGGCTACG GGCTCCAATC ACTACCAGGA AAAGGGGCTA CAGGGTCCAA TCACTACCAC 
AGAAAGGGGC TACGGGCTCC AATCACTACC AGGAAAAGGG GCTACGGGGT CCAATCACTA 
CCAGGAAAAG GGGCTACAGG GTCCAATCAC TACCAGGAAA AGGGGCTACG GGGTCCAATC 
ACTACCAGGA AAAGGGGCTA CGGGCTCCAA TCACTACCAG GAAAAGGGGC TACGGGGTCC 
AATCACTACC AGGAAAAGGG GCTACAGGGT CCAATCACTA CCAGGAAAAG GGGCTACAGG 
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GTCCAATCAC TACCACAGAA AGGGGCTACG GGCTCCAATC ACTACCAGGA AAAC3GGGCTA 
CGGGGTCCAA TCACTACCAG GAAAAGGGGC TACGGGCTCC AATCACTACC AGGAAAAGAG 
GCTATGGGGT CCAATCACTA CCAGGAAAAG GGGCTACGGG CTCCAATCAC TACCAGGAAA 
AGGGGCTATG GGGTCCAATC ACTACCACAG AAAGGGGCTA CGGGGTCCAA CGTCATCCGT 
AGAGAGCACG GAGACCCCCT CATCGAGGAG CTCAATCCCG GCGACGCCTT GGAGCCTGAG 
GGCCGGGGCA CAGGGGGTGT GGTGACCGAC TTCGACGGAG ACGGGATGCT GGACCTCATC 
TTGTCCCATG GAGAGTCCAT GGCTCAGCCG CTGTCCGTCT TCCGGGGCAA TCAGGGCTTC 
AACAACAACT GGCTGCGAGT GGTGCCACGC ACCCGGTTTG GGGCCTTTGC CAGGGGAGCT 
AAGGTCGTGC TCTACACCAA GAAGAGTGGG GCCCACCTGA GGATCATCGA CGGGGGCTCA 
GGCTACCTGT GTGAGATGGA GCCCGTGGCA CACTTTGGCC TGGGGAAGGA TGAAGCCAGC 
AGTGTGGAGG TGACGTGGCC AGATGGCAAG ATGGTGAGCC GGAACGTGGC CAGCGGGGAG 
ATGAACTCAG TGCTGGAGAT CCTCTACCCC CGGGATGAGG ACACACTTCA GGACCCAGCC 
CCACTGGAGT GTGGCCAAGG ATTCTCCCAG CAGGAAAATG GCCATTGCAT GGACACCAAT 
GAATGCATCC AGTTCCCATT CGTGTGCCCT CGAGACAAGC CCGTATGTGT CAACACCTAT 
GGAAGCTACA GGTGCCGGAC CAACAAGAAG TGCAGTCGGG GCTACGAGCC CAACGAGGAT 
GGCACAGCCT GCGTGGGTAC TGAGCTAGGC TCTAGGCATA CAATGACGTG GAAACCAAGG 
CCCAAAAAGG AGCTGCAACT TTCCCAAGGC ATCTGCACCC CCGTCTGGTC CTTTTTCCTG 
CCGGGTTGCC GGCTGCTCCT CAAAAGAGCT CAGCTCCAGG CTGCTCCCAG CACCCTTCTC 
CAGAAAGCTC CAGGTATTCC AGAAGCCCAA GTGTATGAAC AAGATCAGGA ATAA 

Seq ID NO: 185 Protein sequence 
Protein Accession #: FGENESHH 

1 11 21 31 41 51 

I I I I I I 

MACPGGLPAR CSGWMGLGGP SGSSPASPPH SSSRYNGPNI* VLKYDRAQKR LVNIAVDERS 60 

SPYYALRDRQ GNAIGVTACD IDGDGREEIY FLNTNNAFSG HSSSAQVPSG LHRNRPVLKP 120 

PPTTPAGIiLG LPPLSGRDFS SSLGQASPDS RQGERVPVPC CRGGLRPTHE PEPPLLRPKS 180 

GVATYTDKLF KFRNNRWEDI LSDEVNVARG VASLPAGRSV ACVDRKGSGR YSIYIANYAY 240 

GNVGPDALIE MDPBASDLSR GILALRDVAA EAGVSKYTEG FSHTASPSIG EISGRTEERE 300 

GGDPEEADEB HSGDGSTSQL CRLGWKDGQF KEBAAALVEE QREAGAAGVP RGRVRTALQT 360 

SKSHLADKNL FGPPCYYSVC APSPAHPFPA RQAPQHYPVA PLVTQLHTHG RLAGKLARSV 420 

PHPRAPGMDP KCKGRHAEPG LKAEALGAWP ALSTTWPGG LRSWEESRQK GQAMSRCAIiR 480 

ELGGPWSQAT QHLPARELYD LGEPPILQRT DGDPGRRROS PKVTQECHLV ATMPALGGI.E 540 

GPGRVAKREI GRETGAVGRP LSHPLVPNFP SCLRPLEAGT VPGAAXjPGNP GNWVLDMAKA 600 

LAWNQMEKEE GKIHGDHEPR FRLRKAREAE FPPGSSEEPL LQFPSGXiRGS PVLQVGLGLA 660 

SATHCGSMSF LGGRGVSVGP ILSSSASDIF CDNENGFNFL FHNRGDGTFV DAAASAERRL 720 

AFIVHLKYHL CRDFPHSLCH LAETGPSSSC CPWHARLLQA PHCHHGLSMS FTRTGSRFYS 780 

FLTQGLASSA HRRTLSLQGS QGAPPCLIiAR APCVLGSLIP TAYYIVLWSA IPESLMTHSY 840 

LSSBRVNVGV DDPHQHGRGV ALADFNRDGK VDIVYGNWNG PHRLYLQMST HGKVRFRDIA 900 

SPKFSMPSPV RTVITADFDN DQELEIFFNN IAYRSSSANR LFRCSILARG SSSLTAGGRN 960 

GQGEGLRIRR GGFPGPGGQA KVNTGPLMKK QKGRKDEDWA RGCGNAGQSL AREPASAIAG 1020 

KGKGNVAQSV PRTQAPQDTK PHYHKKGLOG PITTRKRGYG VQSLPGKGAT GSNHYQEKGL 1080 

RGPITTRKRG YGVQSLPGKQ ATGSNHYQEK GLCGPITTRK RGYGLQSLPG KGATGSNHYH 1140 

RKGLRAPITT RKRGYGVQSL PGKGATGSNH YQEKGLRGPI TTRKRGYGLQ SLPGKGATGS 1200 

NHYQEKGLQG PITTRKRGYR VQSLPQKGAT GSNHYQEKGL RGPITTRKRG YGLQSLPGKE 1260 

AMGSNHYQEK GLRAPITTRK RGYGVQSLPQ KGATGSNVIR REHGDPLIEE LNPGDALEPE 1320 

GRGTGGWTD FDGDGMLDLI LSHGESMAQP LSVFRGNQGF NNNWLRWPR TRFGAFARGA 1380 

KWLYTKKSG AHLRIIDGGS GYLCEMEPVA HFGLGKDEAS SVEVTWPDGK MVSRNVASGE 1440 

MNSVLEILYP RDEDTLQDPA PLECGOGFSQ QENGHCMDTN ECIQPPFVCP RDKPVCVNTY 1500 

GSYRCRTNKK CSRGYEPNED GTACVGTELG SRHTMTWKPR PKKELQLSOG ICTPVWSFFL 1S60 
PGCRLLLKRA QLQAAPSTLL QKAPGIPEAQ VYEQDQE 

Seq ID NO: 186 DMA sequence 

Nucleic Acid Accession #: NM_000584.1 

Coding sequence: 75.. 3 74 

1 11 21 31 41 51 

I I I I I I 

AGCAGAGCAC ACAAGCTTCT AGGACAAGAG CCAGGAAGAA ACCACCGGAA GGAACCATCT 60 

CACTGTGTGT AAACATGACT TCCAAGCTGG CCGTGGCTCT CTTGGCAGCC TTCCTGATTT 120 

CTGCAGCTCT GTGTGAAGGT GCAGTTTTGC CAAGGAGTGC TAAAGAACTT AGATGTCAGT 180 

GCATAAAGAC ATACTCCAAA CCTTTCCACC CCAAATTTAT CAAAGAACTG AGAGTGATTG 240 

AGAGTGGACC ACACTGCGCC AACACAGAAA TTATTGTAAA GCTTTCTGAT GGAAGAGAGC 300 

TCTGTCTGGA CCCCAAGGAA AACTGGGTGC AGAGGGTTGT GGAGAAGTTT TTGAAGAGGG 360 

CTGAGAATTC ATAAAAAAAT TCATTCTCTG TGGTATCCAA GAATCAGTGA AGATGCCAGT 420 

GAAACTTCAA GCAAATCTAC TTCAACACTT CATGTATTGT GTGGGTCTGT TGTAGGGTTG 480 

CCAGATGCAA TACAAGATTC CTGGTTAAAT TTGAATTTCA GTAAACAATG AATAGTTTTT 540 

CATTGTACCA TGAAATATCC AGAACATACT TATATGTAAA GTATTATTTA TTTGAATCTA 600 

CAAAAAACAA CAAATAATTT TTAAATATAA GGATTTTCCT AGATATTGCA CGGGAGAATA 660 

TACAAATAGC AAAATTGAGC CAAGGGCCAA GAGAATATCC GAACTTTAAT TTCAGGAATT 720 

GAATGGGTTT GCTAGAATGT GATATTTGAA GCATCACATA AAAATGATGG GACAATAAAT 780 

TTTGCCATAA AGTCAAATTT AGCTGGAAAT CCTGGATTTT TTTCTGTTAA ATCTGGCAAC 840 

CCTAGTCTGC TAGCCAGGAT CCACAAGTCC TTGTTCCACT GTGCCTTGGT TTCTCCTTTA 900 

TTTCTAAGTG GAAAAAGTAT TAGCCACCAT CTTACCTCAC AGTGATGTTG TGAGGACATG 960 

TGGAAGCACT TTAAG TT TTT TCATCATAAC ATAAATTATT TTCAAGTGTA ACTTATTAAC 1020 

CTATTTATTA TTTATGTATT TATTTAAGCA TCAAATATTT GTGCAAGAAT TTGGAAAAAT 1080 

AGAAGATGAA TCATTGATTG AATAGTTATA AAGATGTTAT AGTAAATTTA TTTTATTTTA 1140 

GATATTAAAT GATGTTTTAT TAGATAAATT TCAATCAGGG TTTTTAGATT AAACAAAGAA 1200 

ACAATTGGGT ACCCAGTTAA ATTTTCATTT CAGATAAACA ACAAATAATT TTTTAGTATA 1260 

AGTACATTAT TGTTTATCTG AAAGTTTTAA TTGAACTAAC AATCCTAGTT TGATACTCCC 1320 

AGTCTTGTCA TTGCCAGCTG TGTTGGTAGT GCTGTGTTGA ATTACGGAAT AATGAGTTAG 1380 

AACTATTAAA ACAGCCAAAA CTCCACAGTC AATATTAGTA ATTTCTTGCT GGTTGAAACT 1440 

TGTTTATTAT GTACAAATAG ATTCTTATAA TATTATTTAA ATGACTGCAT TTTTAAATAC 1500 
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AAGGCTTTAT ATTTTTAACT TTAAGATGTT TTTATGTGCT CTCCAAATTT TTTTTACTGT 1560 
TTCTGATTGT ATGGAAATAT AAAAGTAAAT ATGAAACATT TAAAATATAA TTTGTTGTCA 1620 
AAGTAAAAAA AAAAAAAAA 

5 Seq ID NO* 187 Protein Bequence 
Protein Accession ft: NP_O0O575.l 

1 11 21 31 41 51 

in I I 1 I 1 1 

10 MTSKLAVALL AAFLISAALC EGAVLPRSAK ELRCQCIKTY SKPFHPKFIK ELRVIESGPH 60 
CANTEIIVKL SDGRELCLDP KENWVQRWE KFLKRAENS 

Seq ID NO: 188 DNA sequence 
Nucleic Acid Accession 8: NM_0O366l.l 
IS Coding sequence: 1..1152 

1 11 21 31 41 51 

I I I I I I 

ATGAGTGCAC TTTTCCTTGG TGTGGGAGTG AGGGCAGAGG AAGCTGGAGC GAGGGTGCAA 60 

20 CAAAACGTTC CAAGTGGGAC AGATACTGGA GATCCTCAAA GTAAGCCCCT CGGTGACTGG 120 

GCTGCTGGCA CCATGGACCC AGAGAGCAGT ATCTTTATTG AGGATGCCAT TAAGTATTTC 180 

AAGGAAAAAG TGAGCACACA GAATCTGCTA CTCCTGCTGA CTGATAATGA GGCCTGGAAC 240 

GGATTCGTGO CTGCTGCTGA ACTGCCCAGG AATGAGGCAG ATGAGCTCCG TAAAGCTCTG 300 

GACAACCTTG CAAGACAAAT GATCATGAAA GACAAAAACT GGCACGATAA AGGCCAGCAG 360 

25 TACAGAAACT GGTTTCTGAA AGAGTTTCCT CGGTTGAAAA GTGAGCTTGA GGATAACATA 420 

AGAAGGCTCC GTGCCCTTGC AGATGGGGTT CAGAAGGTCC ACAAAGGCAC CACCATCGCC 480 

AATGTGGTGT CTGGCTCTCT CAGCATTTCC TCTGGCATCC TGACCCTCGT CGGCATGGGT 540 

CTGGCACCCT TCACAGAGGG AGGCAGCCTT GTACTCTTGG AACCTGGGAT GGAGTTGGGA 600 

ATCACAGCCG CTTTGACCGG GATTACCAGC AGTACCATGG ACTAOGGAAA GAAGTGGTGG 660 

30 ACACAAGCCC AAGCCCACGA CCTGGTCATC AAAAGCCTTG ACAAATTGAA GGAGGTGAGG 720 

GAGTTTTTGG GTGAGAACAT ATCCAACTTT CTTTCCTTAG CTGGCAATAC TTACCAACTC 780 

ACACGAGGCA TTGGGAAGGA CATCCGTGCC CTCAGACGAG CCAGAGCCAA TCTTCAGTCA 840 

GTACCGCATG CCTCAGCCTC ACGCCCCCGG GTCACTGAGC CAATCTCAGC TGAAAGCGGT 900 

GAACAGGTGG AGAGGGTTAA TGAACCCAGC ATCCTGGAAA TGAGCAGAGG AGTCAAGCTC 960 

35 ACGGATGTGG CCCCTGTAAG CTTCTTTCTT GTGCTGGATG TAGTCTACCT CGTGTACGAA 1020 

TCAAAGCACT TACATGAGGG GGCAAAGTCA GAGACAGCTG AGGAGCTGAA GAAGGTGGCT 1080 

CAGGAGCTGG AGGAGAAGCT AAACATTCTC AACAATAATT ATAAGATTCT GCAGGCGGAC 1140 
CAAGAACTGT GA 

40 Seq ID NO: 189 Protein sequence 
Protein Accession #: NP_0 03 652.1 

1 11 21 31 41 51 

AC I \ \ I I I I 

4j MSALFLGVGV RAEEAGARVQ QNVPSGTDTG DPQSKPLGDW AAGTMDPESS IFIEDAIKYF 60 

KEKVSTQNIiI* LLLTDNEAWN GFVAAAELPR NEADELRKAL DNLARQMIMK DKNWHDKGQQ 120 

YRNWFLKEFP RLKSELEDNI RRLRALADGV QKVHKGTTIA NWSGSLSIS SGILTLVGMG 180 

LAPFTEGGSI* VLLEPGMELG ITAALTGITS STMDYGKKWW TQAQAHDLVI KSLDKLKEVR 240 

BFLGENISNF LSLAGNTYQL TRGIGKDIRA LRRARANLQS VPHASASRPR VTEPISABSG 300 

50 EQVERVNEPS ILEMSRGVKL TDVAPVSFFL VLDWYLVYB SKHLHEGAKS ETAEELKKVA 360 
QELEEKLNIL NNNYKILQAD QEL 

Seq ID NO: 190 DNA sequence 
Nucleic Acid Accession #: NM_014452.1 
55 Coding sequence: 1..196B 

1 11 21 31 41 51 

I I I I I I 

ATGGGGACCT CTCCGAGCAG CAGCACCGCC CTCGCCTCCT GCAGCCGCAT CGCCCGCCGA 60 

60 GCCACAGCCA CGATGATCGC GGGCTCCCTT CTCCTGCTTG GATTCCTTAG CACCACCACA 120 

GCTCAGCCAG AACAGAAGGC CTCGAATCTC ATTGGCACAT ACCGCCATGT TGACCGTGCC 180 

ACCGGCCAGG TGCTAACCTG TGACAAGTGT CCAGCAGGAA CCTATGTCTC TGAGCATTGT 240 

ACCAACACAA GCCTGCGCGT CTGCAGCAGT TGCCCTGTGG GGACCTTTAC CAGGCATGAG 300 

AATGGCATAG AGAAATGCCA TGACTGTAGT CAGCCATGCC CATGGCCAAT GATTGAGAAA 360 

65 TTACCTTGTG CTGCCTTGAC TGACCGAGAA TGCACTTGCC CACCTGGCAT GTTCCAGTCT 420 

AACGCTACCT GTGCCCCCCA TACGGTGTGT CCTGTGGGTT GGGGTGTGCG GAAGAAAGGG 480 

ACAGAGACTG AGGATGTGCG GTGTAAGCAG TGTGCTOGGG GTACCTTCTC AGATGTGCCT 540 

TCTAGTGTGA TGAAATGCAA AGCATACACA GACTGTCTGA GTCAGAACCT GGTGGTGATC 600 

„ AAGCCGGGGA CCAAGGAGAC AGACAACGTC TGTGGCACAC TCCCGTCCTT CTCCAGCTCC 660 

70 ACCTCACCTT CCCCTGGCAC AGCCATCTTT CCACGCCCTG AGCACATGGA AACCCATGAA 720 

GTCCCTTCCT CCACTTATGT TCCCAAAGGC ATGAACTCAA CAGAATCCAA CTCTTCTGCC 780 

TCTGTTAGAC CAAAGGTACT GAGTAGCATC CAGGAAGGGA CAGTCCCTGA CAACACAAGC 840 

TCAGCAAGGG GGAAGGAAGA CGTGAACAAG ACCCTCCCAA ACCTTCAGGT AGTCAACCAC 900 

CAGCAAGGCC CCCACCACAG ACACATCCTG AAGCTGCTGC CGTCCATGGA GGCCACTGGG 960 

75 GGCGAGAAGT CCAGCACGCC CATCAAGGGC CCCAAGAGGG GACATCCTAG ACAGAACCTA 1020 

CACAAGCATT TTGACATCAA TGAGCATTTG CCCTGGATGA TTGTGCTTTT CCTGCTGCTG 1080 

GTGCTTGTGG TGATTGTGGT GTGCAGTATC CGGAAAAGCT CGAGGACTCT GAAAAAGGGG 1140 

CCCCGGCAGG ATCCCAGTGC CATTGTGGAA AAGGCAGGGC TGAAGAAATC CATGACTCCA 1200 

ACCCAGAACC GGGAGAAATG GATCTACTAC TGCAATGGCC ATGGTATCGA TATCCTGAAG 1260 

80 CTTGTAGCAG CCCAAGTGGG AAGCCAGTGG AAAGATATCT ATCAGTTTCT TTGCAATGCC 1320 

AGTGAGAGGG AGGTTGCTGC TTTCTCCAAT GGGTACACAG CCGACCACGA GCGGGCCTAC 1380 

GCAGCTCTGC AGCACTGGAC CATCCGGGGC CCCGAGGCCA GCCTCGCCCA GCTAATTAGC 1440 

GCCCTGCGCC AGCACCGGAG AAACGATGTT GTGGAGAAGA TTCGTGGGCT GATGGAAGAC 1500 

ACCACCCAGC TGGAAACTGA CAAACTAGCT CTCCCGATGA GCCCCAGCCC GCTTAGCCCG 1560 
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AGCCCCATCC CCAGCCCCAA CGCGAAACTT GAGAATTCCQ CTCTCCTGAC GGTGGAGCCT 1620 

TCCCCACAGG ACAAGAACAA GGGCTTCTTC GTGGATGAGT CGGAGCCCCT TCTCCGCTGT 1680 

GACTCTACAT CCAGCGGCTC CTCCGCGCTG AGCAGGAACG GTTCCTTTAT TACCAAAGAA 1740 

AAGAAGGACA CAGTGTTGCG GCAGGTACGC CTGGACCCCT GTGACTTGCA GCCTATCTTT 1800 

GATGACATGC TCCACTTTCT AAATCCTGAG GAGCTGCGGG TGATTGAAGA GATTCCCCAG 1860 

GCTGAGGACA AACTAGACCG GCTATTCGAA ATTATTGGAG TCAAGAGCCA GGAAGCCAGC 1920 
CAGACCCTCC TGGACTCTGT TTATAGCCAT CTTCCTGACC TGCTGTAG 

Seq ID NO: 191 Protein sequence 
Protein Accession #: NP_0SS267.1 

1 11 21 31 41 51 

I 1 I I I I 

MGTSPSSSTA LASCSRIARR ATATMIAGSL LLLGFLSTTT AQPBQKASNL IGTYRHVDRA 60 

TGQVLTCDKC PAGTYVSEHC TNTSLRVCSS CPVGTFTRHE NGIEKCHDCS QPCPWPMIEK 120 

XjPCAALTDRE CTCPPGMFQS NATCAPHTVC PVGWGVRKKG TETEDVRCKQ CARGTFSDVP 180 

SSVMKCKAYT DCLSQNLWI KPGTKETDNV CGTLPSFSSS TSPSPGTAIF PRPEHMETHE 240 

VPSSTYVPKG MNSTESNSSA SVRPKVLSSI QEGTVPDNTS SARGKEDVNK TLPNLQWNH 300 

QQGPHHRHIL KLLPSMEATG GEKSSTPIKG PKRGHPRQNL HKHFDINEHL PWMIVLFLLL 360 

VLWIWCSI RKSSRTLKKG PRQDPSAIVE KAGLKKSMTP TQNREKWIYY CNGHGIDILK 420 

LVAAQVGSQW KDIYQFLCNA SEREVAAFSN GYTADHERAY AALQHWTIRG PEASLAQLIS 480 

ALRQHRRNDV VEKIRGLMED TTQLETDKLA liPMSPSPLSP SPIPSPNAKL ENSALLTVEP 540 

SPQDKNKGFF VDESEPLLRC DSTSSGSSAL SRNGSFITKE KKDTVLRQVR LDPCDLQPIF 600 
DDMLHFLNPE ELRVIEEIPQ AEDKLDRLFE IIGVKSQEAS QTLLDSVYSH LPDLL 

Seq ID NO: 192 DNA sequence 

Nucleic Acid Accession #: XM_044533 

Coding sequence: 238.. 2751 

1 11 21 31 41 51 

I I I I I I 

GCTCTGCCCA AGCCGAGGCT GCGGGGCCGG CGCCGGCGGG AGGACTGCGG TGCCCCGCGG 60 

AGGGGCTGAG TTTGCCAGGG CCCACTTGAC CCTGTTTCCC ACCTCCCGCC CCCCAGGTCC 120 

GGAGGCGGGG GCCCCCGGGG CGACTCGGGG GCGGACCGCG GGGCGGAGCT GCCGCCCGTG 180 

AGTCCGGCCG AGCCACCTGA GCCCGAGCCG CGGGACACCG TCGCTCCTGC TCTCCGAATG 240 

CTGCGCACOG CGATGGGCCT GAGGAGCTGG CTCGCCGCCC CATGGGGCGC GCTGCCGCCT 300 

CGGCCACCGC TGCTGCTGCT CCTGCTGCTG CTGCTCCTGC TGCAGCCGCC GCCTCCGACC 360 

TGGGCGCTCA GCCCCCGGAT CAGCCTGCCT CTGGGCTCTG AAGAGCGGCC ATTCCTCAGA 420 

TTCGAAGCTG AACACATCTC CAACTACACA GCCCTTCTGC TGAGCAGGGA TGGCAGGACC 480 

CTGTACGTGG GTGCTCGAGA GGCCCTCTTT GCACTCAGTA GCAACCTCAG CTTCCTGCCA 540 

GGCGGGGAGT ACCAGGAGCT GCTTTGGGGT GCAGACGCAG AGAAGAAACA GCAGTGCAGC 600 

TTCAAGGGCA AGGACCCACA GCGCGACTGT CAAAACTACA TCAAGATCCT CCTGCCGCTC 660 

AGCGGCAGTC ACCTGTTCAC CTGTGGCACA GCAGCCTTCA GCCCCATGTG TACCTACATC 720 

AACATGGAGA ACTTCACCCT GGCAAGGGAC GAGAAGGGGA ATGTCCTCCT GGAAGATGGC 780 

AAGGGCCGTT GTCCCTTGGA CCCGAATTTC AAGTCCACTG CCCTGGTGGT TGATGGCGAG 840 

CTCTACACTG GAACAGTCAG CAGCTTCCAA GGGAATGACC CGGCCATCTC GCGGAGCCAA 900 

AGCCTTCGCC CCACCAAGAC CGAGAGCTCC CTCAACTGGC TGCAAGACCC AGCTTTTGTG 960 

GCCTCAGCCT ACATTCCTGA GAGCCTGGGC AGCTTGCAAG GCGATGATGA CAAGATCTAC 1020 

TTTTTCTTCA GCGAGACTGG OCAGGAATTT GAGTTCTTTG AGAACACCAT TGTGTCCCGC 1080 

ATTGCCCGCA TCTGCAAGGG CGATGAGGGT GGAGAGCGGG TGCTACAGCA GCGCTGGACC 1140 

TCCTTCCTCA AGGCCCAGCT GCTGTGCTCA CGGCCCGACG ATGGCTTCCC CTTCAACGTG 1200 

CTGCAGGATG TCTTCACGCT GAGCCCCAGC CCCCAGGACT GGCGTGACAC CCTTTTCTAT 1260 

GGGGTCTTCA CTTCCCAGTG GCACAGGGGA ACTACAGAAG GCTCTGCCGT CTGTGTCTTC 1320 

ACAATGAAGG ATGTGCAGAG AGTCTTCAGC GGCCTCTACA AGGAGGTGAA CGGTGAGACA 1380 

CAGCAGTGGT ACACCGTGAC CCACCCGGTG CCCACACCCC GGCCTGGAGC GTGCATCACC 1440 

AACAGTGCCC GGGAAAGGAA GATCAACTCA TCCCTGCAGC TCCCAGACCG CGTGCTGAAC 1500 

TTCCTCAAGG ACCACTTCCT GATGGACGGG CAGGTCCGAA GCCGCATGCT GCTGCTGCAG 1560 

CCCCAGGCTC GCTACCAGCG CGTGGCTGTA CACCGCGTCC CTGGCCTGCA CCACACCTAC 1620 

GATGTCCTCT TCCTGGGCAC TGGTGACGGC CGGCTCCACA AGGCAGTGAG CGTGGGCCCC 1680 

CGGGTGCACA TCATTGAGGA GCTGCAGATC TTCTCATCGG GACAGCCCGT GCAGAATCTG 1740 

CTCCTGGACA CCCACAGGGG GCTGCTGTAT GCGGCCTCAC ACTCGGGCGT AGTCCAGGTG 1800 

CCCATGGCCA ACTGCAGCCT GTACAGGAGC TGTGGGGACT GCCTCCTCGC CCGGGACCCC 1860 

TACTGTGCTT GGAGCGGCTC CAGCTGCAAG CACGTCAGCC TCTACCAGCC TCAGCTGGCC 1920 

ACCAGGCCGT GGATCCAGGA CATCGAGGGA GCCAGCGCCA AGGACCTTTG CAGCGCGTCT 1980 

TCGGTTGTGT CCCCGTCTTT TGTACCAACA GGGGAGAAGC CATGTGAGCA AGTCCAGTTC 2040 

CAGCCCAACA CAGTGAACAC TTTGGCCTGC CCGCTCCTCT CCAACCTGGC GACCCGACTC 2100 

TGGCTACGCA AOGGGGCCCC CGTCAATGCC TCGGCCTCCT GCCACGTGCT ACCCACTGGG 2160 

GACCTGCTGC TGGTGGGCAC CCAACAGCTG GGGGAGTTCC AGTGCTGGTC ACTAGAGGAG 2220 

GGCTTCCAGC AGCTGGTAGC CAGCTACTGC CCAGAGGTGG TGGAGGACGG GGTGGCAGAC 2280 

CAAACAGATG AGGGTGGCAG TGTACCOGTC ATTATCAGCA CATCGCGTGT GAGTGCACCA 2340 

GCTGGTGGCA AGGCCAGCTG GGGTGCAGAC AGGTCCTACT GGAAGGAGTT CCTGGTGATG 2400 

TGCACGCTCT TTGTGCTGGC CGTGCTGCTC CCAGTTTTAT TCTTGCTCTA CCGGCACCGG 2460 

AACAGCATGA AAGTCTTCCT GAAGCAGGGG GAATGTGCCA GCGTGCACCC CAAGACCTGC 2520 

CCTGTGGTGC TGCCCCCTGA GACCCGCCCA CTCAAOGGCC TAGGGCOCCC TAGCACCCCG 2580 

CTCGATCACC GAGGGTACCA GTCCCTGTCA GACAGCCCCC CGGGGTCCCG AGTCTTCACT 2640 

GAGTCAGAGA AGAGGCCACT CAGCATCCAA GACAGCTTCG TGGAGGTATC CCCAGTGTGC 2700 

CCCCGGCCCC GGGTCCGCCT TGGCTCGGAG ATCCGTGACT CTGTGGTGTG AGAGCTGACT 2760 

TCCAGAGGAC GCTGCCCTGG CTTCAGGGGC TGTGAATGCT CGGAGAGGGT CAACTGGACC 2820 

TCCCCTCCGC TCTGCTCTTC GTGGAACACG ACCGTGGTGC CCGGCCCTTG GGAGCCTTGG 2880 

GGCCAGCTGG CCTGCTGCTC TCCAGTCAAG TAGOQAAGCT CCTACCACCC AGACACCCAA 2940 

• ACAGCCGTGG CCCCAGAGGT CCTGGCCAAA TATGGGGGCC TGCCTAGGTT GGTGGAACAG 3000 

TGCTCCTTAT GTAAACTGAG CCCTTTGTTT AAAAAACAAT TCCAAATGTG AAACTAGAAT 3060 

GAGAGGGAAG AGATAGCATG GCATGCAGCA CACACGGCTG CTCCAGTTCA TGGCCTCCCA 3120 

GGGGTGCTGG GGATGCATCC AAAGTGGTTG TCTGAGACAG AGTTGGAAAC CCTCACCAAC 3180 
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TGGCCTCTTC 
CAGGACCAGC 
CTGCCGTCGT 
GGTCCTGGGC 
GAGAGGACAG 
CAGGGAAGAG 
CCACCATATC 
GTCCTCTCCC 
TCAACACTGC 
ATGCACTTTA 



ACCTTCCACA 
TTGGGCTGCG 
CCCACCACCT 
TCGGACCCAA 
CGCQAGCTCA 
ACTGTOGCXrr 
CACCCTCGCT 
CAGTCCCCAG 
CCAGCACAGG 
TGTCATTTTT 



TTATCCCGCT 
TGCGTTCTGC 
CAGGGACCAG 
CTCCTGGACC 
GGAGAGATTT 
GCCTTCCTCC 
CCATCTTTGA 
TTCACCCTCC 
GGCCCTGAAT 
TAATAAAGTC 



GCCACOGGCT 
CTTGCCAGTC 
AGGGCTAGGT 
TTTCCAGCCT 
CGTGACAATG 
GTTGTTGCGT 
ACTCAAACAC 
ATCCCTCACC 
TTATGTGGTT 
TGAAGAATTA 



GCCCTGTCTC 
AGCCGAGGAT 
TGGCACTGCG 
GTATCAGGCT 
TACGCCTTTC 
GAGAACCCGT 
GAGGAACTAA 
TTCCTCCACT 
TTTATACATT 
CTGTTT 



ACTGCAGATT 
GTAGTTGTTG 
GCCCTCACCA 
GTGGCCACAC 
CCTCAGAATT 
GTGCCCCTTC 
CTGCACCCTG 
CTAAGGGATA 
TTTTAATAAG 



Seq ID NO: 193 Protein sequence 
Protein Accession #: XP 044533.3 



MLRTAMGLRS 
RFEAEHISNY 
SFKGKDPQRD 
GKGRCPFDPN 
VASAYIPESL 
TSFLKAQLLC 
FTMKDVQRVF 
NPLKDHFLMD 
PRVHIIEELQ 
PYCAWSGSSC 
FQPNTVNTLA 
EGFQQLVASY 
MCTLFVLAVL 
PLDHRGYQSL 



11 
I 

WIAAPWGALP 
TALLLSRDGR 
CQNYIKILLP 
FKSTALWDG 
GSLQGDDDKI 
SRPDDGFPFN 
SGLYKEVNRE 
GQVRSRMLLL 
IFSSGQPVQN 
KHVSLYQPQIj 
CPLLSNLATR 
CPBWBDGVA 
LPVLPLLYRH 
SDSPPGSRVP 



21 
I 

PRPPLLLIiLL 
TLYVGAREAL 
LSGSHLPTCG 
ELYTGTVSSF 
YFFFSETGQE 
VLQDVFTLSP 
TQQWYTVTHP 
QPQARYQRVA 
LLLDTHRGLL 
ATRPWIQDIE 
LWLRNGAPVN 
DQTDEGGSVP 
RNSMKVFLKQ 
TESEKRPLSI 



31 
I 

LLLLLQPPPP 
FALSSNLSFL 
TAAFSPMCTY 
QGNDPAISRS 
FEFFENTIVS 
SPQDWRDTLF 
VPTPRPGACI 
VHRVPGLHHT 
YAASHSGWQ 
GASAKDLCSA 
ASASCHVLPT 
VIISTSRVSA 



QDSFVEVSPV 



41 
I 

TWALSPRISL 
PGGEYQELLW 
INMENFTLAR 
QSLRPTKTES 
RIARICKGDE 
YGVFTSQWHR 
TNSARERKIN 
YDVLPLGTGD 
VPMANCSLYR 
SSWSPSPVP 
GDLLLVGTQQ 
PAGGKASWGA 
CPWLPPETR 
CPRPRVRLGS 



Seq ID NO: 194 DNA sequence 

Nucleic Acid Accession #: NM_022819.1 

Coding sequence: 1..635 



1 
I 

ATGGCAGATG 
TTCTCTGGGT 
AGCCTGGGTA 
GCTCACGGCA 
ATCCTGTCCT 
GATGAGGTGG 
GGCTGTCACC 
TGCAGTGACC 
GTTCTGTGCC 
TGCCAGGGCC 
AGTCACCAAT 



11 
I 

GGGCAAAGGC 
GGAGGGGCCC 
TGAAGAAGTT 
GCCTGCTCAA 
TCGTGGGCTA 
ACTGGTGCTG 
CCTATGTGGA 
TCAACAAGAC 
TCATGAACCA 
CCACGCCCAA 
CCCCAGCGCC 



21 
I 

CAACCCCAAA 
ACGCTTCGGG 
CTTCACCGTG 
CCTGAAGGCC 
CGGTTGCTAC 
CCACGCCCAC 
CCACTATGAT 
AGAGTGTGAC 
GACGTACCGA 
CTGCAGCATC 
CCCCGCCCCT 



31 
I 

GGGTTCAAAA 
GCCTCCTGTC 
GCCATCCTTG 
ATGGTGGAGG 
TGTGGGCTGG 
GACTGCTGCT 
CACACCATCG 
AAGCAGACAT 
GAGGAGTACC 
TATGAACCGC 
CCCTAG 



41 
I 

AGAAGGTGCT 
CTTCAAGAAC 
CTGGCAGCGT 
CCGTCACAGG 
GGGGCCGTGG 
ACCAGGAACT 
AGAACAACAC 
GCATGTGTGA 
GTGGCTTCCT 
CCCCTGAGGA 



3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 



51 
I 

PLGSEERPFL 
GADAEKKQQC 
DEKGNVLLED 
SUIWLQDPAF 
GGERVLQQRW 
GTTEGSAVCV 
SSLQLPDRVL 
GRLHKAVSVG 
SCGDCLIiARD 
TGEKPCEQVQ 
LGEFQCWSLE 
DRSYHKEFLV 
PLNGLGPPST 
EIRDSW 



51 
I 

GGATAGATGC 
CTCCAGGTCT 
TCTGTCCACA 
GAGGAGCGCC 
CCAGCCCAAG 
CTTTGACCAA 
TGAGATAGTC 
CAAGAACATG 
CAATGTCTAC 
GGTCACCTGC 



Seq ID NO: 195 Protein sequence 
Protein Accession #: NP_073730 

1 11 21 31 41 SI 

1 i I I I I 

MADGAKANPK GFKKKVLDRC FSGWRGPRFG ASCPSRTSRS SLGMKKFFTV AILAGSVLST 
AHG SLLNLKA KVEAVTGRSA ILSFVGYGCY CGLGGRGQPK DEVDWCCHAH DCCYQELFDQ 
GCHPYVDHYD HTIENNTEIV CSDLNKTECD KQTCMCDKNM VLCLMNQTYR EEYRGFLNVY 
CQGPTPNCSI YEPPPEEVTC SHQSPAPPAP P 

Seq ID NO: 196 DNA sequence 

Nucleic Acid Accession #: XM_028196.1 

Coding sequence: 13 15.. 17 91 " 



GGCATTGATG 
GTGTGTGTGT 
AGACTGGGGG 
ATCGCCTGGG 
CTCTTGTGGG 
CTGCTCTCTG 
GGGTCACCGT 
CCCAGCCAGG 

GAGTCCAGGC 
CTGCCCCTAC 
CAGGAATGCA 
CTCCCGCCAT 
ATGCTCCACC 
AGAGGCAAGG 
AGGCAGGGCC 
TGGAGGACGC 
AGGCTGCCCA 
GCTGGGCCTG 





ACCTTCCTTC 
AGCGGCTGAG 
CTGCCACCAC 
CCTTTAGCCC 
GACCCTGCAG 
TGCATGGCTG 
AAGTGATGGG 
CAGGGTCCAG 
ATCCTACAGT 
GGCCCTGCCT 
GGCAGCTAAG 



ATGGCAGGGT 
TGAGGGTATC 
GAGGCCAGGC 
GAGGCGCCAA 
AGTCAGGGAA 
TAGCACCCTG 
CCCCAGGCCC 
CCACCCACAT 
TCAGTGTGTG 
CTCACCTTCA 
AGGCCTGCTC 
ACCCCTCTGG 
GCAAACCATG 
ACCGCAGAGA 
GGCCCAGGAG 
GGGGGCAAGG 
GCTTGGCTGG 
CTGGAGCTTT 



TGGCCTCAAC 
TGACTTGGAC 
TGTGGAGGAA 
GAGCTGAGCT 
GTTCCCCTGC 
TAGGCAGGCG 
GCCGAAGGGT 
TGGAATGTTC 
TCCTCAGGCG 
AGTGAGCTCC 
GCTTOCAAGT 
GTGGGCCCCA 
TGAGACCCCC 
AGAGAAGCAG 
GTGCTCTGAG 
GGCTGGGGGC 
GGCCAGGGTC 




CTGCGGCCCT 
GCCGACAGCC 
TCCTGGGGGC 
GCTGTGGTGC 
AGGGATGAGA 
GGAGGGAGAG 
GTCCGGTGAA 
TGCTGGGAGG 
CAGAGCCTCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 
120 
180 



51 
I 

GTGTGTGTGT 
GCTCTGCCCC 
TCCCGATGGG 
GGGTCCCAGG 
TGGCTGGCTG 
CCTGGGCCTG 
CAGGAGCCAG 
CCAGCAGCTG 
GCAGGTGGAC 
CAGCTGCACC 
GAGCCCCTGC 
AGCCCTGCTC 
TGCAGTGAAC 
GTGCTGGGGT 
TGGGACCCCC 
AGCTTCCTGG 
GGCAGGGACT 
TGGCTGGGAG 
CTCCCTTCAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



1031 



WO 03/042661 



PCT/US02/36810 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



CTTCCTGCTG 
ACCCAGCACT 
TCCCAGCTGA 
CACCCACCAG 
ATTCCAGACC 
GATAACCCAG 
CCAGGGCCCC 
TGCCTCCTCT 
GAGTCCGTGG 
TCCTTGCTCA 
GGCGAGTTCA 
TGGCAGGGAA 
GGGCCCGAGG 
TTTGGGTGGG 
CTGGCCTGGA 
CTCTGATGAG 
CCTGATGTGG 
CTCTCCCTGG 
CGGTTCTGCG 
TGGGGAGCTG 
GTGGGCCTGA 
GTCAGCGTCT 
TGCCCCGTGT 
GCCTGGACGG 
GCTGGGTGGG 
AGGCGGAGCT 
GGCATGAGCC 
TGGAGCACTG 
GCCACAGCCC 
CTGATGGGCA 
TGAAGCCCCT 
TACGTGCCCA 
GGACTTGCAG 
AGAAAGACAG 
CTGGTGCCCT 
CTGCCGCTCC 
CCCCTGCAGC 
CCCCTGCGGC 
CGCCTGCCCT 
GCACTTGCCC 



CACAGAACCC 
GGCTGCTGCT 
CCCAGCCTCC 
TCTCTCCCAG 
TTGTCGCCGG 
CCAAGTGGGG 
GCTGGGCTCT 
GTGCTGCCTG 
GTCTGGGCAG 
CTCAGTCCAG 
GCCCCAGGGA 
AGTGGGTGAA 
GAGCCACAGC 
TTTGGCCGGT 
GGCGGGGGGT 
GCATGATGTC 
ATGGCCTGGA 
AGTTCGACTT 
AGTTTCCGGA 
ACAGGGCAGG 
GGCAGGCAGC 
CCACCCAGGC 
TTGACGAGAC 
CTGGATGGGC 
CCTGAGCTAG 
GCCAGGGGCC 
CCTGGGTGAG 
GTACCTGCTG 
AAGGCAGAGC 
GCATTTTCGG 
CTTGCTGCCC 
GCTCAGGCCG 
AGCCCTACGT 
CCACCAAAAA 
TCAGCCAGGT 
GAACTGAGCC 
ACTGGGCAGA 
CAGCCAGGGA 
TGCCCCACTC 
AGGCCGCCCT 



GGCCCGCCGG 
TGGCAGGGAC 
GGGTCTGAGC 
ACAGCCCGAG 
GCTGACCGTG 
GAAGGTCCAG 
GGGCACGGCG 
CCAGAATGTG 
CGTAGGCAAG 
CATGCTGGCC 
GGTGGACCGC 
CTGAATGCAC 
GCAGGACCAC 



CCACCCCGTG 
GGTAGCGGGC 
CTTCCAAACC 
CCAGCTGGCA 
GAGTTGTGGG 
TCCACAGATG 
GCCCTTGCCG 
CGCCGCCACA 
ACCACCACCA 
AAATCCAGGC 
CCCACAGAGG 
AAGGAGGCCA 
GGAAGGCAGG 
GAAGCCGACG 
GTCATGCAAG 
GCCCCTTGTC 
GGGGATGCTC 
GAGGTGAAGG 
GGGGAAGGGC 
TGAGCCCACC 
CCTGGGGGCA 
CATGAGACAA 
CACGTGAGTC 
TGGGCCTGGG 
CCTGGCTCAC 
TGCAGCTTTT 
CACTGGGCAC 
CTGCCACTCA 
CCTGCCCTAT 
CCCAACTCGG 
CAGGTCGGGG 
GTGGTGCTGG 
CTCATGCTGA 
GCCCCCTACT 
GACCTGGTGC 
GTGCACCTGG 
CACGCCCGGC 
ATGCTGGCCC 
CACATGCCTC 
TGCAATAAAC 



CTGCCTCCTT 
AGGGGCCGGA 
AGCAGGGTAG 
CCACAGCTAT 
ATTCCCAAGA 
CACTCAGCCT 
CGGGCGTCCT 
GGAAGAAGCC 
CCCACCTGGT 
TCCAGAGCCC 
CAGGGCGTTG 
TGCAACAGGG 
GGGTACCCCA 
ATTTGTGCCT 
GGCTGCCCGG 
CCAACTCACT 
AGCAATGGGG 
GCCCCGCTGC 
AGACCCCATG 
CCGCTGGCTC 
CCGTGGACCC 
AAGTGCACCG 
AGGGATGGTC 
CAGCTGGGTG 
GCCGCTGCCT 
CAACTTCAAG 
CGTGGATCTG 
GGTGAGGTGC 
GGGCCATCGG 
CCAGAATCAC 
AGCTGTGCTT 
AGGCTCGAGG 
ACCAGAGGAA 
TCAATGAGGC 
TGGCTGTCTG 
GTGCCCGGGC 
GGCCCATTGC 
TGCAGCCCCG 
TGTCTCCCCG 
GCCTTCTCCT 




CCAGATCAGG 
CTATGCCCGG 
AGGCACGCTC 
GGCTGGGTGG 
GGCCTGGGCA 
CAGATCCCGC 
CGCTTCTCGG 
CAGCATGTTC 
TGGTCACCAG 
AAAGACAGGC 
CCTCCOGGGC 
CTCTCTCCGG 
CCTGCGTCCA 
GTGGAAGAAG 
CTTCACCTTC 
GGACCGCAGC 
CTCGGGGCAG 
CCAGCGGCAC 
CCTTOGCCTG 
CTGAGCCCAG 
GCC 



1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
16B0 
1740 
1B00 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 



Seq ID NO: 197 Protein sequence 
Protein Accession #: XP_028196.1 

1 11 21 31 41 51 

I I I I I I 

MGHPPVSPSA PAPAGTTAIP GLIPDLVAGT PCELWDSQEG CGDNPAKWGL QLSTDALSLA 
STPGPRWALI AGALAAGVLL VSCLLCAACC CCRRHRKKPR DKESVGLGSA RG T TTT HL VR 
SGSLLTQSRE GLKSRLQSPG QRGEFSPRDG LTPTEAGR 

Seq ID NO: 198 DNA sequence 

Nucleic Acid Accession 8: NM_000612.2 

Coding sequence: 553.. 1095 



TTCTCCCGCA 
CTCCCCCCCT 
TTCTGTTTCT 
TCTCCCCCTC 

CCGAAAAGTA 
CAGACGAATT 
ACATTCGGCC 
CCAACGCCCG 
TTCCAGACAC 
GCCTTCGCCT 
CTGGTGGACA 
AGCCGTGTGA 
CTGGCCCTCC 
CCTCCGACCG 
GACACCTGGA 
CGCCGGGGTC 
CCCCTGATTG 
AGCAATCGGA 
CTCCTCCTGA 
CCCCCTGGGG 
CCTCGGCCCC 
GATTGGCTTT 



11 
I 

ACCTTCCCTT 
CACGCCCGCC 
CTCCGTGCTG 
GCCCTCTCTT 
CTCTATCCTT 
CAACATCTGG 
CTCCCCCCCC 
CCCGCGACTC 
CTGTTCGGTT 
CAATGGGAAT 
CGTGCTGCAT 
CCCTCCAGTT 
GCCGTCGCAG 
TGGAGACGTA 
TGCTTCCGGA 
AGCAGTCCAC 
ACGTGCTCGC 
CTCTACCCAC 
AGTGAGCAAA 
CCACGGACGT 
CTTCTCCTGA 
CTCCATCGGG 
AAACACCCTT 



21 
I 

OGCTCCCTCC 
CTCTCGCCTT 
TTCTCTCCCG 
CGGCCCCCCC 
GATACAACAG 
CCOGCCCCAG 
CCCCAAAAAA 
GGCCAGAGCG 
TGCGACACGC 
CCCAATGGGG 
TGCTGCTTAC 
CGTCTGTGGG 
CCGTGGCATC 
CTGTGCTACC 
CAACTTCCCC 
CCAGCGCCTG 
CAAGGAGCTC 
CCAAGACCCC 
ACTGCCGCAA 
TTCCATCAGG 
CCCAGTCCCC 
CTGAGGAAGC 
CACATACCCT 



31 
I 

CGTCCCCCCC 
CGCCGAACCA 
CTGTGCGCCT 
CTTTCACGTT 
CTGACCTCAT 
CCCGAAGACA 
AAAAGCCATC 
GCGCTGGCAG 
AGCAGGGAGG 
AAGTCGATGC 
CGCCCCAGTG 
GACCGCGGCT 
GTTGAGGAGT 
CCCGCCAAGT 
AGATACCCCG 
CGCAGGGGCC 
GAGGCGTTCA 
GCCCACGGGG 
GTCTGCAGCC 
TTCCATCCCG 
GTGCCCCGCC 
ACAGCAGCAT 
CCCCCC 




CCCCCGCTCT 
AGGAGTGTCC 
TGGGCGGCAG 
TGGTGCTTCT 
AGACCCTGTG 
TCTACTTCAG 
GCTGTTTCCG 
CCGAGAGGGA 
TGGGCAAGTT 
TGCCTGCCCT 
GGGAGGCCAA 
GCGCCCCCCC 
CGGCGCCACC 
AAAATCTCTC 
TCCCCGAAAC 
CTTCAAACAT 



CACCTTCTTG 
CGGCGGGGAG 
CAGGCCCGCA 
CAGCTGTGAC 
CGTGTCGACC 
CnCCAATAT 
CCTGCGTGCC 
ACGTCACCGT 
AGAGATGGCC 
ATCCTGCAGC 
GGTTCCACGT 
AGGCTACTCT 
GTACAAAATC 



60 
120 



60 
120 
180 
240 
300 
360 
420 
4B0 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



Seq ID NO: 199 Protein sequence 
Protein Accession #: NPJ)00603.1 



11 



21 



31 



51 

1032 



WO 03/042661 



PCT/US02/36810 



10 
15 
20 
25 
30 
35 
40 
45 
50 



60 
65 
70 
75 
80 



I I I I I I 

MGIPMGKSML VLLTFLAFAS CCIAAYRPSE TLCGGBLVDT LQFVCGDRGP YPSRPASRVS 60 

RRSRGIVEEC CFRSCD1ALL BTYCATPAKS ERDVSTPPTV LPDNFPRYPV GKFFQYDTWK 120 
QSTQRLRRGL. PALLRARRGH VLAKELEAPR EAKRHRPLIA LPTQDPAHGG APPEMASNRK 

Seq ID NO: 200 DNA sequence 

Nucleic Acid Accession #; AK057131.1 

Coding sequence: 61.. 1146 

1 11 21 31 41 51 

I I I I I I 

AGTCTGGGCG TTTAGGTCAG AACTACCCCG GTAGCCTGAC AGCAGGAGCT GGAGAGAAGC 60 

ATGGCTCAGC GGTGOGTTTG CGTGCTGGCC CTGGTGGCTA TGCTGCTCCT AGTTTTCCCT 120 

ACCGTCTCCA GATCGATGGG CCCGAGGAGC GGGGAGCATC AAAGGGCGTC GCGAATCCCT 180 

TCTCAGTTCA GCAAAGAGGA ACGCGTCGCG ATGAAAGAGG CGCTGAAAGG TGCCATCCAG 240 

ATTCCAACAG TGACTTTTAG CTCTGAGAAG TCCAATACTA CAGCCCTGGC TGAGTTCGGA 300 

AAATACATTC ATAAAGTCTT TCCTACAGTG GTCAGCACCA GCTTTATCCA GCATGAAGTC 360 

GTGGAAGAGT ATAGCCACCT GTTCACTATC CAAGGCTCGG ACCCCAGCTT GCAGCCCTAC 420 

CTGCTGATGG CTCACTTTGA TGTGGTGCCT GCCCCTGAAG AAGGCTGGGA GGTGCCCCCA 480 

TTCTCTGGGT TGGAGCGTGA TGGCGTCATC TATGGTTGGG GCACACTGGA CGACAAGAAC . 540 

TCTGTGATGG CATTACTGCA GGCCTTGGAG CTCCTGCTGA TCAGGAAGTA CATCCCCCGA 600 

AGATCTTTCT TCATTTCTCT GGGCCATGAT GAGGAGTCAT CAGGGACAGG GGCTCAGAGG 660 

ATCTCAGCCC TGCTACAGTC AAGGGGCGTC CAGCTAGCCT TCATTGTGGA CGAGGGGGGC 720 

TTCATCTTGG ATGATTTCAT TCCTAACTTC AAGAAGCCCA TCGCCTTGAT TGCAGTCTCA 780 

GAGAAGGGTT CCATGAACCT CATGCTGCAA GTAAACATGA CTTCAGGCCA CTCTTCAGCT 840 

CCTCCAAAGG AGACAAGCAT TGGCATCCTT GCAGCTGCTG TCAGCCGATT GGAGCAGACA 900 

CCAATGCCTA TCATATTTGG AAGCGGGACA GTGGTGACTG TATTGCAGCA ACTGGCAAAT 960 

GAGGTTTATG GAGAGAAATC CCTTAACCAA TGCAATAATC AGGACCACCA CGGCACTCAC 1020 

CATATTCAAA GCAGGGGTCA AGTTCAATGT CATCCCCCCA GTGGCCCAGG CCACAGTCAA 1080 

CTTCCGGATT CACCCTGGAC AGACAGTCCA AGAGGTCCTA GAACTCACGA AGAACATTGT 1140 

GGCTGATAAC AGAGTCCAGT TCCATGTGTT GAGTGCCTTT GACCCCCTCC CCGTCAGCCC 1200 

TTCTGATGAC AAGGCCTTGG GCTACCAGCT GCTCCGCCAG ACCGTACAGT CCGTCTTCCC 1260 

GGAAGTCAAT ATTACTGCCC CAGTTACTTC TATTGGCAAC ACAGACAGCC GATTCTTTAC 1320 

AAACCTCACC ACTGGCATCT ACAGGTTCTA CCCCATCTAC ATACAGCCTG AAGACTTCAA 1380 

ACGCATCCAT GGAGTCAACG AGAAAATCTC AGTCCAAGCC TATGAGACCC AAGTGAAATT 1440 

CATCTTTGAG TTGATTCAGA ATGCTGACAC AGACCAGGAG CCAGTTTCTC ACCTGCACAA 1500 

ACTGTGAGGT CAAGGGGCCT GCTGGGTTAG GCATGCCCGA CCCCGGGACA GGACTAACCC 1560 

AAGGGGGAAA GCTAGTGTTG ATGAAACTTT TGATCAAAAC CACATTGTAA AACATTGCCC 1620 

ATCTGTCTTG CTCACTCTTA AACTCTCCCA AGAACAAGGC CGGGGTAAGG TAAAGTCAGC 1680 

AGAAATCTGG CTTCTCCCTT CCTCCCGACA TCTGCATCCC TTGATCCACT GGCATTTGCT 1740 

GCCCTCTTGT CCCTTATCTG TCTTATGCTG GTTATTTCAC TGCTTCACCT TCCAGGCTTG 1800 

ACTTAACAAA TGTAGATTTG AGAAATCTCA ACCAGTTGTT ACCTGATAGG AGTCTTTAAT 1860 

TTAGGGCACT CTTGCTGGGA TGCTTTCTCC AGAGCTTATA TATTTCTTCT TACTAGAACT 1920 

TTCTTCCCCC TTTTATTCCC CTCTCTTCTT GGACTCATGA GCTGTCTCTT CATCTCTCCT 1980 

CTCTCTCCTG CATCTCTCCC CTTACTCTTC AATTTATTCT ACTTCTGGAC CTGGACTTAC 2040 

CCAAACTGTG ATACTACCAT AATTGTCACC ATAATCAGTC AAATAAAGTG ATCTGTGCAT 2100 
C 

Seq ID NO: 201 Protein sequence 
Protein Accession : BAB71368.1 



1 11 21 31 41 51 

I I I I I I 

MAQRCVCVLA LVAMLLLVFP TVSRSMGPRS GEHQRASRIP SQFSKEERVA MKEALKGAIQ 60 

55 IPTVTPSSEK SNTTAUVEFG KYIHKVFPTV VSTSFIQHEV VEEYSHLFTI OGSDPSLQPY 120 

LLMAHFDWP APEEGWEVPP FSGLERDGVI YGWGTLDDKN SVMALLQALE LLLIRKYIPR 180 

RSFFISLGHD EESSGTGAQR ISALLQSRGV QLAFIVDEGG FILDDFIPNF KKPIALIAVS 240 

EKGSMNLMLQ VNMTSGHSSA PPKETSIGIL AAAVSRLEQT PMPIIFGSGT WTVLQQLAN 300 

EVYGEKSLNQ CNNQDHHGTH HIQSRGCYQC HPPSGPGHSQ LPDSPWTDSP RGPRTHEEHC 360 
G 



Seq ID NO: 202 DNA sequence 

Nucleic Acid Accession #: NM_004217.1 

Coding sequence : 5 B . . 10 92 

1 11 21 31 41 51 

I I I I I I 

GGCCGGGAGA GTAGCAGTGC CTTGGACCCC AGCTCTCCTC CCCCTTTCTC TCTAAGGATG 60 

GCCCAGAAGG AGAACTCCTA CCCCTGGCCC TACGGCCGAC AGACGGCTCC ATCTGGCCTG 120 

AGCACCCTGC CCCAGCGAGT CCTCCGGAAA GAGCCTGTCA CCCCATCTGC ACTTGTCCTC 180 

ATGAGCCGCT CCAATGTCCA GCCCACAGCT GCCCCTGGCC AGAAGGTGAT GGAGAATAGC 240 

AGTGGGACAC CCGACATCTT AACGCGGCAC TTCACAATTG ATGACTTTGA GATTGGGCGT 300 

CCTCTGGGCA AAGGCAAGTT TGGAAACGTG TACTTGGCTC GGGAGAAGAA AAGCCATTTC 360 

ATCGTGGOGC TCAAGGTCCT CTTCAAGTCC CAGATAGAGA AGGAGGGCGT GGAGCATCAG 420 

CTGCGCAGAG AGATCGAAAT CCAGGCCCAC CTGCACCATC CCAACATCCT GCGTCTCTAC 480 

AACTATTTTT ATGACCGGAG GAGGATCTAC TTGATTCTAG AGTATGCCCC CCGCGGGGAG 540 

CTCTACAAGG AGCTGCAGAA GAGCTGCACA TTTGACGAGC AGCGAACAGC CACGATCATG 600 

GAGGAGTTGG CAGATGCTCT AATGTACTGC CATGGGAAGA AGGTGATTCA CAGAGACATA 660 
AAGCCAGAAA ATCTGCTCTT AGGGCTCAAG GGAGAGCTGA AGATTGCTGA CTTCGGCTGG , 720 

TCTGTGCATG CGCCCTCCCT GAGGAGGAAG ACAATGTGTG GCACCCTGGA CTACCTGCCC 780 

CCAGAGATGA TTGAGGGGCG CATGCACAAT GAGAAGGTGG ATCTGTGGTG CATTGGAGTG 840 

CTTTGCTATG AGCTGCTGGT GGGGAACCCA CCCTTTGAGA GTGCATCACA CAACGAGACC 900 

TATCGCCGCA TCGTCAAGGT GGACCTAAAG TTCCCCGCTT CTGTGCCCAC GGGAGCCCAG 960 

GACCTCATCT CCAAACTGCT CAGGCATAAC CCCTCGGAAC GGCTGCCCCT GGCCCAGGTC 1020 

1033 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



TCAGCCCACC CTTGGGTCCG GGCCAACTCT CGGAGGGTGC TGCCTCCCTC TGCCCTTCAA 1080 

TCTGTCGCCT GATGGTCCCT GTCATTCACT CGGGTGCGTG TGTTTGTATG TCTGTGTATG 1140 

TATAGGGGAA AGAAGGGATC CCTAACTGTT CCCTTATCTG TTTTCTACCT CCTCCTTTGT 1200 
TTAATAAAGG CTGAAGCTTT TTGT 

Seq ID NO: 203 Protein sequence 
Protein Accession #: NP 004208 



MAQKENSYPW 
SSGTPDILTR 
QLRREIEIQA 
MEELADALMY 
PPEMIEGRMH 
QDLISKLLRH 



11 
I 

PYGRQTAPSG 
HFTIDDPEIG 
HLHHPNILRL 
CHGKKVIHRD 
NEKVDLWCIG 
NPSERLPLAQ 



21 
I 

LSTLPQRVLR 
RPLGKGKFGN 
YNYFYDRRRI 
IKPENLLLGL 
VLCYELLVGN 
VSAHPWVRAN 



31 
I . 

KEPVTPSALV 
VYLAREKKSH 
YLILEYAPRG 
KGELKIADPG 
PPFESASHNE 
SRRVLPPSAL 



41 



51 



LMSRSNVQPT AAPGQKVMEN 
FIVALKVLFK SQIEKEGVEH 
ELYKELQKSC TFDEQRTATI 
WSVHAPSLRR KTMCGTLDYL 
TYRRIVKVDL KFPASVPTGA 
QSVA 



Seq ID NO: 204 DNA sequence 
Nucleic Acid Accession fh AK05S663 
Coding sequence : 38.. 1423 



1 
I 

AGAACGGCTT 
AAAACCACAA 
CCGAAGGTCC 
GCTTATGTGG 
TTTTGATCTT 
TAGCCCTGTC 
AGTCTTGGCA 
ACAGCCCGAG 
CCTGTTCACG 
TACGAGCTGG 
GGGACTTAGC 
AGCATTTGCT 
CACTGCCTCT 
GTACAGTGGG 
ACTCATCAGA 
GACCCTAGGT 
TGAACAAATG 
TGTTCAAATT 
CAATGTCCTA 
TGATTTGAAC 
ATTTAACACT 
TTATGGTTTT 
TGGAGTTCCA 
TAGATATGGA 
TTATAAGGAA 
TGTTTAATCA 
TATGAAACTA 
GCTTTAAATA 
GTTTTGTAGT 
AGATCTGTCA 
TGCCACTGTG 
CTTAGTTTTT 
ATGCAGTGGC 
TGCCTCAGCC 
TGTATTTTTA 
ACCTCATGAT 
CCTGGCCGAT 
GGGAAAGGGA 
AATTGCTAAA 
TTTTTAGCAG 
GATTTTTGTT 



11 
I 

CCGGCGGGAG 
AGATCCTTTT 
TGGAAGATAC 
TGCAGTTCTA 
TTTAGTTTAA 
TATTCATTTG 
CAGTTGGGAG 
ATACACACGG 
ATGCTTTCTA 
CTTCAAGAGC 
AGTATCTTCC 
CTTTGTATTA 
GCTATAGCTA 
AAAGTCTTAC 
GAGGTATCTA 
TTTGGCTCAT 
GTTCTTGCTC 
TTCAAGGATG 
AACTTTTCAG 
CCAGTTACAT 
CCTGGGAAAA 
GGTCTCAATC 
GGAATTGGAG 
ACTAATAATA 
TATTGACTCC 
TTTACTCTAA 
TATTTTTGTA 
GGCTTCCTTT 
TGACTGCAGT 
CATTACTAAG 
CCCGGCCAAT 
GTTTTGTTTT 
ATGATCTCAG 
TCCCGAGTAG 
GTAAAGACGG 
CCACCCACCT 
ATTTTCTTTA 
AAAATGTCTG 
TTTTTCTTTG 
AAATTTTGGA 
AAAGTTTCTC 



21 
I 

CTGTGCAGCT 
TTGGCAAGTT 
TGCTCTTTGG 
CTAATAGTAT 
TGACATGTTT 
GGTTTGAAAG 
CTCTCTTTAT 
GAAGATTATT 
TTCGGAATAA 
ATGTTGCAGA 
TTCCCCGAAT 
CATATATGCT 
TTGCCTTGAT 
TCCAGACAAC 
CCTTAGATGG 
TGGCTGGATC 
ATGTGACCAA 
ACTGGATTAG 
ATCATCACGT 
CAACTCCAGC 
ATGTGAACCC 
ATGGACACAC 
CAACTCAAGG 
GAATTGGACA 
TTGGCTTCCA 
ATGTTAGATA 
AAATGTATTT 
AGAAAATGTG 
GTGATGTGAC 
ATACGATATT 
ACATTATTAT 
GTTTTTTGAG 
CTCACTGCAA 
CTGGGATTAC 
GGGATTTCAC 
TAGCCTCCCA 
ATGAAATTTA 
TTCAAAAAGT 
AGGTTCTCCT 
ATACATTCTA 
TCCTTTAAAA 



31 
I 

CCTTATCATG 
GTTACGGGAA 
TGTAATAAAC 
AGCTTTAACT 
AATAAGTTAC 
ATTAGAAGTC 
ATTAAAAGAA 
AGTTGGTACT 
ACCTTTTGCT 
TCTTAGTCGA 
GAATCCATTT 
CATTGAAATT 
GACATTTGGC 
ACCACCCCAT 
AGTTTTAGAA 
AGTGCATGTA 
CAGGCTGTAC 
GCCTGCCTTA 
AATCCCAATG 
TAAACCTAGT 
AGTTATTCTT 
ACCTXACAGC 
ATTGAGGACT 
ACCAAGACCA 
ATTTATTTAG 
ATAGTAGTCT 
GTGAGAGTGA 
TTTCTTTAAA 
CTTACCTTTA 
TCTTTTTTTT 
TAACTTAAGG 
ATGGAGTCTC 
CCTCTGCCTC 
AGGCACCTGC 
CATGTTGGCC 
AAGTGCTGGG 
TAAATATGCT 
AAAGGTCTCT 
GAATTATGTC 
TCTAGCACAA 
ATTTTAGTAC 



41 
I 

GGGACAATTC 
TTTAGACTTG 
TTGATATGTA 
GCCTATACTT 
TGGGTAACAT 
CTGGCTGTAT 
AGTGCAGAAC 

TATGTCTCAG 
AGCTTGTGTG 
GTTTTGATTG 
AATAATTATT 
ACTATGTATC 
GTTATTGGTC 
GTCCGAAATG 
AGAATTOGAC 
ACTCTAGTGT 
TTGTCTGGGC 
CCTCTTTTAA 
AGTCCACCTC 
CTAAACACAC 
AGCATGCTTA 
GGTTTTACAA 
TGATAGACTC 
TAATCCAACT 
TGTTCACATT 
AATCCTCGTA 
TTTGGATTTT 
TAAGAGCCAC 
TCCGAGACGG 
CTGTACTTTA 
ACTCTGTCGC 
CTGAGTTCAA 
CACCACGCCC 
AGGCTGGTCT 
ATTAGGTGTG 
TCTTGAATAA 
TTTATAGCTT 
TTACAAACTA 
TTTGAATTTT 
ATTTGTAAAT 



51 
I 

ATCTCTTTCG 
TAGCAGCTGA 
CTGGCTTCCT 
ACCTGACCAT 
TGAGGAAACC 
TTGCCTCCAC 
GCTTTTTGGA 
TTTGTTTCAA 
AAGCTGCTAG 
GAATTATTCC 
ATCTTGCTGG 
TJGCCGTAGA 
CCATGAGTGT 
AGTTGGACAA 
AACATTTTTG 
GAGATGCCAA 
CTACTCTAAC 
CTGTTGCAGC 
AGGGTACTGA 
CAGAATTTTC 
AAACAAGGCC 
ATCAAGGACT 
ATATACCAAO 
TAACTTATTT 
TTGCATTGAC 
TCATGAAACC 
AATGTTAAAG 
GGTATCTTTG 
TTGATGGAGT 
AGTCTTGCTC 
TTAAGGCTTC 
CCAGGCTGGA 
ATGATTCTCC 
AGCTAATTTT 
TGAACTCCTG 
AGCCACCGCA 
TACACATTTT 
TTCCAAACTT 
AAAGCAAAAA 
TAATTATCAA 



Seq ID NO: 205 Protein sequence 
Protein Accession #: BAB70980.1 



1 
I 

MGTIHLPRKP 
TAYTYLTIFD 
ESAERFLEQP 
RSLCGIIPGL 
GTMYPMSVYS 
VRIRRDANEQ 
MPLLKGTDDL 
SSMLNQGLGV 



11 
I 

QRSFFGKLLR 
LFSLMTCXIS 
EIHTGRLLVG 
SSIFLPRMNP 
GKVLLQTTPP 
MVLAHVTNRL 
NPVTSTPAKP 
PGIGATQGLR 



21 
I 

EFRLVAADRR 
YWVTLRKPSP 
TFVALCFNIiF 
FVLIDLAGAF 
HVIGQLDKLI 
YTLVSTLTVQ 
SSPPPEFSFN 
TGFTNIPSRY 



31 
I 

SWKILLFGVI 
VYSFGFERLE 
TMLSIRNKPF 
AI/CITYMLIE 
REVSTLDGVL 
IFKDDWIRPA 
TPGKNVNPVI 
GTNNRIGQPR 



41 
I 

NLICTGFLLM 
VLAVFASTVL 
AYVSEAASTS 
INNYFAVDTA 
EVRNEHFWTL 
LLSGPVAANV 
LLNTQTRPYG 
P 



Seq ID NO: 206 DNA sequence 

Nucleic Acid Accession #: NM_016361.1 

Coding sequence : 3 97 1 6 6 2 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 



51 
I 

WCSSTNSIAL 
AQLGALFILK 
WLQEHVADIiS 
SAIAIALMTF 
GFGSLAGSVH 
LNFSDHHVIP 
FGLNHGHTPY 



60 
120 
180 
240 
300 
360 
420 



11 



21 



31 



51 

1034 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



GGAACTCAGG 
TTTAATGTTC 
AGGAGCGACC 
TTGTTCTCTC 
TGCTTGCTGA 
TGAGCCCCGG 
CGAGGGCCTA 
GTCCTGACCT 
GAGGCCGATG 
GTGTTTCGAC 
TGGAACCCCC 
CTAGCTGGTG 
AAGGGGGGCA 
GGAGAGAGAC 
CCACAGGAGG 
TTGCTGGCTG 
GCAGATTCAG 
AGAGGCCGGA 
AAGGACAGGA 
GTGGCTGCCG 
ATGATCGAAC 
AGTCTTCAGA 
ATGGACTCTO 
GTGACCTTCA 
GCTGTTGACC 
CTCTATTACC 
GACATGTTCT 
TGCTCTCAAA 
ATGTGTTGAT 



I 

GCCGGCTCCT 
CTCGTCGGGC 
AACCCCGACC 
CCCCGTTCCC 
AGGGCTGGAT 
CGCCCGCAGA 
CCACCATGAT 
CGCTGGCGTA 
GCCAGTGTCC 
ACGGGGCTCG 
AGCTATTAGA 
GTCCGAAACC 
TGTTTGCTGG 
TGAGGAAGAA 
TCTTTATTCG 
GGCTTTTCCA 
AAGTCTTGTA 
GGCAGACTGC 
TGGGCATTGA 
AGCAGGCACA 
AGAGAGCTGT 
TGGCAGTAGG 
CCACTGCCCC 
TACCGCTCTT 
TGACCATGGA 
ACGGGAAGGA 
TGAATGCCAT 
CTCAGGTGAT 
TTTAAAATAA 



GTTCCTTCAA 
AAAAGATAAG 
CGGGTTAAAA 
ACTCGGGGTC 
GTACGCATCC 
AGACTTGTGT 
CACTGGTGTG 
CTGCCTGCAC 
GGTCGACCGC 
GAGTCCTCTC 
GGTCCCACCC 
ATATTCTCCT 
GCAGCTGACC 
CTATGTGGAA 
TTCCACTAAC 
GTGTCAGAAA 
TCCCAACTAC 
CTCTTTACAG 
CAGTAGTGAT 
CAACCTCCCA 
GGACACATCC 
CCCATTCCTC 
CGACAAGATC 
AATGACCCTG 
ACTTTACCAG 
GCAGGTGCCG 
GTCAGTTTAT 
GGAAGTTGGA 
AGTGCCTTTA 



GAGTGCTGGA 
GATCCGATCT 
CTCCCAGGGA 
TCCCTCAGGG 
GCAGGTTCCC 
TTGCCTCCTG 
TTCAGCATGC 
CAGCGGCGGG 
AGCCTGCTGA 
AAGCCGCTCC 
CAAACTCAGT 
TACGACTCTC 
AAGGTGGGCA 
GACATTCCCT 
ATTTTTCGGA 
GAAGGACCCA 
CAAAGCTGCT 
CCAGGAATCT 
AAAGTGGACT 
AGCTGCCCCA 
TTGTACATAC 
CACATCCTAG 
AGAAAGCTGT 
GGGATTTTTG 
CACCTGGAAT 
AGAGGTTGCC 
ACCTTAAGCC 
AATGAAGAGT 
TACAAAAAAA 



GGCCAAACTT 
CCCCCGGCCC 
CTCTTCGCTG 
CCGGGAGGCA 
GCGGACTTGG 
CAGCCTCAAC 
GCTTGTGGAC 
TGGCCCTGGC 
AGTTGAAAAT 
CGCTGGAGGA 
TTGATTACAC 
AATACCATGA 
TGCAGCAAAT 
TTCTTTCACC 
ATCTGGAGTC 
TCATCATCCA 
GGAGCCTGAG 
CAGAGGATTT 
TCTTCATCCT 
TGCTGAAGAG 
TGCCCAAGGA 
AGAGCAACCT 
ATCTCTATGC 
ACCACAAATG 
CTAAGGAGTG 
CTGATGGGCT 
CAGAAAAATA 
AACTGATTTA 
AAAAAAAAAA 



I 

GAAATACAAG 
GGTGTGCAGC 
CTGCCACCTC 
CAGCGGTCCC 
GGGCGCCCGC 
CCGGAGGCAG 
CCCAGTGGGC 
CGAGCTGCAG 
GGTGCAGGTC 
GCAGGTAGAG 
AGTCACCAAT 
GACCACCCTG 
GTTTGCCTTG 
AACCTTCAAC 
CACCCGTTGT 
CACTGATGAA 
GCAGAGAACC 
GAAAAAGGTG 
CCTGGACAAC 
ATTTGCACGG 
AGACAGGGAA 
GCTGAAAGCC 
GGCTCATGAT 
GCCACCGTTT 
GTTTGTGCAG 
CTGCCCGCTG 
CCATGCACTC 
TAAAAGCAGG 
A 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



Seq ID NO: 207 Protein sequence 
Protein Accession #: NP_057445-1 



1 
I 

MRLWTPVGVL 
LPLEEQVEWN 
GMQQMFALGE 
PIIIHTDEAD 
DPFILLDNVA 
LESNLLKAMD 
ESKEWFVQLY 
E 



11 
I 

TSLAYCLHQR 
PQLLEVPPQT 
RLRKNYVEDI 
SEVLYPNYQS 
AEQAHNLPSC 
SATAPDKIRK 
YKGKEQVPRG 



21 
I 

RVALAELQEA 
QFDYTVTNLA 
PFLSPTFNPQ 
CWSLRQRTRG 
PMLKRPARMI 
LYIiYAAHDVT 
CPDGLCPLDM 



31 
I 

DGQCPVDRSL 
GGPKPYSPYD 
EVFIRSTNIF 
RRQTASLQPG 
EQRAVDTSLY 
FIPLLMTLGI 
FLNAMSVYTL 



41 
I 

LKLKMVQWP 
SQYHETTLKG 
RNLESTRCLL 
ISEDLKKVKD 
ILPKEDRESL 
PDHKWPPPAV 
SPEKYHALCS 



51 
I 

RHGARSPLKP 
GMFAGQLTKV 
AGLFQCQKEG 
RMGIDSSDKV 
QMAVGPFLHI 
DLTMEIiYQHb 
QTQVMEVGNE 



Seq ID NO; 208 DNA sequence 

Nucleic Acid Accession #: CAT cluster 



TTTGAGGGGG 
TAACAGCCTG 
TGCTGGGGGC 
TAGTGGCAGA 
TCTGGCTGCT 
GGAGCCCTCC 
GGTCTTTATT 
GCAGGTTCCT 
TGAGCTGGCA 



CCCTTGGAGA 
CAGTGTTAGG 
GCCTGGAGTG 
CTTGCATTTG 
GCAGATCAGT 
GGCAAATGGG 
CACTGCTCCT 
GCGCAG 



21 
I 

AGTTTAATTC 
GAAGTCCTTC 
CTTCAGGCCA 
ATGAGTGGGA 
CATTTGCCAC 
CCGCTCAGCT 
TCATTGGCTT 
TCACCAGTGT 



31 
1 

ATAAAGAAGC 
CTTGAGGATA 
TCCCTGGAGG 
TGGCCTTCTC 
TCAGAACTGC 
GCAGGTTTTT 
CCAAGGCAGT 
CCTGCGAGGT 



41 
I 

CTCCTGATCA 
AGGCCTCCCA 
CCAGTCCTGT 
AGGTACAGGA 
CGCGATCCCA 
CCAGTCATAG 
CAGGCCAACT 
CACCTTGGCG 



Seq ID NO: 209 DNA sequence 
Nucleic Acid Accession #: 
Coding sequence: 1..564 



FGENESH predicted 



60 
120 
180 
240 
300 
360 
420 



51 
I 

GAAAGGGGCC 
GGGGAGGAGG 
GCTCAGCAAG 
CTGTGCTGCT 
GCAATGGCCA 
TAGAAGGGAT 
GTGTGACTCT 
AGGGCTCACC 



60 
120 
180 
240 
300 
360 
420 
480 



ATGGAGCCCT 
TCAGATCCGT 
CAGAGTCACA 
GATCCCTTCT 
CTGGOCATTG 
AAGCAGCACA 
ACCTTGGCCA 
AGCTACACAT 
GATGCTGCTG 
GGAACGAAGG 



11 

I 

GGGCGTGGCT 
TCTCTGCGCT 
CAGTTGGCCT 
ACTATGACTG 
CTGGGATCGC 
GTCCTGTACC 
AATCAAATAA 
CAGTCTTCCG 
CCCTCACATC 
GCTGTACCTA 



21 
I 

GCAGGGTTTA 
GCCAGCTCAG 
GACTGCCTTG 
GAAAAACCTG 
GGCAGTTCTG 
TGAGAAGGCC 
ACCTTTATCT 
AGTGCCTCTT 
AGGCCATCCA 
ATGA 



31 
I 

AAGAGCCGAC 
GACACTGGTG 
GAAGCCAATG 
CAGCTGAGCG 
AGTGGCAAAT 
ATCCCACTCA 
CCAAGCACCT 
TCTGCGTCCC 
AGCATGCAGA 



41 



51 



CCACGTGCCC 
AAGGAGCAGT 
ACCCATTTGC 
GACTGATCTG 
GCAAATGCAA 
TCACTCCAGG 
TTGTCTTGGT 
TGTACCCTGC 
ACATAAGCAT 



AGCAGCCTCC 
GAGGAACCTG 
CAATAAAGAC 
CGGAGGGCTC 
GAGCAGCCAG 
CAGATTTCTC 
GTTTGGCATC 
CATTCCTGGT 
GCAGAACACT 



60 
120 
180 
240 
300 
360 
420 
480 
540 



Seq ID NO: 210 Protein sequence 
Protein Accession ft: FGENESH predicted 

1 11 21 31 41 si 

I I I 1 I I 

MEPWAWLQGL KSRPTCPAAS SDPFSALPAQ DTGEGAVRNL QSHTVGLTAL EANDPFANKD 
DPFYYDWKNL QLSGLICGGL LAIAGIAAVL. SGKCKCKSSQ KQHSPVPEKA IPLITPGRFL 

1035 



60 
120 



WO 03/042661 



TLAKSNKPLS PSTFVLVFGI SYTSVFRVPL SASLYPAIPG DAAALTSGHP SMQN2SMQNT 1BO 
QTKGCT 



Seq ID NO: 211 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted 

Coding sequence: 1..318 



ATGCCCGGCC ACCCCGTCTG TGAAGTGAGG AGCACCTCTG CCCGGCTGCC CCGTCTGGGA 60 

AGTGAGGAGC GCCTCTGCCC GGCTGCCACC CCGTCTGTGA GTGCCTGCTG CGCTGGGCCC 120 

AGGCCGCCCG TGCCCTGCCA GGCCCTCCGG CCCCCCACCT TCCACCCCAG GGCCTGCTCC 180 

TCACCCCAGG GTTCCATCTC CCTAGTTTCC ACCAGAGACT GGGTCTTCAT TCTCACCCTG 240 

CTACACAGCC CCTACCAGAA CGTTCTGAAA TGCAAACCTA ACAACTGTCT CACCCCAGCA 300 

GGAAACTCCC CAGGGTCCCG GGCCCCCTGC GGGGTTGCAG GCCTCACTCT TCGCGCCCAT 360 

CCCTCCGCCC TGACCGCCCT GAGCTCGCCC CCAGTGCTGG CCCTTCACGT CCAGTTATCC 420 

CTCCCAGCCT CCAAGGTCCC CGTTACCGAA GACCGCCACC ATCACGACAT AGCGCAGCAC 480 

ATATGGGACA CTGGTGAAGG AGCAGTGAGG AACCTGCAGA GTCACACAGT TGGCCTGACT 540 

GCCTTGGAAG CCAATGACCC ATTTGCCAAT AAAGACGATC CCTTCTACTA TGACTGGAAA 600 

AACCTGCAGC TGAGCGGACT GATCTGCGGA GGGCTCCTGG CCATTGCTGG GATCGCGGCA 660 

GTTCTGAGTG GCAAATGCAA ATGCAAGAGC AGCCAGAAGC AGCACAGTCC TGTACCTGAG 720 

AAGGCCATCC CACTCATCAC TCCAGGCAGA TTTCTCACCT TGGCCAAATC AAATAAACCT 780 

TTATCTCCAA GCACCTTTGT CTTGGTGTTT GGCATCAGCT ACACATCAGT CTTCCGAGTG 840 

CCTCTTTCTG CGTCCCTGTA CCCTGCCATT CCTGGTGATG CTGCTGCCCT CACATCAGGC 900 
CATCCAAGCA TGCAGAACAT AAGCATGCAG AACACTGGAA CGAAGGGCTG TACCTAA 

Seq ID NO: 212 Protein sequence 
Protein Accession #: FGENESH predicted 



1 11 21 31 41 si 

I I I I I I 

MPGHPVCEVR STSARLPRLG SEERLCPAAT PSVSACCAGP RPPVPCQALR PPTFHPRACS 60 
SPQGSISLVS TRDWVFILTL LHSPYQNVLK CKPNNCLTPA GNSPGSRAPC GVAGLTLRAH 120 
PSALTALSSP PVLALHVQLS LPASKVPVTE DRHHHDIAQH IWDTGEGAVR NLQSHTVGLT 160 
ALEANDPFAN KDDPFYYDWK NLQLSGLICG GLLAIAGIAA VLSGKCKCKS SQKQHSPVPE 240 
KAIPLITPGR PLTLAKSNKP LSPSTFVLVF GISYTSVFRV PLSASLYPAI PGDAAALTSG 300 
HPSMQNISMQ NTGTKGCT 

Seq ID NO: 213 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted 

Coding sequence: 1..1758 



1 11 21 

I I ! 

ATGATGGGGT CTCATGTTGC CCAGGCTGGT 
GCCTTGGCCT CCGAAAGTGC TGGGATTACA 
GAGGATGTGG CTGTGGACTT CACTTGGGAG 
GACCTGTACC GGGATGTGAT GTTGGAGAAC 
GCCAGCAAAC CGGATGCACT CTTCAAGTTG 
GATGGAATCC ACAGTGGAGC CTGTTCAGGT 
TCTGTGCCCT TCACTCTTCA AAATTGCCTT 
CTGGAGCGCT TGCAGAGTGA AAGCCTGGTG 
GCATTTGAAA ATATTGTTCA TTGCAGCAAA 
ATATTTGACT TACGTGGAAA AAGTTTGAAA 
GGCTATGAAA TAAAGAACTC TGTTGAGTTT 
AACCATGAAC GACTTCATAC TGCAATTAAA 
AAGTCCCAAT TCATCAGTCC CAAGCATCAG 
TGCAGTGAAT GTGGGAAAGC CTTCATCAAG 
CATACAGGAG AGAAACCCCA CAGATGTAGT 
ATGCTTACTG AACATCAGCG AACTCATACA 
GGCAAAGCCT TTCTCAAGAA ATCACGGCTC 
AAACCCTATA TATGCAGTGA ATGTGGAAAA 
CACCAGCGAA TTCATACAGG TGAGAAACCT 
ATTCAGAAGA CGTGTCTCAT AGCACATCAG 
TGCAGTGAAT GTGGAAAATC CTGTTCTCAG 
CACACAGGAG AGAAACCCTT TGAATGTAGT 
AAGCTCATTG TCCATCAAAG GACTCATACA 
GGGAAAGCGT TTGCGTATAT GTCGTGTCTG 
AAACAAGAGG CAGCCAAGGT GGAAAATCCT 
AGTGATGTCA TGCAGGAGAA AAACTCTGCT 
GCCCCTCAGA CATCATTAAA CATCAGCGGC 
GGACAGCCAG TGGTCAGATG TGCAGCCTCA 
AACCTTGTGA ATGCAGTGAA TGTGGTTGTG 
GTTACAGAAA ACCCATAG 



31 41 51 

t I i 

CTTGAACTCC TGGGCTCGAG TGACCCTCCT 60 

GGACTGTTAT TACAGGAATC CATAACACTG 120 

GAGTGGCAAC TCCTGGGCGC TGCTCAGAAG 180 

TACAGCAACC TGGTGGCAGT GGGGTATCAA 240 

GAACAAGGAG AACAACTGTG GACAATTGAA 300 

TCTCCAAAGG TCCCGTTCTC CATTTTCTCA 360 

CATTCTAACA TATGGAAAGT TGATCATGTG 420 

AACAGAAGGA AACCATGTCA TGAACATGAT 480 

AGTCAGTTTC TGTTAGGGCA AAATCATGAT 540 

TCCAATTTAA CTTTAGTTAA CCAGAGCAAA 600 

ACTGGAAATG GGGACTCCTT TCTTCATGCT 660 

TTCCCTGCAA GTCAAAAACT CATCAGCACT 720 

AAAACACGAA AATTAGAGAA GCATCATGTG 780 

AAGTCTTGGC TAACTGATCA CCAGGTAATG 840 

CTATGTGAGA AAGCCTTCTC CAGAAAGTTC 900 

GGAGAAAAAC CTTATGAATG OCCTGAATGT 960 

AACATACATC AGAAAACACA TACOGGAGAG 1020 

GGCTTCATCC AGAAAGGAAA TCTCATTGTA 1080 

TATATATGCA ATGAATGTGG AAAAGGCTTC 1140 

AGATTTCACA CAGGAAAGAC GCCCTTTGTG 1200 

AAATCAGGTC TCATTAAACA TCAAAGAATT 1260 

GAATGTGGGA AAGCCTTTAG CACAAAGCAA 1320 

GGAGAGAGAC CCTATGGCTG TAACGAGTGT 1380 

GTTAAGCATA AGAGAATACA CACAAGGGAG 1440 

CCTGCAGAGA GGCACAGCTC ATTACACACC 1500 

AACGGGGCGA CTACACAAGT GCCTTCTGTG 1560 

CTCCTCGCAA ACAGGAACGT AGTCCTTGTG 1620 

GGAGATAACA GAGGATTTGC ACAGGACAGA 1680 

CCTTCCGTGA TCAATTATGT CTTATTTTAT 1740 



Seq ID NO: 214 Protein sequence 
Protein Accession #: FGENESH predicted 

} 11 21 31 41 si 

I I I I | | 

MMGSHVAQAG LELLGSSDPP ALASESAGIT GLLLQESITL EDVAVDFTWE EWQLLGAAQK 60 
DLYRDVMLEN YSNLVAVGYQ ASKPDALFKL EQGEQLWTIE DGIHSGACSG SPKVPFSIFS 120 
SVPPTLQNCL HSNIWKVDHV LERLQSESLV NRRKPCHEHD AFENIVHCSK SQFLLGQNHD 180 
IFDLRGKSLK SNLTLVNQSK GYEIKNSVEF TGNGDSFLHA NHERLHTAIK FPASQKLIST 240 

1036 



WO 03/042661 



KSQFISPKHQ KTRKLEKHHV CSEOGKAFIK KSWLTDHQVM HTGEKPHRCS LCEKAFSRKF 300 

MLTEHQRTHT GEKPYECPEC GKAFLKKSRL NIHQKTHTGE KPYICSECGK GFIQKGNLIV 360 

HQRIHTGEKP YICNECGKGP IQKTCLIAHQ RFHTGKTPFV CSECGKSCSQ KSGLIKHQRI 420 

HTGEKPFECS ECGKAFSTKQ KLIVHQRTHT GERPYGCNEC GKAFAYMSCL VKHKRIHTRE 480 

KQEAAKVENP PAERHSSLHT SDVMQEKNSA NGATTQVPSV APQTSLNISG LLANRNWLV 540 
GQPWRCAAS GDNRGFAQDR NLVNAVNWV PSVINYVLFY VTENP 

Seq ID NO: 215 DNA sequence 

Kucleic Acid Accession NM_032190.1 

Coding sequence: 502.. 1332 

1 11 21 31 41 51 

I I I I I I 

GATTCCGTGT TCTTGGCCAT GTTAGCCATA ATATCCTGTG CAGTATGTTT TTCCT GTGCA 60 

GAGGCAAAAA CATATTGGGC ATATGTTCCC AAGCCCCCAG CAGTATGACC CATACTTTGG 120 

AGTGACACTC CTCCTAAGAT TTATCATGAT TAAGGAGCAT GGGCTCCAGG ACCCCTAACT 180 

CCACCTGACA TAGAACAGTT AGACTCTCAG AATAATGTCA TTAATTATAC OGCTCCATTG 240 

GAAGGACTTC CTTTGTGTGT CACCACAAAG ACATCACTCA GCCATAGCTG TCTTACAGTT 300 

CAAGCTCACA CATGGTTGAG TCACTATGGG AAAATCATGT ACTTATTAAG TCTTGGTTAT 360 

ATTAATGTAA CCGGTGTGCT AACCAACCAT TCCTGGCCCA ATCGCCTTCA TTGTGCTGAC 420 

TATACAGAAT GGATTCCCTT CAATAGTTCC TACCCCCCTC CATAGACCCA GTGTCTTGGC 480 

CCACTGGCTA GAAAACAATC TATGTTAACT GGAGACATTG TGGATTGGGG ACCTAAAGGC 540 

CAATTAGATG GAAAAGAAGA AAATCAGAAA TCGTGGCACA AACTTTGCTG GCATTGGTGG 600 

CAAGCTTTTA ATGCTTCTTC TTTATATAAC ACTGGGATCC AATCCCAGTC GGCCGCCCAG 660 

ATTGCTTGGC ATGGAGCAGG CTTTAGCCCG CCTCTTCCTC AGTGGCATTA TCTAGGGAGG 720 

AAAGGACCAA TTCAAAAGAT GATATGGAAG GCAGCATTCC CATTTATGAA TGGCAACATC 780 

TGGGTTGCCA TAATACTATC CAATAATAGC AATAGTAAGC AACACAGTCT TAATGTTACA 840 

TTTGTAAAGA ATATCACCAC TCAATTTACA GTTTGTGTTT TTAATCCTTA TGTGTTTTTG 900 

GCAGCTAAGA AGGACCAGCT CCAGGTAAAC AATACCCAAT TGACCTGTAA ATCTTGCCAG 960 

TTATATCACT GCATTAATCA TAGCACATTG CAAACACATA ATATCTCTAC TTTGATGATT 1020 

TTAGGTTGCA TCCCTGGGCT ATGGATTCCT GTTAATCTGT CTGAGCCATG GGCTGCCACA ' 1080 

ATTGCTTTAC ATTTTGTGAA ACTTCTTCTA ACTCAGTTTA CTCATTGTGT CCGTAGAGGC 1140 

TTAGGCATGA TAATTTTTGC TATTGTTTAC TTGGTCACAC TAATAATTTC TGTTGTGATG 1200 

TCCTCTGTAG CTTTGCATAG TTCTATTCAA ACAGCTCAGT ATGTGGAGAA CTGGACACGC 1260 

ACAGTCAACC AAGGGTGGCT ACTTGAGAAT AAAATTAACA CTGAGTTACA AACTGAAGTG 1320 

GCAGTGTTAT AATCCACGAT TCTATGGTTA GGGGAACAAG TACAAAGCTT GCAATTGCAG 1380 

CAGTAATTGT GTTGTCATTT TAATCACACT CATATTTGTG TAACCAACTT AGAATATAAC 1440 

CAAAGTGAGT ATCCATGGGA TCTTGTGAAA GCCCATTTGC AGGGAGCTTT CACATCCGAC 1500 

ATCACCTTTG ATATTGGTGA ATTACAAAAC AAAATTCTTG ATTTAAATAA ACAAATTCCA 1560 

GAGTTTCAGC CTTCTTTAGA AGACTGGACT GAATTCCAGC AAGGCCTGGA GAGCGTCAAC 1620 

CCTTGGACCT ATCTAAAGCA CCACATTAAC ATCTTATATA TAGTTCTTGG AATAATGTTG 1680 

TTTTGTCTCT GTCTTCTGTT CATAGTCTGT AAAATCGGAT GGACTGCCAA TCGGAGAATG 1740 

AAAGCTACCC AGCCTGGCCT TACATTCTTT CACTTAATAC ATAAACAAGA AGGGGGAAAT 1800 

GTTGGGAGCC AAAAAGGCCA AAGGGATGGT GACCAACTCA GCATTCCACT GGAGGCTACA 1860 

TGATCAAACA GCAAACTGTT TATCATGAAT ACAGAATGTG GGCAAACTCG CTTCTGTGCC 1920 

TGCCCAGAAG GTTTGCTGAG GGCCATCGCT CCCTGGCCCC GGCTCCTTGA GGTTATCTAC 1980 

TGGGACATCT AGAGCCTATT GTTCGAGGAA TGCAGTCTTG CAAGCCTACT CTGGACCGAG 2040 

CAGCTGACCT CTTCTTCCAC ACCCCTTCTC ACTATCTCTT TTGCCTAATA AATATGGAGG 2100 

GCTGTGTAAA GCTCAGGGCC CTTGTCCACT AGAGGCAAGG TGTCCCCTGA CCCTTCTTCC 2160 
AAACAT 

Seq ID NO: 216 Protein sequence 
Protein Accession #» NP_115S66.1 

1 11 21 31 41 51 

I I I I I I 

MLTGDIVDWG PKGQLDGKEE NQKSKHKLCW HWWQAFNASS LYNTGIQSQS AAQIAWHGAG 60 

FSPPLPQWHY LGRKGPIQKM IWKAAFPFMN GNIWVAIILS NNSNSKQHSL NVTPVKNITT 120 

QFTVCVFNPY VFLAAKKDQL QVNNTQLTCK SCQLYHCINH STLQTHNIST LMILGCIPGL 180 

WIPVNLSEPW AATIALHPVK LLLTQFTHCV RRGLGMIIFA IVYLVTLIIS WMSSVALH3 240 
SIQTAQYVEN WTRTVNQGWL LENKINTELQ TEVAVL 

Seq ID NO: 217 DNA sequence 

Nucleic Acid Accession ft: FGENESH predicted 

Coding sequence: 1..1566 

1 11 21 31 41 51 
I I I I I I 

ATGGTGAACC CCAAATCCAC TTCCTCCCTC TTCAGGTTAT GTTTTTTGCT CCTGAGGAGT 60 

CAGAACCTGT GGGTTGAAGA GCAAATTCAA TGCAAAAACA TATTGGGCAT ATGTTCCCAA 120 

TCCCCCAGCA GTATGGCCTA TACTTTGGAG CTCACTCCTC CTGAGATTTA TCACGATCAG 180 

GGAGAGTGGG CTCCAGGACC CCTAACTCCC CGTGACATAG AAAAGTTAGA CTCTCAGAAC 240 

AATGTCATTA ATTATACCAC TCCACTGGAA GGACTCCCTT TGTTTATCAC CACAAAGACG 300 

TCGCTCAGCC ATAGCTGTCT TGCAATTCAA GCTCAAACAT GGTTGAGTCA CTATGGAAAA 360 

ATTATGTACT TATTAGGTCT TGGTTCTATT AATGTAACTG GTGTGCTAAC CAATCATTCC 420 

CAGTCCAGTC ACCCTAATTG TGCTGATTAT ACAGAATGGA TTCCATTCAA TAGTTCCTAC 480 

CCCACTCTGT GGACCCAGTG TCTTGATCCA CTGGCTAGTA AACAATATAT GTCAACTGAA 540 

GACACTGTGG ATTGGGAACC TAAAGGTCAA TTAGATGGAA AAGGTGAAAG TCAGAAATCA 600 

TGGCACAAAC TTCACTGGCA TTGGCGGCAA GCTTTTAATG CTTCTTCTTT ATACAACAGC 660 

AGAATCCAAT CCCAGTCTGC TGCTCAGATT GCTTGGCATG GAGCAGGCTT TAGCCCACCT 720 

CTTCCTCAGT TGCATTATCT GGGGAGGAAA GGACCAATTC AAGAAACTAT ATGGAAGGCA 780 

GCACTCCCAT TTATGAATGG CAACATCTGG ATTGGAACAC TGTCTAATAA TAGCAATAGT 840 

AAGCAACACA GTCTTAATGT TGCATTTGTA AAGAATATCA CCACTCAGTT TACAGTTTGT 900 

GTTTTTAATC CTTATGCCTT TTTGGCAGCT AAGAAGAACC AGCTTCAGGT GGAGAACTGG 960 
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ACACGCACAG CTGACCAAGC GAGGCTACTT CAGAATAAAA TTAACACTGA GTTACAAACT 1020 

GAAGTGGCAA TGTTGAAATC CATGGTTCTG TGGTTAGGAG AACAGGTACA AAGCTTGCAG 1080 

TTGCAGCAGC AATTGCGTCA TCATTTTAAT CACATTCATA TTTGCGTAAC TAACTCAGAA 1140 

TATAACCAAA GTGAGTATCC GTGGGACCTT GTGAAAGCCC ATTTGCAAGG AGCTTTCACA 1200 

5 TCCAACATCA CCTTTGATAT TGGTGAATTA CAAAACAAAA TTATTGATTT AAATAGGCAA 1260 

ACTCAAGAAT TTCAGCCTTC TTTAGAAGAC TGGACCGAAT TCCAGGAAGG CCTGGAGAGC 1320 

CTCAACCCTT GGACCTATCT AAGGCACCAC ATTAACATCT TATATGTAGT TCTTGGAATA 1380 

ATGTTGTTTT GTCTCTGTCT TCGGTTTATA GTCTGTAAAA TCGGATGGAC CACCAATTGG 1440 

AGAATGAGAG CCTCCCAGCC CAGCCTTACA TTCTTTCAAT TAATACATAA ACAGAAAGGG 1500 

10 GGATATGCAG GGAGCCAAAG GCCTGTGGGA CGTGACCAAC TCAGCATTCT GCTGGAGGCT 1560 
ATATGA 

Seq ID NO: 218 Protein sequence 
Protein Accession #: FGENESH predicted 

1 11 21 31 41 51 

I I I I I I 

MVNPKSTSSL FRLCFIiLLRS QNLWVEEQIQ CKNILGICSQ SPSSMAYTLE LTPPEIYHDQ 60 

GEWAPGPLTP RDIEKLDSQN NVINYTTPLE GLPLFITTKT SLSHSCLAIQ AQTWLSHYGK 120 

20 IMYLLGLGSI NVTGVL.TNHS QSSHPNCADY TEWIPFMSSY PTLWTQCLDP LASKQYMSTE 180 

DTVDWEPKGQ LDGKGESQKS WHKLHWHWRQ AFNASSLYNS RIQSQSAAQI AWHGAGFSPP 240 

LPQLHYLGRK GPIQETIWKA ALPFMNGNIW IGTLSNNSNS KQHSLNVAFV KNITTQFTVC 300 

VFNPYAFLAA KKNQLQVENW TRTADQARLL QMKINTELQT EVAMLKSMVL WLGEQVQSLQ 360 

^ _ LQQQLRHHFN HIHICVTNSE YNQSEYPWDL VKAHLQGAPT SNITFDIGEL QNKIIDLNRQ 420 

25 TQEFQPSLED WTEFQEGLES LNPWTYLRHH INILYWLGI HLFCLCLRFI VCKIGWTTNW 480 
RMRASQPSLT FFQL1HKQKG GYAGSQRPVG RDQLSILLEA I 

Seq ID NO: 219 DNA sequence 
Nucleic Acid Accession #: FGENESH predicted 
30 Coding sequence: 1..900 

1 11 21 31 41 51 

I I I I I I 

ATGCCGCCGC GGGAGCTGAG CGAGGCCGAG CCGCCCCCGC TCCGGGCCCC GACCCCTCCC 60 

35 CCGCGGCGGC GTAGCGCGCC CCCAGAGCTG GGCATCAAGT GCGTGCTGGT GGGCGACGGC 120 

GCCGTGGGCA AGAGCAGCCT CATCGTCAGC TACACCTGCA ATGGGTACCC CGCGCGCTAC 180 

CGGCCCACTG CGCTGGACAC CTTCTCTGGT ACGTACGTTC AATCGCCCGT GCGGCCGCGT 240 

GGCTGCGGCG GGGCTGTGCA CCGGGGAGCT GGGGCGGGCG TCTCGGCGGG AGGGCGCAGA 300 

GGACCCCGGG GAGGAGACTG GAGCAGGCCC CGAGGTGGCG CTGGTGCGGC CCAGGACGCT 360 

40 CTTCCTAACT CAGGCTCTCC CCGCCCCGCC CCTGCAGTGC AAGTCCTGGT GGATGGAGCT 420 

CCGGTGCGCA TTGAGCTCTG GGACACAGCG GGACAGGAGG ATTTTGACCG ACTTCGTTCC 480 

CTTTGCTACC CGGATACOGA TGTCTTCCTG GCGTGCTTCA GCGTGGTGCA GCCCAGCTCC 540 

TTTCAAAACA TCACAGAGAA ATGGCTGCCC GAGATCCGCA CGCACAACCC CCAGGCGCCT 600 

GTGCTGCTGG TGGGCACCCA GGCCGACCTG AGGGACGATG TCAACGTACT AATTCAGCTG 660 

45 GACCAGGGGG GCCGGGAGGG CCCCGTGCCC CAACCCCAGG CTCAGGGTCT GGCCGAGAAG 720 

ATCCGAGCCT GCTGCTACCT TGAGTGCTCA GCCTTGACGC AGAAGAACTT GAAGGAAGTA 780 

TTTGACTCGG CTATTCTCAG TGCCATTGAG CACAAAGCCC GGCTGGAGAA GAAACTGAAT 840 
GCCAAAGGTG TGCGCACCCT CTCCCGCTGC CGCTGGAAGA AGTTCTTCTG CTTCGTTTGA 

50 Seq ID NO: 220 Protein sequence 

Protein Accession #: FGENESH predicted 

1 11 21 31 41 51 

„ I I I I I I 

J J MPPRELSEAE PPPLRAPTPP PRRRSAPPEL GIKCVLVGDG AVGKSSLIVS YTCNGYPARY 60 
RPTALDTFSG TYVQSPVRPR GCGGAVHRGA GAGVSAGGRR GPRGGDWSRP RGGAGAAQDA 120 
LPNSGSPRPA PAVQVLVDGA PVRIELWDTA GQEDFDRLRS LCYPDTDVFL ACFSWQPSS 180 
FQNITEKWIiP EIRTHNPQAP VLLVGTQADL RDDVNVLIQL DQGGREGPVP QPQAQGLAEK 240 
IRACCYLECS ALTQKNLKEV FDSAILSAIB HKARLKKKLN AKGVRTLSRC RWKKFFCFV 

60 

Seq ID NO: 221 DNA sequence 

Nucleic Acid Accession #: XM_063832.2 

Coding sequence: 1. .711 

65 1 11 21 31 41 51 

I ! I I I I 

ATGCCGCCGC GGGAGCTGAG CGAGGCCGAG CCGCCCCCGC TCCGGGCCCC GACCCCTCCC 60 

CCGCGGCGGC GTAGCGCGCC CCCAGAGCTG GGCATCAAGT GCGTGCTGGT GGGCGACGGC 120 

GCCGTGGGCA AGAGCAGCCT CATCGTCAGC TACACCTGCA ATGGGTACCC CGCGCGCTAC 180 

70 CGGCCCACTG CGCTGGACAC CTTCTCTGTG CAAGTCCTGG TGGATGGAGC TCCGGTGCGC 240 

ATTGAGCTCT GGGACACAGC GGGACAGGAG GATTTTGACC GACTTCGTTC CCTTTGCTAC 300 

CCGGATACOG ATGTCTTCCT GGCGTGCTTC AGCGTGGTGC AGCCCAGCTC CTTTCAAAAC 360 

ATCACAGAGA AATGGCTGCC CGAGATCCGC ACGCACAACC CCCAGGCGCC TGTGCTGCTG 420 

GTGGGCACCC AGGCCGACCT GAGGGACGAT GTCAACGTAC TAATTCAGCT GGACCAGGGG 480 

75 GGCCGGGAGG GCCCCGTGCC CCAACCCCAG GCTCAGGGTC TGGCCGAGAA GATCCGAGCC 540 

TGCTGCTACC TTGAGTGCTC AGCCTTGACG CAGAAGAACT TGAAGGAAGT ATTTGACTCG 600 

GCTATTCTCA GTGCCATTGA GCACAAAGCC CGGCTGGAGA AGAAACTGAA TGCCAAAGGT 660 
GTGCGCACCC TCTCCCGCTG CCGCTGGAAG AAGTTCTTCT GCTTCGTTTG A 

80 Seq ID NO: 222 Protein sequence 
Protein Accession #: XP_063832.1 

1 11 21 31 41 51 

1 I I 1 I I 
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MPPRELSEAE PPPLRAPTPP PRRRSAPPEL GIKCVLVGDG AVGKSSLIVS YTCNGYPARY 60 

RPTALDTFSV QVLVDGAPVR IELWDTAGQE DFDRLRSLCY PDTDVPLACF SWQPSSPQN 120 

ITEKWLPEIR THNPQAPVLL VGTQADLRDD VNVLIQLDQG GREGPVPQPQ AQGLAEKIRA 180 
CCYLECSALT QKNLKEVFDS AILSAIEHKA RLEKKLNAKG VRTLSRCRWK KFPCFV 

Seq ID NO: 223 DNA sequence 

Nucleic Acid Accession 8: Eos sequence 

Coding sequence: 

1 11 21 31 41 51 

I I 1 I I I 

ATGAATOGGC ACCATCTGCA GGATCACTTT CTGGAAATAG ACAAGAAGAA CTGCTGTGTG 60 

TTCCGAGATG ACTTCATTGC CAAGGTGTTG CCGCCGGTGT TGGGGCTGGA GTTTATCTTT 120 

GGGCTTCTGG GCAATGGCCT TGCCCTGTGG ATTTTCTGTT TCCACCTCAA GTCCTGGAAA 180 

TCCAGCCGGA TTTTCCTGTT CAACCTGGCA GTAGCTGACT TTCTACTGAT CATCTGCCTG 240 

CCGTTCGTGA TGGACTACTA TGTGCGGCGT TCAGACTGGA AGTTTGGGGA CATCCCTTGC 300 

CGGCTGGTGC TCTTCATGTT TGCCATGAAC CGCCAGGGCA GCATCATCTT CCTCACGGTG 360 

GTGGCGGTAG ACAGGTATTT CCGGGTGGTC CATCCCCACC ACGCCCTGAA CAAGATCTCC 420 

AATTGGACAG CAGCCATCAT CTCTTGCCTT CTGTGGGGCA TCACTGTTGG CCTAACAGTC 480 

CACCTCCTGA AGAAGAAGTT GCTGATCCAG AATGGCCCTG CAAATGTGTG CATCAGCTTC 540 

AGCATCTGCC ATACCTTCCG GTGGCACGAA GCTATGTTCC TCCTGGAGTT CCTCCTGCCC 600 

CTGGGCATCA TCCTGTTCTG CTCAGCCAGA ATTATCTGGA GCCTGCGGCA GAGACAAATG 660 

GACCGGCATG CCAAGATCAA GAGAGCCATC ACCTTCATCA TGGTGGTGGC CATCGTCTTT 720 

GTCATCTGCT TCCTTCCCAG CGTGGTTGTG CGGATCCGCA TCTTCTGGCT CCTGCACACT 780 

TCGGGCACGC AGAATTGTGA AGTGTACCGC TCGGTGGACC TGO CGTTCTT TATCACTCTC 840 

AGCTTCACCT ACATGAACAG CATGCTGGAC CCCGTGGTGT ACTACTTCTC CAGCCCATCC 900 

TTTCCCAACT TCTTCTCCAC TTTGATCAAC CGCTGCCTCC AGAGGAAGAT GACAGGTGAG 960 

CCAGATAATA ACCGCAGCAC GAGCGTCGAG CTCACAGGGG ACCCCAACAA AACCAGAGGC 1020 

GCTCCAGAGG CGTTAATGGC CAACTCCGGT GAGCCATGGA GCCCCTCTTA TCTGGGCCCA 1080 

ACCTCAAATA ACGATTCCAA GAAGGGACAT TGTCACCAAG AACCAGCATC TCTGGAGAAA 1140 
CAGTTGGGAT GTTGCATCGA G 



Seq ID NO: 224 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I 1 I I 

KNRHHLQDHF LEIDKKNCCV FRDDFIAKVL PPVLGLEFIF GLLGNGLALW IFCFHLKSMK 60 

SSRIFLFNIA VADFLLIICL PFVMDYYVRR SDWKFGDIPC RLVLFMFAMN RQGSIIPLTV 120 

VAVDRYFRW HPHHALNKIS NWTAAIISCL LWGITVGLTV HLLKKKLLIQ NGPANVCISF 180 

SICHTFRWHE AMFLLEFLLP LGIILFCSAR IIWSLRQRQM DRHAKIKRAI TFIMWAIVF 240 

VICFLPSVW RIRIFWLLHT SGTQNCEVYR SVDLAFFITL SFTYMNSMLD PWYYFSSPS 300 

FPNFFSTLIN RCLQRKMTGE PDNNRSTSVE LTGDPNKTRG APEALMANSG EPWSPSYLGP 360 
TSNNHSKKGH CHQEPASLEK QLGCCIE 

Seq ID NO i 225 DNA sequence 
Nucleic Acid Accession # : Eos sequence 
Coding sequence : 1 . . 1092 

1 11 21 31 41 51 

I I I I I I 

ATGAATCGGC ACCATCTGCA GGATCACTTT CTGGAAATAG ACAAGAAGAA CTGCTGTGTG 60 

TTCCGAGATG ACTTCATTGT CAAGGTGTTG CCGCCGGTGT TGGGGCTGGA GTTTATCTTC 120 

GGGCTTCTGG GCAATGGCCT TGCCCTGTGG ATTTTCTGTT TCCACCTCAA GTCCTGGAAA 1B0 

TCCAGCCGGA TTTTCCTGTT CAACCTGGCA GTGGCTGACT TTCTACTGAT CATCTGCCTG 240 

CCCTTCCTGA TGGACAACTA TGTGAGGCGT TGGGACTGGA AGTTTGGGGA CATCCCTTGC 300 

CGGCTGATGC TCTTCATGTT GGCTATGAAC CGCCAGGGCA GCATCATCTT CCTCACGGTG 360 

GTGGCGGTAG ACAGGTATTT CCGGGTGGTC CATCCCCACC ACGCCCTGAA CAAGATCTCC 420 

AATCGGACAG CAGCCATCAT CTCTTGCCTT CTGTGGGGCA TCACTATTGG CCTGACAGTC 480 

CACCTCCTGA AGAAGAAGAT GCCGATCCAG AATGGCGGTG CAAATTTGTG CAGCAGCTTC 540 

AGCATCTGCC ATACCTTCCA GTGGCACGAA GCCATGTTCC TCCTGGAGTT CTTCCTGCCC 600 

CTGGGCATCA TCCTGTTCTG CTCAGCCAGA ATTATCTGGA GCCTGCGGCA GAGACAAATG 660 

GACCGGCATG CCAAGATCAA GAGAGCCATC ACCTTCATCA TGGTGGTGGC CATCGTCTTT 720 

GTCATCTGCT TCCTTCCCAG CGTGGTTGTG CGGATCCGCA TCTTCTGGCT CCTGCACACT 780 

TCGGGCACGC AGAATTGTGA AGTGTACCGC TCGGTGGACC TGGOGTTCTT TATCACTCTC 840 

AGCTTCACCT ACATGAACAG CATGCTGGAC CCCGTGGTGT ACTACTTCTC CAGCCCATCC 900 

TTTCCCAACT TCTTCTCCAC TTTGATCAAC CGCTGCCTCC AGAGGAAGAT GACAGGTGAG 960 

CCAGATAATA ACCGCAGCAC GAGCGTCGAG CTCACAGGGG ACCCCAACAA AACCAGAGGC 1020 

GCTCCAGAGG CGTTAATGGC CAACTCCGGT GAGCCATGGA GCCCCTCTTA TCTGGGCCCA 108 0 

Seq ID NO: 226 Protein sequence 
Protein Accession #: Eos sequence 

75 i 11 21 31 41 SI 

I I I I I I 

MNRHHLQDHF LEIDKKNCCV FRDDFIVKVL PPVLGLEFIF GLLGNGLALW IFCFHLKSHK 60 

SSRIFLFNLA VADFLLIICL PFLMDNYVRR WDWKFGDIPC RLMLFMLAMN RQGSIIPLTV 120 

- ~ VAVDRYFRW HPHHALNKIS NRTAAIISCL LWGITIGLTV HLLKKKMPIQ NGGANLCSSF 180 

OU SICHTFQWHE AMFLLEFFLP LGIILFCSAR IIWSLRQRQM DRHAKIKRAI TFIMWAIVF 240 

VICFLPSVW RIRIFWLLHT SGTQNCEVYR SVDLAFFITL SFTYMNSMLD PWYYFSSPS 300 

FPNFFSTLIN RCLQRKMTGE PDNNRSTSVE LTGDPNKTRG APEALMANSG EPWSPSYLGP 360 
TSP 
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Seq ID NO: 227 DMA sequence 

Nucleic Acid Accession #: NM_006018 

Coding sequence: 61.. 1224 

1 11 21 31 41 51 

I I I I I I 

CGCCACTTTG CTGGAGCATT CACTAGGCGA GGCGCTCCAT CGGACTCACT AGCCGCACTC 60 

ATGAATCGGC ACCATCTGCA GQATCACTTT CTGGAAATAG ACAAGAAGAA CTGCTGTGTG 120 

TTCCGAGATG ACTTCATTGC CAAGGTGTTG CCGCCGGTGT TGGGGCTGGA GTTTATCTTT 180 

GGGCTTCTGG GCAATGGCCT TGCCCTGTGG ATTTTCTGTT TCCACCTCAA GTCCTGGAAA 240 

TCCAGCCGGA TTTTCCTGTT CAACCTGGCA GTAGCTGACT TTCTACTGAT CATCTGCCTG 300 

CCGTTCGTGA TGGACTACTA TGTGGGGCGT TCAGACTGGA ACTTTGGGGA CATCCCTTGC 360 

CGGCTGGTGC TCTTCATGTT TGCCATGAAC CGCCAGGGCA GCATCATCTT CCTCACGGTG 420 

GTGGCGGTAG ACAGGTATTT CCGGGTGGTC CATCCCCACC ACGCCCTGAA CAAGATCTCC 480 

AATTGGACAG CAGCCATCAT CTCTTGCCTT CTGTGGGGCA TCACTGTTGG CCTAACAGTC S40 

CACCTCCTGA AGAAGAAGTT GCTGATCCAG AATGGCCCTG CAAATGTGTG CATCAGCTTC 600 

AGCATCTGCC ATACCTTCCG GTGGCACGAA GCTATGTTCC TCCTGGAGTT CCTCCTGCCC 660 

CTGGGCATCA TCCTGTTCTG CTCAGCCAGA ATTATCTGGA GCCTGCGGCA GAGACAAATG 720 

GACCGGCATG CCAAGATCAA GAGAGCCATC ACCTTCATCA TGGTGGTGGC CATCGTCTTT 780 

GTCATCTGCT TCCTTCCCAG CGTGGTTGTG CGGATCCGCA TCTTCTGGCT CCTGCACACT 840 

TCGGGCACGC AGAATTGTGA AGTGTACCGC TCGGTGGACC TGGCGTTCTT TATCACTCTC 900 

AGCTTCACCT ACATGAACAG CATGCTGGAC CCCGTGGTGT ACTACTTCTC CAGCCCATCC 960 

TTTCCCAACT TCTTCTCCAC TTTGATCAAC CGCTGCCTCC AGAGGAAGAT GACAGGTGAG 1020 

CCAGATAATA ACCGCAGCAC GAGCGTCGAG CTCACAGGGG ACCCCAACAA AACCAGAGGC 1080 

GCTCCAGAGG CGTTAATGGC CAACTCCGGT GAGCCATGGA GCCCCTCTTA TCTGGGCCCA 1140 

ACCTCAAATA ACCATTCCAA GAAGGGACAT TGTCACCAAG AACCAGCATC TCTGGAGAAA 1200 

CAGTTGGGCT GTTGCATCGA GTAATGTCAC TGGACTCGGC CTAAGGTTTC CTGGAACTTC 1260 

CAGATTCAGA GAATCTGATT TAGGGAAACT GTGGCAGATG AGTGGGAGAC TGGTTGCAAG 1320 

GTGTGACCAC AGGAATCCTG GAGGAACAGA GAGTAAAGCT TCTAGGCATC TGAAACTTGC 1380 

TTCATCTCTG ACGCTCGCAG GACTGAAGAT GGGCAAATTG TAGGCGTTTC TGCTGAGCAG 1440 

AGTTGGAGCC AGAGATCTAC TTGTGACTTG TTGGCCTTCT TCCCACATCT GCCTCAGACT 1500 

GGGGGGGGCT CAGCTCCTCG GGTGATATCT AGCCTGCTTG TGAGCTCTAG CAGGGATAAG 1560 

GAGAGCTGAG ATTGGAGGGA ATTGTGTTGC TCCTGGAGGA AGCCCAGGCA TCATTAAACA 1620 

AGCCAGTAGG TCACCTGGCT TCCGTGGACC AATTCATCTT TCAGACAAGC TTTAGAGAAA 1680 

TGGACTCAGG GAAGAGACXC ACATGCTTTG GTTAGTATCT GTGTTTCCGG TGGGTGTAAT 1740 

AGGGGATTAG CCCCAGAAGG GACTGAGCTA AACAGTGTTA TTATGGGAAA GGAAATGGCA 1800 

TTGCTGCTTT CAACCAGCGA CTAATGCAAT CCATTCCTCT CTTGTTTATA GTAATCTAAG 1860 

GGTTGAGCAG TTAAAACGGC TTCAGGATAG AAAGCTGTTT OCCACCTGTT TCGTTTTACC 1920 

ATTAAAAGGG AAACGTGCCT CTGCCCCACG GGTAGAGGGG GTGCAOGTTC CTCCTGGTTC 1980 

CTTCGCTTGT GTTTCTGTAC TTACCAAAAA TCTACCACTT CAATAAATTT TGATAGGAGA 2040 
CAAAAAAAAA A 

Seq ID NO: 228 Protein sequence 
Protein Accession #: NP_006009.l 

1 11 21 31 41 SI 

I I t I I I 

MNRHHLQDHF LEIDKKNCCV FRDDFIAKVL PPVLGLEFIF GLLGNGLALW IFCFHLKSWK 60 

SSRIFLFNLA VADFLLIICL PFVMDYYVRR SDWNFGDIPC RLVLFMFAMN RQGSIIFLTV 120 

VAVDRYFRW HPHHALNKIS NWTAAIISCL LWGITVGLTV HLLKKKLLIQ NGPANVCISF 180 

SICHTFRWHE AMFLLEFLLP LGIILFCSAR IIWSLRQRQM DRHAKIKRAI TFIKWAIVF 240 

VICFLPSVW RIRIFWLLHT SGTQNCEVYR SVDLAFFITL SFTYMNSMLD PWYYFSSPS 300 

FPNFFSTLIN RCLQRKMTGE PDNNRSTSVE LTGDPNKTRG AP SALMAN SG EPWSPSYLGP 360 
TSNNHSKKGH CHQEPASLEK QLGCCIE 

Seq ID NO: 229 UNA sequence 

Nucleic Acid Accession #: NMJU4398.1 

Coding sequence: 64.. 1314 

1 11 21 31 41 51 

I I 1 I I I 

GGCACCGATT CGGGGCCTGC CCGGACTTCG CCGCACGCTG CAGAACCTCG CCCAGCGCCC 60 

ACCATGCCCC GGCAGCTCAG CGCGGCGGCC GCGCTCTTCG CGTCCCTGGC CGTAATTTTQ 120 

CACGATGGCA GTCAAATGAG AGCAAAAGCA TTTCCAGAAA CCAGAGATTA TTCTCAACCT 180 

ACTGCAGCAG CAACAGTACA GGACATAAAA AAACCTGTCC AGCAACCAGC TAAGCAAGCA 240 

CCTCACCAAA CTTTAGCAGC AAGATTCATG GATGGTCATA TCACCTTTCA AACAGCGGCC 300 

ACAGTAAAAA TTCCAACAAC TACCCCAGCA ACTACAAAAA ACACTGCAAC CACCAGCCCA 360 

ATTACCTACA CCCTGGTCAC AACCCAGGCC ACACCCAACA ACTCACACAC AGCTCCTCCA 420 

GTTACTGAAG TTACAGTCGG CCCTAGCTTA GCCCCTTATT CACTGCCACC CACCATCACC 480 

CCACCAGCTC ATACAGCTGG AACCAGTTCA TCAACCGTCA GCCACACAAC TGGGAACACC 540 

ACTCAACCCA GTAACCAGAC CACCCTTCCA GCAACTTTAT OGATAGCACT GCACAAAAGC 600 

ACAACCGGTC AGAAGCCTGA TCAACCCACC CATGCCCCAG GAACAACGGC AGCTGCCCAC 660 

AATACCACCC GCACAGCTGC ACCTGCCTCC ACGGTTCCTG GGCCCACCCT TGCACCTCAG 720 

. CCATCGTCAG TCAAGACTGG AATTTATCAG GTTCTAAACG GAAGCAGACT CTGTATAAAA 780 

GCAGAGATGG GGATACAGCT GATTGTTCAA GACAAGGAGT CGGTTTTTTC ACCTCGGAGA 840 

TACTTCAACA TCGACCCCAA CGCAACGCAA GCCTCTGGGA ACTGTGGCAC COGAAAATCC 900 

AACCTTCTGT TGAATTTTCA GGGCGGATTT GTGAATCTCA CATTTACCAA GGATGAAGAA 960 

TCATATTATA TCAGTGAAGT GGGAGCCTAT TTGACCGTCT CAGATCCAGA GACAGTTTAC 1020 

CAAGGAATCA AACATGCGGT GGTGATGTTC CAGACAGCAG TCGGGCATTC CTTCAAGTGC 1080 

GTGAGTGAAC AGAGCCTCCA GTTGTCAGCC CACCTGCAGG TGAAAACAAC CGATGTCCAA 1140 

CTTCAAGCCT TTGATTTTGA AGATGACCAC TTTGGAAATG TGGATGAGTG CTCGTCTGAC 1200 

TACACAATTG TGCTTCCTGT GATTGGGGCC ATCGTGGTTG GTCTCTGCCT TATGGGTATG 1260 

GGTGTCTATA AAATCCGCCT AAGGTGTCAA TCATCTGGAT ACCAGAGAAT CTAATTGTTG 1320 

CCCGGGGGGA ATGAAAATAA TGGAATTTAG AGAACTCTTT CATCCCTTCC AGGATGGATG 1380 
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TTGGGAAATT ccctcagaqt gtgggtcctt caaacaatgt aaaccaccat cttctattca 

AATGAAGTGA GTCATGTGTG ATTTAAGTTC AGGCAGCACA TCAATTTCTA AATACTTTTT 
GTTTATTTTA TGAAAGATAT AGTGAGCTGT TTATTTTCTA GTTTCCTTTA GAATATTTTA 
GCCACTCAAA GTCAACATTT GAGATATGTT GAATTAACAT AATATATGTA AAGTAGAATA 
AGCCTTCAAA TTATAAACCA AGGGTCAATT GTAACTAATA CTACTGTGTG TGCATTGAAG 
ATTTTATTTT ACCCTTGATC TTAACAAAGC CTTTGCTTTG TTATCAAATG GACTTTCAGT 
GCTTTTACTA TCTGTGTTTT ATGGTTTCAT GTAACATACA TATTCCTGGT GTAGCACTTA 
ACTCCTTTTC CACTTTAAAT TTGTTTTTGT TTTTTGAGAC GGAGTTTCAC TCTTGTCACC 
CAGGCTGGAG TACAGTGGCA CGATCTCGGC TTATGGCAAC CTCCGCCTCC CGGGTTCAAG 
TGATTCTCCT GCTTCAGCTT CCCGAGTAGC TGGGATTACA GGCACACACT ACCACGCCTG 
GCTAATTTTT GTATTTTTAT TATAGACGGG TTTCACCATG TTGGCCAGAC TGGTCTTGAA 
CTCTTGACCT CAGGTGATCC ACCCACCTCA GCCTCCCAAA GTGCTGGGAT TACAGGCATG 
AGCCATTGCG CCCGGCCTTA AATGTTTTTT TTAATCATCA AAAAGAACAA CATATCTCAG 
GTTGTCTAAG TGTTTTTATG TAAAACCAAC AAAAAGAACA AATCAGCTTA TATTTTTTAT 
CTTGATGACT CCTGCTCCAG AATTGCTAGA CTAAGAATTA GGTGGCTACA GATGGTAGAA 
CTAAACAATA AGCAAGAGAC AATAATAATG GCCCTTAATT ATTAACAAAG TGCCAGAGTC 
TAGGCTAAGC ACTTTATCTA TATCTCATTT CATTCTCACA ACTTATAAGT GAATGAGTAA 
ACTGAGACTT AAGGGAACTG AATCACTTAA ATGTCACCTG GCTAACTGAT GGCAGAGCCA 
GAGCTTGAAT TCATGTTGGT CTGACATCAA GGTCTTTGGT CTTCTCCCTA CACCAAGTTA 
CCTACAAGAA CAATGACACC ACACTCTGCC TGAAGGCTCA CACCTCATAC CAGCATACGC 
TCACCTTACA GGGAAATGGG TTTATCCAGG ATCATGAGAC ATTAGGGTAG ATGAAAGGAG 
AGCTTTGCAG ATAACAAAAT AGCCTATCCT TAATAAATCC TCCACTCTCT GGAAGGAGAC 
TGAGGGGCTT TGTAAAACAT TAGTCAGTTG CTCATTTTTA TGGGATTGCT TAGCTGGGCT 
GTAAAGATGA AGGCATCAAA TAAACTCAAA GTATTTTTAA ATTTTTTTGA TAATAGAGAA 
ACTTCGCTAA CCAACTGTTC TTTCTTGAGT GTATAGCCCC ATCTTGTGGT AACTTGCTGC 
TTCTGCACTT CATATCCATA TTTCCTATTG TTCACTTTAT TCTGTAGAGC AGCCTGCCAA 
GAATTTTATT TCTGCTGTTT TTTTTGCTGC TAAAGAAAGG AACTAAGTCA GGATGTTAAC 
AGAAAAGTCC ACATAACCCT AGAATTCTTA GTCAAGGAAT AATTCAAGTC AGCCTAGAGA 
CCATGTTGAC TTTCCTCATG TGTTTCCTTA TGACTCAGTA AGTTGGCAAG GTCCTGACTT 
TAGTCTTAAT AAAACATTGA ATTGTAGTAA AGGTTTTTGC AATAAAAACT TACTTTGG 

Seq ID NO: 230 Protein sequence 
Protein Accession #: NP_055213.1 

1 11 21 31 41 51 

I I I I I I 

MPRQLSAAAA LFASLAVILH DGSQMRAKAF PETRDYSQPT AAATVQDIKK PVQQPAKQAP €0 

HQTLAARFMD GHITPQTAAT VKIPTTTPAT TKNTATTSPI TYTLVTTQAT PNNSHTAPPV 120 

TEVTVGPSliA PYSLPPTITP PAHTAGTSSS TVSHTTGNTT QPSNQTTLPA TLSIALHKST 180 

TGQKPDQPTH APGTTAAAHN TTRTAAPAST VPGPTIAPQP SSVKTGIYQV LNGSRLCIKA 240 

EMGIQLIVOD KESVFSPRRY FNIDPNATQA SGNCGTRKSN LLLNFQGGFV NLTFTKDEES 300 

YYISBVGAYL TVSDPETVYQ GIKHAWMFQ TAVGHSFKCV SEQSLQLSAH LQVKTTDVQL 360 
QAFDFEDDHF GNVDECSSDY TIVLPVIGAI WGLCLMGMG VYKIRLRCQS SGYQRI 

Seq ID NO: 231 DNA sequence 

Nucleic Acid Accession #: NM J) 0540 9. 3 

Coding sequence: 94.. 378 

1 11 21 31 41 51 

I I I I I I 

TTCCTTTCAT GTTCAGCATT TCTACTCCTT CCAAGAAGAG CAGCAAAGCT GAAGTAGCAG 60 

CAACAGCACC AGCAGCAACA GCAAAAAACA AACATGAGTG TGAAGGGCAT GGCTATAGCC 120 

TTGGCTGTGA TATTGTGTGC TACAGTTGTT CAAGGCTTCC CCATGTTCAA AAGAGGACGC 180 

TGTCTTTGCA TAGGCCCTGG GGTAAAAGCA GTGAAAGTGG CAGATATTGA GAAAGCCTCC 240 

ATAATGTACC CAAGTAACAA CTGTGACAAA ATAGAAGTGA TTATTACCCT GAAAGAAAAT 300 

AAAGGACAAC GATGCCTAAA TCCCAAATCG AAGCAAGCAA GGCTTATAAT CAAAAAAGTT 360 

GAAAGAAAGA ATTTTTAAAA ATATCAAAAC ATATGAAGTC CTGGAAAAGG GCATCTGAAA 420 

AACCTAGAAC AAGTTTAACT GTGACTACTG AAATGACAAG AATTCTACAG TAGGAAACTG 480 

AGACTTTTCT ATGGTTTTGT GACTTTCAAC TTTTGTACAG TTATGTGAAG GATGAAAGGT 540 

GGGTGAAAGG ACCAAAAACA GAAATACAGT CTTCCTGAAT GAATGACAAT CAGAATTCCA 600 

CTGCCCAAAG GAGTCCAGCA ATTAAATGGA TTTCTAGGAA AAGCTACCTT AAGAAAGGCT 660 

GGTTACCATC GGAGTTTACA AAGTGCTTTC ACGTTCTTAC TTGTTGTATT ATACATTCAT 720 

GCATTTCTAG GCTAGAGAAC CTTCTAGATT TGATGCTTAC AACTATTCTG TTGTGACTAT 780 

GAGAACATTT CTGTCTCTAG AAGTTATCTG TCTGTATTGA TCTTTATGCT ATATTACTAT 840 

CTGTGGTTAC AGTGGAGACA TTGACATTAT TACTGGAGTC AAGCCCTTAT AAGTCAAAAG 900 

CATCTATGTG TCGTAAAGCA TTCCTCAAAC ATTTTTTCAT GCAAATACAC ACTTCTTTCC 960 

CCAAATATCA TGTAGCACAT CAATATGTAG GGAAACATTC TTATGCATCA TTTGGTTTGT 1020 

TTTATAACCA ATTCATTAAA TGTAATTCAT AAAATGTACT ATGAAAAAAA TTATACGCTA 1080 

TGGGATACTG GCAACAGTGC ACATATTTCA TAACCAAATT AGCAGCACOG GTCTTAATTT 1140 

GATGTTTTTC AACTTTTATT CATTGAGATG TTTTGAAGCA ATTAGGATAT GTGTGTTTAC 1200 

TGTACTTTTT GTTTTGATCC GTTTGTATAA ATGATAGCAA TATCTTGGAC ACATTTGAAA 1260 

TACAAAATGT TTTTGTCTAC CAAAGAAAAA TGTTGAAAAA TAAGCAAATG TATACCTAGC 1320 

AATCACTTTT ACTTTTTGTA ATTCTGTCTC TTAGAAAAAT ACATAATCTA ATCAATTTCT 1380 

TTGTTCATGC CTATATACTG TAAAATTTAG GTATACTCAA GACTAGTTTA AAGAATCAAA 1440 
GTCATTTTTT TCTCTAATAA ACTACCACAA CCTTTCTTTT TTAAAAAAAA AAA 

Seq ID NO: 232 Protein sequence 
Protein Accession *h NPJJ05400.1 

1 11 21 31 41 51 

I I I I I I 

MSVKGMAIAL AVILCATWQ GFPMFKRGRC LCIGPGVKAV KVADIEKASI MYPSNNCDKI 60 
EVIITLKENK GQRCLNPKSK QARLIIKKVE RKNF 
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Seq ID NO: 233 DNA sequence 
Nucleic Acid Accession ft: NM_000S77.1 
Coding sequence: 41.. 520 

1 11 21 31 41 51 

I I I I t I 

GGCACGAGGG GAAGACCTCC TGTCCTATCA GGCCCTCCCC ATGGCTTTAG AGACGATCTG 60 

CCGACCCTCT GGGAGAAAAT CCAGCAAGAT GCAAGCCTTC AGAATCTGGG ATGTTAACCA 120 

GAAGACCTTC TATCTGAGGA ACAACCAACT AGTTGCCGGA TACTTGCAAG GACCAAATGT 180 

CAATTTAGAA GAAAAGATAG ATGTGGTACC CATTGAGCCT CATGCTCTGT TCTTGGGAAT 240 

CCATGGAGGG AAGATGTGCC TGTCCTGTGT CAAGTCTGGT GATGAGACCA GACTCCAGCT 300 

GGAGGCAGTT AACATCACTG ACCTGAGCGA GAACAGAAAG CAGGACAAGC GCTTCGCCTT 360 

CATCCGCTCA GACAGTGGCC CCACCACCAG TTTTGAGTCT GCCGCCTGCC CCGGTTGGTT 420 

CCTCTGCACA GCGATGGAAG CTGACCAGCC CGTCAGCCTC ACCAATATGC CTGACGAAGG 480 

CGTCATGGTC ACCAAATTCT ACTTCCAGGA GGACGAGTAG TACTGCCCAG GCCTGCCTGT 540 

TCCCATTCTT GCATGGCAAG GACTGCAGGG ACTGCCAGTC CCCCTGCCCC AGGGCTCCOG 600 

GCTATGGGGG CACTGAGGAC CAGCCATTGA GGGGTGGACC CTCAGAAGGC GTCACAACAA 660 

CCTGGTCACA GGACTCTGCC TCCTCTTCAA CTGACCAGCC TCCATGCTGC CTCCAGAATG 720 

GTCTTTCTAA TGTGTGAATC AGAGCACAGC AGCCCCTGCA CAAAGCCCTT CCATGTCGCC 780 

TCTGCATTCA GGATCAAACC CCGACCACCT GCCCAACCTG CTCTCCTCTT GCCACTGCCT 840 

CTTCCTCCCT CATTCCACCT TCCCATGCCC TGGATCCATC AGGCCACTTG ATGACCCCCA 900 

ACCAAGTGGC TCCCACACCC TGTTTTACAA AAAAGAAAAG ACCAGTCCAT GAGGGAGGTT 960 

TTTAAGGGTT TGTGGAAAAT GAAAATTAGG ATTTCATGAT TlTTTTnT T CAGTCCCCGT 1020 

GAAGGAGAGC CCTTCATTTG GAGATTATGT TCTTTCGGGG AGAGGCTGAG GACTTAAAAT 1080 

ATTCCTGCAT TTGTGAAATG ATGGTGAAAG TAAGTGGTAG CTTTTCCCTT CTTTTTCTTC 1140 

TTTTTTTGTG ATGTCCCAAC TTGTAAAAAT TAAAAGTTAT GGTACTATGT TAGCCCCATA 1200 

ATTTTTTTTT TCCTTTTAAA ACACTTCCAT AATCTGGACT CCTCTGTCCA GGCACTGCTG 1260 

CCCAGCCTCC AAGCTCCATC TCCACTCCAG ATTTTTTACA GCTGCCTGCA GTACTTTACC 1320 

TCCTATCAGA AGTTTCTCAG CTCCCAAGGC TCTGAGCAAA TGTGGCTCCT GGGGGTTCTT 1380 

30 TCTTCCTCTG CTGAAGGAAT AAATTGCTCC TTGACATTGT AGAGCTTCTG GCACTTGGAG 1440 

ACTTGTATGA AAGATGGCTG TGCCTCTGCC TGTCTCCCCC ACCAGGCTGG GAGCTCTGCA 1500 

GAGCAGGAAA CATGACTCGT ATATGTCTCA GGTCCCTGCA GGGCCAAGCA CCTAGCCTCG 1560 

CTCTTGGCAG GTACTCAGCG AATGAATGCT GTATATGTTG GGTGCAAAGT TCCCTACTTC 1620 

CTGTGACTTC AGCTCTGTTT TACAATAAAA TCTTGAAAAT GCCTAAAAAA AAAAAAAAAA 1680 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAA 
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Seq ID NO: 234 Protein sequence 
Protein Accession #: NP_00056B.l 

40 1 11 21 31 41 51 

I I I I I I 

MALETICRPS GRKSSKMQAF RIWDVNQKTF YLRNNQLVAG YLOGPNVNLB EKIDWPIEP 60 

HALFLGIHGG KMCLSCVKSG DETRLQLEAV NITDLSENRK QDKRFAFIRS DSGPTTSFES 120 
AACPGWFLCT AMEADQPVSL TNMPDEGVMV TKFYFQEDE 

45 

Seq ID NO: 235 DNA sequence 

Nucleic Acid Accession #: NM_001840-1 

Coding sequence : 149 . . 1567 

50 i 11 21 31 41 51 

I 1 I I I I 

GGGGACTACG GAGAGCTCTG CAGGGAGCCG AGGCCCCCGC CCGGGCCAAG GGAGCTTCTG 60 

TCCCGAGGAC CAGGGGATGC GAAGGGATTG CCCCCTGTGG GTCACTTTCT CAGTCATTTT 120 

GAGCTCAGCC TAATCAAAGA CTGAGGTTAT GAAGTCGATC CTAGATGGCC TTGCAGATAC 180 

55 CACCTTCCGC ACCATCACCA CTGACCTCCT GTACGTGGGC TCAAATGACA TTCAGTACGA 240 

AGACATCAAA GGTGACATGG CATCCAAATT AGGGTACTTC CCACAGAAAT TCCCTTTAAC 300 

TTCCTTTAGG GGAAGTCCCT TCCAAGAGAA GATGACTGCG GGAGACAACC CCCAGCTAGT 360 

CCCAGCAGAC CAGGTGAACA TTACAGAATT TTACAACAAG TCTCTCTCGT CCTTCAAGGA 420 

/rA GAATGAGGAG AACATCCAGT GTGGGGAGAA CTTCATGGAC ATAGAGTGTT TCATGGTCCT 480 

60 GAACCCCAGC CAGCAGCTGG CCATTGCAGT CCTGTCCCTC ACGCTGGGCA CCTTCACGGT 540 

CCTGGAGAAC CTCCTGGTGC TGTGCGTCAT CCTCCACTCC CGCAGCCTCC GCTGCAGGCC 600 

TTCCTACCAC TTCATCGGCA GCCTGGCGGT GGCAGACCTC CTGGGGAGTG TCATTTTTGT 660 

CTACAGCTTC ATTGACTTCC ACGTGTTCCA CCGCAAAGAT AGCCGCAACG TGTTTCTGTT 720 

CAAACTGGGT GGGGTCACGG CCTCCTTCAC TGCCTCCGTG GGCAGCCTGT TCCTCACAGC 780 

65 CATCGACAGG TACATATCCA TTCACAGGCC CCTGGCCTAT AAGAGGATTG TCACCAGGCC 840 

CAAGGCCGTG GTGGOGTTTT GCCTGATGTG GACCATAGCC ATTGTGATCG COGTGCTGCC 900 

TCTCCTGGGC TGGAACTGCG AGAAACTGCA ATCTGTTTGC TCAGACATTT TCCCACACAT 960 

TGATGAAACC TACCTGATGT TCTGGATCGG GGTCACCAGC GTACTGCTTC TGTTCATCGT 1020 

_ A GTATGCGTAC ATGTATATTC TCTGGAAGGC TCACAGCCAC GCCGTCCGCA TGATTCAGCG 1080 

70 TGGCACCCAG AAGAGCATCA TCATCCACAC GTCTGAGGAT GGGAAGGTAC AGGTGACCCG 1140 

GCCAGACCAA GCCCGCATGG ACATTAGGTT AGCCAAGACC CTGGTCCTGA TCCTGGTGGT 1200 

GTTGATCATC TGCTGGGGCC CTCTGCTTGC AATCATGGTG TATGATGTCT TTGGGAAGAT 1260 

GAACAAGCTC ATTAAGACGG TGTTTGCATT CTGCAGTATG CTCTGCCTGC TGAACTCCAC 1320 

CGTGAACCCC ATCATCTATG CTCTGAGGAG TAAGGACCTG CGACACGCTT TCCGGAGCAT 1380 

75 GTTTCCCTCT TGTGAAGGCA CTGCGCAGCC TCTGGATAAC AGCATGGGGG ACTCGGACTG 1440 

CCTGCACAAA CACGCAAACA ATGCAGCCAG TGTTCACAGG GCCGCAGAAA GCTGCATCAA 1500 

GAGCACGGTC AAGATTGCCA AGGTAACCAT GTCTGTGTCC ACAGACACGT CTGCCGAGGC 15 SO 

TCTGTGAGCC TGATGCCTCC CTGGCAGCAC AGGAAAAGAA TTTTTTTTTT TAAGCTCAAA 1620 

OA ATCTAGAAGA GTCTATTGTC TCCTTGGTTA TATTTTTTTA ACTTTACCAT GCTCAATGAA 1680 

00 AAGGTGATTG CCACATGTCA CTTATTTGCT TAGTTTCCGT TTGGGCTAAT CTTCCGGGGT 1740 
TOGTAGGAAA CCTTT 

Seq ID NO: 236 Protein sequence 
Protein Accession #: NP_001831.1 
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1 11 21 31 41 51 

1 I I I I I 

MKSILDGLAD TTFRTITTDIj LYVGSNDIQY EDIKGDMASK LGYFPQKFPL TSFRGSPFQE 60 

KMTAGDNPQL VPADQVNITE FYNKSLSSFK ENEENIQCGE NFMDIECFMV LNPSQQLAIA 120 

VLSIiTLGTFT VLENLLVLCV ILHSRSLRCR PSYHFIGSLA VADLLGSVIF VYSFIDFHVF 1BO 

HRKDSRNVFL FKU3GVTASF TASVGSLFLT AIDRYISIHR PLAYKRIVTR PKAWAFCLM 240 

WTIA1VIAVL PLLGWNCEKL QSVCSDIFPH IDETYLMFWI GVTSVLLLFI VYAYMYILWK 300 

AHSHAVRMIQ RGTQKSIIIH TSEDGKVQVT RPDQARMDIR LAKTLVLILV VLIICWGPLL 360 

AIMVYDVFGK MNKLIKTVFA FCSMLCLLNS TVNPIIYALR SKDLRHAFRS MFPSCEGTAQ 420 
PLDNSMGDSD CLHKHANNAA SVHRAAESCI KSTVKIAKVT MSVSTDTSAE AL 



Seq ID NO: 237 DNA sequence 
Nucleic Acid Accession #; NM_016083. 
15 Coding sequence: 64.. 1482 



1 11 21 31 41 SI 

I I I I I I 

GATTGCCCCC TGTGGGTCAC TTTCTCAGTC ATTTTGAGCT CAGCCTAATC AAAGACTGAG 60 

GTTATGAAGT CGATCCTAGA TGGCCTTGCA GATACCACCT TCCGCACCAT CACCACTGAC 120 

CTCCTGTACG TGGGCTCAAA TGACATTCAG TAOGAAGACA TCAAAGGTGA CATGGCATCC 180 

AAATTAGGGT ACTTCCCACA GAAATTCCCT TTAACTTCCT TTAGGGGAAG TCCCTTCCAA 240 

GAGAAGATGA CTGCGGGAGA CAACCCCCAG CTAGTCCCAG CAGACCAGGT GAACATTACA 300 

GAATTTTACA ACAAGTCTCT CTCGTCCTTC AAGGAGAATG AGGAGAACAT CCAGTGTGGG 360 

GAGAACTTCA TGGACATAGA GTGTTTCATG GTCCTGAACC CCAGCCAGCA GCTGGCCATT 420 

GCAGTCCTGT CCCTCACGCT GGGCACCTTC ACGGTCCTGG AGAACCTCCT GGTGCTGTGC 480 

GTCATCCTCC ACTCCCGCAG CCTCGGCTGC AGGCCTTCCT ACCACTTCAT CGGCAGCCTG 540 

GCGGTGGCAG ACCTCCTGGG GAGTGTCATT TTTGTCTACA GCTTCATTGA CTTCCACGTG 600 

TTCCACCGCA AAGATAGCCG CAACGTGTTT CTGTTCAAAC TGGGTGGGGT CACGGCCTCC 660 

CTCACTGCCT CCGTGGGCAG CCTGTTCCTC ACAGCCATCG ACAGGTACGT ATCCATTCAC 720 

AGGCCCCTGG CCTATAAGAG GATTGTCACC AGGCCCAAGG CCGTGGTGGC GTTTTGCCTG 780 

ATGTGGACCA TAGCCATTGC GATCGCCGTG CTGCCTCTCC TGGGCTGGAA CTGCGAGAAA 840 

CTGCAATCTG TTTGCTCAGA CATTTTCCCA CACATTGATG AAACCTACCT GATGTTCTGG 900 

ATCGGGGTCA CCAGCGTACT GCTTCTGTTC ATCGTGTATG CGTACATGTA TATTCTCTGG 960 

AAGGCTCACA GCCACGCCGT CCGCATGATT CAGCGTGGCA CCCAGAAGAG CATCATCATC 1020 

CACACGTCTG AGGATGGGAA GGTACAGGTG ACCCGGCCAG ACCAAGCCCG CATGGACATT 1080 

AGGTTAGCCA AGACCCTGGT CCTGATCCTG GTGGTGTTGA TCATCTGCTG GGGCCCTTTG 1140 

CTTGCAATCA TGGTGTATGA TGTCTTTGGG AAGATGAACA AGCTCATTAA GACX3GTGTTT 1200 

GCATTCTGCA GTATGCTCTG CCTGCTGAAC TCCACCGTGA ACCCCATCAT CTATGCTCTG 1260 

AGGAGTAAGG ACCTGCGACA CGCTTTCCGG AGCATGTTTC CCTCTTGTGA AGGCACTGCG 1320 

CAGCCTCTGG ATAACAGCAT GGGGGACTCG GACTGCCTGC ACAAACACGC AAACAATGCA 1380 

GCCAGTGTTC ACAGGGCCGC AGAAAGCTGC ATCAAGAGCA CGGTCAAGAT TGCCAAGGTA 1440 

ACCATGTCTG TGTCCACAGA CACGTCTGCC GAGGCTCTGT GAGOCTGATG CCTCCCTGGC 1500 

AGCACAGGAA AAGAATTTTT TTTTTTAAGC TCAAAATCTA GAAGAGTCTA TTGTCTCCTT 1560 

GGTTATATTT TTTTAACTTT ACCATGCTCA ATGAAAAGGT GATTGTCACC ATGATCACTT 1620 

ATCAGTTTGC TAATGTTTCC ATAGTTTAGG TACTCAAACT CCATTCTCCA GGGGTTTACA 16B0 

GTGAAGAAAG CCTGTTGTTT AAGTGACTGA ACGATCCTTC AAAGTCTCAA TGAAATAGGA 1740 

GGGAAACCTT TGGCTACACA ATTGGAAGTC TAAGAACCCA TGGAAAAATG CCATCAAATG 1800 

AATAATGCCT TTGTAACCAC AACTTTCACT ATAATGTGAA ATGTAACTGT CCGTAGTATC 1860 

AGAGATGTCC ATTTTTACAA GTTATAGTAC TAGAGATATT TTGTAAAATG TATTATGTCC 1920 

TGTGAGATGT GTATCAGTGT TTATGTGCTA TTAATATTTG TTTAGTTCAG CAAAACTGAA 1980 

AGGTAGACTT TTATGAGAAC AATGGACAAG CAGTGGATAC GTGTCAATGT GTGCACTTTT 2040 

TTTCTATATT ATTGCCCATG ATATAACTTT AGAAATAAAC CTTAATATTT CTTCAAATAT 2100 

CTCTATTTAA TTTTGACACT GAAATAACCG TAAAGGTTTA TTTTTCTGTT ACCTCAACAA 2160 

GAAGAATTTG AAGACTTCAA AATATTGAGC AGAATTCATT CATACTTAAA AATTTATTAG 2220 

CCCTGCATTT TCATAGGAAG ACACATTATC TTCTGGACTA TAGCTGTTCT AATGGATTAT 2280 

AATCAGAATG GAAGAGAGAA AGCATATTGA CTTTTTTTGA GCGACATCTC TGACTTTCTT 2340 

TAGTCTTTAG CTATTACTGG ATCTCTTAAG ACAGCATGTG TTAATCTTAA TGTATATCGT 2400 

TATCACTGTG CAGTTGCTGT TTACTTGAAT AGTATTGTGT TCCTATATTC CAGGTTTAAG 2460 

TAGATTTCAT GCCTGGGTGG CCAAACAACA GTCTTCATTT TTTTTAATTG AAAAGAAGTA 2520 

GTGTCTGGAT CAGTAAAATT ATACTGTGTG TGAGTGTGAA TATAAATGTG TGTATGTGTG 2580 

TTTCTGTCCG TAACTGTTAC AGTAATGTCA TAAAGTGAGA AAACTGTGAC CAAGTATAAA 2640 

CTTTTACCAC TTGCTGCACT CTTGCACATG GATTCAGTTT CTAAAATTGA GTTCTTCCTG 2700 

TAATCTTGTT GATAAAAATA CTGACTCCAA CCATTCAAAA ATTTCACCCC ATCCCTCCTT 2760 

AAGAGATTGG ATCAAGTATT ACTAAATTGA CCTTTAGGTA TTACACAAGA CCAGTGCTTA 2820 

GCAAAAAATA ATGACAGGCA TCCAAGGAAG GGATGTATTT GTAGTGTTAT TGCCAGGAAA 2880 

GGAGAGTACT TTGGTTTCTG AGCACCGAAT ATTGAGCAAT ATGTCAGTCA CTAAAAGGAA 2940 

GACAGTTCTA CAGAAAAACA AATGGTAACA TTTTTCAATA GCGTGTGTAG ATAGTATGCA 3000 

CTATATACAT CACGTTAAAG TAGGACTATC ACACCCAGCC CATGTGGCTA AAAAAGCTGA 3060 

ATCAGACAGT GGATGAGACA CACAACGGCA GTGAAGAACC GATACACTTG GCATTGACGT 3120 

CTAGCTATGC TGTATCTGTG CTTTGCCCAC ATGCCCTTGG TGACAGCTGA GCACCCAGCT 3180 

CTGTCTTGGT AGGTTTGGGC TAAGGAACAA ATCTCTCCTT TGCTCGTGGT TAGCAAGATA 3240 

CACTCAAGCA TGAAGATAAA CACAGCTGCT TTCTTCTTAC ACCCCGGTCT CATGCTCCTT 3300 

AATGGCGCCA TGGGTGCTTG TTGGGCCTTT TTCCAGTAAG GAATGATATT GCTGAAGAAT 3360 

CTACTTAACC CTGACAAATT TTAATTATAA TCTCTTCTTA TACAGATAAA ACATGACTCC 3420 

TACAAGGCCC CAAGGTTTAC ATAGTCTGAA GTGAAGTACA GAGCTGGCAT CTATCTGGTG 3480 

ATTTCTAGCT CTCGAGATAC CCAAGCAGCC TGATGGGGCA GTTCCCCTTC TTACGGTTCA 3540 

CGCTCTAAGG CAGGATGTGG CTTATGAGAT ACTTTGCATT GTCTGTCTGC ACACCTTGAA 3600 

TCTGCCTGCT GGCTCCCTTA CTTTACCTCT CTGTCATGTG CAGATGAAGG CTCAGGGTGC 3660 

TAGAGGATTA GTAAGATCTC TTTCTAAAGA CAGGAGAGAT TATTTACAAG AAGAACTCAC 3720 

CAGGGTTTAG TTTGCATTTA AGAATTGCCA GTCTTTTGTC CTGCATCATC TTGAACATTA 3780 

ATCCACATGT TTCAGAGCTC ACCAGGCAGT ACCAATGCTC TTTTCACAGC TATGAAGAGC 3840 

TAGAGAAATT CTTGTTATGG TAGAAAAATT TCACGGTTCA TTTTTGAAAC TGCATTTGTG 3900 

CGTATGCAGT GTAGATTTTA TAGTGTGTTG TGCTTTCAAG ATCTAAATCA TATATAATAA 3960 
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ATTAAGGGAC AATGGGGCTG ACAGCACTAA ACTTGGTGCT TATTGATATT CTAAGAAATA 4020 

TCTGTGAAAT ATCATCACGT ATGTTATACA ACCTTCATTT AAAAAGGTTT AAAACTAGTT 4090 

AGATTCACTT TGACACTTTT CATATCATTT CTTAACCCAA GTGACGAAAA CATTGTCCCC 4140 

AATGAATATA CTCATTAGAA TTACCATTTG TTAATATCAC TCATTAATTA ACCCCATAAT 4200 

5 TAGATCCATT AATTTAAATG ATTTAAATTT AAGTAAGTTT TATAAGGTCT GACATCAGAG 4260 

GTATCTTACT TTCCTCTGAG GATGATGTAC TTGCCCTGAC CATGCATTTT ACCATCACAC 4320 

ATGTTCAGAA AGGGCCAAAT TCCCAACCTG CTCATTTTTT TTTTTATCAG AGTCATGATG 4380 

AATCAGTCCT AGAATGTTTC ATTTGCACAA GTAGGGCTGC CTCCAAGAGG AACCTCTGAT 4440 

TTATTTTGTA TGAAATATAT GTGAAAGGAT ATGAATCTGA GAGATGCTGT AGACATCTGT 4S00 

10 CCTACACTTG AGATGATTTC CAAGCCTCTC TGGCACTTTG AGTTAAGTCT ATCTGGTATT 45 60 

AAATGCCAAG GACCTTTTGC TGCCTAAATC CACTCTGCAG GAAATAGGCC CAACCACCAG 4620 

ATGAGAATTA GGCCCTGGAT GAGTAGCGCT ATAGTTACTG TCCTGTTGAT TAATTTCTGC 4680 

CATTTCATGT CCATAAAAGA GACCACCCAT ATCATGCACA CAATTAGATT TCTCACACTC 4740 

TAACTGTATA TTTGTATGAT ATTTTAAAAT CTCCTAAATG CTGGGCAATG GCTATTAACA 4800 

15 ATTAATTGTC TTGCACTGGC CTTCTGATGA AATGTTAACA ATGCCTATTG TAATATAGAA 4860 

AAAAACATTC TATCTACTGA TTTGGGCTGA ATGTATGTAA ATAGGTTTCT AAAAAGTCAG 4920 

ATGTTTGAGC AGTGGCCTAC AAATCAGTAA TTTTCGGGTG GGAGAGTTTC TTTACATTGC 4980 

CGTGGCATCT TAAAAGCTAT CTTCATGTAA ATTGACTGTA CTAGGCCTAC TGGGGATCAG 5040 

AGTTCCCAAG AAAGGAAACC TTTTCTTGTA TCTGGATTCA AATTTATTTC CAATGTTTCA 5100 

20 AGCGGGAAAC ATGACTCTTT ATTGTCTGTA AATCTAACAT TATTACTTTT CCTCTTAGAA 5160 

GAATATTGTA TTGTTAGATG TTTGTTGAGC TGGTAACATC GTTGCAACCA CTGCAATATC 5220 

TTCGTTAGTA ATCTGTATAA TACTTTGTAT ACAAGTACTG GTAAGATTGT TATTAAATGT 5280 

AGCTTCAGTC ATTAAATTAC TATAGCAAAG TAGTACTTCT TCTGTAATAT TTACAATGTA 5340 

TTAAGCCCAC AGTATATTTT ATTTCAATGT AATTAAACTG TTAACTTATT CAAAGAGAAA 5400 

25 ACATCTCATC ATGTCTATTG TCCAAAGTTA CCTGGAATCA AATAAAAATT CTAGATTACC 5460 
ATGAAGAACA TA 

Seq ID NO: 238 Protein sequence 
Protein Accession #i NP 057167.1 

30 

1 11 21 31 41 51 

I I I I I I 

MKSILDGLAD TTFRTITTDL LYVGSNDIQY EDIKGDMASK LGYPPQKFPIi TSFRGSPFQK 60 

KMTAGDNPQL VPADQVN1TE PYNKSLSSFK ENEENIQCGE NPMDIECFMV LNPSQQLAIA 120 

35 VLSLTLGTFT VL.ENLLVLCV ILHSRSLRCR PSYHPIGSIiA VADLLGSVIP VYSFIDFHVF 180 

HRKDSRNVFL FKLGGVTASL TASVGSLFLT AIDRYVSIHR PLAYKRIVTR PKAWAFCLM 240 

WTIAIAIAVL PLLGWNCEKL QSVCSDIFPH IDETYLMFWI GVTSVLLLFI VYAYMYILWK 300 

AHSHAVRMIQ RGTQK3IIIH TSEDGKVQVT RPDQARMDIR LAKTLVLILV VLIICWGPLL 360 

AIMVYDVFGK MNKLIKTVFA FCSMLCLLNS TVNPIIYALR SKDLRHAFRS MFPSCEGTAQ 420 

40 PLDNSMGDSD CLHKHANNAA SVHRAABSCI KSTVKIAKVT MSVSTDTSAE AL 

Seq ID NO: 239 DNA sequence 
Nucleic Acid Accession ft: NM_03318l.l 
. _ Coding sequence: 17.. 1252 

45 

1 11 21 31 41 51 

I I I I I I 

ATGAAGTGGA TCCTAGATGG CCTTGCAGAT ACCACCTTCC GCACCATCAC CACTGACCTC 60 

CTGTACGTGG GCTCAAATGA CATTCAGTAC GAAGACATCA AAGGAGAATG AGGAGAACAT 120 

50 CCAGTGTGGG GAGAACTTCA TGGACATAGA GTGTTTCATG GTCCTGAACC CCAGCCAGCA 180 

GCTGGCCATT GCAGTCCTGT CCCTCACGCT GGGCACCTTC AOGGTCCTGG AGAACCTCCT 240 

GGTGCTGTGC GTCATCCTCC ACTCCCGCAG CCTCCGCTGC AGGCCTTCCT ACCACTTCAT 300 

CGGCAGCCTG GCGGTGGCAG ACCTCCTGGG GAGTGTCATT TTTGTCTACA GCTTCATTGA 360 

CTTCCAOGTG TTCCACCGCA AAGATAGCCG CAACGTGTTT CTGTTCAAAC TGGGTGGGGT 420 

55 CACGGCCTCC TTCACTGCCT CCGTGGGCAG CCTGTTCCTC ACAGCCATCG ACAGGTACAT 480 

ATCCATTCAC AGGCCCCTGG CCTATAAGAG GATTGTCACC AGGCCCAAGG CCGTGGTGGC 540 

GTTTTGCCTG ATGTGGACCA TAGCCATTGT GATCGCCGTG CTGCCTCTCC TGGGCTGGAA 600 

CTGCGAGAAA CTGCAATCTG TTTGCTCAGA CATTTTCCCA CACATTGATG AAACCTACCT 660 

GATGTTCTGG ATCGGGGTCA CCAGCGTACT GCTTCTGTTC ATCGTGTATG CGTACATGTA 720 

60 TATTCTCTGG AAGGCTCACA GCCACGCCGT CCGCATGATT CAGOGTGGCA CCCAGAAGAG 780 

CATCATCATC CACACGTCTG AGGATGGGAA GGTACAGGTG ACCCGGCCAG ACCAAGCCCG 840 

CATGGACATT AGGTTAGCCA AGACCCTGGT CCTGATCCTG GTGGTGTTGA TCATCTGCTG 900 

GGGCCCTCTG CTTGCAATCA TGGTGTATGA TGTCTTTGGG AAGATGAACA AGCTCATTAA 960 

GACGGTGTTT GCATTCTGCA GTATGCTCTG CCTGCTGAAC TCCACCGTGA ACCCCATCAT 1020 

65 CTATGCTCTG AGGAGTAAGG ACCTGCGACA CGCTTTCCGG AGCATGTTTC CCTCTTGTGA 1080 

AGGCACTGCG CAGCCTCTGG ATAACAGCAT GGGGGACTCG GACTGCCTGC ACAAACACGC 1140 

AAACAATGCA GCCAGTGTTC ACAGGGCCGC AGAAAGCTGC ATCAAGAGCA CGGTCAAGAT 1200 
TGCCAAGGTA ACCATGTCTG TGTCCACAGA CACOTCTGCC GAGGCTCTGT GA 

70 Seq ID NO: 240 Protein sequence 
Protein Accession <h NP_149421.1 

1 11 21 31 41 51 

nc' \ I I I I I 

/ J MALQIPPSAP SPLTSCTWAQ MTFSTKTSKE NEENIQCGEN FMDIECFMVL NPSQQLAIAV 60 
LSLTLGTFTV LENLLVLCVI LHSRSLRCRP SYHFIGSLAV ADLLGSVIFV YSFIDFHVFH 120 
RKDSRNVFLF KLGGVTASFT ASVGSLFLTA IDRYISIHRP LAYKRIVTRP KAWAFCLMW 180 
TIAIVIAVLP LLGWNCEKLQ SVCSDIFPHI DETYLMFWIG VTSVLLLFIV YAYMYILWKA 240 
OA HSHAVRMIQR GTQKSIIIHT SEDGKVQVTR PDQARMDIRL AKTLVLILW LIICWGPLLA 300 
80 IMVYDVFGKM NKLIKTVFAP CSMLCLLNST VNPIIYALRS KDLRHAFRSM PPSCEGTAQP 360 
LDNSMGDSDC LHKHANNAAS VHRAAESCIK STVKIAKVTM SVSTDTSAEA L 

Seq ID NO: 241 DNA sequence 

Nucleic Acid Accession #i NM_003 596.1 



1044 



WO 03/042661 



Coding sequence: 82 . . 1194 



1 11 21 31 41 51 

I I I I I I 

GTAGACTGTC CATGGCCTGA ACATTTTCCG AAAATCATTT TGAGCAAAAT ATCTGTTTAA 60 

TAACAAGATA ACCACATCAA GATGGTTGGA AAGCTGAAGC AGAACTTACT ATTGGCATGT 120 

CTGGTGATTA GTTCTGTGAC TGTGTTTTAC CTGGGCCAGC ATGCCATGGA ATGCCATCAC 180 

CGGATAGAGG AACGTAGCCA GCCAGTCAAA TTGGAGAGCA CAAGGACCAC TGTGAGAACT 240 

GGCCTGGACC TCAAAGCCAA CAAAACCTTT GCCTATCACA AAGATATGCC TTTAATATTT 300 

ATTGGRGGTG TGCCTCGGAG TGGAACCACA CTCATGAGGG CCATGCTGGA CGCACATCCT 360 

GACATTCGCT GTGGAGAGGA AACCAGGGTC ATTCCCCGAA TCCTGGCCCT GAAGCAGATG 420 

TGGTCACGGT CAAGTAAAGA GAAGATCCGC CTGGATGAGG CTGGTGTTAC TGATGAAGTG 480 

CTGGATTCTG CCATGCAAGC CTTCTTACTA GAAATTATCG TTAAGCATGG GGAGCCAGCC 540 

CCTTATTTAT GTAATAAAGA TCCTTTTGCC CTGAAATCTT TAACTTACCT TTCTAGGTTA 600 

TTCCCCAATG CCAAATTTCT CCTGATGGTC CGAGATGGCC GGGCATCAGT ACATTCAATG 660 

ATTTCTCGAA AAGTTACTAT AGCTGGATTT GATCTGAACA GCTATAGGGA CTGTTTGACA 720 

AAGTGGAATC GTGCTATAGA GACCATGTAT AACCAGTGTA TGGAGGTTGG TTATAAAAAG 780 

TGCATGTTGG TTCACTATGA ACAACTTGTC TTACATCCTG AACGGTGGAT GAGAACACTC 840 

TTAAAGTTCC TCCAGATTCC ATGGAACCAC TCAGTATTGC ACCATGAAGA GATGATTGGG 900 

AAAGCTGGGG GAGTGTCTCT GTCAAAAGTG GAGAGATCTA CAGACCAAGT AATCAAGCCA 960 

GTCAATGTAG GAGCTCTATC AAAATGGGTT GGGAAGATAC CGCCAGATGT TTTACAAGAC 1020 

ATGGCAGTGA TTGCTCCTAT GCTTGCCAAG CTTGGATATG ACCCATATGC CAACCCACCT 1080 

AACTACGGAA AACCTGATCC CAAAATTATT GAAAACACTC GAAGGGTCTA TAAGGGAGAA 1140 

TTCCAACTAC CTGACTTTCT TAAAGAAAAA CCACAGACTG AGCAAGTGGA GTAGCAGAAC 1200 

CAGGAGCCTC TTCCATACAT GAGGAAAGAT TGCTGCCTTT TCAGCAGAAG GGAAATTCCT 1260 

AGGATTGGCT GTCC CCTGCC AAGCTTGGTG GAGCGTCTGC ACCTTGGCTG CGCCGCCTGT 1320 

GCATTTGCCA GTTTCCTCCC ACTGAGAGGA TGGAGGTGTC CGCACAGCTT TGGGCCTCGT 1380 

GAGGGATCTG CCTCCTGAGC AAAGAGCTCT TGATCCCGAT TTCATGCACA GCCCTGCAGT 1440 

AAGGAGCCCA GAAGGAACAT GTGTTTCCTG TTAAAACTCC TCTTGTTCTC TTTTCTTACA 1500 

TTATGA CGTT TGTTTTCAAG GAGAGGGTTT AAAAATGGGA TCCTGTAAGC AGACTTGGGC 1560 

AGTCTCCTTT TGAAATAGGT TGTCTGTACA TGTTCTAATG TTTTGTAGAA CACGTGTGCC 1620 

TGTTTAAGTG TATTGATGTG AATAATATTA AATATCCTAA TTATTTAATT CATTGTATTG 1680 

TTTCT GAGAA GTTGGGAAAT TACCATTATA CATTTACAAC CTAATGACTT TTGTATTTTA 1740 
TTTTTCAAAA TAAAAGCTTT CAATGTGA 



Seq ID NO: 242 Protein sequence 
Protein Accession #: NP_003587.1 

1 11 21 31 41 51 

I I I I I I 

MVGKLKQNLL LACLVISSVT VPYLGQHAME CHHRIEERSQ PVKLESTRTT VRTGLDLKAN 60 
KTFAYHKDMP LIPIGGVPRS GTTLMRAMLD AHPDIRCGEE TRVIPRILAL KQMWSRSSKE 120 
KIRLDEAGVT DEVLDSAMQA FLLEIIVKHG EPAPYLCNKD PFALKSLTYL SRLFPNAKFL 180 
LMVRDGRASV HSMISRKVTI AGFDLNSYRD CLTKWNRAIE TMYNQCMEVG YKKCMLVHYE 240 
QLVLHPERWM RTLLKFLQIP WNHSVLHHEE MIGKAGGVSL SKVERSTDQV IKPVNVGALS 300 
KWVGKIPPDV LQDMAVIAPM LAKLGYDPYA NPPNYGKPDP KIIENTRRVY KGEFQLPDFL 360 
KEKPQTEQVE 



Seq ID NO: 243 DNA sequence 

Nucleic Acid Accession #: NM_001492.3 

Coding sequence: 1395.. 2513 



1 11 21 

I I I 

AOGCGGGGCG CGCGGCTCCG TCGGCTACCG 
CGAGCGGGCG GTATGGCGGC GGCGGGGCCC 
CCGAGCTACG CGCAGCTAGT GCAGCGCGGC 
TGCACGGACT GCGGCTGGGG GCTGGCGCGT 
CCGCCCGAGC TGCTGCTGCT GGCGCTCGGC 
GCCACTGCGC GCCTCTTTCG GCCCCTGGCG 
GCCAAGATGC CCGAGAGCGC TTGGAAGTTT 
GCCTACCTGC TGTTTGGCAC CGACTACCCC 
GACTGGACGC CGGGCATGGC AGTGCCACGG 
AGCTTCTATG GCCACTCCAT CTACGCTACG 
GTGGTCATGC TCCTCCACCA CGTGGTCACT 
CGGTACCACA ATGTGGGCAT CCTTGTGCTC 
GAGTTCACCA AGCTCAACAT TTACTTCAAG 
GCCTTGGCAG CAGACTTGGG CTGCCTCAGC 
TACTGGTTCC CGCTCAAGGT CCTGTATGCC 
GACATCCCCT TCTACTTCTT CTTCAATGCG 
TACTGGTTCC TGTACATCGT GGCGTTTGCA 
CTGAAGGACC TGCGGGAGTA TGACACAGCC 
GAGAAGCCAC TGAGGAACGG CCTGGTGAAG 
CCCCGTGGAC CCGGCCCCAC CCCGAATACC 
TCCACCCCCT CCAACTCTGC TCCTCTAGGG 
CTCATCCTGC CTCCATTTCC CGGCCACGCC 
GGCCCCGCCC TCAGCCCACT GGTCCCGGGC 
CGCCTGGGAG GAAGATGCCA CCGCCGCAGC 
TCCTGGCCCT GCTGCTGCCC TCGCTGCCCC 
CCGCCGCCCT GCTCCAGGCT CTAGGACTGC 
GGCCGGTTCC CCCGGTCATG TGGCGCCTGT 
CTGGCTCGCG GCGGACGTCC CCAGGGGTCA 
GGGTCGCCGG AAACATCGTG CGCCACATCC 
AGCCTGTCTC GGCCGCGGGG CATTGCCCTG 



31 41 51 

I I I 

CGGGCGGGCG CAGGCGACGG GCACGGCGGG 60 

GCGGCGGGGC CGACGGGGCC CGAGCCCATG 120 

TGGGGCAGCG CGCTGGCGGC GGCGCGGGGC 180 

CGCGGCCTGG CTGAGCACGC GCACCTGGCG 240 

GCGCTGGGCT GGACCGCGCT GCGCTCCGCG 300 

AAGCGGTGCT GCCTCCAGCC CAGAGATGCC 360 

CTCTTCTACC TGGGCAGCTG GAGCTACAGT 420 

TTCTTCCATG ACCCACCATC TGTCTTCTAC 480 

GACATTGCAG CCGCCTACCT GCTCCAGGGA 540 

CTATACATGG ACACCTGGCG CAAGGACTCG 600 

CTCATCCTCA TCGTCTCCTC CTACGCCTTC 660 

TTCCTGCACG ATATCAGTGA CGTGCAGCTT 720 

TCCCGCGGCG GCTCCTACCA TCGGCTGCAT 7 B0 

TTCGGCTTCA GCTGGTTCTG GTTCCGCCTC 840 

ACCAGTCACT GCAGTCTGCG CACGGTGCCT 900 

CTCCTGCTGC TGCTCACCCT TATGAACCTC 960 

GCCAAGGTGT TGACAGGCCA GGTGCACGAG 1020 

GAGGCCCAGA GCCTGAAGCC CAGCAAAGCC 1080 

GACAAGCGCT TCTGAACCCC TCGGCCCCGC 1140 

CCGGCCACGC TCCCCGTCCT TGGCCGCCCC 1200 

CCGCCGCCAC CTCCCCTGGG ACCCCGCCCC 1260 

CCCCAGGACC CCTGCCCCTC CGGGGACACC 1320 

CGCCGCGGAC CCTGCGCACT CTCTGGTCAT 1380 

AAGGTCCCTG CGGCCACCAC CTCCTCCTCC 1440 

TGACCCGCGC CCCCGTGCCC CCAGGCCCAG 1500 

GCGATGAGCC CCAGGGTGCC CCCAGGCTCC 1560 

TTCGACGCCG GGACCCCCAG GAGACCAGGT 1620 

CCCTGCAACC GTGCCACGTG GAGGAGCTGG 1680 

CGGACCGCGG TGCGCCCACC CGGGCCTCGG 1740 

AGTGGACAGT OGTCTTCGAC CTGTCGGCTG 1800 



1045 



WO 03/042661 



PCT/US02/36810 



TGGAACCCGC TGAGCGCCCG AGCCGGGCCC GCCTGGAGCT GCGTTTCGCG GCGGCGGCGG 1860 

CGGCAGCCCC GGAGGGCGGC TGGGAGCTGA GCGTGGCGCA AGCGGGCCAG GGCGCGGGCG 1920 

CGGACCCCGG GCCGGTGCTG CTCCGCCAGT TGGTGCCCGC CCTGGGGCCG CCAGTGCGCG 1980 

CGGAGCTGCT GGGCGCCGCT TGGGCTCGCA ACGCCTCATG GCCGCGCAGC CTCCGCCTGG 2040 

5 CGCTGGCGCT ACGCCCCCGG GCCCCTGCCG CCTGCGCGCG CCTGGCCGAG GCCTCGCTGC 2100 

TGCTGGTGAC CCTCGACCCG CGCCTGTGCC ACCCCCTGGC CCGGCCGCGG CGCGACGCCG 2160 

AACCCGTGTT GGGCGGCGGC CCCGGGGGCG CTTGTCGCGC GCGGCGGCTG TACGTGAGCT 2220 

TCCGCGAGGT GGGCTGGCAC CGCTGGGTCA TCGCGCCGCG CGGCTTCCTG GCCAACTACT 2280 

GCCAGGGTCA GTGCGCGCTG CCCGTCGCGC TGTCGGGGTC CGGGGGGCCG CCGGCGCTCA 2340 

10 ACCACGCTGT GCTGCGCGCG CTCATGCACG CGGCCGCCCC GGGAGCCGCC GACCTGCCCT 2400 

GCTGCGTGCC CGCGCGCCTG TCGCCCATCT CCGTGCTCTT CTTTGACAAC AGCGACAACG 2460 

TGGTGCTGCG GCAGTATOAG GACATGGTGG TGGACGAGTG CGGCTGCCGC TAACCCGGGG 2520 
CGGGCAGGGA CGCGGGCCCA ACAATAAATG CCGOGTGGTC TGCTC 

15 Seq ID NO: 244 Protein sequence 
Protein Accession #: NP_001483.2 

1 11 21 31 41 51 

OA 1 I I I I I 

Z\J MPPPQQGPCG HHLLLLLALL LPSLPLTRAP VPPGPAAALI* QALGLRDEPQ GAPRLRPVPP 60 

VMWRLFRRRD PQETRSGSRR TSPGVTLQPC HVEELGVAGN IVRHIPDRGA PTRASEPVSA 120 

AGHCPEWTW FDLSAVEPAE RPSRARLELR FAAAAAAAPE GGWELSVAQA GQGAGADPGP 180 

VLIjRQLVPAL GPPVRAELLG AAWARNASWP RSLRLALALR PRAPAACARL AEASLLLVTL 240 

DPRLCHPLAR PRRDAEPVLG GGPGGACRAR RLYVSFREVG WHRWVIAPRG FLANYCQGQC 300 

25 ALPVALSGSG GPPALNHAVL RALMHAAAPG AADLPCCVPA RLSPISVLFF DNSDNWLRQ 360 
YEDMWDECG CR 



Seq ID NO: 245 DNA sequence 
Nucleic Acid Accession #: NM_021267,1 
30 Coding sequence: 17.. 1125 

1 11 21 31 41 51 

I I I I I I 

ACGCGGGGCG CGCGGCTCCG TCGGCTACCG CGGGCGGGCG CAGGCGACGG GCACGGCGGG 60 

35 CGAGCGGGCO GTATGGCGGC GGCGGGGCCC GCGGCGGGGC CGACGGGGCC CGAGCCCATG 120 

CCGAGCTACG CGCAGCTAGT GCAGCGCGGC TGGGGCAGCG CGCTGGCGGC GGCGCGGGGC 180 

TGCACGGACT GCGGCTGGGG GCTGGCGCGT CGCGGCCTGG CTGAGCACGC GCACCTGGCG 240 

CCGCCCGAGC TGCTGCTGCT GGCGCTCGGC GCGCTGGGCT GGACCGCGCT GCGCTCCGCG 300 

GCCACTGCGC GCCTCTTTCG GCCCCTGGCG AAGCGGTGCT GCCTCCAGCC CAGAGATGCC 360 

40 GCCAAGATGC CCGAGAGCGC TTGGAAGTTT CTCTTCTACC TGGGCAGCTG GAGCTACAGT 420 

GCCTACCTGC TGTTTGGCAC CGACTACCCC TTCTTCCATG ACCCACCATC TGTCTTCTAC 480 

GACTGGACGC CGGGCATGGC AGTGCCACGG GACATTGCAG CCGCCTACCT GCTCCAGGGA 540 

AGCTTCTATG GCCACTCCAT CTACGCTACG CTATACATGG ACACCTGGCG CAAGGACTOG 600 

GTGGTCATGC TGCTCCACCA CGTGGTCACT CTCATCCTCA TCGTCTOCTC CTACGCCTTC 660 

45 CGGTACCACA ATGTGGGCAT CCTTGTGCTC TTCCTGCACG ATATCAGTGA CGTGCAGCTT 720 

GAGTTCACCA AGCTCAACAT TTACTTCAAG TCCCGCGGCG GCTCCTACCA TCGGCTGCAT 780 

GCCTTGGCAG CAGACTTGGG CTGCCTCAGC TTCGGCTTCA GCTGGTTCTG GTTCCGCCTC 840 

TACTGGTTCC CGCTCAAGGT CCTGTATGCC ACCAGTCACT GCAGTCTGCG CACGGTGCCT 900 

GACATCCCCT TCTACTTCTT CTTCAATGCG CTCCTGCTGC TGCTCACCCT TATGAACCTC 960 

50 TACTGGTTCC TGTACATCGT GGCGTTTGCA GCCAAGGTGT TGACAGGCCA GGTGCACGAG 1020 

CTGAAGGACC TGCGGGAGTA TGACACAGCC GAGGCCCAGA GCCTGAAGCC CAGCAAAGCC 1080 

GAGAAGCCAC TGAGGAACGG CCTGGTGAAG GACAAGCGCT TCTGAACCCC TOGGCCCCGC 1140 

CCCCGTGGAC CCGGCCCCAC CCOGAATACC CCGGCCACGC TCCCCGTCCT TGGCCGCCCC 1200 

TCCACCCCCT CCAACTCTGC TCCTCTAGGG CCGCCGCCAC CTCCCCTGGG ACCCCGCCCC 1260 

55 CTCATCCTGC CTCCATTTCC OGGCCACGCC CCCCAGGACC CCTGCCCCTC CGGGGACACC 1320 

GGCCCCGCCC TCAGCCCACT GGTCCCGGGC CGCCGCGGAC CCTGCGCACT CTCTGGTCAT 1380 

CGCCTGGGAG GAAGATGCCA CCGCCGCAGC AAGGTCCCTG CGGCCACCAC CTCCTCCTCC 1440 

TCCTGGCCCT GCTGCTGCCC TCGCTGCCCC TGACCCGCGC CCCCGTGCCC CCAGGCCCAG 1500 

CCGCCGCCCT GCTCCAGGCT CTAGGACTGC GCGATGAGCC CCAGGGTGCC CCCAGGCTCC 1560 

60 GGCCGGTTCC CCCGGTCATG TGGCGCCTGT TTCGACGCCG GGACCCCCAG GAGACCAGGT 1620 

CTGGCTCGCG GCGGACGTCC CCAGGGGTCA CCCTGCAACC GTGCCACGTG GAGGAGCTGG 1680 

GGGTCGCCGG AAACATCGTG CGCCACATCC OGGACCGCGG TGCGCCCACC CGGGCCTCGG 1740 

AGCCTGTCTC GGCOGCGGGG CATTGCCCTG AGTGGACAGT CGTCTTCGAC CTGTCGGCTG 1800 

TGGAACCCGC TGAGCGCCCG AGCCGGGCCC GCCTGGAGCT GCGTTTCGCG GCGGCGGCGG 1860 

65 CGGCAGCCCC GGAGGGCGGC TGGGAGCTGA GCGTGGCGCA AGCGGGCCAG GGCGCGGGCG 1920 

CGGACCCCGG GCCGGTGCTG CTCCGCCAGT TGGTGCCCGC CCTGGGGCCG CCAGTGCGCG 1980 

CGGAGCTGCT GGGCGCCGCT TGGGCTCGCA ACGCCTCATG GCCGCGCAGC CTCCGCCTGG 2040 

CGCTGGCGCT ACGCCCCCGG GCCCCTGCCG CCTGCGCGCG CCTGGCCGAG GCCTCGCTGC 2100 

TGCTGGTGAC CCTCGACCCG CGCCTGTGCC ACCCCCTGGC CCGGCCGCGG CGCGACGCCG 2160 

70 AACCCGTGTT GGGCGGCGGC CCCGGGGGCG CTTGTCGCGC GCGGCGGCTG TACGTGAGCT 2220 

TCCGCGAGGT GGGCTGGCAC CGCTGGGTCA TCGCGCCGCG CGGCTTCCTG GCCAACTACT 2280 

GCCAGGGTCA GTGCGCGCTG CCCGTCGCGC TGTCGGGGTC CGGGGGGCCG CCGGCGCTCA 2340 

ACCACGCTGT GCTGCGCGCG CTCATGCACG CGGCCGCCCC GGGAGCCGCC GACCTGCCCT 2400 

GCTGCGTGCC CGCGCGCCTG TCGCCCATCT CCGTGCTCTT CTTTGACAAC AGCGACAACG 2460 
75 TGGTGCTGCG GCAGTATGAG GACATGGTGG TGGACGAGTG CGGCTGCCGC TAACCCGGGG 



Seq ID NO: 246 Protein sequence 
Protein Accession ft: NP_067090.1 

80 1 11 21 31 41 51 

I I I I I I 

MAAAGPAAGP TGPEPMPSYA OLVQRGWGSA LAAARGCTDC GWGLARRGLA EHAHLAPPEL 60 
LLLALGALGW TALRSAATAR LFRPLAKRCC LQPRDAAKMP ESAWKFLFYL GSWSYSAYLL 120 
FGTDYPFFHD PPSVFYDWTP GMAVPRDIAA AYUjQGSFYG HSIYATLYMD TWRKDSWML 180 
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LHHWTLILI VSSYAFRYHN VGILVLFLHD ISDVQLEFTK LNIYFKSRGG SYHRLHALAA 240 
DLGCLSFGFS WFWFRLYWFP LKVLYATSHC SLRTVPDIPF YFFFNALLLL LTLMNLYWFL 300 
YIVAFAAKVL TGQVHELKDL REYDTAEAQS LKPSKAEKPL RNGLVKDKRF 

Seq ID NO: 247 DKA sequence 

Nucleic Acid Accession NM_002081.1 

Coding sequence: 222.. 1898 

1 11 21 31 41 51 

I I I I 1 I 

GGCTGCCCGA GCGAGCGTTC GGACCTCGCA CCCCGCGCGC CCCGCGCCGC CGCCGCCGCC 
GGCTTTTGTT GTCTCCGCCT CCTCGGCCGC CGCCGCCTCT GGACCGCGAG CCCCGCGCGC 
CGGGACCTTG GCTCTGCCCT TCGCGGGCGG GAACTGCGCA GGACCCGGCC AGGATCCGAG 
AGAGGCGCGG GCGGGTGGCC GGGGGCGCCG CCGGCCCCGC CATGGAGCTC CGGGCCCGAG 
GCTGGTGGCT GCTATGTGCG GCCGCAGCGC TGGTCGCCTG CGCCCGCGGG GACCCGGOCA 
GCAAGAGCCG GAGCTGCGGC GAGGTCCGCC AGATCTACGG AGCCAAGGGC TTCAGCCTGA 
GCGACGTGCC CCAGGCGGAG ATCTCGGGTG AGCACCTGCG GATCTGTCCC CAGGGCTACA 
CCTGCTGCAC CAGCGAGATG GAGGAGAACC TGGCCAACCG CAGCCATGCC GAGCTGGAGA 
CCGCGCTCCG GGACAGCAGC OGCGTCCTGC AGGCCATGCT TGCCACCCAG CTGCGCAGCT 
TCGATGACCA CTTCCAGCAC CTGCTGAACG ACTCGGAGCG GACGCTGCAG GCCACCTTCC 
CCGGCGCCTT CGGAGAGCTG TACACGCAGA ACGCGAGGGC CTTCCGGGAC CTGTACTCAG 
AGCTGCGCCT GTACTACCGC GGTGCCAACC TGCACCTGGA GGAGACGCTG GCCGAGTTCT 
GGGCCCGCCT GCTCGAGCGC CTCTTCAAGC AGCTGCACCC CCAGCTGCTG CTGCCTGATG 
ACTACCTGGA CTGCCTGGGC AAGCAGGCCG AGGCGCTGCG GCCCTTCGGG GAGGCCCCGA 
GAGAGCTGCG CCTGCGGGCC ACCCGTGCCT TCGTGGCTGC TCGCTCCTTT GTGCAGGGCC 
TGGGCGTGGC CAGCGACGTG GTCCGGAAAG TGGCTCAGGT CCCCCTGGGC CCGGAGTGCT 
CGAGAGCTGT CATGAAGCTG GTCTACTGTG CTCACTGCCT GGGAGTCCCC GG CGCCAGGC 
CCTGCCCTGA CTATTGCCGA AATGTGCTCA AGGGCTGCCT TGCCAACCAG GCCGACCTGG 
ACGCCGAGTG GAGGAACCTC CTGGACTCCA TGGTGCTCAT CACCGACAAG TTCTGGGGTA 
CATCGGGTGT GGAGAGTGTC ATCGGCAGOG TGCACACGTG GCTGGCGGAG GCCATCAACG 
CCCTCCAGGA CAACAGGGAC ACGCTCACGG CCAAGGTCAT CCAGGGCTGC GGGAACCCCA 
AGGTCAACCC CCAGGGCCCT GGGCCTGAGG AGAAGCGGCG CCGGGGCAAG CTGGCCCOGC 
GGGAGAGGCC ACCTTCAGGC ACGCTGGAGA AGCTGGTCTC TGAAGCCAAG GCCCAGCTCC 
GCGACGTCCA GGACTTCTGG ATCAGCCTCC CAGGGACACT GTGCAGTGAG AAGATGGOCC 
TGAGCACTGC CAGTGATGAC CGCTGCTGGA ACGGGATGGC CAGAGGCCGG TACCTCCCCG 
AGGTCATGGG TGACGGCCTG GCCAACCAGA TCAACAACCC CGAGGTGGAG GTGGACATCA 
CCAAGCCGGA CATGACCATC CGGCAGCAGA TCATGCAGCT GAAGATCATG ACCAACCGGC 
TGCGCAGCGC CTACAACGGC AACGACGTGG ACTTCCAGGA CGCCAGTGAC GACGGCAGCG 
GCTCGGGCAG CGGTGATGGC TGTCTGGATG ACCTCTGCGG CCGGAAGGTC AGCAGGAAGA 
GCTCCAGCTC CCGGACGCCC TTGACCCATG CGCTCCCAGG CCTGTCAGAG CAGGAAGGAC 
AGAAGACCTC GGCTGCCAGC TGCCCCCAGC CCCCGACCTT CCTCCTGCCC CTCCTCCTCT 
TCCTGGCCCT TACAGTAGCC AGGCCCCGGT GGCGGTAACT GCCCCAAGGC CCCAGGGACA 
GAGGCCAAGG ACTGACTTTG CCAAAAATAC AACACAGACG ATATTTAATT CACCTCAGCC 
TGGAGAGGCC TGGGGTGGGA CAGGGAGGGC CGGCGGCTCT GAGCAGGGGC AGGCGCAGAG 
GTCCCAGCCC CAGGCCTGGC CTCGCCTGCC TTTCTGCCTT TTAATTTTGT ATGAGGTCCT 
CAGGTCAGCT GGGAGCCAGT GTGCCCAAAA GCCATGTATT TCAGGGACCT CAGGGGCACC 
TCCGGCTGCC TAGCCCTCCC CCCAGCTCCC TGCACCGCCG CAGAAGCAGC OCCTCGAGGC 
CTACAGAGGA GGCCTCAAAG CAACCCGCTG GAGCCCACAG CGAGCCTGTG CCTTCCTCCC 
CGCCTCCTCC CACTGGGACT CCCAGCAGAG CCCACCAGCC AGCCCTGGCC CACCCCCCAG 
CCTCCAGAGA AGCCCCGCAC GGGCTGTCTG GGTGTCCGCC ATCCAGGGTC TGGCAGAGCC 
TCTGAGATGA TGCATGATGC CCTCCCCTCA GCGCAGGCTG CAGAGCCCGG CCCCACCTCC 
CTGCGCCCTT GAGGGGCCCC AGCGTCTGCA GGGTGAOGCC TGAGACAGCA CCACTGCTGA 
GGAGTCTGAG GACTGTCCTC CCACAGACCC TGCAGTGAGG GGCCCTCCAT GCGCAGATGA 
GGGGCCACTG ACCCACCTGC GCTTCTGCTG GAGGAGGGGA AGCTGGGCCC AAAGGCCCAG 
GGAGGCAGCG TGGGCTCTGC CAATGTGGGC TGCCCCTCGC ACACAGGGCT CACAGGGCAG 
GCCTTGCTGG GGTCCAGGGC TGTTGGAGGA CCCCGAGGGC TGAGGAGCAG CCAGGACCCG 
CCTGCTCCCA TCCTCACCCA GATCAGGAAC CAGGGCCTCC CTGTTCACGG TGACACAGGT 
CAGGGCTCAG AGTGACCCTC GGCTGTCACC TGCTCACAGQ GATGCTGGTG GCTGGTGAGA 
CCCCGCACTG CACACGGGAA TGCCTAGGTC CCTTCCCGAC CCAGCCAGCT GCACTGCAGG 
GCACGGGGAC CTGGATAGTT AAGGGCTTTT CCAAACATGC ATCCATTTAC TGACACTTCC 
TGTCCTTGTT CATGGAGAGC TGTTCGCTCC TCCCAGATGG CTTCGGAGGC CCX3CAGGGCC 
CACCTTGGAC CCTGGTGACC TCCTOTCACT CACTGAGGCC ATCAGGGCCC TGCCCCAGGC 
CTGGACGGGC CCTCCTTCCC TCCTGTGCCC CAGCTGCCAG GTGGCCCTGG GGAGGGGTGG 
TGTGGTGTTG GGAAGGGGTC CTGCAGGGGG AGGAGGACTT GGAGGGTCTG GGGGCAGCTG 
TCCTGAACCG ACTGACCCTG AGGAGGCCGC TTAGTGCTGC TTTGCTTTTC ATCACCGTCC 
CGCACAGTGG AOGGAGGTCC CCGGTTGCTG GTCAGGTCCC CATGGCTTGT TCTCTGGAAC 
CTGACTTTAG ATGTTTTGGG ATCAGGAGCC CCCAACACAG GCAAGTCCAC CCCATAATAA 
CCCTGCCAGT GCCAGGGTGG GCTGGGGACT CTGGCACAGT GATGCCGGGC GCCAGGACAG 
CAGCACTCCC GCTGCACACA GACGGCCTAG GGGTGGCGCT CAGACCCCAC CCTACGCTCA 
TCTCTGGAAG GGGCAGCCCT GAGTGGTCAC TGGTCAGGGC AGTGGCCAAG CCTGCTGTGT 
CCTTCCTCCA CAAGGTCCCC CCACCGCTCA GTGTCAGCGG GTGACGTGTG TTCTTTTGAG 
TCCTTGTATG AATAAAAGGC TGGAAACCTA AA 

Seq ID NO: 248 Protein sequence 
Protein Accession #: NP 002072.1 
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1 11 21 31 

till 
MELRARGWWL LCAAAALVAC ARGDPASKSR SCGEVRQIYG 
ICPQGYTCCT SEMEENLANR SHABLBTALR DSSRVLQAML 
TLQATFPGAF GELYTQNARA FRDLYSELRL YYRGANLHLE 
QLLLPDDYLD CLGKQAEALR PFGBAPRELR LRATRAFVAA 
PLGPBCSRAV MKLVYCAHCL GVPGARPCPD YCRNVLKGCL 
TDKFWGTSGV ESVIGSVHTW LAKAINALQD NRDTLTAKVI 



41 



51 



AKGFSLSDVP QAEISGEHLR 60 

ATQLRSFDDH FQHLLNDSER 120 

ETLAEFWARL LERLFKQLHP 180 

RSFVQGLGVA SDWRKVAQV 240 

ANQADLDAEW RNLLDSMVLI 300 

QGCGNPKVNP QGPGPEBKRR 360 



1047 



WO 03/042661 



PCT/US02/36810 



RQKLAPRERP PSGTLEKLVS EAKAQLRDVQ DFWISLPGTL CSEKMALSTA SDDRCWNGMA 
RGRYLPEVMG DGLANQINNP EVEVDITKPD MTIRQQIMQL KIMTNRLRSA YNGNDVDFQD 



420 
460 



Seq ID NOs 249 DNA sequence 

Nucleic Acid Accession #: NM_001492.3 

Coding sequence : 8 . . 1864 



10 

15 

20 

25 

30 

35 

40 

45 
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75 



GAAGGCCATQ 
CATTTTCCTC 
GCCGGGTCCA 
CTTCTTCAGA 
CCTGGGCGCG 
GCCTGATCTC 
CACCTTCTCT 
CGCCTGGAGC 
GGCCCGGGAC 
CGAGCCGCCT 
GCGGTGCGGT 
GGTGTGCCGA 
ATGCCTAGAG 
CAGCCCCAGG 
TGACGGGAAC 
CACCTGCCCG 
TGGACCCTGC 
CTGCCACTGC 
CCTGCAGCCA 
CTGCCGCGCC 
CGCCTGCGCT 
GCTGGGCTTC 
TGCTCACGGC 
CTACATGGGA 
GGCCCCGCCG 
ACTGCTCGTG 
TGGCCACTCC 
CGCACTCCOG 
CTCGTCCGTA 
TGCTCCTTCC 
GCGOGCTGGG 
ATGAATTGGG 



11 
I 

GTCTCCCCAC 
CCCCAGACAC 
GGCCCTGGGG 
GTCTGCCTGA 
GCGCTGAGTG 
CCACTGCCCG 
TTCATCATCG 
CTGCTGGCGC 
ATTCAGCGCG 
GCCGTCGGGA 
CCGGGACTGC 
GCAGGCTGCA 
GGCTGGACTG 
GGCCCGTCCT 
CCGTGTGCCA 
CGTGGGTTCT 
TTCAACGGCG 
CCACCTGGTT 
TGCCGCAATG 
GGCTTCGCGG 
AACGGCGGCA 
GGCGGCCGCG 
GGCCGCTGCT 
GCGCGGTGTG 
GGCCTCAGGC 
GCCGCGGGCG 
CAGGATGCTG 
GATGCACTCA 
GATTGGAATC 
ATCTACGCTC 
CAGAGGCAGC 
TAGAGTCrCT 



21 
I 

GGATGTCCGG 
GGCCCGCTGG 
CCCCGCGGTC 
AGCCTGGGCT 
CGCGCGGACC 
ACGGGCTCTT 
AAACCTGGAG 
GCGTGGCTGG 
CAGGCGCCTG 
CCGCGTGCAC 
GCCCCTGCGC 
GCCCTGAGCA 
GACCCCTCTG 
CTGCTACCAC 
ATGGAGGCAG 
ACGGGCTGCG 

TCCAAGGCTC 
GCGGACTCTG 
GTCCTCGCTG 
CGTGTGTGGA 
ACTGCCGOGA 
ACGCCCACTT 
AGTTCCCAGT 
CCGGGGACCC 
TGGCOGGCGC 
GGTCTCGCTT 
ACAACCTAAG 
GCCCTGAAGA 
GGGAGGTAGC 
ACCTGCTTTT 
GGAAGGTTTT 



31 
I 

GCTCCTCTCC 
CGTCTTCGAG 
CCCCTGCAGC 
CTCAGAGGAG 
GGTCTACACC 
GCAGGTGCCC 
AGAGGAGTTA 
CAGGCGGCGC 
GGAGCTGCGC 
GCGCCTCTGC 
ACCGCTCGAG 
TGGCTTCTGT 
CACGGTCCCT 
CGGATGCCTT 
CTGTAGTGAG 
GTGTGAGGTG 
CGGGGGTGCA 
CAACTGTGAG 
CCTGGACCTG 
CGAGCACGAC 
GGGCGGCGGC 
GCGCGCGGAC 
CTCCGGCCTC 
GCACCCCGAC 
TCAGCGCTAC 
TGCGCTCTTG 
GCTGGCTGGG 
GACGCAGGAG 
TGTAGACCCT 
GACGCCCCTT 
TCCCTACCCT 
AAGCCCATTT 




GCCGCCGAGT 
GAGCAGCCCG 
TTCCGGGACG 
GGAGACCAGA 
TTGGCAGCCG 
TTCTCGTACC 
CGTCCGCGCA 
GACGAATGTG 
GAACAGCCCG 
GTCTCCACCA 
GTCCCTGGGC 
ACACCCAGGT 
AGCGGGGTGA 
GACCCTGACT 
AAGAGGGTGG 
GGCCACGCCC 
CTGGACGACT 
GCGCACCGCT 
CCGTGCGCCG 
GTCTGCGCTT 
GGCGCAAGOG 
CTTTTGCCTC 
CTGGTCCACG 
ACCCCGGAGC 
GGTTCCGGGG 
CAAGGGATTT 
TTCCCCCCGC 
TCCTOGATTC 
TCAGTTCTAA 



51 
I 

TCCTAGCGCT 
ACTCTTTCGG 
CCTGCCGCCT 
CCCCGTGCGC 
GAGCGCCCGC 
CCTGGCCTGG 
TTGGAGGGCC 
GAGGCCCGTG 
GCGCGCGCTG 
GCGCCCCCTC 
AGGCGCCGCT 
GTGAATGCCG 
GCAGCTGCCT 
CTGGGCCCTG 
CCTTTGAATG 
CATGTGCAGA 
CTGCCTACAT 
ACCGGTGCAG 
TGCGCTGCCG 
GCGCGGGCCG 
GCTCCTGCGC 
CGCGCCCCTG 
GCGCTCCCGG 
CCTTGCCCGC 
CGGCTCTGGG 
TGCGCCGCCG 
CGTCAGTCCA 
ATGGTCCGAG 
ATGTCATATC 
TACACACTGG 
TGTCCGTGAA 
CTTACTTTCA 



Seq ID NO: 250 Protein sequence 
Protein Accession #: NP 058637.1 



1 
I 

MVSPRMSGLL 
RVCLKPGLSB 
SFIIBTWREB 
PAVGTACTRL 
EGWTGPLCTV 
PRGFYGIiRCK 
PCRNGGLCLD 
FGGRDCRERA 
PGURPGDPQR 
PDALNNLRTQ 
GQRQHLLFPY 



11 
I 

SQTVILALIP 
EAAESPCALG 
LGDQIGGPAW 
CRPRSAPSRC 
PVSTSSCLSP 
VSGVTCADGP 
LGHALRCRCR 
DPCAARPCAH 
YLLPPALGLL 
EGSGDGPSSS 
PSSILSVK 



21 
I 

LPQTRPAGVP 
AALSARGPVY 
SLIiARVAGRR 
GPGLRPCAPL 
RGPSSATTGC 
CFNGGLCVGG 
AGFAGPRCEH 
GGRCYAHPSG 
VAAGVAGAAL 
VDWNRPEDVD 



31 
I 

ELQIHSFGPG 
TEQPGAPAPD 
RLAAGGFWAR 
EDECEAPLVC 
LVPGPGPCDG 
ADPDSAYICH 
DLDDCAGRAC 
LVCACAPGYM 
LIfVHVRRRGH 
PQGIYVISAP 



41 
I 

PGPGAPRSPC 
LPLPDGLLQV 
DIQRAGAWEL 
RAGCSPEHGP 
NPCANGGSCS 
CPPGFQGSNC 
ANGGTCVEGG 
GARCEPPVHP 
SQDAGSRLLA 
SIYAREVATP 



Seq ID NO: 251 DNA sequence 

Nucleic Acid Accession #: CAT cluster 



GAAATATAAC 
AGATTATTGA 
GCTTCAATTC 
TTAGAAAGTC 
AGTTTTACTA 
TTGTGCAAAC 
GGGTGCTATT 
GGTTTTCTTT 
GTTTACTGAA 
TTATGTAAAT 
TGCCATTGTC 
TAAAAATACA 



11 
1 

CATTGCAATT 
CAATCCCAAA 
CTTATATGAC 
TGAGAGACTT 
CTGTAGTGAC 
AACCTTTCAT 
TACCTAGAAC 
TTCAAGGAGG 
AGCACTCAAT 
ATGTTAAATA 
ATATCAAACT 
TTTCAATGTT 



21 
I 

AGAAAATATC 
TATACAATTT 
TTCAGACTGG 
TATACATAAA 
CGTTGAGAAG 
TGTTAAGTGC 
CATTGTCTAC 
TTCAATTAAG 
GAAGTCAAAT 
TAACATCTAA 
TGCTTTGTAT 
AAAAAAAAAA 



31 
I 

CAAAATAGCC 
TTCTTTAAAA 
AGAAGCCTGT 
TTCTCAATTT 
ACCCTTGTTT 
CTGTATTCCT 
TACAATTAAC 
GCAATAAGAT 
TACTGAAGCT 
GGAAAATAAA 
CATACTAATG 
AAAAAAAA 



41 
I 

TGTTATCTTC 
GTAGTACAAT 
TAAACCACTG 
GGCTGCTGTA 
ATTTACATTT 
TTCATTTACT 
ATTTACATTA 
GTTTGCTGGA 
TTTGCCTACA 
CAATATTATA 
TTACATAACT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 



51 
I 

SARLPCRLFF 
PFRDAWPGTF 
RFSYRARCEP 
CEQPGECRCL 
ETPRSFECTC 
EKRVDRCSLQ 
GAHRCSCALG 
DGASALPAAP 
GTPEPSVHAL 
LFPPLHTGRA 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 



51 
I 

CACGTGGCCT 
TTCCTTTGTA 
TTAGTTTCAG 
CACGTGCCAG 
GAAGCACTGT 
TCATGTCCAG 
CAAAGTGTGT 
GAAACCTATT 
TCTTGGTCTT 
ATTATGTGTT 
TATCGATCAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



80 



Seq ID NO: 252 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

1 11 21 31 41 51 

I I I I I I 

AGGTACTGCC AGAAAGGATC AGGACCTGGA GTCTGGCAAG AGGAAGACAG AGGCCTGTGT 



1048 



WO 03/042661 



PCTAJS02/36810 
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GGGAAGCGAG TTGTTATCTT TGGTTATCTA GCTGTATGAG TGTATTGGTC TTCATAAAGC 120 

TAGATAACCG AAAGTAAAAA CTTCTTCAAG ATCGCCGGGG AGCGTGTGAG AATGAAAGAC 180 

TACAGCCGAG AGACAGTAAA AACCAGAAAG GTCAGGAATA CTTATTGAAT CTAACTTTGT 240 

TTTTGTTTTG TTTTTTTCCT TATGATTAAA GGTGGGATGA GAGAAAATTA AATGACACAC 300 

ACATGCTAAA ATATCAAGGT TCCAGATATG TCTTGAGAGG GGTTGTTGCA GCTGCAAAGG 360 

AGAAGTGTAT AGTGATAATG AGTAAAGATG CATGTGCAGT TTGTTCTATT TTAAGGCAAA 420 

AGTTATATCA GGGATTTTTT TCTTAGAAAG GTGTTGCAGA GATGTCTGGT ACCTAGTTTA 480 

AAAATGATTC CATAATATGT AGACTTGGGC AGTTCCTTTG GGAGGCACCT CCCTCTCAAA 540 

ATTTGAAGAT TGTGCTTGGA AATTACTTTA CATGTATTTG GGCTGTATGT CAATTTGTAC 600 

GAAATTAGAC TTTCAGAAAA GTTTATACTG GAAGGTTAAT AATTTGTATC TACTGAGGAC 660 

TTAGAGCTAG CAGGCAAAAT GAAAAAAAAA AAAAAAAARG CAAGGGCTGA TTTTTATTCT 720 

TCTATTCAAA ATACAAGGAC AGATGCTTCT CTGTTCCAAG AGGGTTTCCT TGAGGAAGCT 780 

ACTGAAGCAG AAAGACATGA TGGAGACGAG ATCGCCTCCC CTCTTGTCAA AGTGTTAAAA 840 

AAATGTTCTG TCTTACTCTG CGCCTAGCAT TGGAAATGAA AGTGACATTT AOGCCACAAC 900 

CCACGTGTGC GCCTCCTCTC TTTTTGTTTA AGGATGATCA GGTCTATCCA GGAAACAGCT 960 

CTGGCATCCC AAACTGAAAT AATTAGGACG TATATAGACC TGACAAAAAT GGAAAGGGGG 1020 

TGGGGAATCT GAGGGTCTGT CTTGCCTAAT TGATTCCGCT AAACGGAATG CAGGAGATGT 1080 

GAACGGCAGG ACGCTCCGAT TCCCACGCTC GGGGGCAAGT QATAAAGCGG GGCCGGGCAG 1140 

CCTATGACAG ACAGCCCTGT TGGGGGGTGG GGGTATGAAA AAAACATCAA GTGCACACAC 1200 

CATACTCATC TCCATCGCTT AAGAAAGTAA AGGCATTTCC CACCCACAGC CATCTGCAGC 1260 

TTCCCAAATG GCAGCACCAA CTGGTCTGTA GCTGCTACAT AGTCTGCTTC TGTTAATTTT 1320 

TTAACCACTG TTTAATCTGG CCATAATTAA GTTTGGCTTT CTTCGTTGTT TGAGATTTTC 1380 

AGAATTCAAG GCAAGCTAGT AGAAAGCAAT TCCAAGAAAG TCCCATGACT GCCTGCCCCT 1440 

AATGTCAAAA TCTCAGTCCA TGAGATTATG GCCTTGTGAC CACATTTTTG CTTTTGTTTT 1500 

GGGTGGGCAA ATGTGTATAG AGATAAAATA CATATCTCTA TATAACAGTC GTTATTATAA 1560 

TTTCATGAGG CTTTTTACCT CTTAACATGA TACATCTAGG AACTTGGTCT AATTGTGCTA 1620 

GTAGATATAC ACTAGAAGTA AAACTATAAA GTCATTAGCC TGTAGAAAGT TGATTATGAT 1680 

AACAATATGA TAAAAAGTTT GTTTTGGAAT AGTCTCAGCT AGATGGGTCT AAATAGCCAT 1740 

TTTAATGTAA TCTAAAAATA ACACTATGCC TAGCAGAAAC TTTGGCGCTT TGGAGGTCCC 1800 

CATTGTCGCC TTTTCATAAA AGTCCTTAAG TTTTCCATAT GTCACCAAGC AAACATTTGT 1860 

ATAGGCATTT ACACAGAAAT ATAACCATTG CAATTAGAAA ATATCCAAAA TAGCCTGTTA 1920 

TCTTCCACGT GGCCTAGATT ATTGACAATC CCAAATATAC AATTTTTCTT TAAAAGTAGT 1980 

ACAATTTCCT TTGTAGCTTC AATTCCTTAT ATGACTTCAG ACTGGAGAAG CCTGTTAAAC 2040 

CACTGTTAGT TTCAGTTAGA AAGTCTGAGA GACTTTATAC ATAAATTCTC AATTTGGCTG 2100 

CTGTACACGT GCCAGAGTTT TACTACTGTA GTGACOGTTG AGAAGACCCT TGTTTATTTA 2160 

CATTTGAAGC ACTGTTTGTG CAAACAACCT TTCATTGTTA AGTGCCTGTA TTCCTTTCAT 2220 

TTACTTCATG TCCAGGGGTG CTATTTACCT AGAACCATTG TCTACTACAA TTAACATTTA 2280 

CATTACAAAG TGTGTGGTTT TCTTTTTCAA GGAGGTTCAA TTAAGGCAAT AAGATGTTTG 2340 

CTGGAGAAAC CTATTGTTTA CTGAAAGCAC TCAATGAAGT CAAATTACTG AAGCTTTTGC 2400 

CTACATCTTG GTCTTTTATG TAAATATGTT AAATATAACA TCTAAGGAAA ATAAACAATA 2460 

TTATAATTAT GTGTTTGCCA TTGTCATATC AAACTTGCTT TGTATCATAC TAATGTTACA 2520 
TAACTTATCG ATCAATAAAA ATACATTTCA ATGTT 

Seq ID NO: 253 DNA sequence 

Nucleic Acid Accession #: NM_001650.2 

Coding sequence: 40.1011 

1 11 21 31 41 SI 

I I I I I I 

GGGGCAGGCA ATGAGAGCTG CACTCTGGCT GGGGAAGGCA TGAGTGACAG ACCCACAGCA 60 

AGGCGGTGGG GTAAGTGTGG ACCTTTGTGT ACCAGAGAGA ACATCATGGT GGCTTTCAAA 120 

GGGGTCTGGA CTCAAGCTTT CTGGAAAGCA GTCACAGCGG AATTTCTGGC CATGCTTATT 180 

TTTGTTCTCC TCAGCCTGGG ATCCACCATC AACTGGGGTG GAACAGAAAA GCCTTTACCG 240 

GTCGACATGG TTCTCATCTC CCTTTGCTTT GGACTCAGCA TTGCAACCAT GGTGCAGTGC 300 

TTTGGCCATA TCAGOSGTGG CCACATCAAC CCTGCAGTGA CTGTGGCCAT GGTGTGCACC 360 

AGGAAGATCA GCATCGCCAA GTCTGTCTTC TACATCGCAG CCCAGTGCCT GGGGGCCATC 420 

ATTGGAGCAG GAATCCTCTA TCTGGTCACA CCTCCCAGTG TGGTGGGAGG CCTGGGAGTC 480 

ACCATGGTTC ATGGAAATCT TACCGCTGGT CATGGTCTCC TGGTTGAGTT GATAATCACA 540 

TTTCAATTGG TGTTTACTAT CTTTGCCAGC TGTGATTCCA AACGGACTGA TGTCACTGGC 600 

TCAATAGCTT TAGCAATTGG ATTTTCTGTT GCAATTGGAC ATTTATTTGC AATCAATTAT 660 

ACTGGTGCCA GCATGAATCC CGCCCGATCC TTTGGACCTG CAGTTATCAT GGGAAATTGG 720 

GAAAACCATT GGATATATTG GGTTGGGCCC ATCATAGGAG CTGTCCTCGC TGGTGGCCTT 780 

TATGAGTATG TCTTCTGTCC AGATGTTGAA TTCAAACGTC GTTTTAAAGA AGCCTTCAGC 840 

AAAGCTGCCC AGCAAACAAA AGGAAGCTAC ATGGAGGTGG AGGACAACAG GAGTCAGGTA 900 

GAGACGGATG ACCTGATTCT AAAACCTGGA GTGGTGCATG TGATTGACGT TGACCGGGGA 960 

GAGGAGAAGA AGGGGAAAGA CCAATCTGGA GAGGTATTGT CTTCAGTATG ACTAGAAGAT 1020 

CGCACTGAAA GCAGACAAGA CTCCTTAGAA CTGTCCTCAG ATTTCCTTCC ACCCATTAAG 1080 

GAAACAGATT TGTTATAAAT TAGAAATGTG CAGGTTTGTT GTTTCATGTC ATATTACTCA 1140 

GTCTAAACAA TAAATATTTC ATAATTTACA AAGGAGGAAC GGAAGAAACC TATTGTGAAT 1200 

TCCAAATCTA AAAAAAGAAA TATTTTTAAG ATGTTCTTAA GCAAATATAT ACCTATTTTA 1260 

TCTAGTTACC TTTCATTAAC AACCAATTTT AACCGTGTGT CAAGATTTGG TTAAGTCTTG 1320 

CCTGACAGAA CTCAAAGACA CGTCTATCAG CTTATTCCTT CTCTACTGGA ATATTGGTAT 1380 
AGTCAATTCT TATTTGAATA TTTATTCTAT TAAACTGAGT TTAACAATGG C ' 

Seq id NO: 254 Protein sequence 
Protein Accession #: NP_001641.1 

1 11 21 31 41 51 

I I I I I I 

MSDRFTARRW GKCGPLCTRE NIMVAFKGVW TQAFWKAVTA EFLAMLIFVL LSLGSTINWG 60 

GTEKPLPVDM VLISLCFGLS IATMVQCFGH ISGGHINPAV TVAMVCTRKI SIAKSVPYIA 120 

AQCLGAIIGA GILYLVTPPS WGGLGVTMV HGNLTAGHGL LVELI ITFQI* VPTIFASCDS 180 

KRTDVTGSIA LAIGFSVAIG HLFAINYTGA SMNPARSFGP AVIMGNWENH WIYWVGPIIG 240 

AVLAGGLYEY VFCPDVEFKR RFKEAFSKAA QQTKGSYMEV EDNRSQVETD DLILKPGWH 300 



1049 



WO 03/042661 



PCT/US02/36810 



VIDVDRGEEK KGKDQSGEVL SSV 



Seq ID NO: 255 DNA sequence 
Nucleic Acid Accession ft: U26742.1 
Coding sequence: 325.. 1449 
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i 
I 

CAGGAAACCC 
GGCAGCGGAC 
CTCAAACCTC 
GCGCCCGTCC 
TCTTTAAGTG 
GCTGTCCCTT 
GCAGAAAGAA 
TCCACCTACA 
GTGGACATAT 
AACACTGAAC 
AACAAACGGA 
AACTTCCTGC 
AAAATGGCTT 
TCAATGATTT 
GAAGTTCTCA 
TCAGCCAGAT 
CTTATGTCAG 
AATGTGGAAA 
GGATTTCGCT 
AGGGGACATG 
AAATCACCTG 
CGTGAACCTT 
ATCGTTGATA 

TGCGTCTAGA 



TTTCCCATTT 
TCACATTTCT 
CAATAAAGCT 



11 

I 

TGGTACTGGC 
CCGGCACTTC 
CCTGCAGACC 
ATTCCAAGAT 
CCATAATCAA 
CATTGAATTT 
GACAGCTGTT 
GAACAGCATG 
GGAATGTCAT 
TCAACGTGTC 
TGCCAACCAC 
TTGCAGCGTT 
TAGCCACATT 
CTGACTCCAG 
AACTACCCAC 
CCTGTTTCTC 
ATCCTCCCCC 
ATGTCTTCCA 
ACCGATGCCA 
CCGGTGGTTC 
CTAAGAAGCT 
TGCACCCCAT 
CTTGGCCTCC 
CCTCAGGAAG 
TGGATAACAT 
CCTCCAGGGT 
TCAGTCATTT 
TTCTGTCACC 
GTGTACACTA 



21 
I 

AGCAGCCAGC 
CAACATTATT 
AATGGACACC 
AAATAGGATT 
CTGCCATTTC 
TAGAATGATT 
TGCAGAGATG 
CAAGCTTAGG 
AGAAGCATTG 
CCGCTTAGAG 
TCACCAAATC 
TGATCCGGAA 
GTGTQGAGGG 
TGGGGTGATG 
GGCAGTTTTT 
CCAACAGAAA 
GCAGTGTCTG 
TCCGGTTGAG 
ACAGTGTCAC 
TCATAGCAAC 
GACTAATGCA 
GTTCCCAGAT 
CAGACCTGTA 
TCCTTTTATT 
GACTTCTTCT 
GCATGGTACC 
TGGAATGTTC 
ACAGAGATTG 
AATGTCC 



31 
I 

CTCTGCTGTG 
AAATAATAAG 
TTCTAAGAGT 
TACCAATCCT 
AAAGAATATA 
GAAGATAGTG 
AGGGCTCAAG 
TTTGTTCAGA 
CGGGAAAATG 
GCTGTGCTCT 
CATGTGGAGC 
GGCCATGGTA 
AAGATCATGG 
GTTTATGGAC 
GAAGGTCCTT 
AAAGTCACGT 
GTCTGGTTGC 
TGTTCCTACT 
AATTACCAGC 
CAGCACCAAA 
TTAAGCAAGT 
CAGCCTGAGA 
ACCAGCATGA 
ACCAGGAGCT 
ACCCTAAAAT 
CATTAACCCA 
TCTGTGAACC 
GCCTACGGTT 



41 
I 

CCCACATGAC 
AAAGCGGCTC 
TTGGCGAGTC 
TGGATGAAGT 
GATGGTTTTG 
GGAAAAGAGG 
ATCTGGATCG 
AGAAATGCAA 
CTCTGAACAA 
CCACTATTTT 
AGTCCATCAG 
AAATTTCAGT 
ACAAATTAAG 
GATATGACCA 
CATTTGGTTA 
TAAATGGTTT 
CTCTTCTGCA 
GCCACAGTGA 
TCTGTCAGGA 
TGAAAGAGTA 
CCCTGAGCTG 
AGCCACTCAA 
ACGACACCCT 
CGGACGGTGC 
ATTCCTATAA 
AAATATGATT 
ACAGTTGGGT 
TCTGTTTTGA 



51 
I 

CCACAACTCT 
CTACTCCAGG 
AGTGACTGAA 
GCTTGGGAAG 
AAAAGTTCAT 
AAATACCATG 
CATCCGACTC 
TTTGCACCTG 
CCTGGACCCA 
TTACCAGCTC 
CCTCCTCCTT 
ATTTGCTGTC 
ATATATTTTC 
ATTCCTTCGG 
CACAGAACAG 
CTTGGACACG 
TCGACTAGCA 
GAGTATGATG 
CTGCTTCTGG 
CACGTCATGG 
TGCTTCCAGC 
CTTGGCTCAC 
GTTCTCCCAC 
TTTTGGTGGA 
TACTTTGAGC 
ATTTCCCTTT 
TGTTTAAAGC 
GGGTGCTGTT 



1 
I 

GCGGATTGTT 
CACCCCTTAC 
TGTGGGTGAT 
GACTAAAAGC 
AGTCCGTAAA 
TAAAAGTTAC 
TACAATCCTT 
CTCCTTTCCT 
CTCCAGTCTT 
GCGAGCTGTA 
TGTCATCATC 
TGTACGAGTG 
TCTGGTAGAA 
GCCCATCCAG 
GGAGACTCTT 
CAATGCCCTG 
GGAACAGGGG 
GGTAGCAGTA 
TGTGGAGATG 
CATTGTTGGC 
ACGGAAAAAC 
GACCTCTTCA 
CGTGGACAAG 
GACTGCCCTC 
GAACTTCGGA 
TGGAATTCCT 
CACTGACGAC 



11 

I 

GCTCCGTTGT 
AAAATCAGAA 
TCCCAGACAC 
AATGGAGAAG 
CGCACACTTT 
CTGTTTCGGA 
GGATTTACCC 
GGGGAACTTC 
GTCACAGGAA 
GTCTATTATA 
ATCCATCCTG 
ACAGCTGCAG 
GCCTGCTTTA 
GCCAACGAAA 
ACCCGAATCA 
GGTCTAGTTG 
CAGGCCCTGA 
ATAATGTGGT 
GAAGACATGG 
TTACTCATTC 

AGTTCTGCCA 
CGCGTCACCA 
TATGAGGCTT 
CAAATTATTA 
CAGGCGGGCC 
ATCACGCTCA 



21 
I 

ACCTGCTGGG 
AAGTTGTGTT 
TGAAGTGCAA 
AGCCCAAGAT 
TGGCCAAGAA 
ATGCTTTTGT 
TCCGACCATA 
TGATGAGGAT 
TGGCGGCGCT 
TGACTACCAC 
GGAAGGGCAC 
ATGCCTTCCT 
AACAGTTTAA 
CGCTTGTGGG 
CAGAGGAGCT 
TCTTCTCCAT 
GAGAGTTCTT 
ATGCCCCCGT 
GTGTGATTGG 
ACGCAGTCAT 
TTATTGGAGG 
CCCTACCCAT 
GATTOGTGCT 
TGGCTGCCAT 
CAATCAGCAT 
TGGTCACTAT 
TCATCGCGGT 



31 
I 

GAATTCACCT 
TTCTAATACC 
AGAAGAGACC 
GGGGGGCAGG 
GAAAGTGCAG 
GCTGCTCACA 
CAGAATGAGC 
GTTACAGATG 
AGATAGTAAG 
CATCATTGCT 
AAAGGAAAAC 
GGACTTGATC 
AACCAACTAT 
TGCTGTGATA 
GGTCCCAGTT 
GTGCTTCGGT 
TGATTCTCTT 
GGGTATTCTC 
GGGGCAGCTT 
0GTCTT6CCA 
GTTGCTGCAA 
CACCTTCAAG 
CCCCGTAGGA 
TTTCATTGCT 
CACAGCCACA 
GGTCATTGTG 
GGACTGGTTC 



41 
I 

CGTTACTGCT 
AAAGAGGAGG 
CTCCTAGAAA 
ATGGAGAGAT 
AACATTACAA 
GTCACCGCTG 
TACCGGGAAG 
CTGGTCTTAC 
GCATCAGGGA 
GTGGTGATTG 
ATGCACAGAG 
AGGAACATGT 
GAGAAGAGAA 
AACAATGTGT 
CCAGGATCTG 
TTTGTGATTG 
AACGAAGCCA 
TTCCTGATTG 
GCCATGTACA 
CTCCTCTACT 
GCACTCATCA 
TGCCTGGAAG 
GCCACCATTA 
CAAGTTAACA 
GCTGCCAGTA 
CTGACATCTG 
CTGGATCGCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



Seq ID NO: 256 Protein sequence 
Protein Accession 8: AAC50424.1 

1 11 21 31 41 51 

I II I I I 

MIEDSGKRGN TMAERRQLFA EMRAQDLDRI RLSTYRTACK LRFVQKKCNL HLVDIWNVIE 
ALRENALNNL DPNTELNVSR LEAVLSTIFY QLNKRMPTTH QIHVEQSISL LLNFLLAAFD 
PEGHGKISVF AVKMALATLC GGK3MDKLRY IFSMISDSSG VMVYGRYDQF LREVLKLPTA 
VFEGPSFGYT EQSARSCFSQ QKKVTLNGFL DTLMSDPPPQ CLVWLPLLHR LANVENVFHP 
VECSYCHSES MMGFRYRCQQ CHNYQLCQDC FWRGHAGGSH SNQHQMKBYT SWKSPAKKLT 
NALSKSLSCA SSREPLHPMF PDQPEKPLNL AHIVDTWPPR PVTSMNDTLF SHSVPSSGSP 
FITRSSDGAF GGCV 

Seq ID NO: 257 DNA sequence 

Nucleic Acid Accession #: NM_004172.1 

Coding sequence: 179.. 1807 



60 
120 
180 
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300 
360 



51 
I 

TGATATCTTC 
TTTGGCTTTC 
AGTAAAATAT 
TCCAGCAGGG 
AGGAGGATGT 
TCATTGTGGG 
TCAAGTACTT 
CACTTATCAT 
AGATGGGAAT 
GCATAATCAT 
AAGGCAAAAT 
TCCCTCCAAA 
GCTTTAAAGT 
CTGAGGCCAT 
TGAATGGAGT 
GAAACATGAA 
TCATGAGACT 
CTGGGAAGAT 
CCGTGACTGT 
TCTTGGTAAC 
CGGCTCTGGG 
AGAACAATGG 
ACATGGATGG 
ACTTTGAACT 
TTGGGGCAGC 
TCGGCCTGCC 
TCCGGACCAC 
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1440 
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CACCAACGTA CTGGGAGACT CCCTGGGAGC TGGGATTGTG GAGCACTTGT CACGACATGA 1680 

ACTGAAGAAC AGAGATGTTG AAATGGGTAA CTCAGTGATT GAAGAGAATG AAATGAAGAA 1740 

ACCATATCAA CTGATTGCAC AGGACAATGA AACTGAGAAA CCCATCGACA GTGAAACCAA 1800 

GATGTAGACT AACATAAAGA AACACTTTCT TGAGCACCAG GTGTTAAAAA CCATTATAAA I860 

ATCTTTCCAT CTCATTACAG CTCATTCGCT CCAGCAAGCC CGTCATCTTC CCTTTCCTCC 1920 

CTTCTGATAA GACTGGAAAA TAGTCCTCCA AAACACAAGG GAGGATTTTG GGTGGCCAAA 1980 

GTGTACAATT TTCATCCCAC AATTGAAATT TTTAAATCAT TTCATGTTAG TCTTACCGAA 2040 

TAAGGTACCA AGATCACAAA TAGTGTTGAT CAGATCTTAC AAGTTTATGT GGCACACAAT 2100 

TCCTATAAAT GTGATTTTTT TATATAAGTT AAAGAGACAA ATAGTAGGCT AAAAACATTT 2160 

TAAAATCAAC TTTTGAAATT TAAAAATCTT TCAGAATACA ATTCAGTTTT AGTTTCAAAA 2220 

TGTTAACAAC TTGAATTACA ACCGGTTATC AGTTGGACAG TAAGATTTTA TCCCTTTCTC 2280 

TTCTGACTGG TATACCTATT TCATTAGTAG CTAGGTGCAC ATATACATCT AGCACAGCTG 2340 

TGAGGACAGA CAGAAGGCAA AGTTTCCATG TGGCCTTGAG CAAGTCCCAT CTCACCTCTA 2400 

GGCCTCAGTG TCCTCATCTA TAAAATGAGG GACTTCCCTA GAAGTCTTCA TGGTCTCTTC 2460 

CAGCCCAGAC ATCCTGTGAT GTCATGAAAG CACCTGCCCT CTGTTTCCCC TCAGAACACC 2520 

CTGTACCATC CATGGAGCAC GAGGCCTTCA GAAAAGACAC TTCAATGGGA GTGAACATTT 2580 . 

CTAACTAAGG ACAGGATGGC TGTGTGTGGT GGTCACCAGG TCCTGTGAGC AAAGTGCAGG 2640 

TTATGCAAGT CGCCAGGCAG GAGGCCATTC CAGGAGTGGG ATTATTCATC AAACTCTTTG 2700 

CCCAGTTCAT CCCAATGGGG GAAGTATTCC CTTCTTTCCT ACTCTGGGAA GAATGTCTCC 2760 

TGCCACTCCT CAACTGATGA TAGACTTCGA AAACAGATGA GAAGACTAGC AGCTAGCAAG 2820 

GGTGCTTGTA GTCACACTGT GGAACACTAA AGAGCTAGGA AAGAGTTGAG CACAGGCAAC 2880 

ATTACAAACA AAGGATTTGA AAACACCAAG AGTACAGGTC TTCTTTAAGG AAGAATAAAA 2940 

AAGAAGAGGT TCATTTTTCT GGCTTTTTTT TTCACCTGAA ACACTTTTTC TOGAGTCCAA 3000 

AATCATTCCC CCCGTGAAGT CTGCTTACCA AAACATAAGA CGACTTATAT ATTTGAAAGA 3060 

AGTCAAATGA ATGAGCTCTC TAATAGAAGT CCATGAGTTG AGTGGGTATT TCTTATTTGA 3120 

AAGTGTTTTT CTTTAATCAA AAGTCCTTAG AATGAGGGAA ACAAAATATT TATTTGTTTT 3180 

GGAATCCCAC TTATCAAATC ATTCAAAACT TTCAGCTGGA GTGGGGTTTG CTTTTGTTTT 3240 

GTTTGTGTCC ATAAGAGAAA TGGTAGAAGA TGAATCAGTA TGAAGACACT GTCAATGAGG 3300 

TTATGAGAAA AAAACAGCAG GGGCATTAGT TTCAGGCAAG GCAGCTCCCA GGTTTAGAGA 3360 

TTAATTTTTA CCCCCTAAGG AATATCCAGT CAAAGACGCT GAGTGGGAGC TGTCAGGCAG 3420 

TAGCAGCTGT GTTTGAGTTT CTGGCTGAAA ATGGTGAAGA ATGGACTTAA TTATGCTAAC 3480 

AAACTGAAAA ATCTAGACAT AGATCCTCTG ATATACAATT AGAGATATTT TTATATAGAC 3540 

CCCAAGCATT CTGTGCATAA AAGTTAACAT TAGGCTGTGG TGCAGTAACC ATTTAATGTC 3600 

GAGGCTCTAT TTCGGAAATA CACTACAAAT GTTAAAGTAC GTGGCTGTCC TCTTAAGACA 3660 

CTAGTAGAGC AAAGACTTAA TCATATCAAC TTAATTCTGT TACACAATAT GTGTTTTTTA 3720 

ATATACTAAC CATTTCTTAT GGAAAGGTCC TGTGGGGAGC CCATCCTCTC GOCAAGCCAT 3780 

CACAGGCTCT GCATACACAT GCACTCAGTG TGGACTGGGA AGCATTACTT TGTAGATGTA 3840 
TTTTCAATAA AGAAAAAAAT AGTTTTACAT T 

Seq ID NO: 258 Protein sequence 
Protein Accession #: NPJJ04163.1 

1 11 21 31 41 51 

i I I I ! I 

MTKSNGEEPK MGGRMERFQQ GVRKRTLLAK KKVQNITKED VKSYLPRNAF VLL.TVTAVIV 60 

GTILGPTLRP YRMSYREVKY FSFPGELLMR MLQMLVLPLI ISSLVTGMAA LDSKASGKMG 120 

MRAWYYMTT TIIAWIGII IVIIIHPGKG TKENMHREGK IVRVTAADAF LDLIRNMFPP 180 

NLVEACFKQF KTNYEKRSFK VPIQANETLV GAVINNVSEA METLTRITEE LVPVPGSVNG 240 

VNALGLWFS MCFGFVIGNM KEQGQALREF PDSLNEA1MR LVAVIMWYAP VGILFLIAGK 300 

IVEMEDMGV1 GGQLAMYTVT VIVGLLIHAV IVLPLLYFLV TRKNPWVFIG GLIjQALITAL 360 

GTSSSSATLP ITFKCLEENN GVDKRVTRFV LPVGATINMD GTALYEALAA IFIAQVNNFE 420 

LNFGQIITIS ITATAASIGA AGIPQAGLVT MVIVLTSVGL PTDDXTLIIA VDWFUDRLRT 480 

TTNVLGDSLG AGIVEHLSRH ELKNRDVEMG NSVIEENEMK KPYQLIAQDN ETEKPIDSET 540 
KM 

Seq ID NOt 259 DNA sequence 

Nucleic Acid Accession #: NMJ>21948.1 

Coding sequence: 48.. 2783 

1 11 21 31 41 51 

I I I I I I 

TGTGGCACTG CCTGOGTACC CAACCCCAGC CCTGGGTAGC CTGCAGCATG GCCCAGCTGT 60 

TCCTGCCCCT GCTGGCAGCC CTGGTCCTGG CCCAGGCTCC TGCAGCTTTA GCAGATGTTC 120 

TGGAAGGAGA CAGCTCAGAG GACCGCGCTT TTCGCGTGCG CATCGCGGGC GACGCGCCAC 180 

TGCAGGGCGT GCTCGGCGGC GCCCTCACCA TCCCTTGCCA CGTCCACTAC CTGCGGCCAC 240 

CGCCGAGCCG CCGGGCTGTG CTGGGCTCTC CGCGGGTCAA GTGGACTTTC CTGTCCCGGG 300 

GCCGGGAGGC AGAGGTGCTG GTGGOGCGGG GAGTGCGCGT CAAGGTGAAC GAGGCCTACC 360 

GGTTCCGCGT GGCACTGCCT GCGTACCCAG CGTCGCTCAC CGACGTCTCC CTGGCGCTGA 420 

GCGAGCTGCG CCCCAACGAC TCAGGTATCT ATCGCTGTGA GGTCCAGCAC GGCATCGATG 480 

ACAGCAGCGA CGCTGTGGAG GTCAAGGTCA AAGGGGTCGT CTTTCTCTAC CGAGAGGGCT 540 

CTGCCCGCTA TGCTTTCTCC TTTTCTGGGG CCCAGGAGGC CTGTGCCCGC ATTGGAGCCC 600 

ACATCGCCAC CCCGGAGCAG CTCTATGCCG CCTACCTTGG GGGCTATGAG CAATGTGATG 660 

CTGGCTGGCT GTCGGATCAG ACCGTGAGGT ATCCCATCCA GACCCCACGA GAGGCCTGTT 720 

ACGGAGACAT GGATGGCTTC CCCGGGGTCC GGAACTATGG TGTGGTGGAC CCGGATGACC 780 

TCTATGATGT GTACTGTTAT GCTGAAGACC TAAATGGAGA ATTGTTCCTG GGTGACCCTC 840 

CAGAGAAGCT GACATTGGAG GAAGCACGGG CGTACTGCCA GGAGCGGGGT GCAGAGATTG 900 

CCACCACGGG CCAACTGTAT GCAGCCTGGG ATGGTGGCCT GGACCACTGC AGCCCAGGGT 960 

GGCTAGCTGA TGGCAGTGTG CGCTACCCCA TCGTCACACC CAGCCAGCGC TGTGGTGGGG 1020 

GCTTGCCTGG TGTCAAGACT CTCTTCCTCT TCCCCAACCA GACTGGCTTC CCCAATAAGC 1080 

ACAGCCGCTT CAACGTCTAC TGCTTCCGAG ACTCGGCCCA GCCTTCTGCC ATCCCTGAGG 1140 

CCTCCAACCC AGCCTCCAAC CCAGCCTCTG ATGGACTAGA GGCTATCGTC ACAGTGACAG 1200 

AGACCCTGGA GGAACTGCAG CTGCCTCAGG AAGCCACAGA GAGTGAATCC CGTGGGGCCA 1260 

TCTACTCCAT CCCCATCATG GAGGACGGAG GAGGTGGAAG CTCCACTCCA GAAGACCCAG 1320 

CAGAGGCCCC TAGGACGCTC CTAGAATTTG AAACACAATC CATGGTACCG CCCACGGGGT 1380 
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TCTCAGAAGA GGAAGGTAAG GCATTGGAGG AAGAAGAGAA ATATGAAGAT GAAGAAGAGA 1440 

AAGAGGAGGA AGAAGAAGAG GAGGAGGTGG AGGATQAGGC TCTGTGGGCA TGGCCCAGCG 1500 

AGCTCAGCAG CCCGGGCCCT GAGGCCTCTC TCCCCACTGA GCCAGCAGCC CAGGAGGAGT 1560 

CACTCTCCCA GGCGCCAGCA AGGGCAGTCC TGCAGCCTGG TGCATCACCA CTTCCTGATG 1620 

GAGAGTCAGA AGCTTCCAGG CCTCCAAGGG TCCATGGACC ACCTACTGAG ACTCTGCCCA 1680 

CTCCCAGGGA GAGGAACCTA GCATCCCCAT CACCTTCCAC TCTGGTTGAG GCAAGAGAGG 1740 

TGGGGGAGGC AACTGGTGGT CCTGAGCTAT CTGGGGTCCC TCGAGGAGAG AGCGAGGAGA 1800 

CAGGAAGCTC CGAGGGTGCC CCTTCCCTGC TTCCAGCCAC ACGGGCCCCT GAGGGTACCA 1860 

GGGAGCTGGA GGCCCCCTCT GAAGATAATT CTGGAAGAAC TGCCCCAGCA GGGACCTCAG 1920 

TGCAGGCCCA GCCAGTGCTG CCCACTGACA GCGCCAGCCG AGGTGGAGTG GCCGTGGTCC 1980 

CCGCATCAGG TGACTGTGTC CCCAGCCCCT GCCACAATGG TGGGACATGC TTGGAGGAGG 2040 

AGGAAGGGGT CCGCTGCCTA TGTCTGCCTG GCTATGGGGG GGACCTGTGC GATGTTGGCC 2100 

TCCGCTTCTG CAACCCCGGC TGGGACGCCT TCCAGGGCGC CTGCTACAAG CACTTTTCCA 2160 

CACGAAGGAG CTGGGAGGAG GCAGAGACCC AGTGCCGGAT GTACGGOGCG CATCTGGCCA 2220 

GCATCAGCAC ACCCGAGGAA CAGGACTTCA TCAACAACCG GTACCGGGAG TACCAGTGGA 2280 

TCGGACTCAA CGACAGGACC ATCGAAGGCG ACTTCTTGTQ GTCGGATGGC GTCCCCCTGC 2340 

TCTATGAGAA CTGGAACCCT GGGCAGCCTG ACAGCTACTT CCTGTCTGGA GAGAACTGCG 2400 

TGGTCATGGT GTGGCATGAT CAGGGACAAT GGAGTGACGT GCCCTGCAAC TACCACCTGT 2460 

CCTACACCTG CAAGATGGGG CTGGTGTCCT GTGGGCCGCC ACCGGAGCTG CCCCTGGCTC 2520 

AAGTGTTCGG CCGCCCACGG CTGCGCTATG AGGTGGACAC TGTGCTTCGC TACCGGTGCC 2580 

GGGAAGGACT GGCCCAGCGC AATCTGCCGC TGATCCGATG CCAAGAGAAC GGTCGTTGGG 2640 

AGGCCCCCCA GATCTCCTGT GTGCCCAGAA GACCTGCCCG AGCTCTGCAC CCAGAGGAGG 2700 

ACCCAGAAGG ACGTCAGGGG AGGCTACTGG GACGCTGGAA GGCGCTGTTG ATCCCCCCTT 2760 

CCAGCCCCAT GCCAGGTCCC TAGGGGGCAA GGCCTTGAAC ACTGCCGGCC ACAGCACTGC 2820 
CCTGTCACCC AAATTTTCCC TCACACCCTG CGCTCACCAC AGGAAGTGAC AACATGAC 

Seq ID MO: 260 Protein sequence 
Protein Accession #: NP_068767.1 

J 11 21 31 41 51 
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MAQLFLPLLA ALVLAQAPAA LADVLEGDSS EDRAFRVRIA GDAPLQGVLG GALTIPCHVH 60 

YLRPPPSRRA VLGSPRVKWT FLSRGREAEV LVARGVRVKV NEAYRFRVAL PAYPASLTDV 120 

SLALSBLRPN DSGIYRCEVQ HGIDDSSDAV EVKVKGWFL YREGSARYAF SFSGAQBACA 180 

RIGAHIATPE QLYAAYLGGY EQCDAGWLSD QTVRYPIQTP REACYGDMDG FPGVRNYGW 240 

DPDDLYDVYC YAEDLNGELF LGDPPEKLTL EEARAYCQER GAEIATTGQL YAAWDGGLDH 300 

CSPGWIiADGS VRYPIVTPSQ RCGGGLPGVK TLFLFPNQTG PPNKHSRPNV YCFRDSAQPS 360 

AIPEASNPAS NPASDGLEAI VTVTETLEEL QLPQEATESE SRGAIYSIPI MEDGGGGSST 420 

PEDPAEAPRT LLEFETQSMV PPTGFSEEEG KALEEEEKYE DEEEKEEEEE EEEVEDEALW 480 

AWPSELSSPG PEASLPTEPA AQEESLSQAP ARAVLQPGAS PLPDGESEAS RPPRVHGPPT 540 

ETLPTPRERN LASPSPSTLV EAREVGEATG GPELSGVPRG ESEETGSSEG APSLLPATRA 600 

PEGTRELEAP SEDNSGRTAP AGTSVQAQPV LPTDSASRGG VAWPASGDC VPSPCHNGGT 660 

CLEEEEGVRC LCLPGYGGDL CDVGLRFCNP GWDAFQGACY KHFSTRRSWE EAETQCRMYG 720 

AHLASISTPE EQDFINNRYR EYQWIGLNDR TIEGDFLWSD GVPLLYENWN PGQPDSYFLS 780 

GENCWMVWH DQGQWSDVPC NYHLSYTCKM GLVSCGPPPE LPLAQVFGRP RLRYEVDTVL 840 

RYRCREGLAQ RNLPLIRCQE NGRWEAPQIS CVPRRPARAL HPEEDPEGRQ GRtiLGRWKAL 900 
LIPPSSPMPG P 

Seq ID NO; 261 DMA sequence 

Nucleic Acid Accession #: NMJ) 04 386.1 

Coding sequence: 2.. 3967 

1 11 21 31 41 51 

i ! I I | | 

GATGGGGGCC CCGTTTGTCT GGGCCTTGGG CCTTTTGATG CTGCAGATGC TGCTCTTTGT 60 

GGCTGGGGAA CAGGGCACAC AGGATATCAC CGATGCCAGC GAAAGGGGGC TCCACATGCA 120 

GAAGCTGGGG TCTGGGTCAG TGCAGGCTGC GCTGGCGGAG CTGGTGGCCC TGCCCTGTCT 180 

CTTTACCCTG CAGCCACGGC CAAGCGCA6C CCGAGATGCC CCTCGGATAA AGTGGACCAA 240 

GGTGCGGACT GCGTCGGGCC AGCGACAGGA CTTGCCCATC CTGGTGGCCA AGGACAATGT 300 

CGTGAGGGTG GCCAAAAGCT GGCAGGGACG AGTGTCACTG CCTTCCTACC CCCGGCGCCG 360 

AGCCAACGCC ACGCTACTTC TGGGGCCACT GAGGGCCAGT OACTCTGGGC TGTACCGCTG 420 

CCAGGTGGTG AGGGGCATCO AGGATGAGCA GGACCTGGTG CCCTTGGAGG TGACAGGTGT 480 

TGTGTTCCAC TACCGATCAG CCCGGGACCG CTATGCACTG ACCTTCGCTG AGGCCCAGGA S40 

GGCCTGCOGT CTCAGCTCAG CCATCATTGC AGCCCCTCGG CATCTACAGG CTGCCTTTGA 600 

GGATGGCTTT GACAACTGTG ATGCTGGCTG GCTCTCTGAC CGCACTGTTC GGTATCCTAT 660 

CACCCAGTCC CGTCCTGGTT GCTATGGCGA CCGTAGCAGC CTTCCAGGGG TTCGGAGCTA 720 

TGGGAGGCGC AACCCACAGG AACTCTACGA TGTGTATTGC TTTGCCCGGG AGCTGGGGGG 780 

OGAGGTCTTC TACGTGGGCC CGGCCCGCCG CCTGACACTG GCCGGCGCGC GTGCACAGTG 840 

CCGCCGCCAG GGTGCCGCGC TGGCCTCGGT GGGACAGCTG CACCTGGCCT GGCATGAGGG 900 

CCTGGACCAG TGCGACCCGG GCTGGCTGGC CGACGGCAGC GTGCGCTACC CGATCCAGAC 960 

GCCGCGCCGG CGCTGCGGGG GCCCAGCCCC GGGOGTGCGC ACCGTCTACC GCTTCGCTAA 1020 

CCGGACCGGC TTCCCCTCAC CCGCCGAGCG CTTCGACGCC TACTGCTTCC GAGCTCATCA 1080 

CCCCACGTCA CAACATGGAG ACCTAGAGAC CCCATCCTCT GGGGATGAGG GGGAGATTCT 1140 

GTCAGCAGAG GGGCCCCCAG TTAGAGAACT GGAGCCCACC CTGGAGGAGG AAGAGGTGGT 1200 

CACCCCTGAC TTCCAGGAGC CTCTGGTGTC CAGTGGGGAA GAAGAAACCC TGATTTTGGA 1260 

GGAGAAGCAG GAGTCTCAAC AGACCCTCAG CCCTACCCCT GGGGACCCCA TGCTGGCCTC 1320 

ATGGCCCACT GGGGAAGTGT GGCTAAGCAC GGTGGCCCCC AGCCCTAGCG ACATGGGGGC 1380 

AGGCACTGCA GCAAGTTCAC ACACGGAGGT GGCCCCAACT GACCCTATGC CTAGGAGAAG 1440 

GGGGCGCTTC AAAGGGTTGA ATGGGCGCTA CTTCCAGCAG CAGGAACOGG AGCCGGGGCT 1500 

GCAAGGGGGG ATGGAGGCCA GCGCCCAGCC CCCCACCTCA GAGGCTGCAG TGAACCAAAT 1560 

GGAGCCTCCG TTGGCCATGG CAGTCACAGA GATGTTGGGC AGTGGCCAGA GCCGGAGCCC 1620 

CTGGGCTGAT CTGACCAATG AGGTGGATAT GCCTGGAGCT GGTTCTGCTG GTGGCAAGAG 1680 

CTCCCCAGAG CCCTGGCTGT GGCCCCCTAC CATGGTCCCA CCCAGCATCT CAGGCCACAG 1740 

CAGGGCCCCT GTCCTGGAGC TAGAGAAAGC OGAGGGCCCC AGTGCCAGGC CAGCCACCCC 1800 
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AGACCTGTTT TGGTCCCCCT TGGAGGCCAC TGTCTCAGCT CCCAGCCCTG CCCCCTGGGA 1860 

GGCATTCCCT GTGGCCACCT CCCCAGATCT CCCTATGATG GCCATQCTGC GTGGTCCCAA 1920 

AGAGTGGATG CTACCACACC CCACCCCCAT CTCCACOGAG GCCAATAOAG TTGAGGCACA 1980 

TGGTGAGGCC ACCGCCACGG CTCCACCCTC CCCTGCTGCA GAGACCAAGG TGTATTCCCT 2040 

GCCTCTCTCT TTGACCCCAA CAGGACAGGG TGGAGAGGCC ATGCCCACAA CACCTGAGTC 2100 

CCCCAGGGCA GACTTCAGAG AAACTGGGGA GACCAGCCCT GCTCAGGTCA ACAAAGCTGA 2160 

GCACTCCAGC TCCAGCCCAT GGCCTTCTGT AAACAGGAAT GTGGCTGTAG GTTTTGTCCC 2220 

CACTGAGACT GCCACTGAGC CAACGGGCCT CAGGGGTATC CCGGGGTCTG AGTCTGGGGT 2280 

CTTCGACACA GCAGAAAGCC CCACTTCTGG CTTGCAGGCC ACTGTAGATG AGGTGCAGGA 2340 

CCCCTGGCCC TCAGTGTACA GCAAAGGGCT GGATGCAAGT TCCCCATCTG CCCCCCTGGG 2400 

GAGCCCTGGA GTCTTCTTGG TACCCAAAGT CACCCCAAAT TTGGAGCCTT GGGTTGCTAC 2460 

AGATGAAGGA CCCACTGTGA ATCCCATGGA TTCCACAGTC ACGCCGGCCC CCAGTGATGC 2520 

TAGTGGAATT TGGGAACCTG GATCCCAGGT GTTTGAAGAA GCCGAAAGCA CCACCTTGAG 2580 

CCCTCAGGTG GCCCTGGATA CAAGCATTGT GACGCCCCTC ACGACCCTGG AGCAGGGGGA 2640 

CAAGGTTGGA GTTCCAGCCA TGTCTACACT GGGCTCCTCA AGCTCCCAAC CCCACCCAGA 2700 

GCCAGAGGAT CAGGTGGAGA CCCAGGGAAC ATCAGGAGCT TCAGTGCCTC CGCATCAGAG 2760 

CAGTCCCCTA GGGAAACCGG CTGTTCCTCC TGGGACACCG ACTGCAGCCA GTGTGGGCGA 2820 

GTCTGCCTCA GTTTCCTCAG GGGAGCCTAC GGTACCGTGG GACCCCTCCA GCACCCTGCT 2880 

GCCTGTCACC CTGGGCATAG AGGACTTCGA ACTGGAGGTC CTGGCAGGGA GCCCGGGTGT 2940 

AGAGAGCTTC TGGGAGGAGG TGGCAAGTGG AGAGGAGCCA GCCCTGCCAG GGACCCCTAT 3000 

GAATGCAGGT GCGGAGGAGG TGCACTCAGA TCCCTGTGAG AACAACCCTT GTCTTCATGG 3060 

AGGGACATGT AATGCCAATG GCACCATGTA TGGCTGTAGC TGTGATCAGG GCTTCGCCGG 3120 

GGAGAACTGT GAGATTGACA TTGATGACTG CCTCTGCAGC CCCTGTGAGA ATGGAGGCAC 3180 

CTGTATTGAT GAGGTCAATG GCTTTGTCTG CCTTTGCCTC CCCAGCTATG GGGGCAGCTT 3240 

TTGTGAGAAA GACACCGAGG GCTGTGACCG CGGCTGGCAT AAGTTCCAGG GCCACTGTTA 3300 

CCGCTATTTT GCCCACCGGA GGGCATGGGA AGATGCCGAG AAGGACTGCC GCCGCCGCTC 3360 

CGGCCACCTG ACCAGCGTCC ACTCACCGGA GGAACACAGC TTCATTAATA GCTTTGGGCA 3420 

TGAAAACACG TGGATCGGCC TGAACGACAG GATCGTGGAG AGAGATTTCC AGTGGACGGA 3480 

CAACACCGGG CTGCAATTTG AGAACTGGCG AGAGAACCAG CCGGACAATT TCTTCGCGGG 3540 

TGGCGAGGAC TGTGTGGTGA TGGTGGCGCA TGAAAGCGGG GGCTGGAACG ATGTCCCCTG 3600 

CAACTACAAC CTACCCTATG TCTGCAAGAA GGGCACAGTG CTCTGTGGTC CCCCTCCGGC 3660 

AGTGGAGAAT GCCTCACTCA TCGGTGCCCG CAAGGCCAAG AACAATGTCC ATGCCACTGT 3720 

AAGGTACCAG TGCAATGAAG GATTTGCCCA GCACCATGTG GTCACCATTC GATGCCGGAG 3780 

CAATGGCAAG TGGGACAGGC CCCAAATTGT CTGCACCAAA CCCAGACGTT CACATCGGAT 3840 

GCGGGGACAC CACCACCACC ACCAACACCA CCACCAGCAT CACCACCACA AATCCCGCAA 3900 

GGAGCGCAGA AAACACAAGA AACACCCAAC GGAGGACTGG GAGAAGGACG AAGGGAATTT 3960, 

TTGCTGAAGA ACCAGAAAAA AGAAAGCACA ACACCTTTCC CATGCCTCCT CTGGAGCCTT 4020* 

CGCCTGGGGA GACAGAACCC AGAGAGAAAC AAGAGAGTCC AGAAGTCCCT GAACGCCAAA 4080 

CTGTTCTCGC AAAAAAAATA TTCCTTTGAA CAAAGGTCTT CTTTTCCTTT TTTTACATAC 4140 

ACAAGATCTT CTTGGCAGGT GGAGCCAGGT GTCTGAAAAG TTCATTCTCG TCTGGCTGAA 4200 

CTCTGGGAGT GTGTCCCAGC TGAGGGAAGC ACAAGTAGCA AAGCTCATTG GTCTGGTCTC 4260 

TTGTTTGCCA GGCTGATTGA AGCAGGCCTT GATGAGGGTG CATGAGTGTA TGTTTGCATT 4320 

CACATGAAGG AATTGCTTTT CACACCAGAA ATTCAGACTT AGTCAATGTT GGCTGAATTC 4380 

CTAAATCCAG GAAGAAGCCT GGACGTAGGG TCATTAGCTT TGGGAATAGA AGGCTACACA 4440 

GAAGCACACT GTTTTTGAAC TTGACAACAG CTCTCCCTTT ACCCTGGACT TCAGCCCAAG 4500 

TTCCGTCTTT GGTCTTGGTG GATAAACACA CAGTGTGGAG ATCCCACGTA CTGCATTTTA 4560 

GGGATGTTTT TAGGACAACC TCCCTCCATG CCTTCAGAGT TAGGAGTGAG AATGATCAAA 4620 

GCAATATGTA GGTGATGGAG GGAGAGTGTA TTGCTAACCC TTCCAGGTCT AGTCCAGCGC 4680 

TGAGATTTGG TGGTTCTGCA TGTGTGATGA ATCTCTTTCA CACAAATAGA CGAGAGGATA 4740 

TTTAGGGCTA GATGAGCCCA GATTTCTTCC CCCTCCATCT CTCAGGGAGA CAAAGAACCT 4800 

CCTTCCTGGA CCAAGGAGGT GCTGCCAAGT TTTCTAGCCC AGTGCACATA CCCAGTCCTT 4860 

AAGCAGACAT TGGTAGTGCC CCTGCCCTGG GTCCCACTCC TGCCCCACCC CACCCTTGTC 4920 

CCTGGCCATT GCCTGGTGGT CTAGAAACAC TTAAAACTTG AAGTAGTGAC ACCTACCTGC 4980 

GGTCATATTG TAGAGAGATG CTCAGTGTTA AAACTGAAAC ACACAAACAC ACACACACAC 5040 

ACATTTTTCT CTTGTAGATT TTAATTTTTT AAGTGGGAAA GAACTCACCT TGCCTTCCTC 5100 

CCCCAAATGT GCAACCTGTA AAAGGTCTCT CCACACCAGG GGCCAGGATC CAGTTCCCTC 5160 

ATCTCTGGCA GGAAAGATCC ACAGCTTTTC CTCCATGTCT GTTACTCACT TTCAGCAGTC 5220 

CGGGTAAAAT CTGTGGATCA GGGTTAAAAA AGCACCGTGG AGAATGGCCC TCTTCAGGAA 5280 

AGAAAAATAA GCAAATGAAT GGTCCACCTA GGGGTTCAGT AAAGAAAGAA ATGTGTTAAC 5340 

TGAGCCTGAA TCCCTTCTGG GAAGTAATAA TGACCATTGA CAACTAAGAA GTAGACACCA 5400 

TGCTAAAGAC TTACATACAA TCTCCTTGAA TCTTCTCAAT AGCCCATTGA CTTAGAAACT 5460 

GTTACTTTCC CATTTTACAC ACAGTGAAAC TGAGGCTCAG ATATAAAGGA AAGGTACTGG 5520 

CTTGAAGTCA CAACCACGAC AGGAGTAAGG ATTTGGAATA AGGATTTGGT CCTGTTTTCT 5S80 

GGACCAAATC CTTACTCTGG CTCTGCTTAC ACTTTCTCTC CATCACCAAA TCCTTACTCC 5640 

AAATCCAGAA GTCAGAGCCA ACTCCCATCT TGGTTCTGAC CCAAATCCTG CTCTGGACTC 5700 

TGGAGAGGAG ATTGAAATAT AATTGCACCC TCATACACAT TTAGGAAATG GTTAAGAAGT 5760 

GTAAACTGAA CCCTTATCCT TGTCTTCAAT CTTCCTCCCT GTAGACATCT ATCTTATTAT 5820 

GGTTATTATT CAGAAAACCC AGGGATACAG GTTTGTCTTC TTACTTTGAT AACTCTTCTT 5880 

AGTTTAAAAT AATAATAATA ACACATCTTT GGTCATCTAT GTCACACAAA AATTTTCCTT 5940 

TGTTTGCGGG GGGCTGGGGA TGCAGTGTTT TTTGGGGGGT CTTGGTTTAT GCTCCCTGCC 6000 

CTTGAGCCCC TCAGCCGTTT GCCCTGCCCC CACCTCGGCT CCATGGTGGG AGGGGGCTCT 6060 

GGTCTTTTCT AAAGTGGGCG GTTTGTCTTT TGATCTTTCC CTTTTGGATG TGOGTGTGTG 6120 

TCTGCGTGTG CCATGTGCGT GGCACGCATA TGAGTGTGTG TGCGTGTGAA CGGCTTTGGG 6180 

TCCTGCTGGT TTTGCTGTGA GCTGCAGTGT TCTGTGGGTC TGTGGTATCT GACACTGTGQ 6240 

ACATTAATGT ACTTCTTGGA CATTTTAATA AATTTTTTAA CAGTTCAAAA AAAAAAAAAA 6300 
AAAAAAAAAA 

Seq ID NO: 262 Protein sequence 
Protein Accession #t NP 004377.1 



1 11 21 31 
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MGAPFVWALG LLMLQMLLFV AGEQGTQDIT 
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FTLQPRPSAA 
ANATLLLGPL 
ACRLSSAIIA 
GRRNPQELYD 
LDQCDPGWLA 
' PTSQHGDLET 
EKQESQQTLS 
GRFKGLNGRY 
HADLTNEVDM 
DLFWSPLEAT 
GEATATAPPS 
HSSSSPWPSV 
PWPSVYSKGL 
SGIWEPGSQV 
PEDQVETQGT 
PVTLGIEDFE 
GTCNANGTMY 
CEKDTEGCDR 
ENTWIGLNDR 
NYNLPYVCKK 
NGKWDRPQIV 
C 



RDAPRIKWTK 
RASDSGLYRC 
APRHLQAAFE 
VYCPARELGG 
DGSVRYPIQT 
PSSGDEGBIL 
PTPGDPHLAS 
FQQQEPEPGL 
PGAGSAGGKS 
VSAPSPAPWE 
PAAETKVYSL 
NRNVAVGFVP 
DASSPSAPLG 
FEEAESTTLS 
SGASVPPHQS 
LEVLAGSPGV 
GCSCDQGFAG 
GWHKFQGHCY 
IVERDFQWTD 
GTVLCGPPPA 
CTKPRRSHRM 



VRTASGQRQD 
QWRGIEDEQ 
DGFDNCDAGW 
EVFYVGPARR 
PRRRCGGPAP 
SAEGPPVREL 
WPTGEVWLST 
QGGMEASAQP 
SPEPWLWPPT 
AFPVATSPDL 
PLSLTPTGQG 
TETATEPTGL 
SPGVFLVPKV 
POVALDTSIV 
SPLGKPAVPP 
ESFWEEVASG 
ENCEIDIDDC 
RYFAHRRAWE 
NTGLQFENWR 
VENASLIGAR 



LPILVAKDNV 
DLVPJJEVTGV 
LSDRTVRYPI 
LTLAGARAQC 
GVRTVYRFAN 
EPTLEEEEW 
VAPSPSDKGA 
PTSEAAVNQM 
MVPPSISGHS 
PMMAMLRGPK 
GEAMPTTPES 
RGIPGSESGV 
TPNLEPWVAT 
TPLTTLEQGD 
GTPTAASVGE 
EEPALPGTPM 
LCSPCENGGT 
DAEKDCRRRS 
ENQPDNFFAG 
KAKNNVHATV 



VRVAKSWQGR 
VFHYRSARDR 
TQSRPGCYGD 
RRQGAALASV 
RTGFPSPAER 
TPDFQEPLVS 
GTAASSHTEV 
EPPLAMAVTE 
RAPVLELEKA 
EWMLPHPTPI 
PRADFRETGE 
FDTAESPTSG 
DEGPTVNPMD 
KVGVPAMSTL 
SASVSSGEPT 
HAGAEEVHSD 
CIDEVNGFVC 
GHLTSVHSPE 
GEDCWMVAH 
RYQCNEGFAQ 
ERRKHKKHPT 



VSLPSYPRRR 
YALTFAEAQE 
RSSLPGVRSY 
GQLHLAWHEG 
FDAYCFRAHH 
SGEEETLILE 
APTDPMPRRR 
MLGSGQSRSP 
EGPSARPATP 
STEANRVEAH 
TSPAQVNKAE 
LQATVDEVQD 
STVTPAPSDA 
GSSSSQPHPE 
VPWDPSSTLL 
PCENNPCLHG 
LCLPSYGGSF 
EHSFINSFGH 
ESGRWNDVPC 
HHWTIRCRS 
EDWEKDEGNF 



120 
1BO ■ 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



Seq ID NO: 263 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequences 7-. 2085 



11 



21 



31 



41 51 
I I 

GCCGCGATGG CCAGCACCAG GAGTATCGAG CTGGAGCACT TTGAGGAACG GGACAAAAGG 60 

CCGCGGCCGG GGTCGCGGAG AGGGGCCCCC AGCTCCTCCG GGGGCAGCAG CAGCTCGGGC 120 

CCCAAGGGGA ACGGGCTCAT CCCCAGTCCG GCGCACAGTG CCCACTGCAG CTTCTACCGC 1B0 

ACGCGGACCC TGCAGGCCCT CAGCTCGGAG AAGAAGGCCA AGAAGGCGCG CTTCTACCGG 240 

AACGGGGACC GCTACTTCAA GGGCCTGGTG TTTGCCATCT CCAGCGACGG CTTCCGGTCC 300 

TTCGATGCGC TCCTCATAGA GCTCACCCGC TCCCTGTCGG ACAACGTGAA CCTGCCCCAG 360 

GGTGTCCGCA CTATCTACAC CATCGACGGC AGCCGGAAGG TCACCAGCCT GGACGAGCTG 420 

CTGGAAGGTG AGAGTTACGT GTGTGCATCC AATGAACCAT TTCGTAAAGT CGATTACACC 480 

AAAAATATTA ATCCAAACTG GTCTGTGAAC ATCAAGGGTG GGACATCCCG AGOGCTGGCT 540 

GCTGCCTCCT CTGTGAAAAG TGAAGTAAAA GAAAGTAAAG ATTTCATCAA ACCCAAGTTA 600 

GTGACTGTGA TTCGAAGTGG AGTGAAGCCT AGAAAAGCCG TGOGGATCCT TCTGAATAAA 660 

AAGACTGCTC ATTCCTTTGA ACAAGTCTTA ACAGATATCA CCGAAGCCAT TAAACNAGCC 720 

TCAGGAGTCG TCAAGAGGCT CTGCACCCTG GATGGAAAGC AGGTGAGAGT TACGTGTGTG 780 

CATCTGCCAG ACTTTTTTGG TGATGACGAT GTTTTTATTG CATGTGGACC AGAAAAATTT 840 

CGTTATGCCC AAGATGACTT TGTCCTGGAT CATAGTGAAT GTCGTGTCCT GAAGTCATCT 900 

TATTCTOGAT CCTCAGCTGT TAAGTATTCT GGATCCAAAA GCCCTGGGCC CTCTCGACGC 960 

AGCCAGATTT CTGCTCATGG CAGATCTTCT TCCAATGTAA ACGGTGGACC TGAGCTTGAC 1020 

CGTTGCATAA GTCCTGAAGG TGTGAATGGA AACAGATGCT CTGAATCATC AACTCTTCTT 1080 

GAGAAATACA AAATTGGAAA GGTCATTGGT GATGGCAATT TTGCAGTAGT CAAAGAGTGT 1140 

ATAGACAGGT CCACTGGAAA GGAGTTTGCC CTAAAGATTA TAGACAAAGC CAAATGTTGT 1200 

GGAAAGGAAC ACCTGATTGA GAATGAAGTG TCAATACTGC GCCGAGTGAA ACATCCCAAT 1260 

ATCATTATGC TGGTCGAGGA GATGGAAACA GCAACTGAGC TCTTTCTGGT GATGGAATTG 1320 

GTCAAAGGTG GAGATCTCTT TGATGCAATT ACTTCGTCGA CCAAGTACAC TGAGAGAGAT 1380 

GGCAGTGCCA TGGTGTACAA CTTAGCCAAT GCCCTCAGGT ATCTCCATGG CCTCAGCATC 1440 

GTGCACAGAG ACATCAAACC AGAGAATCTC TTGGTGTGTG AATATCCTGA TGGAACCAAG 1500 

TCTTTGAAAC TGGGAGACTT TGGGCTTGCG ACTGTGGTAG AAGGCCCTTT ATACACAGTC 156 0 

TGTGGCACAC CCACTTATGT GGCTCCARAA ATCATTGCTG AAACTGGCTA TGGCCTGAAG 1620 

GTGGACATTT GGGCAGCTGG TGTGATCACA TACATACTTC TCTGTGGATT CCCACCATTC 1680 

CGAAGTGAGA ACAATCTCCA GGAAGATCTC TTCGACCAGA TCTTGGCTGG GAAGCTGGAG 1740 

TTTCCGGCCC CCTACTGGGA TAACATCACG GACTCTGCCA AGGAATTAAT CAGTCAAATG 1800 

CTTCAGGTAA ATGTTGAAGC TCGGTGTACC GCGGGACAAA TCCTGAGTCA CCCCTGGGTG I860 

TCAGATGATG CCTCCCAGGA GAATAACATG CAAGCTGAGG TGACAGGTAA ACTAAAACAG 1920 

CACTTTAATA ATGCGCTCCC CAAACAGAAC AGCACTACCA CCGGGGTCTC CGTCATCATG 1980 

GTGAGTGGAA GGCGGCAGGT CTGGCCTGAC TGCGGAGCCG GCCTTGAAGT TTTTGAATTA 2040 

GGTAGCCGGG AGCTGCCCTC ACATGGAAGT TGGTGCCTTC OGTAGTCCTA TTTCATATGA 2100 

AGATTGGCTT GGCATGTGGA GGGCACTCAT TCGGCAACTC CCAGGCTTTG GGCACTGTGT 2160 

GGAGGGGCTT GTGTAGGGAC CAGCAGGCCT GGTGTGAGGG GTCCAGGCGT CAAGGAGCTC 2220 

CTGGCTGGGC CCTCTGGGCA GCTGCTTCCA CTCTTGTCTC TGCCTTCTCA TCTAGAGAGA 2280 

CTCCCAAGCC CTGGAGGGGT GTGTTGTGTT AGGAATTAAC TCCCTGCCTA CCCCAAGGCC 2340 

TCAGAAATAG ATTATTAGAG ATGTGAATTA TTCTTTGAGA CTTGGGATAA GAAACAGCCA 2400 

AAGCTAAACA TATTTCAGTT TTAAAAAATC AGTGTTTTAT AAAACACAGT TTGGGGCTTT 2460 

TAAAGGTACA TAATCAAGGA AAAAAATATA TATTCATTTT TCAGGGTTGG TAACATTTTA 2520 

TGAGATGTCA GTGACAACGA TGGCCTTATT TTTTTCAGCC TTTTCTTCTT CCAAAATGTT 2580 

TCTTAAGGCA ACTCTCCTAA ATACATAAAC ACAACAAATT AAAATGAAAA GTGACATGAG 2640 

AGTAAATGAA TCAAAAGGAA AAAACATTGA ACCAGAGGTG AGGGCAGCAC ACCCGCAGCA 2700 

GCTGTCCAGG CCTGAGCCAA TGCAACCCTG GGOGGGAAGG CCAGCTCACC GTGAGCAGGT 2760 

AGAAGCCAGC CAGCCACCCA GGCAGGGACC TTGGTTCTCC CCACACACTC CCAGGAGCAG 2820 

GGAACAGGGG TGGAGTGGCC TTTCCCAGAG CTGGAGTTGG CTGCAGCAGC TTTCGAATCA 2880 

GACCTGCCAA GGTGATGGGC GTCTGAGTTT CACATCTGGG CCCCCCGTGA CCCCACTGAG 2940 

TCCTGAGAGC TAAGGATGGG CCACCTCCAC AGCTCCGTCA CTCGTACTTG GGACAGGCCT 3000 

CTCATCCTCT GGGAAGGTCC TCCTTGTTTC CTACCCAACT AGAAGGGAAA CAGTGGCATA 3060 

TTCTCATGGT ACATGGTTGT CTGAAAGCCT TACCTAGGAA GACGCAGGGT CTAGATAGAA 3120 

GCTATAAGGA AGCCACACAC ATAACCCACA TCCCCACACC CCCAACATCC CCCACACTCC 3180 
CCACACCCCC CACACCCCCC ACATCCCCAC CATAATTACC CCCACCTCCA AAT AT CTCAT 
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MASTRSIELE 
TLQALSSEKK 
RTIYTIDGSR 
SSVKSEVKES 
WKRLCTLDG 
RSSAVKYSGS 
YKIGKVIGDG 
MLVEEMETAT 
RDIKPENLLV 
IWAAGVITYI 
VNVEARCTAG 
GRRQVWPDCG 



11 
I 

HFEERDKRPR 
AKKARFYRNG 
KVTSLDELLE 
KDFIKPKLVT 
KQVRVTCVHL 
KSPGPSRRSQ 
NFAWKECID 
ELFLVMELVK 
CEYPDGTKSL 
LLCGFPPFRS 
QILSHPWVSD 
AGLEVFELGS 



DRYFKGLVFA 
GESYVCASNE 
VIRSGVKPRK 
PDFFGDDDVF 
ISAHGRSSSN 
RSTGKEFALK 
GGDLFDAITS 
KLGDFGLATV 
ENNLQEDLFD 
DASQENNMQA 
RELPSHGSWC 



31 
I 

SGGSSSSGPK 
ISSDRFRSFD 
PFRKVDYTKN 
AVRILLNKKT 
IACGPEKFRY 
VKGGPELDRC 
IIDKAKCCGK 
STKYTERDGS 
VEGPLYTVCG 
QILAGKLEFP 
EVTGKLKQHF 
LP 



41 
I 

GNGLIPSPAH 
ALLIELTRSL 
INPNWSVNIK 
AHSFEQVLTD 
AQDDFVLDHS 
ISPEGVNGNR 
EHLIENEVSI 
AMVYNIiANAL 
TPTYVAPXII 
APYWDNITDS 
NNALPKQNST 



51 
I 

SAHCSFYRTR 
SDNVNIiPQGV 
GGTSRALAAA 
ITEAIKXASG 
ECRVLKSSYS 
CSESSTLLEK 
LRRVKHPNI I 
RYLHGLSIVH 
AETGYGLKVD 
AKELISQMLQ 
TTGVSVIMVS 



60 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
660 



Seq ID NO: 265 DNA sequence 

Nucleic Acid Accession #: AB020684.1 

Coding sequence: 1..1744 



11 



21 



31 41 51 

I i I I I 1 

CCCCCTTGTC ATTAATACAT TAAAAAGATT CAATCTTTAC CCTGAGGTAA TTTTGGCCAG 60 

TTGGTACCGG ATTTATACCA AAATAATGGA CTTGATTGGT ATTCAAACCA AGATATGTTG 120 

GACGGTTACC AGAGGAGAAG GACTCAGTCC TATTGAAAGC TGTGAAGGAT TGGGAGATCC 180 

TGCTTGCTTT TATGTTGCTG TAATTTTTAT TTTAAATGGA CTAATGATGG CATTATTCTT 240 

CATATATGGC ACATATTTAA GTGGCAGCCG ATTAGGAGGC CTGGTTACAG TGTTGTGCTT 300 

CTTTTTCAAT CATGGAGAGT GTACCCGTGT AATGTGGACA CCACCTCTCC GTGAAAGCTT 360 

CTCATATCCA TTTCTTGTTC TTCAGATGTT GCTAGTGACT CATATTCTCA GGGCTACAAA 420 

ACTTTATAGA GGAAGCTTGA TTGCACTCTG CATTTCCAAT GTATTTTTCA TGCTTCCTTG 480 

GCAGTTTGCT CAGTTTGTAC TTCTTACTCA GATTGCATCA TTATTTGCAG TATATGTTGT 540 

CGGGTACATT GATATATGTA AATTACGGAA GATCATTTAT ATACACATGA TTTCTCTTGC 600 

ACTTTGTTTT GTTTTGATGT TTGGGAACTC AATGTTATTA ACTTCTTATT ATGCTTCTTC 660 

TTTGGTAATT ATTTGGGGTA TTCTGGCAAT GAAACCACAT TTCCTGAAAA TAAATGTATC 720 

TGAACTTAGT TTATGGGTTA TTCAAGGATG TTTTTGGTTA TTTGGAACTG TCATACTTAA 780 

ATACTTGACA TCTAAAATTT TTGGTATTGC AGATGACGCT CATATTGGCA ACTTACTAAC 840 

ATCAAAATTC TTTAGTTATA AGGATTTTGA TACTTTATTG TATACCTGTG CAGCGGAGTT 900 

TGACTTTATG GAAAAAGAGA CTCCACTGAG ATACACAAAG ACATTATTGC TTCCAGTTGT 960 

TCTTGTAGTG TTTGTTGCTA TTGTTAGAAA GATTATTAGT GATATGTGGG GTGTCTTAGC 1020 

TAAACAACAG ACACATGTAA GAAAACACCA GTTTGATCAT GGAGAGCTGG TTTACCATGC 1080 

ATTGCAATTG TTAGCATATA CAGCCCTTGG TATTTTAATT ATGAGACTAA AACTCTTCTT 1140 

GACACCACAC ATGTGTGTTA TGGCATCACT GATCTGCTCA AGACAGCTAT TTGGATGGCT 1200 

CTTTTGCAAA GTACATCCTG GTGCTATTGT GTTTGCTATA TTAGCAGCAA TGTCAATACA 1260 

AGGTTCAGCA AATCTGCAAA CCCAGTGGAA TATTGTAGGG GAGTTCAGCA ATTTGCCCCA 1320 

AGAAGAACTT ATAGAATGGA TCAAATATAG TACTAAACCA GATGCAGTGT TTGCGGGTGC 1380 

CATGCCCACG ATGGCAAGTG TTAAGCTCTC TGCACTTCGG CCCATTGTGA ATCATCCACA 1440 

TTATGAAGAC GCAGGCTTGA GAGCCAGAAC AAAAATAGTA TACTCAATGT ATAGTCGGAA 1500 

AGCAGCCGAA GAAGTGAAGC GAGAACTGAT AAAGTTAAAA GTGAACTATT ACATTCTAGA 1560 

AGAGTCATGG TGTGTAAGAA GATCGAAGCC TGGTTGCAGT ATGCCTGAAA TTTGGGATGT 1620 

AGAAGATCCT GCCAATGCTG GGAAAACTCC CTTATGTAAC CTCTTGGTGA AGGATTCCAA 1680 

ACCTCACTTC ACCACTGTAT TCCAGAACAG TGTTTACAAA GTCCTAGAAG TTGTAAAAGA 1740 

ATGACTGCTA CATGACCTGC TGCCTACGGA GAACTACATC TGTAATGGTT TTAATGTTTT 1800 

GCTAAGTCAT GTGTTGTTCA TATCCCAAAA ACTTTTATAG GTAACTGTTT TCAAATAGAA 1860 

AACGTTTTAT TTGGTCAATT TGAATGTCAT TCTAATTATA AAAATGACTT ACACCTTTAT 1920 

CAATTGGTTA CTATTTCAAT GCACCCTTTA AAATTTGCTA TGCAAATGAG TATATGCTTG 1980 

TACTTGACTT TAATATTTGT GCTAAAGTGA GCAAAGCTAC CTGTATAAAG AAAACACAGT 2040 

GGGTTGTGAC AAGGATGACA TGAAAATACA GGACAATTCT GACAATGTAG GGGCTGATTT 2100 

TATAGTGTAA GAACTATTAA TGCCCCTTGC TTCTTTTTTC TGCCTCTTGC TCTTGTCTTT 2160 

TGGACATTTC AGTGATTGTA AGTTCTTCGG TCATGTCAGC CCCTGTCATC AACTTGAGTT 2220 

ACAGTAGATG GGGCAGACAT GGAGTGTTTG CTATATAAAA CTAT CTGTTT GTTTTACTTC 2280 

CTTGTGCGCT TTTTGTTCTC TGTTCTCTTG TTAATGAAGC TTTTCCTGCC CATTATTAAT 2340 

CCAAACTCTT GGACCTTGTG GTTAGGAAAT TCCCTTAACT TCCAGCCATA TGGCATTATC 2400 

GTGTCTCTTT CTCTCTCTCT CTTGCTCTCT CTCTTCTCCT CTTCCCCATA TTTTCTGTCA 2460 

AATAAGTACT GTTTACTCAT TTAGTTGCTT ATCAAGTACT TATTCTTGGT TTTAAAAAAA 2520 

ATTAATGGTA ACTGTATTTT TCTCATTTTT AGCATTATTC AAATGTTTAT ATTTTAATAC 2580 

CTTTAAACCA CTTTAAAGTT TTTTCATGTT TAATTATAGT TTTAAGAAAA ACTATTTTGA 2640 

ACAACCCCAA ATATAGTGCA TCTAGAAACT AATGTATATT TGATTAGACA TCATTTATAG 2700 

TGGAACAGTA GACTGTAGTA CATGGTAATT TTTCTTTTAC TATTAAGATA CAATAAAACA 2760 

TGACTAATTT TGCTGTCAAA AATGTAAAGA ATAATGATAA ATGGAGTTTT TTATATTTTA 2820 

CTTTTAAGAT TGCCTGTCTT TAATAAGACA AAGCCTTAAG CCTTATGTTA TAATTTTGGT 2880 

TCTAAAAACC ATCATTTCAG TATAAGGAAT AAGTATATTT CGTCCTCCTC TTTAGTTTTT 2940 

TTCTTCCTAT TTATTTTTAT TTTGAAAAAT TTCTACACCT TCTTTGAATT CCTTGTATGA 3000 

ATTTTTGTTT CTTAGAAGTT AATTTGTGTG AAATGAGATT CTTCAAAAOG ATGAAACCTC 3060 

ATAGCTCTGA GAAAAGGTTT TAGGGTTTTA AATTCTAAGC AAAGCGTGAC TATGGCTGAC 3120 

AGACTACACA TTTAATTATA CAGCTTCTCT TTCTTAACCA CAGGCAGATT AACCTCATTG 3180 

TGGATTGTCC TTCAGACCTT AGTCCTCAGG CATGGTTTCT GGTGCCCACT CCTGGAAGCC 3240 

GCTGTTCCCT TTCTACCTTC TTACCAGAGC CCAAGGGCAG GCCTGGTCCC GGGGAAGCAG 3300 

CAGCTTGCTG ACATAAGTCA GCTGCAAAGG CTGAGGAGTG TGCCCTCAGA GAAGCACCGC 3360 

CCCCCAGTCT TGTGCCAGCG CCTAGAGCCG CAGCTCCCAG GGATGCTCCT TCCCTGGAGG 3420 

CAGCCCAGGA GAGGGACTCT GGCAGCGTTC TTCAGATTTG TGGCCACTGT TTCTCATTTG 3480 
CTGGTTGACT GTTTTTATTT CTTAGGCTTT TGCTAGTTTT AGAAAATAGG GAAGCAGCCC 3540 

TTGATTTGTG GATTAAAAGC AACATTTGAG CGATGATGCA CAACAGTCCA GGAAAATGGG 3600 
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CGGTGGACAC TTGAGGCTGA GGATGGGAGT TGACATGAGC AGGGAGAGGG AGGTGCGCGC 3660 

TGCTTATCTG TGATTGTTGC TCACCTGAGT GTGGCTGATT GTGTACATCC AGCAGTTACA 3720 

ATTTTTAAAA ATTATACTTT TACATTTATT TTATATTTTT CTCACCCCCA GTAATTTCCT 3780 

TCCAAAGAAG TTCACATGTA ATAAGTAGAA ATTCTGTATA GGAAAAAAGC ATTAAAAATA 3840 

CTATTATAAC TGCTTCATTT GCTGGGAACC ATTAAAAGTA ATATAAATTA GCTTTTTCCA 3900 

GAAGGATCCT TTTGTAGCAG TGTTTATGAA TGTAACCCCC AGCAAAATAT GGCTATATAT 3960 

TAGGGGAGCC AGTTTGGAGC AGAGGCCTGA AGGTCCCTGC TATGCAGCCG TGGCCACAGC 4020 

TCGCAGCCCA AGCACTGTGG AGCATCCACA CCTTTGATGG CAATGCAGAT TGGTAGCAGG 4080 

TTCCATAGGC GTACAAAACA GTATTAAAGC TCAGTGTTTT GCATATTGTT AGCATTTACA 4140 

AATATTTTTG CTTTAGTATG AGGAAAGTAA GGATGGGCAA AGAAGCGATC AAAATAGCTA 4200 

TTGCTACAAC ATTTTCGAAA ACAAAGTTGG GGCTGTATTT CTTTAAAAAG ATAAGCCTCT 4260 

AAAAATGCTT GGCAAAAAAA ATATAGTGTT AAAATAGGCC AGTGATATTA ATGAGAAAAT 4320 

GAAAGTATGT ATCAGGAATA AAGTGATATT GCATAGGAGT ATTGTATTTT TATGAATTTT 4380 
ATGCCAGTTG TTTACATGTA CTATATATGT TAAATTAAAA AAAATCATGA GAAATG 

Seq ID NO: 266 Protein sequence 
Protein Accession fl: BAA74900.1 

1 11 21 31 41 51 

I I I I I I 

PLVINTLKRF NLYPEVILAS WYRIYTKIMD LIGIQTKICW TVTRGEGLSP IESCEGLGDP 60 

ACPYVAVIFI LNGLMMALPF IYGTYLSGSR LGGLVTVLCP FFNHGECTRV MWTPPLRESF 120 

SYPFLVLQML LVTHILRATK LYRGSLIALC ISNVFFMLPW QPAQFVLLTQ IASLPAVYW 180 

GYIDICKLRK IIYIHMISLA LCFVLMFGNS MLLTSYYASS LVIIWGILAM KPHPLKINVS 240 

ELSLWVIQGC FWLFGTVILK YLTSKIFGIA DDAHIGNLLT SKFFSYKDFD TLLYTCAAEF 300 

DFMEKETPLR YTKTLLLPW LWPVAIVRK IISDMWGVLA KQQTHVRKHQ FDHGELVYHA 360 

LQLLAYTALG ILIMRLKLFL TPHMCVMASL ICSRQLFGWL FCKVHPGAIV PAILAAMSIQ 420 

GSANLQTQWN IVGBFSNLPQ EEI*IEWIKYS TKPDAVPAGA MPTMASVKLS ALRPIVNHPH 480 

YEDAGLRART KIVYSMYSRK AAEEVKRELI KLKVNYYILE ESWCVRRSKP GCSMPEIWDV 540 
EDPANAGKTP LCNLL.VKDSK PHFTTVFQNS VYKVLEWKE 

Seq ID NOt 267 DNA sequence 
Nucleic Acid Accession #: U26744.1 
Coding sequence: 59. .1600 

1 11 21 31 41 51 

I I I I ! I 

CTTCAAAGAA TATAGATGGT TTTGAAAAGT TCATGCTGTC CCTTCATTGA ATTTTAGAAT 60 

GATTGAAGAT AGTGGGAAAA GAGGAAATAC CATGGCAGAA AGAAGACAGC TGTTTGCAGA 120 

GATGAGGGCT CAAGATCTGG ATCGCATCCG ACTCTCCACC TACAGAACAG CATGCAAGCT 180 

TAGGTTTGTT CAGAAGAAAT GCAATTTGCA CCTGGTGGAC ATATGGAATG TCATAGAAGC 240 

ATTGCGGGAA AATGCTCTGA ACAACCTGGA CCCAAACACT GAACTCAACG TGTCCCGCTT 300 

AGAGGCTGTG CTCTCCACTA TTTTTTACCA GCTCAACAAA CGGATGCCAA CCACTCACCA 360 

AATCCATGTG GAGCAGTCCA TCAGCCTCCT CCTTAACTTC CTGCTTGCAG CGTTTGATCC 420 

GGAAGGCCAT GGTAAAATTT CAGTATTTGC TGTCAAAATG GCTTTAGCCA CATTGTGTGG 480 

AGGGAAGATC ATGGACAAAT TAAGATATAT TTTCTCAATG ATTTCTGACT CCAGTGGGGT 540 

GATGGTTTAT GGACGATATG ACCAATTCCT TCGGGAAGTT CTCAAACTAC CCACGGAAGT 600 

TTTGGAGGGT CCTTCATTTG GTTACACAGA ACAGTCAGCC AGATCCTGTT TCTCCCAACA 660 

GAAAAAAGTC ACGTTAAATG GTTTCTTGQA CACGCTTATG TCAGATCCTC CCCCGCAGTG 720 

TCTGGTCTGG TTGCCTCTTC TGCATCGACT AGCAAATGTG GAAAATGTCT TCCATCCGGT 780 

TGAGTGTTCC TACTGCCACA GTGAGAGTAT GATGGGATTT CGCTACCGAT GCCAACAGTG 840 

TCACAATTAC CAGCTCTGTC AGGACTGCTT CTGGAGGGGA CATGCCGGTG GTTCTCATAG 900 

CAACCAGCAC GAAATGAAAG AGTACACGTC ATGGAAATCA CCTGCTAAGA AGCTGACTAA 960 

TGCATTAAGC AAGTCCCTGA GCTGTGCTTC CAGCCGTGAA CCTTTGCACC CCATGTTCCC 1020 

AGATCAGCCT GAGAAGCCAC TCAACTTGGC TCACATCGTT GATACTTGGC CTCCCAGACC 1080 

TGTAACCAGC ATGAACGACA CCCTGTTCTC CCACTCTGTT CCCTCCTCAG GAAGTCCTTT 1140 

TATTACCAGG AGCATGCTTG AGAGTTCAAA CCGGCTTGAT GAAGAACACA GGCTAATTGC 1200 

CAGGTATGCG GCAAGGCTGG CAGCAGAGTC CTCTTCGTCT CAGCCACCTC AGCAGAGAAG 1260 

TGCTCCTGAC ATCTCTTTCA CCATCGATGC GAATAAGCAG CAAAGGCAGC TGATTGCTGA 1320 

GCTAGAAAAC AAGAACAGAG AAATCTTACA GGAGATCCAG AGACTTCGGC TAGAGCATGA 1380 

ACAAGCTTCT CAGCCCACGC CAGAGAAGGC ACAGCAAAAC CCCACCCTGC TGGCAGAACT 1440 

CCGGCTCCTC AGACAGCGCA AAGATGAGCT GGAACAGAGA ATGTCTGCTC TCCAGGAGAG 1500 

CCGGAGAGAG CTAATGGTCC AGTTGGAGGG TCTCATGAAG CTACTAAAGG AAGAAGAACT 1560 

GAAGCAGGGA GTAAGTTATG TCCCCTACTG CAGGTCTTAA CTAACAGTGG AGGGGCCTGC 1620 

CGACCTGCGG TTTTCTCATT GCTTTTGCTC TAATGTATGT TCATGCTTCR GTTTGGAAAG 1680 

AGAAAAAAGT CATACTAATT TGCTTCTTTT TCAATGTAGT GCTTGAATTG AGATATATAA 1740 

ATTTAGCATT TTTTATAACT ATCACTACTA TCCACATCAA AAGAAGAACT ATGACATCTT 1800 

TTAGAAAAGG GAACGAATTG TCATTTATTG GAAACATTTT AGATCCCCAG AGGTATAAGT 1860 

TTCAAACCAG TCTTAGCTTT TCAAGTTGTT GATCAGACCC TTCTCTTAAC AGAGAGATAC 1920 

CACAGTCACT AGAGATACCC TGAGGTTCAT GTCATCCCAA AACCCACAGC ACTCAGAAGC 1980 

TAACCTCTAC ACCCACTCAC ACTGTGAGTA TTCAGTTCGG TTTCATTTTA CXGAAAACCT 2040 

GTGAAACCTC TTTTTATAAA AATCAGGCAA TTAAATCCCT TTTCATCACA CAATTATTGA 2100 

GCCTTGTTCC CCATGGCTCA CCAAAATGTG CTCAATTTTG TGAGAGAAAG ACTGTACTCC 2160 

ATAACTGACT ATTCACGTCC CATCTTTTTG GCTCTTCCCC AAAGCAGAAT CCTTACTGTT 2220 

GGTTGACAGT AATCTCTTTT TAAAAAGTAA CTCTCAGCTT TTTCCTTAGC ACCAGAGCCT 2280 

TTCGGCTCCG GGAGACGAGA GGGTCATTAC ATACTTTTTT TTTTTTCTGG AAATAGGGGC 2340 

ATTGTGACTT TATAGCCTAA ACTGGAGCTG TCTGAACCTG TGGTCAGGCT CAAGAGCCAG 2400 
CAGGGGGAGC AGCAAACTC 

Seq ID NO: 268 Protein sequence 
Protein Accession 8: AAC50426.1 

1 11 21 31 41 51 

1 I I 1 1 I 
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MIEDSGKRGN TMAERRQLFA EMRAQDLDRI RLSTYRTACK LRFVQKKCNL HLVDIWNVIE 60 

ALRENALNNL DPNTELNVSR LEAVLSTIFY QLNKRMPTTH QIHVEQSISL LLNFLLAAFD 120 

PEGHGKISVF AVKMALATLC GGKIMDKLRY IFSMISDSSG VMVYGRYDQF LREVLKLPTE 160 

VLEGPSFGYT EQSARSCFSQ QKKVTLNGFL DTLMSDPPPQ CLVWLPLLHR LANVENVFHP 240 

VECSYCHSES MMGFRYRCQQ CHNYQLCQDC FWRGHAGGSH SNQHQMKEYT SWKSPAKKLT 300 

NALSKSLSCA SSRBPLHPMF PDQPEKPLNL AHIVDTWPPR PVTSMNDTLP SHSVPSSGSP 360 

FITRSMLESS NRLDEEHRLI ARYAARLAAE SSSSQPPQQR SAPDISFTID ANKQQRQLIA 420 

ELENKNREIL QEIQRLRLEH EQASQPTPEK AQQNPTLLAE LRLLRQRKDE LEQRMSALQE 480 
SRRELMVQIiE GLMKLLKEEE LKQGVSYVPY CRS 

Seq ID NO: 269 DMA sequence 

Nucleic Acid Accession ftt NM_001276.1 

Coding sequence: 127.. 1278 

1 11 21 31 41 51 

I I I I I I 

AGTGGAGTGG GACAGGTATA TAAAGGAAGT ACAGGGCCTG GGGAAGAGGC CCTGTCTAGG 60 

TAGCTGGCAC CAGGAGCCGT GGGCAAGGGA AGAGGCCACA CCCTGCCCTG CTCTGCTGCA 120 

GCCAGAATGG GTGTGAAGGC GTCTCAAACA GGCTTTGTGG TCCTGGTGCT GCTCCAGTGC 1B0 

TGCTCTGCAT ACAAACTGGT CTGCTACTAC ACCAGCTGGT CCCAGTACCG GGAAGGCGAT 240 

GGGAGCTGCT TCCCAGATGC CCTTGACCGC TTCCTCTGTA CCCACATCAT CTACAGCTTT 300 

GCCAATATAA GCAACGATCA CATCGACACC TGGGAGTGGA ATGATGTGAC GCTCTACGGC 360 

ATGCTCAACA CACTCAAGAA CAGGAACCCC AACCTGAAGA CTCTCTTGTC TGTCGGAGGA 420 

TGGAACTTTG GGTCTCAAAG ATTTTCCAAG ATAGCCTCCA ACACCCAGAG TCGCCGGACT 480 

TTCATCAAGT CAGTACCGCC ATTCCTGCGC ACCCATGGCT TTGATGGGCT GGACCTTGCC 540 

TGGCTCTACC CTGGACGGAG AGACAAACAG CATTTTACCA CCCTAATCAA GGAAATGAAG 600 

GCCGAATTTA TAAAGGAAGC CCAGCCAGGG AAAAAGCAGC TCCTGCTCAG CGCAGCACTG 660 

TCTGCGGGGA AGGTCACCAT TGACAGCAGC TATGACATTG CCAAGATATC CCAACACCTG 720 

GATTTCATTA GCATCATGAC CTACGATTTT CATGGAGCCT GGCGTGGGAC CACAGGCCAT 780 

CACAGTCCCC TGTTCCGAGG TCAGGAGGAT GCAAGTCCTG ACAGATTCAG CAACACTGAC 840 

TATGCTGTGG GGTACATGTT GAGGCTGGGG GCTCCTGCCA GTAAGCTGGT GATGGGCATC 900 

CCCACCTTCG GGAGGAGCTT CACTCTGGCT TCTTCTGAGA CTGGTGTTGG AGCCCCAATC 960 

TCAGGACCGG GAATTCCAGG CCGGTTCACC AAGGAGGCAG GGACCCTTGC CTACTATGAG 1020 

ATCTGTGACT TCCTCCGCGG AGCCACAGTC CATAGAACCC TCGGCCAGCA GGTCCCCTAT 1080 

GCCACCAAGG GCAACCAGTG GGTAGGATAC GACGACCAGG AAAGCGTCAA AAGCAAGGTG 1140 

CAGTACCTGA AGGATAGGCA GCTGGCAGGC GCCATGGTAT GGGCCCTGGA CCTGGATGAC 1200 

TTCCAGGGCT CCTTCTGCGG CCAGGATCTG OGCTTCCCTC TCACCAATGC CATCAAGGAT 1260 

GCACTCGCTG CAACGTAGCC CTCTGTTCTG CACACAGCAC GGGGGCCAAG GATGCCCCGT 1320 

CCCCCTCTGG CTCCAGCTGG CCGGGAGCCT GATCACCTGC CCTGCTGAGT CCCAGGCTGA 1380 

GCCTCAGTCT CCCTCCCTTG GGGCCTATGC AGAGGTCCAC AACACACAGA TTTGAGCTCA 1440 

GCCCTGGTGG GCAGAGAGGT AGGGATGGGG CTGTGGGGAT AGTGAGGCAT CGCAATGTAA 1500 

GACTCGGGAT TAGTACACAC TTGTTGATGA TTAATGGAAA TGTTTACAGA TCCCCAAGCC 1560 

TGGCAAGGGA ATTTCTTCAA CTCCCTGCCC CCTAGCCCTC CTTATCAAAG GACACCATTT 1620 

TGGCAAGCTC TATCACCAAG GAGCCAAACA TCCTACAAGA CACAGTGACC ATACTAATTA 1680 

TACCCCCTGC AAAGCCAGCT TGAAACCTTC ACTTAGGAAC GTAATCGTGT CCCCTATCCT 1740 

ACTTCCCCTT CCTAATTCCA CAGCTGCTCA ATAAAGTACA AGAGTTTAAC AGTGTGTTGG 1800 

CGCTTTGCTT TGGTCTATCT TTGAGCGCCC ACTAGACCCA CTGGACTCAC CTCCCCCATC 1860 

TCTTCTGGGT TCCTTCCTCT GAGCCTTGGG ACCCCTGAGC TTGCAGAGAT GAAGGCCGCC 1920 
ATGTT 

Seq ID NO: 270 Protein .sequence 
Protein Accession #: NP_001267.1 

I 11 21 31 41 51 

II (111 

MGVKASQTGF WLVLLQCCS AYKLVCYYTS WSQYREGDGS CFPDALDRFL CTHIIYSFAN 60 

ISNDHIDTWB WNDVTLYGML NTLKNRNPNL KTLLSVGGWN FGSQRFSKIA SNTQSRRTFI 120 

KSVPPFLRTH GFDGLDLAWI* YPGRRDKQHF TTLIKEMKAE FIKEAQPGKK QLLLSAALSA 180 

GKVTIDSSYD IAKISQHLDF ISIMTYDFHG AWRGTTGHHS PLFRGQEDAS PDRFSNTDYA 240 

VGYMLRLGAP ASKLVMGIPT FGRSFTLASS ETGVGAPISG PGIPGRFTKE AGTIAYYEIC 300 

DFLRGATVHR TLGQQVPYAT KGNQWVGYDD QESVKSKVQY LKDRQIiAGAM VWALDLDDFQ 360 
GSFCGQDLRF PLTNAIKDAL AAT 

Seq ID NO: 271 DNA sequence 

Nucleic Acid Accession fl: NM_006474.1 

Coding sequence: 181.. 669 

1 11 21 31 41 51 

|| | 1 I i 

GCTGCCTAGG GTCTGGAAAG CTCGGGCACC CTCCCTCTCC GGGGCTCCTG CTCCCACCCC 60 

TCCGGCCCCC CCACCGTCGC GCTCCTCCAG GCTGGGCCTG TGGCCGCGGT GCTTTTAATT 120 

TTCCCCCAGC TCAGAATCTT GCTGCTCGGC CCCCAGGAGA GCAACAACTC AACGGGAACG 180 

ATGTGGAAGG TGTCAGCTCT GCTCTTCGTT TTGGGAAGCG CGTCGC TCTG GGTCCTGGCA 240 

GAAGGAGCCA GCACAGGCCA GCCAGAAGAT GACACTGAGA CTACAGGTTT GGAAGGCGGC 300 

GTTGCCATGC CAGGTGCCGA AGATGATGTG GTGACTCCAG GAACCAGCGA AGACCGCTAT 360 

AAGTCTGGCT TGACAACTCT GGTGGCAACA AGTGTCAACA GTGTAACAGG CATTCGCATC 420 

GAGGATCTGC CAACTTCAGA AAGCACAGTC CACGOGCAAG AACAAAGTCC AAGCGCCACA 480 

GCCTCAAACG TGGCCACCAG TCACTCCACG GAGAAAGTGG ATGGAGACAC ACAGACAACA 540 

GTTGAGAAAG ATGGTTTGTC AACAGTGACC CTGGTTGGAA TCATAGTTGG GGTCTTACTA 600 

GCCATCGGTT TCATTGGTGG AATCATCGTT GTGGTTATGC GAAAAATGTC GGGAAGGTAC 660 

TCGCCCTAAA GAGCTGAAGG GTTACGCCCT GCTTGCCAAC GTGCTTTAAA AAAAGACCGT 720 

TTCTGACTCT GTGGCCCTGT CCCTGAGCTC GTGGGGAGAA GATGACCCTG GGAACATTTG 780 

CGGGCCCATT CAGATTCCAC GGTGACTTTC CGTTTGCCAA ATTAACCGAG GAAAGACCTT 840 
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TCACCAGATT TGGTTCTTAA ACTTT 

Seq ID NO: 272 Protein sequence 
Protein Accession #: NP_00646S.l 

1 11 21 31 41 51 

• 1 1 1 1 1 

MWKVSALLFV LGSASLWVLA EGASTGQPED DTETTGLEGG VAHPGAEDDV VTPGTSEDRY 60 
KSGLTTLVAT SVNSVTGIRI EOLPTSESTV HAQEQSPSAT ASNVATSHST BKVDGDTQTT 120 
VEKDGLSTVT LVGIIVGVLL AIGFIGGIIV WMRKMSGRY SP 

Seq ID NO: 273 DNA sequence 

Nucleic Acid Accession 8: CAT cluster 

1 11 21 31 41 51 

I I I I I 1 

GCGGCCGCCA GCTTGCAAAG CCGAAGTCTG GCCGCGCTCT TCGACTCGCT GCGCCACGTC 60 

CCCGGGGGTG CCGAGCCGGC GGGGGGTGAG GTGGCTGCGC CGGCGGCCGG GCTAGGAGGT 120 

GCGGGCACTG GGGGCGCGGG AGGGGACGTG GCAGGCCCCG CGGGGGCCAC GGCGATCCCA 180 

GGGGCCAGGA AGGTCCCGCT GCGGGCAOGC AATCTGCCTC CGTCCTTCTT CACGGAGCCG 240 

TCCCGGGCAG GCGGCGGCGG GTGTGGCCCG TCGGGGCCGG ACGTGAGCTT GGGCGACCTG 300 

GAGAAGGGCG CGGAGGCCGT GGAGTTCTTT GAGCTGCTGG GGCCCX3ACTA CGGCGCCGGC 360 

ACGGAGGCGG CAGTCTTGCT TGCCGCCGAG CCTCTCGACG TGTTCCCCGC CGGAGCCTCC 420 

GTACTGCGGG GACCCCCGGA GCTGGAGCCC GGCCTCTTTG AGCCGCCGCC GGCAGTGGTG 480 

GGAAACCTAC TGTACCCCGA GCCCTGGAGC GTCCCGGGCT GCTCCCCGAC CAAAAAGAGC 540 

CCCTTGACTG CCCTCCGCGG CGGGTTGACC TTGAACGAGC CCTTGAGCCC CCTGT ACCCC 600 

GCCGCTGCGA ATTTCTCCCG GCGGGGAGGA CGGGCCGGGC CATTTGGCTT CTTTCGCCCC 660 
CTTCTTTCCA GACTGOGCTT TGC 

Seq ID NO: 274 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

1 11 21 31 41 51 

I I I 1 I 1 

CAAAGAGGCC GGGCTCCAGC TCCGGGGGTC CCCGCAGTAC GGAGGCTCCG GCGGGGAACA 60 
CGTCGAGAGG CTCGGCGGCA AGCAAGACTG CCGCCTCCGT GCOGGCGCCG TAGTCGGGCC 120 
CCAGCAGCTC AAAGAACTCC ACGGCCTCCG CGCCCTTCTC CAGGTCGCCC AAGCTCACGT 180 
COGGCCCCGA CGGGCCACAC COGCCGCCGC CTGCCCGGGA CGGCTCCGTG AAGAAGGAOG 240 
GAGGCAGATT GCGTGCCCGC AGCGGGACCT TCCTGGCCCC TGGGATCGCC GTGGCCCCCG 300 
CGGGGCCTGC CACGTCCCCT CCCGCGCCCC CAGTGCCCGC ACCTCCTAGC CCGGCCGCCG 360 
GCGCAGCCAC CTCACCCCCC GCCGGCTCGG CACCCCCGGG GAOGTGGCGC AGCGAGTCGA 420 
AGAGCGCGGC CAGACTTCGG CTTTGCAAGC TGGCGGCCGC 

Seq ID NO: 27S DNA sequence 

Nucleic Acid Accession #: NMJ>01118.1 

Coding sequence: 74.. 1651 

1 11 21 31 41 51 

I I I I I 1 

AGCCCAGAGA CACATTGGGG CTGACCTGCC GCTGCTGTCA GTGGGAGGCC AGTGGTGCTG 60 

GCCAAGAAGT GTCATGGCTG GTGTCGTGCA CGTTTCCCTG GCTGCTCACT GOGGGGCCTG 120 

TCCGTGGGGC CGGGGCAGAC TCCGCAAAGG ACGCGCAGCC TGCAAGTCCG CGGCCCAGAG 180 

ACACATTGGG GCTOACCTGC CGCTGCTGTC AGTGGGAGGC CAGTGGTGCT GGCCAAGAAG 240 

TGTCATGGCT GGTGTOGTGC ACGTTTCCCT GGCTGCTCTC CTCCTGCTGC CTATGGCCCC 300 

TGCCATGCAT TCTGACTGCA TCTTCAAGAA GGAGCAAGCC ATGTGCCTGG AGAAGATCCA 360 

GAGGGCCAAT GAGCTGATGG GCTTCAATGA TTCCTCTCCA GGCTGTCCTG GGATGTGGGA 420 

CAACATCACG TGTTGGAAGC CCGCCCATGT GGGTGAGATG GTCCTGGTCA GCTGCCCTGA 480 

GCTCTTCCGA ATCTTCAACC CAGACCAAGT CTGGGAGACC GAAACCATTG GAGAGTCTGA 540 

TTTTGGTGAC AGTAACTCCT TAGATCTCTC AGACATGGGA GTGGTGAGCC GGAACTGCAC 600 

GGAGGATGGC TGGTCGGAAC CCTTCCCTCA TTACTTTGAT GCCTGTGGGT TTGATGAATA 660 

TGAATCTGAG ACTGGGGACC AGGATTATTA CTACCTGTCA GTGAAGGCCC TCTACACGGT 720 

TGGCTACAGC ACATCCCTCG TCACCCTCAC CACTGCCATG GTCATCCTTT GTCGCTTCCG 780 

GAAGCTGCAC TGCACACGCA ACTTCATCCA CATGAACCTG TTTGTGTCGT TCATGCTGAG 840 

GGCGATCTCC GTCTTCATCA AAGACTGGAT TCTGTATGCG GAGCAGGACA GCAACCACTG 900 

CTTCATCTCC ACTGTGGAAT GTAAGGCCGT CATGGTTTTC TTCCACTACT GTGTTGTGTC 960 

CAACTACTTC TGGCTGTTCA TCGAGGGCCT GTACCTCTTC ACTCTGCTGG TGGAGACCTT 1020 

CTTCCCTGAA AGGAGATACT TCTACTGGTA CACCATCATT GGCTGGGGGA CCCCAACTGT 1080 

GTGTGTGACA GTGTGGGCTA CGCTGAGACT CTACTTTGAT GACACAGGCT GCTGGGATAT 1140 

GAATGACAGC ACAGCTCTGT GGTGGGTGAT CAAAGGCCCT GTGGTTGGCT CTATCATGGT 1200 

TAACTTTGTG CTTTTTATTG GCATTATCGT CATCCTTGTG CAGAAACTTC AGTCTCCAGA 1260 

CATGGGAGGC AATGAGTCCA GCATCTACTT GCGACTGGCC CGGTCCACCC TGCTGCTCAT 1320 

CCCACTATTC GGAATCCACT ACACAGTATT TGCCTTCTCC CCAGAGAATG TCAGCAAAAG 1380 

GGAAAGACTC GTGTTTGAGC TGGGGCTGGG CTCCTTCCAG GGCTTTGTGG TGGCTGTTCT 1440 

CTACTGTTTT CTGAATGGTG AGGTACAAGC GGAGATCAAG CGAAAATGGC GAAGCTGGAA 1500 

GGTGAACCGT TACTTCGCTG TGGACTTCAA GCACCGACAC CCGTCTCTGG CCAGCAGTGG 1560 

GGTGAATGGG GGCACCCAGC TCTCCATCCT GAGCAAGAGC AGCTCCCAAA TCCGCATGTC 1620 
TGGCCTCCCT GCTGACAATC TGGCCACCTG AGCCATGCTC CCCT 

Seq ID NO: 276 Protein sequence 
Protein Accession #: NP_001109.1 

1 11 21 31 41 51 

I I 1 I I I 

MAGWHVSLA AHCGACPWGR GRLRKGRAAC KSAAQRHIGA DLPLLSVGGQ WCWPRSVMAG 60 
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PCT/US02/36810 



WHVSLAALL LLPMAPAMHS DCIFKKEQAM CLEKIQRANE LMGFNDSSPG CPGMWDNITC 120 

WKPAHVGEMV LVSCPELPRI FNPDQVWETE TIGESDFGDS NSLDLSDMGV VSRNCTEDGW 180 

SEPFPHYFDA CGFDEYESET GDQDYYYLSV KALYTVGYST SLVTLTTAMV ILCRFRKLHC 240 

TRNFIHMNLF VSFMLRAISV FIKDWILYAE QDSNHCFIST VECKAVMVFP HYCWSNYFW 300 

5 LFIEGLYLFT LLVETFFPER RYFYWYTIIG WGTPTVCVTV WATLRLYFDD TGCWDMNDST 360 

ALWWVIKGPV VGSIMVNFVL FIGIIVILVQ KLQSPDMGGN ESSIYLRLAR STIiLLIPLFG 420 

IHYTVFAFSP ENVSKRERLV FELGLGSFQG FWAVLYCFL NGEVQAEIKR KWRSWKVNRY 480 
FAVDFKHRHP SLASSGVNGG TQLSILSKSS SQIRMSGIiPA DNLAT 

10 Seq ID NO: 277 DNA sequence 

Nucleic Acid Accession #: NM_004000.1 
Coding sequence: 36.. 1193 

1 11 21 31 41 51 

15 | | | | 1 | 

AGAAGAAGCT GGCCAAGGAT ATGGGAGCAA CCACCATGGA CCAGAAGTCT CTCTGGGCAG 60 

GTGTAGTGGT CTTGCTGCTT CTCCAGGGAG GATCTGCCTA CAAACTGGTT TGCTACTTTA 120 

CCAACTGGTC CCAGGACCGG CAGGAACCAG GAAAATTCAC CCCTGAGAAT ATTGACCCCT 180 

TCCTATGCTC TCATCTCATC TATTCATTCG CCAGCATCGA AAACAACAAG GTTATCATCA 240 

AGGACAAGAG TGAAGTGATG CTCTACCAGA CCATCAACAG TCTCAAAACC AAGAATCCCA 300 

AACTGAAAAT TCTCTTGTCC ATTGGAGGGT ACCTGTTTGG TTCCAAAGGG TTCCACCCTA 360 

TGGTGGATTC TTCTACATCA CGCTTGGAAT TCATTAACTC CATAATCCTG TTTCTGAGGA 420 

ACCATAACTT TGATGGACTG GATGTAAGCT GGATCTACCC AGATCAGAAA GAAAACACTC 480 

ATTTCACTGT GCTGATTCAT GAGTTAGCAG AAGCCTTTCA GAAGGACTTC ACAAAATCCA 540 

CCAAGGAAAG GCTTCTCTTG ACTGCGGGCG TATCTGCAGG GAGGCAAATG ATTGATAACA 600 

GCTATCAAGT TGAQAAACTG GCAAAAGATC TGGATTTCAT CAACCTCCTG TCCTTTGACT 660 

TCCATGGGTC TTGGGAAAAG CCCCTTATCA CTGGCCACAA CAGCCCTCTG AGCAAGGGGT 720 

GGCAGGACAG AGGGCCAAGC TCCTACTACA ATGTGGAATA TGCTGTGGGG TACTGGATAC 780 

ATAAGGGAAT GCCATCAGAG AAGGTGGTCA TGGGCATCCC CACATATGGG CACTCCTTCA 840 

CACTGGCCTC TGCAGAAACC ACCGTGGGGG CCCCTGCCTC TGGCCCTGGA GCTGCTGGAC 900 

CCATCACAGA GTCTTCAGGC TTCCTGGCCT ATTATGAGAT CTGCCAGTTC CTGAAAGGAG 960 

CCAAGATCAC GCGCCTCCAG GATCAGCAGG TTCCCTACGC AGTCAAGGGG AACCAGTGGG 1020 

TGGGCTATGA TGATGTGAAG AGTATGGAGA CCAAGGTTCA ' GTTCTTAAAG AATTTAAACC 1080 

TGGGAGGAGC CATGATCTGG TCTATTGACA TGGATGACTT CACTGGCAAA TCCTGCAACC 1140 

AGGGCCCTTA CCCTCTTGTC CAAGCAGTCA AGAGAAGCCT TGGCTCCTTG TGAAGGATTA 1200 

ACTTACAGAG AAGCAGGCAA GATGACCTTG CTGCCTGGGG CCTGCTCTCT CCGAGGAATT 1260 

CTCATGTGGG ATTCCCCTTG CCAGGCTGGC CTTTGGATCT CTCTTCCAAG CCTTTCCTGA 1320 

CTTCCTCTTA GATCATAGAT TGGACCTGGT TTTGTTTTCC TGCAGCTGTT GACTTGTTGC 1380 
CCTGAAGTAC AATAAAAAAA ATTCATTTTG CTCCAGTA 
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Seq ID NO: 278 Protein sequence 
Protein Accession #: NP 003991.1 



1 11 21 31 41 51 

45 | i i | | | 

MDQKSLWAGV WLLLLQGGS AYKLVCYFTN WSQDRQEPGK FTPENIDPFL CSHLIYSFAS 60 

IENNKVIIKD KSEVMLYQTI NSLKTKNPKL KILLSIGGYL FGSKGFHPMV DSSTSRLEFI 120 

NSIILFLRNH NFDGLDVSWI YPDQKENTHF TVLIHELAEA FQKDFTKSTK ERLLLTAGVS 180 

AGRQMIDNSY QVEKLAKDLD FINLLSFDFH GSWEKPLITG HNSPLSKGWQ DRGPSSYYNV 240 

EYAVGYWIHK GMPSEKWMG IPTYGHSFTL ASABTTVGAP ASGPGAAGPI TESSGFLAYY 300 

EICQFLKGAK ITRLQDQQVP YAVKGNQWVG YDDVKSMETK VQFLKNLNLG GAMIWSIDMD 360 
DFTGKSCNQG PYPLVQAVKR SLGSL 



Seq ID NOi 279 DNA sequence 

Nucleic Acid Accession #i NM_0 15 166.1 

Coding sequence: 116.. 1249 

1 11 21 31 41 51 

I I I ! I I 

TGCTGGAAGT CCCTCACCCA GAGACCAGTG CTCCCAACGG CAGAGCAGCG GGGGAGATAA 60 

AGAACTGGTG ACACGTGGCT GTACATTCAG CACAGCTGTG GTGTCCCCAA GTGCCATGAC 120 

CCAGGAGCCA TTCAGAGAGG AGCTGGCCTA TGACCGGATG CCCACGCTGG AGCGGGGCCG 180 

GCAAGACCCC GCCAGCTATG CCCCAGACGC GAAGCCGAGC GACCTGCAGC TGTCGAAGAG 240 

ACTGCCCCCC TGCTTCAGCC ACAAGACGTG GGTCTTCTCT GTGCTGATGG GGAGCTGCCT 300 

CCTGGTGACC TCGGGGTTTT CGCTGTACCT GGGGAACGTG TTCCCGGCTG AGATGGATTA 360 

CTTGCGCTGT GCTGCAGGCT CTTGCATCCC CTCGGCAATT GTGAGCTTCA CCGTCTCCAG 420 

GAGGAACGCC AATGTGATTC CCAACTTTCA GATATTGTTT GTTTCCACGT TTGCTGTGAC 480 

CACTACGTGT TTAATTTGGT TTGGATGCAA ACTAGTCCTG AACCCATCAG CAATAAACAT 540 

CAACTTCAAC CTCATCCTGC TGCTCCTGCT GGAGCTGCTC ATGGCGGCCA OGGTGATCAT 600 

CGCTGCACGG TCCAGCGAGG AGGACTGCAA GAAAAAGAAG GGCTCCATGT CTGACAGCGC 660 

CAACATTCTG GACGAAGTGC CATTTCCTGC TCGGGTCCTG AAATCTTACT CAGTCGTCGA 720 

GGTAATCGCA GGCATCTCTG CCGTCCTCGG GGGGATCATT GCCCTGAACG TGGATGACTC 780 

AGTTTCAGGC CCACACCTCT CAGTGACGTT CTTTTGGATC CTAGTGGCCT GCTTTCCAAG 840 

TGCCATTGCC AGTCATGTGG CAGCAGAGTG TCCCAGCAAG TGTCTGGTGG AGGTCCTGAT 900 

TGCCATAAGC AGCCTCACGT CTCCGCTGCT GTTCACAGCC TCTGGATATC TGTCATTCAG 960 

CATCATGAGA ATCGTGGAGA TGTTTAAGGA TTACCCGCCA GCCATAAAAC CATCCTACGA 1020 

TGTGCTGCTG CTGCTGCTGC TGCTAGTGCT CCTGCTGCAG GCCGGCCTCA ACACGGGCAC 1080 

CGCCATCCAG TGCGTGCGCT TCAAGGTCAG TGCAAGGCTG CAGGGTGCAT CCTGGGACAC 1140 

CCAGAACGGC CCGCAGGAGC GCCTGGCTGG GGAGGTGGCC AGGAGCCCCC TGAAGGAGTT 1200 

CGACAAGGAG AAAGCCTGGA GAGCCGTCGT GGTGCAAATG GCCCAGTGAC CCCCAGACGC 1260 

GGAAACCGGG TGGCAGCGCC CAGCCTGGCC CCAAGCATGG AAACGCACAA CCCCTAATCG 1320 

CCCTGAGCTA CTGCTTCTAA CACCTCTTTT CCCTTGTGTG AGGGCAAACC AGGCTGCAGG 1380 

TGGGGTTTTC ACTTCCTAGG GTAGTTTAAT TTTAAAATAG GCCAATGTTG GCTAGTCTGT 1440 

GCCTCAGTGA GATCAGTCAG CTCCGAGTGG CTCCCGTGTC GTAACAGCAG GAGCATGGCC 1500 
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GCAACTTOCC 
CTGOCCCCAG 
CTGACCAGAC 
GCTTAGGGTC 
GGTGCACCTG 
TTGGCGCCTC 
GCGACACAGG 
TGCCCTGTCT 
CCCACGTCCT 
CGGGCACTGT 
TCATGCCCAG 
GGGCTGCATC 
GGGAAGCTGA 
GGTCAGAGCC 
TTGTCTTGAT 
GGGGTCTTAG 
GGTGTGAGAA 
TCCTCGTCCC 
GGGCGACTGT 
TTCACAATGG 
CGCCGTGGGG 
GTGGCCTGGG 
CGGCCTGCAG 
GAGTGACTCT 
CAACCATTGA 
CAGGCCCTCT 
CTTTGGCCAC 
CCGGATGCAG 
GCGGGGTCTG 

TTTTTGCTGG 
ATGGGGTTTA 
AAACGCTGAC 



AGGCCGAGGA 
CTCCTCTTCC 



CCCCAGGCTC 
TGTCCTGGCC 
CTGCCGCTGC 
CCCTACTTAG 

CATGCCCAGC 
CTTGGAGTGC 
CAAGGCTTTG 
GCCCCTCCCT 
CAGGAATTCT 
CCTCACTGAG 
GGTACAGCCC 
TAACACTGGC 
GAGGGCGCCT 
GAAACTGGAA 
TTTTGTTTCT 
TGTATTGACT 
GCCATCAAGA 
ACCCTGCCAG 
GGCTGCCAAG 
AGGGGGATGC 
CTGCAGTCCT 
AGCCCCAAAC 
ACCCCACCCT 
CTCCTGGGAA 
GGCAGACCAG 
CTTGCACTTT 
AAAGAACTGT 
TTGCAGACGT 



AGGGCCCCCG 
TGCCTCTCTC 
TGCGTCTCCT 
ACTCAGTCAG 
AGGCTGCCCA 
CGGGAGCCGC 
TCTCCCAGGT 
ACCCAGTCAT 
ATGCCCAGCT 
TTGGAGTGCC 
CACGTGGCCG 
TAAACCGGCT 
CCTGTGTTTG 
CACAGGTTCA 
AAGGCCCCAG 
AGGCGGTCAC 
TTTGGGTAGT 
GGGTCCTCCC 
TGCTGTTACT 
TTCCTTCTGA 
GGTCCCCCAT 
TCAAAGAACC 
GGGGTTTGAG 
ATACCTTCAT 
AGGCAAGACA 
CGGCAGCTCC 
AAGCGCCACC 
GTCTGCAGGC 
TGGGGTGGGA 
AGAGCTCAAG 
CTTATTTTCT 
CTCTTTCAGT 
G 



GCTCGGCCTC 
ATGGCTTGGG 
GCCCACTCTG 
CCCAGTGGGA 
GCGGCTCTTC 
AGGGGCCCCT 
GAAATCCACA 
TCTAGAGGGA 
TGGAGTGCCA 
ACATGGCCGC 
CTGCTGTGAC 
GGACCACGTC 
CTCCCATTAC 
CCCAGCGCCC 
GGCAGCGGTC 
TGCCCACCTG 
GCCATGGCCA 
AGCTGACGGC 
TTGATAATTA 
TCTGAGAAGA 
GGGCTGCTGG 
AGGAGGCCTG 
GACCCTCCCA 
GGTGTTCATG 
CCTTCCCAGC 
GGAAGGACAC 
GTGGGAGAGG 
CACCCCCACC 
GCCACAGGGA 
TTTCAGAGTC 
TATTTTAGAG 
CTACATTTTT 



TTGAGAGCCC 
CTGGAGTGGG 
ACOGGGCTTC 
TCTTACCCAC 
CTGGACAGTG 
CCTCCAGAGC 
CCAGTCCACG 
TCCACCGCCA 
CGTGGCCGCT 
TGCCCACGTC 
AGGCAGTGTT 
TCCCTGGCCC 
TCAAAATGCA 
TCTACCTCCT 
TTCTCCATCT 
CCAGGCTGCA 
GGAGTGGGTT 
AGTAAATCCA 
CTTTCCAGCA 
CATGAACGTT 
AAAGGCCCGG 
GGAGATGCAG 
GGTTTCCCAC 
ACAAGTGGAA 
TGCTCCTAGA 
AGTCAGGGGC 
AGAGGCTGCT 
TCCCTGCAGC 
CGACCCGGGG 
AGAATTAGGC 
CGCTTAAAAA 
GTTTAATACG 



CACCCCTGAA 
CTCTCTGGAC 
CTCCCTCCAC 
TTCCCTGCAA 
AGAGCAGGGC 
CTGGGCGCAA 
CCGGGTCGCC 
CACTGGCCGG 
GCCCACGTCC 
CCGGGCACTG 
CTTGGGGGTG 
CAGTGACOGG 
GGACAGATCA 
AGCAAGTACT 
CCGCTGTTTT 
GGGACAGTTG 
TCCCTGCGTC 
CAGTGAGTTG 
GGTGTTTTCC 
TTCTCTTCAC 
AGATGCATCT 
CTGGATGGGG 
TGCGGAACAG 
TCATTATTTT 
GGGGACAAGC 
CGGGCAAACA 
GTCACTGGTA 
TTTGAGGCTG 
CGGGCTGATG 
ACTTGGAACG 
ATCCGGAAAA 
CTTGAGCAAT 



Seq ID NO: 280 Protein sequence 
Protein Accession np 055981.1 



1 
I 

MTQEPPRBEL 
CLLVTSGFSL 
VTTTCLIWPG 
SANILDEVPF 
PSAIASHVAA 
YDVLLLLLLL 
BFDKEKAWRA 



11 
I 

AYDRMPTLER 
YLGNVFPAEM 
CKLVLNPSAI 
PARVLKSYSV 
ECPSKCLVEV 
VLLLQAGLNT 
WVQMAQ 



21 
I 

GRQDPASYAP 
DYIiRCAAGSC 
NINFNLILLL 
VEVIAGISAV 
LIAISSLTSP 
GTAIQCVRPK 



31 
I 

DAKPSDLQLS 
IPSAIVSFTV 
LLELLMAATV 
LGGIIALNVD 
LLPTASGYLS 
VSARLQGASW 



41 
I 

KRLPPCFSHK 
SRRNANVIPN 
IIAARSSEED 
DSVSGPHLSV 
FSIMRIVEMP 
DTQNGPQERJj 



Seq ID NO: 281 DNA sequence 

Nucleic Acid Accession #: NMJ)0451B.1 

Coding sequence: 43.. 2577 



1 
I 

GCTGAGCCTG 
AACGGCGGCG 
CTGGACCCCG 
GCCCCCAAGC 
CCCCCCAAGC 
CGGCCGCGCG 
CTOGTGCTGT 
TACATCCTGG 
GCCGCAGGCT 
CCGTTCTGTG 
TCCCAGGGCA 
CGGATGATCC 
GCCCACAGCA 
TCGTTCCTGG 
GCACTCTGGT 
ACCTGGAACG 
CTGCCTGCAG 
AAGCACTTTG 
TACGCCACCA 
GTCACCGTGC 
CTCAAGAGTA 
CAGAAGGTCA 
AAGGGGTCCC 
GACAGCCCCA 
GCTTTCCGCA 
GAGGACATTG 
GGCCTCAAAG 
TTCAAGGAGA 
CACCTGGACA 
CGGGGCCCAG 
GAGGACCCCA 



11 
I 

AGCCOGACCC 
TATACCCCGG 
GCGCGCCCGA 
GOGGCAGCAT 
GCAACGCCTT 
GCTGGGCGTT 
CTGTGTTTTC 
AAATCGTGAC 
GCTGCTGCCG 
TGATTGACAT 
ACGTCTTTGC 
GCATGGACCG 
AGGAGCTGGT 
TGTACTTGGC 
GGGGCCTGAT 
GCAGGCTCCT 
GCATCTTGGG 
AGAAGAGGCG 
ACCTCTCGCG 
CCATGTACAG 
AATCTGGACT 
GTTTGAAAGA 
CGCAGGCCCA 
GCAAGGTGCC 
TCAAGGGTGC 
TGGATGACAA 
TCAGCATCAG 
GCCTGCGGCC 
TGCTGTCCCG 
CGATCAOGGA 
GCATGATGGG 



21 

! 

GGGGCGCCTC 
CCCGAGCGGG 
CTCCACCCGG 
CCTCAGCAAA 
CTACCGCAAG 
CATCTACCAC 
CACCATCAAG 
TATCGTGGTG 
GTACCGTGGC 
CATGGTGCTC 
CACATCTGCG 
GCGGGGAGGC 
CACTGCCTGG 
AGAGAAGGGG 
CACGCTGACC 
TGOGGCAACC 
GTCTGGGTTT 
GAACCCGGCA 
CACAGACCTG 
ACTTATCCCC 
CGCTTTCAGG 
TOGTGTCTTC 
GACTGTGAGG 
CAAGAGCTGG 
CGCGTCACGG 
GAGCTGCCCC 
AGCOGTGTGT 
CTACGACGTG 
AATTAAGAGC 
CAAGGACCGC 
ACGGCTCGGG 



31 
I 

CCGCCAGGCA 
GAGAAGAAGC 
GACGGGGCGC 
CCTCGOGCGG 
CTGCAGAATT 
GCCTAOGTGT 
GAGTATGAGA 
TTTGGCGTGQ 
TGGAGGGGGC 
ATCGCCTCCA 
CTCCGGAGCC 
ACCTGGAAGC 
TACATCGGCT 
GAGAACGACC 
ACCATTGGCT 
TTCACCCTCA 
GCCCTGAAGG 
GCAGGCCTGA 
CACTCCACGT 
CCGCTGAACC 
AAGGACCCCC 
TCCAGCCCCC 
CGGTCACCCA 
AGCTTCGGGG 
CAGAACTCAG 
TGCGAGTTTG 
GTCATGCGGT 
ATGGACGTCA 
CTGCAGTCCA 
ACCAAGGGCC 
AAGGTGGAGA 





TGCTGGGCTC 
TCCTTTGTCT 
ACTTTGACAC 
ACGGGGACAA 
TGGGTGTCTC 
TTCAGGAGCA 
TCCAGTCGGC 
GGCAGTACTA 
AGCTGGAGCT 
GGCGGGAGCC 
GAGGGGTGGC 
GCGCCGACCA 
ACOGCAGCCG 
AAGAAGCAAG 
TGACCGAGGA 
TCCTGGTGTC 
TCGAGCAGTA 
GAGTGGACCA 
CGGCCGAGGC 
AGCAGGTCTT 



1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 



51 
I 

TWVFSVLMGS 
FQILFVSTFA 
CKKKKGSMSD 
TFFWILVACP 
KDYPPAIKPS 
AGEVARSPLK 



CGGCTCCGAG 
CGCCGGGAAG 
CGTGCTGGAG 
TTTCTCCTGC 



GCGGATCTGG 
TGCCCGGAAA 
GGCCGCCGGC 
GCAGATTCTG 
TGTGGTCTAT 
CATCCTGGCC 
CTACGCGGAT 
GTACCCCCAG 



GCACAGGCAG 
CTGGAGATTC 
CGAGCGAACG 
GCTGAGGAAC 
GTCTCCAAGC 
TGCCAAGGGG 
GAGCCTCGAG 
GGCACGCCAG 
CCTCCCCGGA 
CCTGACCCCG 
CAAGCGGAAG 
CTCAGCCGGC 
GATCGTGGGG 
GGAGCTGCCC 
GTCCATGGAG 



60 
120 
160 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



1060 



WO 03/042661 



PCTAJS02/36810 



AAGAAGCTGG ACTTCCTGGT GAATATCTAC ATGCAGCGGA TGGGCATCCC CCCGACAGAG 1920 
ACCGAGGCCT ACTTTGGGGC CAAAGAGCCG GAGCCGGCGC CGCCGTACCA CAGCCCGGAA 1980 
GACAGCCGGG AGCATGTCGA CAGGCACGGC TGCATTGTCA AGATCGTGCG CTCCAGCAGC 2040 
TCCACGGGCC AGAAGAACTT CTCGGCGCCC COGGCCGOGC CCCCTGTCCA GTGTCCGCCC 2100 
5 TCCACCTCCT GGCAGCCACA QAGCCACCCG CGCCAGGGCC ACGGCACCTC CCCCGTGGGG 2160 
GACCACGGCT CCCTGGTGCG CATCCCGCCG CCGCCTGCCC ACGAGCGGTC GCTGTCCGCC 2220 
TACGGCGGGG GCAACCGCGC CAGCATGGAG TTCCTGCGGC AGGAGGACAC CCCGGGCTGC 2280 
AGGCCCCCCG AGGGGACCCT GCGGGACAGC GACACGTCCA TCTCCATCCC GTCCGTGGAC 2340 
CACGAGGAGC TGGAGCGTTC CTTCAGCGGC TTCAGCATCT CCCAGTCCAA GGAGAACCTG 2400 
10 GATGCTCTCA ACAGCTGCTA CGCGGCCGTG GCGCCTTGTG CCAAAGTCAG GCCCTACATT 2460 
GCGGAGGGAG AGTCAGACAC CGACTCCGAC CTCTGTACCC CGTGCGGGCC CCCGCCACGC 2520 
TCGGCCACCG GCGAGGGTCC CTTTGGTGAC GTGGGCTGGG CCGGGCCCAG GAA GTGA GGC 2580 
GGCGCTGGGC CAGTGGACCC GCCCGCGGCC CTCCTCAGCA CGGTGCCTCC GAGGTTTTGA 2640 
GGCGGGAACC CTCTGGGGCC CTTTTCTTAC AGTAACTGAG TGTGGCGGGA AGGGTGGGCC 270O 
15 CTGGAGGGGC CCATGTGGGC TGAAGGATGG GGGCTCCTGG CAGTGACCTT TTACAAAAGT 2760 
TATTTTCCAA CAGGGCACTC CCAGGCCCTG TCGCCATTGA GGTGCCTCCG CTGGGCTGTC 2820 
TCCTCACCCC TCCCTGTGCT GGAGCCTGTC CCAAAAAGGT GCCAACTGGG AGGCC TCGGA 2880 
AGCCACTGTC CAGGCTCCCA CTGCCTGTCT GCTCTGTTCC CAAAGGCAGC GT GTGTG GCC 2940 
TCGGGCCCTG CGGTGGCATG AAGCATCCCT TCTGGTGTGG GCATCGCTAC GTGTTTTGGG 3000 
20 GGCAGCGTTT CACGGCGGTG CCCTTGCTGT CTCCCTTGGG CTGGCTCGAG CCTGGGGTCC 3060 
ATGTCCCTTT GCCGTCCCGT CATGGGGCAG GGAATCCATA GCGGGGCCCA CAGGCAGGGG 3120 
TATGAGTGCG TCCCACCCAA CGCAGCACCA GCCCCGGCCA CCGCTCCCCG TGTCCCCAGT 3180 
TCCGTCTCAG CTACCTGGAC TCCAGGACCC TGGAGAAGGG AGACCTGGCA GTGGAGGGAG 3240 
GCTGTGCTGT GTGTCCCCCT GCAGGTGTGA CCCCGCCTGC TCTTTCCTCC CCCGCCAGGT 3300 
25 GTGGCCCCGC CTGCTCTTTC CTCCCCCACC AGTATGGCCC CACCTGCTCT TTCCTNCCCC 3360 
CCCAAGGTGT GGCCCCACCT GTTCTTTCCT CCCCTGCCGA GGTGTGACCC CACCTGCTCT 3420 
TTCCTCCCTC CCAGTATGGC CCCACCTGCT CTTTCCTCCC CCGAGGTGAG GCCCCGCCTG 3480 
CTCTTTCCTC CCATGGGAGC CGCTGAGGOG TGCGCACCTG GGCACAGGTT GGGGCTCTGC 3540 
AGGATGAGGA AGACAGGCCA ATCCCTTCCC TCCCAGAAGC TGGCCGCCCA GCAGGAGGGA 3600 
30 CTGAGGCCAG ACTCATGTCC AGCAAGGAAC GTGTGGTGTG TCCCCTGGGA AGTCTCTGGG 3660 
CCCTGGGAAG AGGGAAGGTG CACGTCCTGG GATGGTTGCG GGGCCCTGTT TTGGGAGACA 3720 
AAGGGGTAGA GGGTCTGTCT TGNGCCCCCC CAGACTCTAG CCCGAGCAGT GCAGCCACCT 3780 
ACTGCCCCAC CTCAGAGAAG TGCAGCGGGA AGGAGGCTGG AGGTGGTGCG GCGCTGCCTC 3840 
GGGTGTCTGC GTGAATGAGC GTGGCCAAGG ACCAGTGCCA CCTCATGGCA AAGAGCTCCC 3900 
35 GCAGTGTTTG TTAGAGTGCA CATCCTACGT GCCCACTGGC ACACACACGT GCTCACATAC 3960 
ANGTCNGNGT ACAGGNGTAC ACATGCANGC TTGCACACAT GCACACAGAC CACATAGCAC 4020 
ACATGTGCAC TGACCACACC TGTATAGACC ATGCACAGTA CACATACGTG CATACACATG 4080 
CCTGCATACA GGCATACACA TGCACGCTTA CATGTACACG TGCACAGATC ACACACATGC 4140 
ACACACGTGT AGCTCACACA CAGTATACAC ATACACAAGT GCACAGACCA CACACAGCAC 4200 
40 TAACACATGC ACACACAAAG TGCATAGGCC ACACAGCACA TGCACACAGG TGCACAGACC 4260 
ACACAGCACA CACAAGTGCA CAGAGCACAC TGCACACATG CACACACACA CGCGTGCATG 4320 
CACACTCCTC GCACTTCCAG CCTTGGAGCC CTTCTGTCTC TGGTCTTTCT CTTTGACCCT 4380 
GCTGAGTGTA AGCTGCCTGG GGAGGGGCTA CAAGGAGTAA TTGTGGCTTT AGGGGTCGTG 4440 
GTGATGCTGG AATGTCAAGC GCCGTCGTGG GGTATCCGAC TGTCCGGGCT CCTGGTCCGC 4500 
45 AGTGGCAGAG CGCCAGGCAG AGCCAATCAG GGTCTCGTGC TGCCCTTCCC CCCCACAGCC 4560 
TGGCAGCCAT CCAGAGGAGG GGCTCTACCA GATGCCAAGG TGCCCCGGTG TCTGTATGGG 4620 
TGTCCGGTTG GGTCCTGTGT TTGGTCTGCC CTGGAGGTGG CTGGGCCCTC CTGGG ATGGG 4680 
TGGCTCAGCC TCGAATCCCA GGCCCCAGCC CAGGCAGGTG CTGCTGCCTG TTGTGGTTTC 4740 
CTGGCCCAGC TTCTCCTTCT CCCTCTGCAT AAAATCACAG TCCGTGAGTC TTCCAGCTGC 4800 
50 CACCACGGCT GGGACACGCT GGGGGAGGGC TCCTCCCATG CCTCCTGCAC ACAGCCGTCT 4860 
GAGCAGGGCA GGTGCCAACA CCCCCCACCG GAGACACGCT GCCCCTCAGC GATGCCCCTA 4920 
CCTTTTGGGG GGCCTCGTCT CAAGCCCCCC CTTGGAGGCT GAAATCACCC CAGGCACTGT 4980 
GAGGGCTTCT CCAGGGGGAC ACCCTTTGAG CTGTGGGTCT GATCACCCCA AGTCCCGCAC 5040 
ACGGAGGAGA GGCACAGCCA GGGCGTGTGG TTTAATGTTT GCCCCTTCGG GGCTGGAGGT 5100 
55 CTCAGTGTTT CTAGATTCCA GACCCTGCTG CCAGAGAGAC CTGCTGCCGG AGAGAAGGGG 5160 
AGGAGGACTC CAGCTGGGCT CGGTCCCCCA CAGTCAGGGA CCCCCATAAA GGACACCCCC 5220 
TTCTCTCTAG AAAGAGCTGG GCTCTCAGCT ATTTCTAGTT GCTTCCCAGA AGCCGAGGAG 5280 
CAGAAGGAGC TGTGAGAGCT TTGCAGAAAC GCCCTTGTCC CCGCCCTCCT GAGCTATGAA 5340 
TGCCGTACAG AGCAGAGGCT GGGGCATTGG CAAGATCACA GGTTGATGCT GCACAGCCCC 5400 
60 ATTGACACAA ACCCTCAAAG CAGACGTGAG AGGGACGGTT CACAAAGCTT GGACCTGCCQ 5460 
TGGAGGGTGC CCGGCAGACG TGGCGTGAGA GGGACGGCTC ACGAGGCTTG GACCTGCTGT 5520 
GGAGGGTGCC CAGCAGACGT GGTGTGAGAG GAACGGCTCA CGAGACTTGG ACCTGGTGGA 5580 
GGGTGCCCAG CAGACGTGGT GTGAGAGGGA CGGCTCACAG GGCTTGGACC GGAGAGAGAT 5640 
GGCTCATGAG ACTTGGACCT GCCGTGGAGG GTGCCCAGCA GACGTGGTAT GAGAGGGATG 5700 
65 GCTCACGAGG CTTGGACCTG GTGGAGGGTG CCCGGCAGAC GTGTGAGAGG GACGGTTCAC 5760 
AAGGCTTGGA CCTGCCATGG AGGGTGCCCA GCAGACGTGG TGTGAGAGGG ACAGCTCACG 5B20 
AGGCTTGGAC CTGCCGTGGA GGGTGCCCAG CAGGGGGCTG AGCTCTGAGG GGTGGGTGCT 5880 
CAGTGCACGG GTGCCCCCAG TGTCCTCTGA TCCTGTCCGG TGCCTCCCCC AACCCCCACA 5940 
CCCATGCAGA ACTCCCAGGT CACATGCACG TATGTCCAGG GCATGGGGGT GGCGTGAAGA 6000 
70 GGCCTGGTCA GGGCCTTTAG GGGCTGCAGG ACGGAATGGC CACCTGGGGA GCCTGTGTGG 6060 
CTGTGCCGGG CAGCCATCCT GCATTCCCAC CCAGCGCGCA GTCTCCACCT CGGCCCCAGC 6120 
AAAGCGCTAA GCAGCCGGAG AGACAGCCAG GGCGGCTTCC TGAAGGATGT GGGATGGTGG 6180 
ACTCCGGGGT CGAGGGAATA CGCAGGTTCC TGTCCTCCGG GAGACCTAGA GAAGCTGCAC 6240 
ACCCAGGAGC TTTCCATGAC CCGGGAGCAT GAGTGAATGG GGGGTTCCAG TTTGCTGAAC 6300 
75 TTTGCTGTCT TGTAAGGGTG GGGGCTGACG GCCGACCCTG GGAGGAGGTG ACACCGCAGG 6360 
GGGAGGTTGT GGGCAACGGT GGAGGAGGAG AGACGGGAGG GGACCATTTG GGATGGAGGG 6420 
GCCTCTTCAG AGTTTTAAAA GGCGTTTGTG GGGTGGAGTT GAGTGTGCTC TGGGCTTGGA 6480 
CACTTGCCGT GGTGCCCCTG GCTGGCCGAG GAGACTGGCT CTGGCCAGGG CCCCGTCCTG 6540 
AGAGGTCCTC AGCGTCTGAC TCTCGGCCAG GCGCCAGCAA GGAGGGGCCG GTCCCCGGGG 6600 
80 CTACCAGGCA GGCACGTGCA CATCGCCATC GCCACACGCC AACTCCGCCT GGGTTTTACA 6660 
AAGTCGTTGC CTTAATGCAT GTGGACAGGA ACTCCCTGAG GTCGCCCCAT GCCCCCTGGC 6720 
TGTGCCAGGT ACGGACGCCC TGGACCCTGC GAACAGGTGG GGCGGGCGAG GGGCCCAAGG 6780 
GACGGGCTCC AGAGACACGC GCAGGGCAGG AGGGGTCTCA CGGAGGGGTC TCGCACTGAG 6840 
GCGCCCAGAG CTGGTGGTCC CGCTGGACGC CATCCCTCTG CCCGGGATCC ACACGGCCCA 6900 

1061 



WO 03/042661 



PC17US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



CGTGTGCCCG 
GTGGCTGCAG 
CCAAAGAACA 
TCCTTGTGCC 
CTCAACTTTG 
TCTAACAGAT 
AGAATGCGCC 
CCGTGGAAGG 
TGTGCACTCA 



CCATGCCCGC 
CTTCCCCATT 
TTTCATAATG 
GTAGTGACCA 
TGCACCGTCC 
TAGCTGAGAT 
AGTGCGGGTT 
TTCTGGGGAT 
GTAAAACGCA 



GCCCCACGCC 
TGCGCCGTTG 
CCTTCAGTAC 
GGCCTTTTTT 
ATTTTCTAGG 
ATTAACTTAC 
TGCGGGGGAC 
CCCTGCTGCC 
GTGATTCAAA 



ATTGCAGTCT 
CCTCTGGCTG 
CGAOGTACAC 
TGGTGGATGT 
GATAGACGCC 
TCACACGGAC 
TTCGGGTGTG 
ACGGGGACGA 
AAAAAAAAAA 



TCCATCCTCT 
TCTGCACTTT 
TTCTGACCAT 
GTTACCCCGC 
CAGGGAATGA 
AGGTTGATGC 
GGGTCCTGCG 
GTTCGGACGC 



GGCCGTGACG 
TGTTCATGCT 
TTTGTATGTG 
ACACTTCAAT 
ACTCTAGTTT 
CAGAGCCGTA 
GCCGCGATGG 
CAGGTGGACC 



6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 



Seq ID NO: 282 Protein sequence 
Protein Accession #: NP_004509.l 



1 
I 

MVQKSRNGGV 
GAGAGKPPKR 
SSEGALYIIiE 
AVLAAGSQGN 
LCLILASFLV 
GVSPFALPAG 
QYYERTVTVP 
GVAAKGKGSP 
EASLPGEDIV 
EQYSAGHLDM 
QVLSMEKKLD 
IVRSSSSTGQ 
ERSLSAYGGG 
QSKENLDALN 
GPRK 



11 



YPGPSGEKKL 
NAFYRKLQNF 
IVTIWFGVE 
VFATSALRSL 
YLAEKGENDH 
ILGSGFALKV 
MYRLIPPLNQ 
QAQTVRRSPS 
DDKSCPCEFV 
LSRIKSLQSR 
FLVNIYMQRM 
KNFSAPPAAP 
NRASMEFLRQ 
SCYAAVAPCA 



21 
I 

KVGFVGLDPG 
LYNVLERPRG 
YFVRIWAAGC 
RPLQILRMIR 
FDTYADALWW 
QEQHRQKHFE 
LELIiRNLKSK 
ADQSLEDSPS 
TEDLTPGLKV 
VDQIVGRGPA 
GIPPTETEAY 
PVQCPPSTSW 
EDTPGCRPPE 
KVRPYIAEGE 



31 
I 

APDSTRDGAL 
WAFIYHAYVF 
CCRYRGWRGR 
MDRRGGTWKL 
GLITLTTIGY 
KRRNPAAGLI 
SGLAFRKDPP 
KVPKSWSPGD 
SIRAVCVMRF 
ITDKDRTKGP 
FGAKEPEPAP 
QPQSHPRQGH 
GTLRDSDTSI 
SDTDSDLCTP 



41 
I 

LIAGSEAPKR 
LLVFSCLVLS 
LKFARKPFCV 
LGSWYAHSK 
GDKYPQTWNG 
QSAWRFYATN 
PEPSPSQKVS 
RSRARQAFRI 
LVSKRKFKBS 
AEAELPEDPS 
PYHSPEDSRE 
GTSPVGDHGS 
SIPSVDHEEL 
CGPPPRSATG 



51 
I 

GSILSKPRAG 
VFSTIKEYEK 
IDIMVLIASI 
ELVTAWYIGF 
RLLAATFTLI 
LSRTDLHSTW 
LKDRVFSSPR 
KGAASRQNSE 
IiRPYDVMDVI 
MMGRLGKVEK 
HVDRHGCIVK 
IiVRIPPPPAH 
ERSPSGFSIS 
EGPFGDVGWA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



Seq ID NO i 2838 DNA sequence 
Nucleic Acid Accession #: AF152496. 
Coding sequence: 1..2391 



1 
I 

ATGGAGGCGG 
CTGGGAGGGT 
AAGGGCTTTT 
GCGAGGGGGG 
ACAGGTGATT 
CCATGCATAC 
CTCCGTATCA 
AAAATCCTAG 
GATGTGGGAA 
CTCACTCGCA 
GATCGGGAGG 
CCTCGGTCTG 
GAATTTGCAC 
ATTGTCACTG 
TTTTTTCATG 
ATGCAACTGG 
GCCAAGGATG 
AACGACAACC 
GGAGAGACTG 
GTGATGTGTT 
TACACCCTAG 
ATCACTATCA 
GTCTCCGACG 
CGCGAGAACA 
GGCACCAACG 



TACGAGGCCC 
TTGAGCAGCG 
GTGCTGTACC 
GAGCCGGGCT 
TGGCTGTCGT 
AATGGCGAAG 
GTGGTGCTGG 
CTCCTGGTGG 
GCCCAGGCCG 
TTCCTCTTTT 
TCGGTGGGTC 
GGCACCGGGA 
ACAAATGAGT 
AGGGTTAGCG 



11 
I 

GAGGAGAGCG 
CTCTGGCTGG 
TAATAGCCAA 
CCCAAGTTGT 
TGCTCCTGAA 
TACATTTTCA 
TAGATGTAAA 
AAAGCACTCT 
GAAACAGCCT 
GTCGTAGGGA 
AGCAGCCGGA 
GGACAGCCCA 
AGCCGCTCTA 
TCTCGGCTTC 
CTTCTGAAGA 
TCAAATATTT 
GCGGAGGCCT 
CACCGGAACT 
TACTGGCTGT 
CCATTGAGAA 
TGTCAGAAGG 
CTGACCTGGG 
TCAATGACAA 
ACAGCCCCGC 
CCCAGGTAAC 
TCTCCATCAA 
TGCAGGOGTT 
AGGCGCTGGT 
GGCTGCAGAA 
ACCTGGTGAC 
ACCAGCTGCT 
TGCGCACCGC 
TCAAGGACAA 
ACGGCTTCTC 
ACTTGCTCAC 
CGGTGCTCCT 
GCTGCTCGGT 
CCCTGTCCCA 
TCAAGTTCCT 
AGGCAAATCC 



21 
I 

ATTTCTTAGA 
GTCCGAGTCA 
CCTAGCAAAG 
GTCCAAAGGG 
TGAGAAATTG 
GATATTACTG 
TGACCATTCT 
GCCAGGAACA 
CCAAAACTAC 
CGGAAGGAAG 
ACTCAGCTTA 
GATAAACATC 
TGAGGTTGCA 
TGACTTAGAT 
AATTCGCAAA 
GAATTTTGAA 
ATCCGGAAAG 
GACCTTGTCT 
TTTCAGTGTT 
CAATCTCCCC 
CGCGCTGGAC 
GACACCCAGG 
CGCCCCCGCC 
CCTGCACATC 
CTACTCGCTG 
GGCGGACAAC 
CGAGTTCCGC 
GCGCGTGCTG 
CGGCTCCGCG 
CAAGGTGGTG 
CAAGGCCACG 
CAGGCTGCTG 
TGGCGAGCCT 
CCAGCCCTAC 
CGTCTACCTG 
GTTCGTGGCG 
GCCCGAGGGC 
GAGCTACCAG 
GAAGCCAATT 
CAGTTTCAGG 



31 
I 

CAAAGGCAAG 
AGACGCTATT 
GATCTGGGAC 
AACAAACAGC 
GACCGGGAGG 
CAAAACCCTT 
CCGGTATTCT 
GTAATTCCTT 
ACTATCACTC 
TACCCGGAAC 
ACGCTCACCG 
CAGGTCTTAG 
GTTCTAGAGA 
ACAGGAAGTT 
ACTTTTCAGC 
GCGATTAATA 
TCTACAGTCA 
TCAGTAAACA 
TCTGATCTAG 
TTCTTCCTGA 
AGAGAGACCA 
CTGAAAACCA 
TTCACCCAAA 
GGCAGTGTCA 
CTGCOGCCCC 
GGCCACCTGT 
GTGGGCGCCA 
GTGCTGGACG 
CCCTGCACCG 
GCGGTGGACG 
GAGCCCGGGC 
AGCGAGCGCG 
CCGCGCTCGG 
CTGCCTCTCC 
GTGGTGGCAT 
GTGCGGCTGT 
CCCTTTCCAG 
TACGAGGTGT 
ATCCCCAACT 
AAGAGCTTTG 



41 
I 

TCTTGCTTCT 
CTGTGGCTGA 
TAAGGGTAGA 
ATTTTCAGCT 
AGCTATGCGG 
TGCAATTCGT 
TTGAAAATGA 
TGGGAAATGC 
CGAATTCCCA 
TAGTACTGGA 
CGCTGGACGG 
ATATAAACGA 
ATACCOCCGT 
TTGGGACAAT 
TAAATCCAAT 
GTTATGAAGT 
TAGTCCAGGT 
GCCCTATTCC 
ACTCTGGAGA 
AACCATCTGT 
GATCCGAGTA 
AGTACAACAT 
TCTCCTACAC 
GCGCCACAGA 
AGGACCCGCA 
TTGCCCTCAG 
CAGACCGTGG 
CCAACGACAA 
AGCTGGTGCC 
GCGACTCGGG 
TGTTCGGCGT 
ACGCGGCCAA 
CCACCGCCAC 
CGGAGGOGGC 
TGGCCTCGGT 
GCAGGAGGAG 
GGCAGATGGT 
GTCTGACTGG 
TCGTTGCTCA 
AATTCAGTTA 



51 
I 

CTTTGTTTTT 
GGAAAAAGAG 
GGAACTGGCC 
CAGTCATCAG 
CCCCACAGAA 
TACAAACGAG 
AATGCATCTG 
TGAGGACTTG 
CTTCCACGTA 
TAAAGCGCTC 
CGGCTCTCCC 
CAATGCACCA 
TAACTCTGTC 
ATCATATGCA 
TACTGGTGAT 
CGACATCGAG 
GGTTGATGTC 
TGAGAACTOG 
CAACGGAAGA 
AGAGAATTTT 
CAACATTACC 
AACCGTGCTG 
CCTGTTCGTC 
CAGAGACTCA 
CCTGCCCCTC 
GTCGCTGGAC 
CTCCCCGGCT 
CTCGCCCTTC 
CCGGGCGGCT 
CCAGAACGCC 
GTGGGCGCAC 
GCACAGGCTG 
GCTGCATGTG 
ACCGGCCCAG 
GTCTTCGCTC 
CAGGGCGGCC 
GGACGTGAGC 
AGGCTCCGGG 
GGGTGCAGAG 
A 



Seq ID NO: 284 Protein sequence 
Protein Accession #: AAD43757.1 

1 11 21 

I I I 

MEAGGERFI*R QRQVLLLFVP 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1360 
1440 
1500 
1560 
1620 
1680 
1740 
1600 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 



31 41 51 

I I i 

RRYSVAEEKE KGFLIANLAK DLGLRVEEIA 

1062 



WO 03/042661 



PCT7US02/36810 



ARQAQWSKG NKQHFQLSHQ TGDLLLNEKL DREELCGPTE PCILHFQILL QNPLQFVTNE 120 

LRIIDVNDHS PVFFENEMHL KILESTLPGT VIPLGNAEDL DVGRNSLQNY TITPNSHFHV 180 

LTRSRRDGRK YPELVLDKAL DREEQPELSL TLTALDGGSP PRSGTAQINI QVLDINDNAP 240 

EFAQPLYEVA VLENTPVNSV IVTVSASDLD TGSFGTISYA FFHASEEIRK TFQLNPITGD 300 

5 MQLVKYLNFE AINSYEVDIE AKDGGGLSGK STVIVQWDV NDNPPELTLS SVNSPIPENS 360 

GETVLAVFSV SDLDSGDNGR VMCSIENNLP FFLKPSVENF YTLVSEGALD RETRSEYNIT 420 

ITITDLGTPR LKTKYNITVL VSDVNDNAPA FTQISYTLFV RENNSPALHI GSVSATDRDS 480 

GTNAQVTYSL LPPQDPHLPL SSLVSINADN GHLFALRSLD YEALQAFEFR VGATDRGSPA 540 

LSSEALVRVL VLDANDNSPF VLYPLQNGSA PCTELVPRAA EPGYLVTKW AVDGDSGQNA 600 

10 WLSYQLLKAT EPGLFGVWAH NGEVRTARLL SERDAAKHRL WLVKDNGEP PRSATATLHV 660 

LLVDGFSQPY LPLPEAAPAQ AQADLLTVYL WALASVSSL FLFSVLLFVA VRLCRRSRAA 720 

SVGRCSVPEG PFPGQMVDVS GTGTLSQSYQ YEVCLTGGSG TNEFKFLKPI IPNFVAQGAE 780 
RVSEANPSFR KSFEFS 

15 Seq ID NO: 285 DNA sequence 

Nucleic Acid Accession #: NM_001794.2 
Coding sequence: 15.. 2765 

1 11 21 31 41 51 

20 i | | | | I 

OGCCGGCGGG GAAGATGACC GCGGGCGCCG GCGTGCTCCT TCTGCTGCTC TCGCTCTCCG 60 

GCGCGCTCCG GGCCCATAAT GAGGATCTTA CAACTAGAGA GACCTGCAAG GCTGGGTTCT 120 

CTGAAGATGA TTACACGGCA TTAATCTCCC AAAATATTCT AGAAGGGGAA AAGCTACTTC 180 

AAGTCAAGTT CAGCAGCTGT GTGGGGACCA AGGGGACACA ATATGAGACC AACAGCATGG 240 

25 ACTTCAAAGT TGGGGCAGAT GGGACAGTCT TCGCCACCCG GGAGCTGCAG GTCCCCTCCG 300 

AGCAGGTGGC GTTCACGGTG ACTGCATGGG ACAGCCAGAC AGCAGAGAAA TGGGACGCCG 360 

TGGTGCGGTT GCTGGTGGCC CAGACCTCGT CCCCGCACTC TGGACACAAG CCGCAGAAAG 420 

GAAAGAAGGT CGTGGCTCTG GACCCCTCTC CGCCTCCGAA GGACACCCTG CTGCCGTGGC 480 

CCCAGCACCA GAACGCCAAC GGGCTGAGGC GGCGCAAACG GGACTGGGTC ATCCCGCCCA 540 

30 TCAACGTGCC CGAGAACTCG CGCGGGCCCT TCCCGCAGCA GCTCGTGAGG ATCCGGTCCG 600 

ACAAAGACAA TGACATCCCC ATCCGGTACA GCATCACGGG AGTGGGCGCC GACCAGCCCC 660 

CCATGGAGGT CTTCAGCATT GACTCCATGT CCGGCCGGAT GTACGTCACA AGGCCCATGG 720 

ACCGGGAGGA GCACGCCTCT TACCACCTCC GAGCCCACGC TGTGGACATG AATGGCAACA 780 

AGGTGGAGAA CCCCATCGAC CTGTACATCT AQGTCATGGA CATGAATGAC AACCGCCCTG 840 

35 AGTTCATCAA CCAGGTCTAC AACGGCTCCG TGGACGAGGG CTCCAAGCCA GGCACCTACG 900 

TGATGACCGT CACGGCCAAC GATGCTGACG ACAGCACCAC GGCCAACGGG ATGGTGCGGT 960 

ACCGGATCGT GACCCAGACC CCACAGAGCC CGTCCCAGAA TATGTTCACC ATCAACAGGG 1020 

AGACTGGAGA TATCGTCACA GTGGCGGCTG GCCTGGACCG AGAGAAAGTT CAGCAGTACA 1080 

CAGTCATCGT TCAGGCCACA GATATGGAAG GAAATCTCAA CTATGGCCTC TCAAACACAG 1140 

40 CCACAGCCAT CATCACGGTG ACAGATGTGA ATGACAACCC GCCAGAATTT ACCGCCAGCA 1200 

CGTTTGCAGG GGAGGTCCCC GAAAACCGCG TGGAGACOGT GGTCGCAAAC CTCACGGTGA 1260 

TGGACCGAGA TCAGCCCCAC TCTCCAAACT GGAATGCCGT TTACCGCATC ATCAGTGGGG 1320 

ATCCATCCGG GCACTTCAGC GTCCGCACAG ACCCCGTAAC CAACGAGGGC ATGGTCACCG 1380 

TGGTGAAGGC AGTCGACTAC GAGCTCAACA GAGCTTTCAT GCTGACAGTG ATGGTGTCCA 1440 

45 ACCAGGCGCC CCTGGCCAGC GGAATCCAGA TGTCCTTCCA GTCCACGGCA GGGGTGACCA 1500 

TCTCCATCAT GGACATCAAC GAGGCTCCCT ACTTCCCCTC AAACCACAAG CTGATCCGCC 1560 

TGGAGGAGGG CGTGCCCCCC GGCACCGTGC TGACCACGTT TTCAGCTGTG GACCCTGACC 1620 

GGTTCATGCA GCAGGCTGTG AGATACTCAA AGCTGTCAGA CCCAGCGAGC TGGCTGCACA 1680 

TCAATGCCAC CAACGGCCAG ATCACCACGG CGGCAGTGCT GGACCGTGAG TCCCTCTACA 1740 

50 CCAAAAACAA CGTCTACGAG GCCACCTTCC TGGCAGCTGA CAATGGGATA CCCCCGGCCA 1800 

GCGGCACCGG GACCCTCCAG ATCTATCTCA TTGACATCAA CGACAACGCC CCTGAGCTGC 1860 

TGCCCAAGGA GGCGCAGATC TGCGAGAAGC CCAACCTGAA CGCCATCAAC ATCACGGCGG 1920 

CCGACGCTGA CGTCGACCCC AACATCGGCC CCTACGTCTT CGAGCTGCCC TTTGTCCCGG 1980 

CGGCCGTGCG GAAGAACTGG ACCATCACCC GCCTGAACGG TGACTATGCC CAACTCAGCT 2040 

55 TGCGCATCCT GTACCTGGAG GCCGGGATGT ATGACGTCCC CATCATCGTC ACAGACTCTG 2100 

GAAACCCTCC CCTGTCCAAC ACGTCCATCA TCAAAGTCAA GGTGTGCCCA TGTGATGACA 2160 

ACGGGGACTG CACCACCATT GGCGCAGTGG CAGCGGCTGG TCTGGGCACC GGTGCCATGG 2220 

TGGCCATCCT CATCTGCATC CTCATCCTGC TGACCAXGGT CCTGCTGTTT GTCATGTGGA 2280 

^ TGAAGCGGCG AGAGAAGGAG CGCCACACGA AGCAGCTGCT CATTGACCCC GAGGACGACG 2340 

60 TCOGCGACAA CATCCTCAAG TATGACGAGG AAGGCGGTGG CGAGGAGGAC CAGGACTACG 2400 

ACCTCAGCCA GCTGCAGCAG CCGGAAGCCA TGGGGCACGT GCCAAGCAAA GCCCCTGGCG 2460 

TGCGTCGCGT GGATGAGCGG CCGGTGGGCG CTGAGCCCCA GTACCCGATC AGGCCCATGG 2520 

TGCCGCACCC AGGCGACATC GGTGACTTCA TCAATGAGGG ACTCCGCGCT GCTGACAACG 2580 

zrr ACCCCACGGC ACCCCCCTAT GACTCCCTGC TGGTCTTCGA CTACGAGGGG AGCGGCTCCA 2640 

65 CCGCAGGCTC CGTCAGCTCC CTGAACTCAT CCAGTTCCGG GGACCAAGAC TACGATTACC 2700 

TCAACGACTG GGGGCCCAGA TTCAAGAAGC TGGCGGACAT GTATGGAGGT GGTGAAGAGG 2760 

ATTGACTGAC CTCGCATCTT CGGACCGAAG TGAGAGCCGT GCTCGGACGC CGGAGGAGCA 2820 

GGACTGAGCA GAGGCGGCCG GTCTTCCCGA CTCCCTGCGG CTGTGTCCTT AGTGCTGTTA 2880 

GGAGGCCCCC CAATCCCCAC GTTGAGCTGT CTAGCATGAG CACCCACCCC CACAGCGCCC 2940 

TGCACCCGGC CGCTGCCCAG CACOGOGCTG GCTGGCACTG AAGGACAGCA AGAGGCACTC 3000 

TGTCTTCACT TGAATTTCCT AGAACAGAAG CACTGTTTTT AAAAAAAAAA AAAAAAAAAG 3060 
AAG 



70 
75 
80 



Seq ID NO: 286 Protein sequence 
Protein Accession #: NP_0017B5.2 

1 11 21 31 41 51 

I I I I I I 

MTAGAGVLLL LLSLSGALRA HNEDLTTRET CKAGFSEDDY TALISQNILE GEKLLQVKFS 60 
SCVGTKGTQY BTNSMDFKVG ADGTVFATRE LQVPSEQVAF TVTAWDSQTA EKWDAWRLL 120 
VAQTSSPHSG HKPQKGKKW ALDPSPPPKD TLLPWPQHQN ANGLRRRKRD WVIPPINVPE 180 
NSRGPFPQQL VRIRSDKDND IPIRYSITGV GADQPPMEVF SIDSMSGRMY VTRPMDREEH 240 
ASYHLRAHAV DMNGNKVENP IDLYIYVIDM NDNRPEFINQ VYNGSVDEGS KPGTYVMTVT 300 
ANDADDSTTA NGMVRYRIVT QTPQSPSQNM FTINSETGDI VTVAAGLDRE KVQQYTVIVQ 360 

1063 



WO 03/042661 



ATDMEGNLNY GLSNTATAII TVTDVNDNPP E FT AST F AGE VPENRVETW ANLTVMDRDQ 420 

PHSPNWNAVY RIISGDPSGH FSVRTDPVTN EGMVTWKAV DYELNRAFML TVMVSNQAPL 480 

ASGIQMSFQS TAGVTISIMD INEAPYFPSN HKLIRLEEGV PPGTVLTTFS AVDPDRFMQQ 540 

AVRYSKLSDP ASWLHINATN GQITTAAVLD RESLYTKNNV YEATFLAADN GIPPASGTGT 600 

LQIYLIDIND NAPELLPKEA QICEKPNLNA INITAADADV DPNIGPYVFE LPFVPAAVRK S60 

NWTITRLNGD YAQLSLRILY LEAGMYDVPI IVTDSGNPPL SNTSIIKVKV CPCDDNGDCT 720 

TIGAVAAAGL GTGAIVAILI CILILLTMVb LFVMWMKRRE KERHTKQLLI DPEDDVRDNI 780 

LKYDEEGGGE EDQDYDLSQL QQPEAMGHVP SKAPGVRRVD ERPVGAEPQY PIRPMVPHPG 840 

DIGDFINEGL RAADNDPTAP PYDSLLVFDY EGSGSTAGSV SSLNSSSSGD QDYDYLNDWG 900 
PRFKKLADMY GGGEED 

Seq ID NO: 287 DNA sequence 

Nucleic Acid Accession fh AF1S2495.1 

Coding sequence: 1..2397 

1 11 21 31 41 51 

I I I I I I 

ATGGAGGCCG GAGAGGGGAA GGAGCGCGTT CCGAAACAAA GGCAAGTCCT GATATTCTTT 60 

GTTTTGCTGG GCATAGCTCA GGCTAGTTGC CAGCCTAGGC ACTATTCAGT GGCCGAGGAA 120 

ACGGAGAGTG GCTCCTTTGT GGCCAATTTG TTAAAAGACC TGGGGCTGGA GATAGGAGAA 180 

CTTGCTGTGA GGGGGGCCAG GGTCGTTTCC AAAGGAAAAA AAATGCATTT GCAGTTCGAT 240 

AGGCAGACCG GGGATTTGTT GTTAAATGAG AAATTGGACC GGGAGGAGCT GTGCGGCCCC 300 

ACAGAGCCCT GTGTCCTACC TTTCCAGGTG TTACTAGAAA ATCCCTTGCA GTTTTTTCAG 360 

GCGGAGCTAC GGATTAGGGA CGTAAATGAT CATTCCCCAG TTTTCCTAGA CAAAGAAATA 420 

CTTTTGAAAA TTCCAGAAAG TATCACTCCT GGAACTACTT TCTTAATAGA ACGTGCCCAG 480 

GACTTGGATG TAGGAACCAA CAGTCTCCAA AATTACACAA TCAGTCCCAA TTTCCACTTT 540 

CATCTTAATT TACAAGACAG TCTCGATGGC ATAATATTAC CACAGCTGGT GCTGAACAGA 600 

GCCCTGGATC GCGAGGAGCA GCCTGAGATC AGGTTAACCC TCACAGCGCT AGATGGCGGG 660 

AGTCCACCCA GGTCCGGCAC GGCCCTGGTA CGGATTGAAG TTGTGGACAT CAATGACAAC 720 

GTCCCAGAGT TTGCAAAGCT GCTCTATGAG GTGCAGATCC CGGAGGACAG CCCCGTTGGA 780 

TCCCAGGTTG CCATCGTCTC TGCCAGGGAT TTAGACATTG GAACTAATGG AGAAATATCT 840 

TATGCATTTT CCCAAGCATC TGAAGACATT CGCAAAACGT TTCGATTAAG TGCAAAATCG 900 

GGAGAACTGC TTTTAAGACA GAAACTGGAT TTCGAATCCA TCCAGACATA CACAGTAAAT 960 

ATTCAGGCGA CAGATGGTGG GGGCCTATCT GGAACTTGTG TGGTATTTGT CCAAGTGATG 1020 

GATTTGAATG ACAATCCTCC GGAACTAACT ATGTCGACAC TTATCAATCA GATCCCAGAA 1080 

AACTTGCAGG ACACCCTCAT TGCTGTATTC AGCGTTTCAG ATCCTGACTC CGGAGACAAC 1140 

GGAAGGATGG TGTGCTCCAT CCAAGATGAT CTTCCTTTTT TCTTGAAACC TTCTGTTGAG 1200 

AACTTTTACA CTCTGGTGAT AAGCACGGCC CTGGACCGGG AGACCAGATC CGAATACAAC 1260 

ATCACCATCA CCGTCACCGA CTTCGGQACA CCCAGGCTGA AAACCGAGCA CAACATAACC 1320 

GTGCTGGTCT CCGACGTCAA TGACAACGCC CCCGCCTTCA CCCAAACCTC CTACACCCTG 1380 

TTCGTCCGCG AGAACAACAG CCCCGCCCTG CACATCGGCA GCGTCAGCGC CACAGACAGA 1440 

GACTCGGGCA CCAACGCCCA GGTCACCTAC TCGCTGCTGC CGCCCCAGGA CCCGCACCTG IS 00 

CCCCTCGCCT CCCTGGTCTC CATCAACGCG GACAACGGCC ACCTGTTCGC TCTCCAGTCG 1560 

CTGGACTACG AGGCCCTGCA GGCGTTCGAG TTCCGCGTGG GCGCCGCAGA CCGCGGCTCC 1620 

CCGGCGTTGA GCAGCGAGGC GCTGGTGCGC GTGCTGGTGC TGGACGCCAA CGACAACTCG 1680 

CCCTTCGTGC TGTACCCGCT GCAGAACGGC TCCGCGCCCT GCACCGAGCT GGTGCCCCGG 1740 

GCGGCCGAGC CGGGCTACCT GGTGACCAAG GTGGTGGCGG TGGACGGCGA CTCGGGCCAG 1800 

AACGCCTGGC TGTCGTACCA GCTGCTCAAG GCCACGGAGC CCGGGCTGTT CGGCGTGTGG 1860 

GCGCACAATG GCGAGGTGCG CACCGCCAGG CTGCTGAGGG AGCGCGACGC TGCCAAGCAG 1920 

AGGCTGGTGG TGCTGGTCAA GGACAATGGC GAGCCTCCGC GCTCGGCCAC CGCCACGCTG 1960 

CACGTGCTCC TGGTGGACGG CTTCTCCCAG CCCTACCTGC TGCTCCCGGA GGCGGCACCG 2040 

GCCCAGGCCC AGGCCGACTT GCTCACCGTC TACCTGGTGG TGGCGTTGGC CTCGGTGTCT 2100 

TCGCTCTTCC TCTTCTCGGT GCTCCTGTTC GTGGCGGTGC GGCTGTGCAG GAGGAGCAGG 2160 

GCGGCCTCGG TGGGTCGCTG CTCGGTGCCC GAGGGCCCCT TTCCAGGGCA GATGGTGGAC 2220 

GTGAGCGGCA CCGGGACCCT GTCCCAGAGC TACCAGTACG AGGTGTGTCT GACTGGAGGC 2280 

TCCGGGACAA ATGAGTTCAA GTTCCTGAAG CCAATTATCC CCAACTTCGT TGCTCAGGGT 2340 
GCAGAGAGGG TTAGCGAGGC AAATCCCAGT TTCAGGAAGA GCTTTGAATT CACTTAA 

Seq ID NO: 288 Protein sequence 
Protein Accession #: AAD43 756.1 

1 11 21 31 41 51 

I I I I I I 

MEAGEGKERV PKQRQVLIFP VLLGIAQASC QPRHYSVAEE TESGSFVANL LKDLGLEIGE 60 

LAVRGARWS KGKKMHLQFD RQTGDLLLNE KLDREELCGP TEPCVLPFQV LLENPLQFFQ 120 

AELRIRDVND HSPVFLDKEI LLKIPESITP GTTFLIERAQ DLDVGTNSLQ NYTISPNFHF 180 

HLNLQDSLDG IILPQLVLNR ALDREEQPEI RLTLTALDGG SPPRSGTALV RIEWDINDN 240 

VPEFAKLLYE VQIPEDSPVG SQVAIVSARD LDIGTNGEIS YAFSQASEDI RKTFRIiSAKS 300 

GELLLRQKLD FESIQTYTVN IQATDGGGLS GTCWFVQVM DLNDNPPELT MSTLINQIPE 360 

NLQDTLIAVF SVSDPDSGDN GRMVCSIQDD LPFFLKPSVE NFYTLVISTA LDRETRSEYN 420 

ITITVTDFGT PRLKTEHNIT VLVSDVNDNA PAFTQTSYTb FVRENNSPAI* HIGSVSATDR 480 

DSGTNAQVTY SLLPPQDPHL PLASLVSINA DNGHLFALQS LDYEALQAFE FRVGAADRGS 540 

PALSSEALVR VLVLDANDNS PFVIiYPIiQNG SAPCTBLVPR AAEPGYLVTK WAVDGDSGQ 600 

NAWLSYQLLK ATEPGLFGVW AHNGEVRTAR LLRERDAAKQ RLWLVKDNG EPPRSATATL 660 

HVLLVDGFSQ PYLLLPBAAP AQAQADLLTV YLWALASVS SLFLFSVLLF VAVRLCRRSR 720 

AASVGRCSVP EGPFPGQMVD VSGTGTLSQS YQYEVCLTGG SGTNEFKFLK PIIPNFVAQG 780 
AERVSEANPS FRKSFEFT 

Seq ID NO: 289 DNA sequence 

Nucleic Acid Accession #: NM_018674.1 

Coding sequence: 3 90.. 2 00 9 

1 11 21 31 41 51 

I I I I I I 
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CGGAGCACAT GCTGAGCGGA GCGGCTGGGQ CTGCGCGGCG TGGCGGAGCA GCGCTCGCTC 60 

CCTCGCTCAC TCGCTOGCTC GCAGGGACAC ACGCAGGGGC TGACAGCTGT GCTGGTGCTG 120 

ATAAGGGAAG CCACAAGGAG ACGATCGAGG AGAGAGACAA GCGGCAGCAG AGGCAGCAGC 180 

GGCAGAGGCA GCACCAGGGC TGCGGAGCTG CTGGGAGTGG GAGTGACTCC CCCACCTCGG 240 

GCCCCCACCC TGTCCCTGTC CTCTTCCCGC TTGCCCTGAG TTTAGAAGAG CAGCCGCTGC 300 

CACCACTGCC ACTCGGGAGG GCACCAGGGC TGCTGGCTAG GGAGGGACAG GGCAGGGAGG 360 

CTCTGGCCAG TCCCAGCAGC CGGGGACAGA TGCCGATCGA GATTGTGTGC AAAATCAAAT 420 

TTGCTGAGGA GGATGCGAAA CCCAAGGAGA AGGAGGCAGG GGATGAGCAG AGCCTCCTCG 480 

GGGCTGTTGC CCCTGGAGCA GCCCCCCGAG ACCTGGCCAC CTTTGCCAGC ACCAGCACCC 540 

TGCATGGACT GGGCCGGGCC TGTGGCCCAG GCCCCCACGG ACTGCGCAGA ACCCTGTGGG 600 

CACTGGCCCT ACTCACCTCG CTGGCTGCCT TCCTGTACCA GGCGGCTGGC CTGGCCCGGG 660 

GCTACCTGAC CCGGCCTCAC CTGGTGGCAA TGGACCCCGC TGCCCCAGCC CCAGTGGCGG 720 

GCTTCCCGGC TGTCACCCTC TGCAATATCA ACCGCTTCOG GCATTCGGCA CTCAGCGATG 780 

CCGACATCTT CCACCTGGCC AATCTGACAG GGCTGCCCCC CAAAGACCGG GATGGGCACC 840 

GTGCGGCTGG CCTGCGCTAC CCAGAGCCTG ACATGGTAGA CATCCTCAAC CGCACTGGCC 900 

ACCAGCTCGC CGACATGCTT AAGAGCTGCA ACTTCAGTGG GCATCACTGC TCCGCCAGCA 960 

ACTTCTCTGT GGTCTATACT CGCTATGGGA AGTGTTACAC CTTCAACGCG GACCCGCGGA 1020 

GCTCGCTGCC CAGCCGGGCA GGGGGCATGG GCAGTGGCCT GGAGATCATG CTGGACATCC 1080 

AGCAGGAGGA GTACCTGCCC ATCTGGAGGG AGACAAATGA GACGTCGTTT GAGGCAGGTA 1140 

TTCGGGTGCA GATCCACAGC CAGGAGGAGC CGCCCTACAT CCACCAGCTG GGGTTCGGGG 1200 

TGTCCCCAGG CTTCCAGACC TTTGTGTCCT GCCAGGAACA GCGGCTGACC TACCTGCCCC 1260 

AGCCCTGGGG CAACTGCCGC GCAGAGAGTG AGCTCAGGGA GCCTGAGCTT CAGGGCTACT 1320 

CGGCCTACAG TGTGTCTGCC TGCCGGCTGC GCTGTGAAAA GGAGGCCGTG CTTCAGCGCT 1380 

GCCACTGCCG GATGGTGCAC ATGCCAGGCA ATGAGACCAT CTGCCCACCA AATATCTACA 1440 

TCGAGTGTGC AGACCACACA CTGGACTCCC TGGGTGGGGG CCCTGAGGGC CCGTGCTTCT 1500 

GCCCCACCCC CTGCAACCTG ACACGCTATG GGAAAGAGAT CTCCATGGTC AGGATCCCCA 1560 

ACAGGGGCTC AGCCCGGTAC CTGGOGAGGA AGTACAACCG CAACGAGACC TACATACGGG 1620 

AGAACTTCCT GGTCCTAGAT GTCTTCTTTG AGGCCCTGAC CTCTGAAGCC ATGGAGCAGC 1680 

GAGCAGCCTA TGGCCTGTCA GCCCTGCTGG GAGACCTCGG GGGACAGATG GGCCTGTTCA 1740 

TTGGGGCCAG CATCCTCACG TTGCTGGAGA TCCTCGACTA CATCTATGAO GTGTCCTGGO 1800 

ATCGACTGAA GCGGGTATGG AGGCGTCCCA AGACCCCCCT GCGGACCTCC ACTGGGGGCA 1860 

TCTCCACTTT GGGGCTTCAG GAGCTGAAGG AACAGAGTCC CTGCCCGAGC CTGGGCCGAG 1920 

CGGAGGGTGG GGGGGTCAGC AGTCTGCTCC CCAATCACCA CCACCCCCAC GGTCCCCCAG 1980 

GAGGTCTCTT TGAAGATTTT GCTTGCTAGG ACGGTGCTGT GACTGAAAGG ACCCAGGAGT 2040 

CTGGGACCCC TCCTGGGATC CCCAGCACAT TCTCCTGCTC CTGGGAGAGG CCTGGGGGCG 2100 

GTGCTCACTG GGAGGGCCAG GACTCAGTTC CTGCTCTCAT CCTCCCCTGC CCTGATGTCA 2160 

GCTGCTTTGC ACAAAGGTCC TTCTTGTCCA CACCCCTTAT CCCCAGGCTG GTGCCCCGGG 2220 

AGGGCTGGAG ACCAGGCCAT GGGCCCTCAC GGAGAGGAAG GGAAGGAAGG AGAGGGAGGG 2280 

GGAGGATAGA GCCCATCCCA GCCGGGGAGG GGGAGCCCTC TGTACATTTG TAAATATTTA 2340 

GGGAAAGCCG GGTGGGGGGA GGGGATACAG ATGTAGAAGG TGGGTAGGGC TACAGGGGTG 2400 

GGTGATTTAG GGACAGCCAG GGTCCCAGCC CCAATGTCAG CAGGATAGGG AGAGCCCCAG 2460 

GACTCAGGAG TGCTGGGCTG GTCCTACTTC CTGCCCCTCT CCAGGCCCAG CTCCCCTCTT 2520 

GGCAGGGGGA GAGGATGGCC CAGCAGGCCT GGCCCAGCTC CCAGTTCCCC CTGCACCAGC 2580 

CCCACCCCTA GAGTCCCTTC TATAGGGAGG GGGCAGGAGA CCTTCCAGAC TTCGGCTGAG 2640 

CTTGGAGGGT GGGAAGGGAG CCTTCTCAGT CCTCTCTCCC TCCAGTCTGA TTTTATAAAG 2700 
TGCTGACGAG 

Seq ID NO: 290 Protein sequence 
Protein Accession #: NP 061144.1 



MPIEIVCKIK 
GPHGLRRTLW 
NRFRHSALSD 
NPSGHHCSAS 
ETNETSFEAG 
ELREPELQGY 
LGGGPEGPCP 
EALTSEAMEQ 
KTPLRTSTGG 



11 

I 

FAEEDAKPKE 
ALALLTSLAA 
ADIFHLANLT 
NFSWYTRYG 
IRVQIHSQEE 
SAYSVSACRL 
CPTPCNLTRY 
RAAYGLSALL 
ISTLGLQELK 



21 
I 

KEAGDEQSLL 
FLYQAAGLAR 
GLPPKDRDGH 
KCYTFNADPR 
PPY1HQLGFG 
RCEKEAVLQR 
GKEISMVRIP 
GDLGGQMGLF 
EQSPCPSLGR 



31 
I 

GAVAPGAAPR 
GYLTRPHLVA 
RAAGLRYPEP 
SSLPSRAGGM 
VSPGFQTPVS 



NRGSARYLAR 
IGASILTLLE 
AEGGGVSSLL 



41 
I 

DLATFASTST 
MDPAAPAPVA 
DMVDILNRTG 
GSGLEIMLDI 
CQEQRLTYLP 
NETICPPNIY 
KYNRNETYIR 
ILDYIYEVSW 



51 
I 

LHGLGRACGP 
GFPAVTLCNI 
HQLADMLKSC 
QQEEYLPIWR 
QFWGNCRAES 
IECADHTLDS 
ENFLVLDVFP 
DRLKRWRRP 
GGLFEDFAC 
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Seq ID NO: 291 DMA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 62.. 895 



1 
I 

CACTGCTCTG 
TATGATCATC 
CAAGGATGGA 
AGAAGCACGG 
TGAAGGCGGC 
TGTCTGTGCT 
GCCCAACTGT 
TGAAAGATGG 
AGATCCAAAG 
CTGCTACTGG 
TGACCTTGAA 
TGATGTCCAT 
TACAGGAAAT 
CCAAATCAAA 
TACTACTTCT 
AAAAAAAGGA 



11 
I 

AGAATTTGTG 
TTAATTTACT 
ATTTTTCATA 
TCTGGCAAAT 
CATCTCGCAA 
GCTGGATGGA 
GGATTTGGAA 
QATGCCTATT 
CAAATTTTTA 
CACATTAGAC 
GATGACCCAG 
GGCTTTGTGG 
GTCATGACCT 
TATGTTGCAA 
ACTGGAAATA 
TGATCAAAAC 



21 
I 

AGCAGCCCCT 
TATTTCTCTT 
ACTCCATATG 
ACAAGCTCAC 
CTTACAAGCA 
TGGCTAAGGG 
AAACTGGCAT 
GCTACAACCC 
AATCTCCAGG 
TCAAGTATGG 
GTTGCTTGGC 
GAAGATACTG 
TGAAGTTTCT 
TGGATCCTGT 
AAAACTTTTT 
ACACAGTGTT 



31 
I 

AACAGGCTGT 
GCTATGGGAA 
GCTTGAACGA 
CTACGCAGAA 
GCTAGAGGCA 
CAGAGTTGGA 
TATTGATTAT 
ACACGCAAAG 
CTTCCCAAAT 
TCAGCGTATT 
TGATTATGTT 
TGGAGATGAG 
AAGTGATGCT 
ATCCAAATCC 
AGCTGGAAGA 
TATGTTGGAA 



41 
1 

TACTTCACTA 
GACACTCAAG 
GCAGCCGGTG 
GCTAAGGCGG 
GCCAGAAAAA 
TACCCCATTG 
GGAATCCGTC 
GAGTGTGGTG 
GAGTACGAAG 
CACCTGAGTT 
GAAATATATG 
CTTCCAGATG 
TCAGTGACAG 
AGTCAAGGAA 
TTTAGCCACT 
TCTTTTGGAA 



51 
I 

CAACTGACGA 
GATGGGGATT 
TGTACCACAG 
TGTGTGAATT 
TTGGATTTCA 
TGAAGCCAGG 
TCAATAGGAG 
GCGTCTTTAC 
ATAACCAAAT 
TTTTAGATTT 
ACAGTTACGA 
ACATCATCAG 
CTGGAGGTTT 
AAAATACAAG 
TATAAAAAAA 
CTCCTTTGAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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CTCACTGTTA 
TAGGGAAAAT 
ACTGCATAGA 
TTTGTGGTAT 
TCTATGTACA 
TCATTGATTA 
TGTTTTATGC 
ATTGTTGCAA 



TTATTAACAT 
TGGAAAATAT 
AATAACAAGC 
ATGTATATAT 
GTTTTGTATT 
TTCTACAAAA 
ATTATTTAAG 
TAAATATCCT 



TTATTTATTA 
AGGAAACTTT 
GTTAACATTT 
GTACCTATAT 
ATACTTTTTA 
ACATGATTTT 
CCTGTCTCTA 
TGAACACACA 



TTTTTCTAAA 
AAA CG AG AAA 
TCATATTTTT 
GTATTTGCAT 
AATCTTGAAC 
AAACAGCTGT 
TTGTTGGAAT 
AAAAAAAAAA 



TGTGAAAGCA 
ATGAAACCTC 
TTCTTTCAGT 
TTGAAATTTT 
TTTATAAACA 
AAAATATTCT 
TTCAGGTCAT 
AA 



ATACATAATT 
TCATAATCCC 
CATTTTTCTA 
GGAATCCTGC 
TTTTCTGAAA 
ATGATATGAA 
TTTCATAAAT 



Seq ID NO: 292 Protein sequence 
Protein Accession th Eos sequence 



1 11 

I I 
MIILIYLFLL LWEDTQGWGF 
EGGHLATYKQ LEAARKIGFH 
ERWDAYCYNP HAKECGGVFT 
DLEDDPGCLA DYVEIYDSYD 
QIKYVAMDPV SKSSQGKNTS 



21 31 41 51 

till 
KDGIFHNSIW LERAAGVYHR EARSGKYKLT YAEAKAVCEF 
VCAAGWKAKG RVGYPIVKPG PNCGFGKTGI IDYGIRLNRS 
DPKQIFKSPG FPNEYEDNQI CYWHIRLKYG QRIHLSFLDF 
DVHGFVGRYC GDELPDDIIS TGNVMTLKFL SDASVTAGGF 
TTSTGNKNFL AGRFSHL 



Seq ID NO: 293 DNA sequence 

Nucleic Acid Accession #: NM_007115.1 

Coding sequence: 69- .902 



GAATTCGCAC 
CTGACGATAT 
GGGGATTCAA 
ACCACAGAGA 
GTGAATTTGA 
GATTTCATGT 
AGCCAGGGCC 
ATAGGAGTGA 
TCTTTACAGA 
ACCAAATCTG 
TAGATTTTGA 
GTTACGATGA 
TCATCAGTAC 
GAGGTTTCCA 
ATACAAGTAC 
AAAAAAAAAA 
TTGATCTCAC 
TAATTTAGGG 
ATCCCACTGC 
TTGTATTTGT 
CCTGCTCTAT 
TGAAATCATT 
ATGAATGTTT 
TAAATATTGT 



11 
I 

TGCTCTGAGA 
GATCATCTTA 
GGATGGAATT 
AGCACGGTCT 
AGGCGGCCAT 
CTGTGCTGCT 
CAACTGATGA 
AAGATGGGAT 
TCCAAAGCGA 
CTACTGGCAC 
CCTTGAAGAT 
TGTCCATGGC 
AGGAAATGTC 
AATCAAATAT 
TACTTCTACT 
AAGGATGATC 
TGTTATTATT 
AAAATTGGAA 
ATAGAAATAA 
GGTATATGTA 
GTACAGTTTT 
GATTATTCTA 
TATGCATTAT 
TGCAATAAAT 



21 
I 

ATTTGTGAGC 
ATTTACTTAT 
TTTCATAACT 
GGCAAATACA 
CTCGCAACTT 
GGATGGATGG 
TTTGGAAAAA 
GCCTATTGCT 
ATTTTTAAAT 
ATTAGACTCA 
GACCCAGGTT 
TTTGTGGGAA 
ATGACCTTGA 
GTTGCAATGG 
GGAAATAAAA 
AAAACACACA 
AACATTTATT 
AATATAGGAA 
CAAGCGTTAA 
TATATGTACC 
GTATTATACT 
CAAAAACATG 
TTAAGCCTGT 
ATCCTTCGGA 



31 
I 

AGCCCCTAAC 
TTCTCTTGCT 
CCATATGGCT 
AGCTCACCTA 
ACAAGCAGCT 
CTAAGGGCAG 
CTGGCATTAT 
ACAACCCACA 
CTCCAGGCTT 
AGTATGGTCA 
GCTTGGCTGA 
GATACTGTGG 
AGTTTCTAAG 
ATCCTGTATC 
ACTTTTTAGC 
GTGTTTATGT 
TATTATTTTT 
ACTTTAAACG 
CATTTTCATA 
TATATGTATT 
TTTTAAATCT 
ATTTTAAACA 
CTCTATTGTT 
ATTC 



41 
I 

AGGCTGTTAC 
ATGGGAAGAC 
TGAACGAGCA 
CGCAGAAGCT 
AGAGGCAGCC 
AGTTGGATAC 
TGATTATGGA 
CGCAAAGGAG 
CCCAAATGAG 
GCGTATTCAC 
TTATGTTGAA 
AGATGAGCTT 
TGATGCTTCA 
CAAATCCAGT 
TGGAAGATTT 
TGGAATCTTT 
CTAAATGTGA 
AGAAAATGAA 
TTTTTTTCTT 
TGCATTTGAA 
TGAACTTTAT 
GCTGTAAAAT 
GGAATTTCAG 



1020 
1080 
1140 
1200 
1260 
1320 
1380 



51 
I 

TTCACTACAA 
ACTCAAGGAT 
GCCGGTGTGT 
AAGGCGGTGT 
AGAAAAATTG 
CCCATTGTGA 
ATCCGTCTCA 
TGTGGTGGCG 
TACGAAGATA 
CTGAGTTTTT 
ATATATGACA 
CCAGATGACA 
GTGACAGCTG 
CAAGGAAAAA 
AGCCACTTAT 
TGGAACTCCT 
AAGAAATACA 
ACCTCTCATA 
TCAGTCATTT 
ATTTTGGAAT 
GAACATTTTC 
ATTCTATGAT 
GTCATTTTCA 



60 
120 
180 
240 



60 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



Seq ID NO: 294 Protein sequence 
Protein Accession #: NP_009046.1 

1 11 21 31 41 51 

I I I I I I 

MIILIYLFLL LWEDTQGWGF KDGIFHNSIW LERAAGVYHR EARSGKYKLT YAEAKAVCEF 
EGGHLATYKQ LEAARKIGFH VCAAGWMAKG RVGYPIVKPG PNXXFGKTGI IDYGIRLNRS 
ERWDAYCYNP HAKECGGVFT DPKRIFKSPG FPNEYEDNQI CYWHIRLKYG QRIHLSFLDF 
DLEDDPGCLA DYVEIYDSYD DVHGFVGRYC GDELPDDIIS TGNVMTLKFL SDASVTAGGF 
QIKYVAMDPV SKSSQGKNTS TTSTGNKNFL AGRFSHL 



60 
120 
180 
240 



Seq ID NO: 295 DNA sequence 
65 Nucleic Acid Accession #: NMJ>01218.2 
Coding sequence: 116.. 1160 

1 11 21 31 41 51 

-7A I I I 1 1 1 

/U GTACTCGCCA CGGCACCCAG GCTGCGCGCA CGCGGTCCCG GTGTGCAGCT GGAGAGCGAG 60 

CGGCCACCGG GAGCCCCCGG CACAGCCCGC GCCCGCCCCG CAGGAGCCCG CGAAGATGCC 120 

CCGGCGCAGC CTGCACGCGG CGGCCGTGCT CCTGCTGGTG ATCTTAAAGG AACAGCCTTC 180 

CAGCCCGGCC CCAGTGAACG GTTCCAAGTG GACTTATTTT GGTCCTGATG GGGAGAATAG 240 

CTGGTCCAAG AAGTACCCGT CGTGTGGGGG CCTGCTGCAG TCCCCCATAG ACCTGCACAG 300 

75 TGACATCCTC CAGTATGACG CCAGCCTCAC GCCCCTCGAG TTCCAAGGCT ACAATCTGTC 360 

TGCCAACAAG CAGTTTCTCC TGACCAACAA TGGCCATTCA GTGAAGCTGA ACCTGCCCTC 420 

GGACATGCAC ATCCAGGGCC TCCAGTCTCG CTACAGTGCC ACGCAGCTGC ACCTGCACTG 480 

GGGGAACCCG AATGACCCGC ACGGCTCTGA GCACACCGTC AGCGGACAGC ACTTCGCCGC 540 

c _ CGAGCTGCAC ATTGTCCATT ATAACTCAGA CCTTTATCCT GACGCCAGCA CTGCCAGCAA 600 

80 CAAGTCAGAA GGCCTCGCTG TCCTGGCTGT TCTCATTGAG ATGGGCTCCT TCAATCCGTC 660 

CTATGACAAG ATCTTCAGTC ACCTTCAACA TGTAAAGTAC AAAGGCCAGG AAGCATTCGT 720 

CCCGGGATTC AACATTGAAG AGCTGCTTCC GGAGAGGACC GCTGAATATT ACCGCTACCG 780 

GGGGTCCCTG ACCACACCCC CTTGCAACCC CACTGTGCTC TGGACAGTTT TCCGAAACCC 840 

CGTGCAAATT TCCCAGGAGC AGCTGCTGGC TTTGGAGACA GCCCTGTACT GCACACACAT 900 



1066 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



GGACGACCCT 
GAGGCTGGTA 
GGGCATCATC 
GTCCATTTGG 
CAAGCCAGCC 
ACATCCAGGA 
ACCTCAAGGT 
TGGGTGCCTC 
TGCACTCTCA 
GAAGTGGTCT 
AAAACACTTT 
GCTTCTCCAA 
CAACTTCTTT 
CTTTAATCTG 
AGAAAAAAGA 
CCTCTGGATC 
GGTGGGGATG 
TCTGGGAGAG 
TGTAAGGACA 
ATGTTTTATA 
CTGTGTCCTG 
AATAGGCAGA 
CACTTCTGCA 
CTGAAGATGT 
ATCTGTGCAG 
ATACTCCATA 
TTCCTCCCGC 
ACCCCTAGTT 
GACAGCATGA 
CCCTCTGACT 
AGTAAGCCAA 
AAAAAAAAAA 



TCCCCCAGAG 
TACACCTCCT 
CTCTCACTGG 
CTTTTCAGAA 
ACCAAGATGG 
AGGACCTTGC 
GTTCTCTGTA 
CTTGCCTTGG 
GACCAAGCAG 
GAATTCTGGA 
CATCCGGGGC 
GCTTTCAGAC 
CCTCTGGAGA 
CACCTTACAA 
AAACAAGAGA 
ACAGACACTG 
AGAAGGGAGA 
TCTCACTTTG 
GAATAATACA 
CTCTACAGAT 
GAGGAGACAT 
AACAAAATGA 
GGAT CCCTAC 
CAAATCGTGG 
GGGGCTGGCA 
TTCTTCCCCG 
AGGTGAGAAC 
GAGAGCACCT 
AATGTCCTCT 
CTAAGAATTC 
GGTCATGCCA 



AAATGATCAA 
TCTCCCAAGT 
CCCTGGCTGG 
GGAAGAGTAT 
AGACTGAGGC 
TTTGGACCCT 
GCTCAATCTG 
GACCATGGCC 
CAGGAATTCA 
ATCACAAACC 
TTTGCCAGAG 
AAGAATGTGC 
CGGGGCATCT 
CTCGGGGACA 
TATACATTGT 
CACAGACCTT 
GAGAGCTAGA 
GAATCAGAAT 
ATATTAAGTC 
TTTACAAATA 
GAGTTCCGAG 
CTGTAGAACT 
TGCCAACCTG 
TTTAGATCAA 
TCATGTATTT 
AATCACACAG 
CCTGTGGAGA 
TGCAGTTCCC 
TGAAGCATAG 
TCTCTTCTGG 
CTGCACTCTA 



CAACTTCCGG 
GCAAGTCTGT 
CATTCTTGGC 
CAAAAAAGGT 
CCACGCTTGA 
ACACACTTCG 
CAAACATGCC 
ACCCCAGAGC 
AAGCTGCTTG 
AAGCCATGCT 
CGTGCTTTCA 
ACTCTCTGCT 
CCCTCTGATT 
AATGGGGACA 
GATATATTAG 
AGGGAATGGC 
GGGACAAAGA 
TGGAATCACA 
CAAATCCAAC 
ATGAGGCTGT 
ATGACCCAAT 
TATTCTCTGT 
GAATGGAGAC 
ATATATTTCA 
AGGGGCAAGT 
ACAGTTTCTG 
TGAGTCAGTG 
CGAGAACTTT 
CTTTTTAAAT 
AATCGCTTGA 
GCCTGGGTGA 



CAGGTCCAGA 
ACTGCGGCAG 
ATCTGTATTG 
GATAACAAGG 
GGTCCCCGGA 
GCTCTCTGGA 
AGGCCTCAGG 
CATCCGATCG 
CTGTAACTGT 
GGTGGGCCAT 
AGTGTCCTGG 
TAGGTTTTGC 
TCCTTCTGCT 
GGAAGGATCA 
GGACACTTTC 
AGGTTCAAGT 
GAATGAGAAG 
TTCTGTTTAT 
CTCCTGTCAG 
TCCTTGAAAA 
CTGCCTTTGA 
AGGCCAAATT 
TTTTATCTAC 
AGCTATAAAA 
AATAATGGAA 
ACAGGCGCAA 
CCATGACTGA 
CTGATTCACA 
ATCTTTTTCC 
ACCCAGGAGG 
CAGAGCGAGA 



AGTTCGATGA 
GACTGAGTCT 
TGGTGGTGGT 
GAGTCATTTA 
GCTCCCGGGC 
CACTTGCGAC 
GATCCTCTGC 
ATGGATGGGA 
GTGAGATTGT 
TAATGGTTGG 
AAAGTCTGCT 
TTGGGAAACT 
ATGACAAAAC 
AGTTGTAGAG 
ACAGTCCTGT 
TCCACTTCTT 
ACATGGATGA 
CAAGCCATAA 
TGGAGCAGTT 
TGTGTTGTTG 
ATCTGGAGGA 
TCATTTCAGC 
TTCTCTCTCT 
GCAGGAGGTT 
TGCTACTAAG 
CTCCTCCATT 
GAAGGAACCG 
GTCTCATTTT 
TTCTACTCCT 
GGGAGGTTGC 
CTCCATCTCA 



960 
1020 

ioao 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1600 

1740 

1800 

I860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 



Seq ID NO: 296 Protein sequence 
Protein Accession tft NP_001209.1 



1 
I 

MPRRSLHAAA 
HSDILQYDAS 
HWGNPNDPHG 
PSYDKIFSHL 
NPVQISQEQL 
SLGIILSLAL 



11 
I 

VLLUVILKEQ 
LTPLEFQGYN 
SEHTVSGQHP 
QHVKYKGQEA 
LALETALYCT 
AGILGICIW 



21 
1 

PSSPAPVNGS 
LSANKQFLLT 
AAELHIVHYN 
FVPGFNIEEL 
HMDDPSPREM 
WSIWLFRRK 



31 
I 

KWTYFGPDQE 
NNGHSVKLNI* 
SDLYPDASTA 
LPERTAEYYR 
INNFRQVQKF 
SIKKGDNKGV 



41 
I 

NSWSKKYPSC 
PSDMHIQGLQ 
SNKSEGLAVL 
YRGSLTTPPC 
DERLVYTSFS 
IYKPATKMET 



51 
I 

GGLLQSPIDL 
SRYSATQLHli 
AVLIEMGSFN 
NPTVLWTVFR 
QVQVCTAAGL 
EAHA 



60 
120 
180 
240 
300 



Seq ID NO: 297 DNA sequence 

Nucleic Acid Accession fh NM_006632. 

Coding sequence : 377 . . 1582 



ACGCGTCCGC 
GCGGCCGTGC 
GACAAGAAGA 
CTTAACAAGC 
GGCGTGCTTC 
CTAGGAGAGG 
CAAGAACGCA 
ATGTTCTGCT 
ACAAAATGTC 
CCAGCTCAAT 
CCCAAAGAGT 
GTGGGGCCCT 
GGGATGCTTT 
TGTCTTCTAT 
TTATGATGAC 
CTCCTTGAAA 
ATCTCTACCC 
AATGGTTGTA 
TGGACTTCTA 
TCTGGCAGAT 
AATTTTAGGA 
CTATATCACA 
AGGGATTTAT 
ATCAAGAGGA 
TAGTCAGGAC 
GTTAGGACTA 
GAGAAAACTC 
TATTGCATAA 
GTGGCTCTTT 
ATGTAAGATG 



11 
I 

CCACGCGTCC 
TGGAGCACCT 
CCCGCATCAT 
TGCTGGGCGA 
TGCCCCAGAA 
AGAAACACAA 
CAAGATATGC 
CGCTATGGAA 
ATCATGAACA 
GATTCCTCTG 
CTTCCTGCAA 
CGACAAGAAC 
ACTGCCATCC 
ATCTTTGGAG 
CCCTTTTCCT 
CAACAGGTCG 
ATTTGGTCCA 
TACATACCAA 
TCTGCCCTTC 
TTCCTTCTAA 
AGTCTCCCCT 
GCAACTGCCT 
ATCAATGTCT 
TTTTCGAGCA 
CCTGAGTTTG 
CTCTTCTACC 
ACTCGTTTAT 
AATAGAAGGC 
TCAATTATGA 
AATAAAAATT 



21 

I 

GCCCACGCGT 
GACCGCCGAG 
CCTGCGCCAC 
AGTCACCATC 
GACCAAGAGC 
TGGCCACCAA 
AAGTGGATGA 
TAGCCCTOGT 
TCACCATGGT 
AGGTGCTGCC 
AGTCCTCAAT 
GAAGCAGACT 
TCATAGGTGG 
GTGTTGGCTG 
ATCCATGGAT 
GGTCTTCTAA 
TATGTTTAGG 
CTTACATCAG 
CTTTTATTGT 
CCAAAAAGTT 
CTTCAGCACT 
TGCTGACGCT 
TAGATATTGC 
TAGCACCTGT 
GGTGGAGGAA 
TCATATTTGG 
GAAGTTATCC 
TTCCGTGATG 
GATCAGTTCA 
CAAATAAAAT 



31 
I 

CCGGTCGGGG 
ATCCTGGAGC 
CTGTAGCTGG 
GCGCAGGGCG 
CACCACAAGG 
GACAGAGTTG 
GACACTGATC 
CTTACATTTC 
AGCCATGGTC 
TGTTGACTCA 
ACTTGGGGGT 
CTGCAGCATT 
CTTCATTAGT 
TGTCTGCTGC 
AAGCACCTCA 
GCAGCCTCTT 
CTGTTTCAGC 
CTCTGTGTAC 
TGCCTGGGTC 
TAGACTCATC 
CATTGTGTCT 
CTCTTGCGGA 
TCCAAGGTAT 
CATTGTACCC 
TGTCTTCTTC 
AGAAGCAGAT 
CACCTTGGAT 
AAAATACCAG 
TTATTTTATT 
GATAACTAAG 



41 
I 

CCAGAGCGCA 
TGGCTGGCAA 
CCATTOGCAA 
GTGTCCTGCC 
CCAAGGGTGA 
AGTCCCACAG 
CCCAGGAAAG 
TGCAATTTCA 
AACAGCACAA 
TTTGGTGGCC 
CAGTTTGCAA 
GCTTTATCAG 
GAAACCCTTG 
CTTCTCTGGT 
GAAAAAGAAT 
CCCATCAAAG 
CATCAATGGT 
CATGTTAACA 
ATAGGCATGG 
ACTGTGAGGA 
CTGCCTTACC 
TTAAGCACAT 
TCCAGTTTTC 
ACTGTCAGCG 
TTGCTGTTTG 
GTCCAAGAAT 
GGAAAAGTCA 
TGAAAAGATT 
CAGACTTTTT 
AAAAAAAAAA 



51 
I 

GGTGTACCTG 
CCCGGCCCGC 
CGGCGAGGAG 
CAACATTCAG 
AAACCATTCA 
CAAGGGAGAG 
GTCCAAGTTT 
CAACGATAGC 
GCOCTCAATC 
TAAGTAAAGC 
TTTGGGAAAA 
GAATGTTACT 
GGTGGCCCTT 
TTGTTGTGAT 
ACATCATATC 
CTATGCTCAG 
TAGTTAGCAC 
TCAGAGACAA 
TGGGAGGCTA 
AAATTGCCAC 
TCAATTCCGG 
TGTGTCAGTC 
TCATGGGAGC 
GATTTCTTCT 
CCGTTAACCT 
GGGCTAAAGA 
TTAGGCACCG 
TTTTTTTCCT 
TTTGAGAGAA 
AAAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



Seq ID NO: 298 Protein sequence 
Protein Accession #: NP_006623.1 



1067 



WO 03/042661 



PCT/US02/36810 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 



75 
80 



1 11 21 31 41 51 

I I i I I I 

MQVDETLIPR KGPSLCSARY G IALVLHFCN FTTIAQNVIM NITMVAMVNS TSPQSQLNDS 60 

SEVLPVDSFG GLSKAPKSLP AKSSILGGQF AIWEKWGPPQ ERSRLCSIAL SGMLLGCFTA 120 

ILIGGFISET LGNPFVFYIF GGVGCVCCLL WFWIYDDPF SYPWISTSEK EYIISSLKQQ 180 

VGSSKQPLPI KAMLRSLPIW SICLGCFSHQ WLVSTMWYI PTYISSVYHV NIRDNGLLSA 240 

LPFIVAWVIG MVGGYLADFL LTKKFRLITV RKIATILGSL PSSALIVSLP YLNSGYITAT 300 

ALLTLSCGLS TLCQSGIYIN VLDIAPRYSS FLMGASRGFS SIAPVIVPTV SGFLLSQDPE 360 
FGWRNVFFIiL FAVNLLGLLF YLIFGEADVQ EWAKERKLTR L 

Seq ID NO: 299 DNA sequence 

Nucleic Acid Accession #: NM_003058.1 

Coding sequence: 145.. 1812 

1 11 21 31 41 51 

I I I I 1 I 

GGCCCTGCCC TGAAGGCTGG TCACTTGCAG AGGTAAACTC CCCTCTTTGA CTTCTGGCCA 60 

GGGTTTGTGC TGAGCTGGCT GCAGCCGCTC TCAGCCTCGC TCCGGGCACG TCGGGCAGCC 120 

TCGGGCCCTC CTGCCTGCAG GATCATGCCC ACCACCGTGG ACGATGTCCT GGAGCATGGA 180 

GGGGAGTTTC ACTTTTTCCA GAAGCAAATG TTTTTCCTCT TGGCTCTGCT CTCGGCTACC 240 

TTCGCGCCCA TCTACGTGGG CATCGTCTTC CTGGGCTTCA CCCCTGACCA CCGCTGCCGG 300 

AGCCCCGGAG TGGCCGAGCT GAGTCTGCGC TGCGGCTGGA GTCCTGCAGA GGAACTGAAC 360 

TACACGGTGC CGGGCCCAGG ACCTGCGGGC GAAGCCTCCC CAAGACAGTG TAGGCGCTAC 420 

GAGGTGGACT GGAACCAGAG CACCTTTGAC TGCGTGGACC CCCTGGCCAG CCTGGACACC 480 

AACAGGAGCC GCCTGCCACT GGGCCCCTGC CGGGAOGGCT GGGTGTACGA GACGCCTGGC 540 

TCGTCCATCG TCACCGAGTT TAACCTGGTA TGTGCCAACT CCTGGATGTT GGACCTATTC 600 

CAGTCATCAG TGAATGTAGG ATTCTTTATT GGCTCTATGA GTATCGGCTA CATAGCAGAC 660 

AGGTTTGGCC GTAAGCTCTG CCTCCTAACT ACAGTCCTCA TAAATGCTGC AGCTGGAGTT 720 

CTCATGGCCA TTTCCCCAAC CTATACGTGG ATGTTAATTT TTCGCTTAAT CCAAGGACTG 780 

GTCAGCAAAG CAGGCTGGTT AATAGGCTAC ATCCTGATTA CAGAATTTGT TGGGCGGAGA 840 

TATCGGAGAA CAGTGGGGAT TTTTTACCAA GTTGCCTATA CAGTTGGGCT CCTGGTGCTA 900 

GCTGGGGTGG CTTACGCACT TCCTCACTGG AGGTGGTTGC AGTTCACAGT TGCTCTGCCC 960 

AACTTCTTCT TCTTGCTCTA TTACTGGTGC ATACCTGAGT CTCCCAGGTG GCTGATCTCC 1020 

CAGAATAAGA ATGCTGAAGC CATGAGAATC ATTAAGCACA TCGCAAAGAA AAATGGAAAA 1080 

TCTCTACCCG CCTCCCTTCA GCGCCTGAGA CTTGAAGAGG AAACTGGCAA GAAATTGAAC 1140 

CCTTCATTTC TTGACTTGGT CAGAACTCCT CAGATAAGGA AACATACTAT GATATTGATG 1200 

TACAACTGGT TCACGAGCTC TGTGCTCTAC CAGGGCCTCA TCATGCACAT GGGCCTTGCA 1260 

GGTGACAATA TCTACCTGGA TTTCTTCTAC TCTGCCCTGG TTGAATTCCC AGCTGCCTTC 1320 

ATGATCATCC TCACCATCGA CCGCATCGGA CGCCGTTACC CTTGGGCTGC ATCAAATATG 1380 

GTTGCAGGGG CAGCCTGTCT GGCCTCAGTT TTTATACCTG GTGATCTACA ATGGCTAAAA 1440 

ATTATTATCT CATGCTTGGG AAGAATGGGG ATCACAATGG CCTATGAGAT AGTCTGCCTG 1500 

GTCAATGCTG AGCTGTACCC . CACATTCATT AGGAATCTTG GCGTCCACAT CTGTTCCTCA 1560 

ATGTGTGACA TTGGTGGCAT CATCACGCCA TTCCTGGTCT ACCGGCTCAC TAACATCTGG 1620 

CTTGAGCTCC CGCTGATGGT TTTCGGCGTA CTTGGCTTGG TTGCTGGAGG TCTGGTGCTG 1680 

TTGCTTCCAG AAACTAAAGG GAAAGCTTTG CCTGAGACCA TCGAGGAAGC CGAAAATATG 1740 

CAAAGACCAA GAAAAAATAA AGAAAAGATG ATTTACCTCC AAGTTCAGAA ACTAGACATT 1800 

CCATTGAACT AAGAAGAGAG ACCGTTGCTG CTGTCATGAC CTAGCTTTGA TGGCAGCAAG 1860 

ACCAAAAGTA GAAATCCCTG CACTCATCAC AAAGCCCATA CAACTCAACC AAACTTACCC 1920 

CTGAGCCCTA TCAACCTAGG TCTACAGCCA GTGGAGTCTA TTGTACACTG TGGAAAAATA 1980 

CCCATGGGAC CAGATCCTGC CAAATTCTTC CAGCTCACTT TATTCTCAGC ATTCCTAGGA 2040 

CATTGGACAT TGGTTTTCTG GAGGGTTTTT TTTCCGATCT TTGTATTTTT TTAAATTTGA 2100 

TTCTTTTCTT TGCAATGCTA GCAACCAGAA TACATAGGGG AACTGTGGGC TAGGCAAANA 2160 

AAATAGAAAA AGTGTGAAAA ACAGTAAAGT TGGGAGAGGA GCATCTATTT TCTTAAAGAA 2220 
ATAAAACACC NAAAACAAAA AAAAAAAAAA AAAAAAA 

Seq ID NO: 300 Protein sequence 
Protein Accession #: NP 003049.1 



1 11 21 31 41 51 

I I I I I I 

MPTTVDDVLE HGGEFHFFQK QMFFLLALLS ATFAPIYVGI VFLGFTPDHR CRSPGVAELS 60 

LRCGWSPAEE LNYTVPGPGP AGEASPRQCR RYEVDWNQST FDCVDPLASL DTNRSRLPLG 120 

PCRDGWVYET PGSSIVTEFN LVCANSWMLD LFQSSVNVGF PIGSMSIGYI ADRFGRKLCL 180 

05 LTTVLINAAA GVLMAISPTY TWMLIFRLIQ GLVSKAGWLI GYILITEFVG RRYRRTVGIF 240 

' YQVAYTVGLL VLAGVAYALP HWRWLQFTVA LPNFFFLLYY WCIPESPRWL ISQNKNAEAM 300 

RIIKHIAKKN GKSLPASLQR LRLEEETGKK LNPSFLDLVR TPQIRKHTMI LMYNWFTSSV 360 

LYQGLIMHMG LAGDNIYLDF FYSALVEFPA AFMIILTIDR IGRRYPWAAS NMVAGAACLA 420 

SVFIPGDLQW LKIIISCLGR MGITMAYEIV CLVNAELYPT FIRNLGVHIC SSMCDIGGII 480 

70 TPFLVYRLTN IWLELPLMVF GVLGLVAGGL VLLLPETKGK ALPETIEEAE NMQRPRKNKE 540 
KMIYLQVQKL DIPI2J 



Seq ID NO: 301 DNA sequence 

Nucleic Acid Accession #: NM_012206.1 

Coding sequence - 52 . .1131 

1 11 21 31 41 51 

I I I I I I 

GTTACCCAGC ATTGTGAGTG ACAGAGCCTG GATCTGAACG CTGATCCCAT AATGCATCCT 60 
CAAGTGGTCA TCTTAAGCCT CATCCTACAT CTGGCAGATT CTGTAGCTGG TTCTGTAAAG 120 
GTTGGTGGAG AGGCAGGTCC ATCTGTCACA CTACCCTGCC ACTACAGTGG AGCTGTCACA 180 
TCAATGTGCT GGAATAGAGG CTCATGTTCT CTATTCACAT GCCAAAATGG CATTGTCTGG 240 
ACCAATGGAA CCCACGTCAC CTATCGGAAG GACACACGCT ATAAGCTATT GGGGGACCTT 300 
TCAAGAAGGG ATGTCTCTTT GACCATAGAA AATACAGCTG TGTCTGACAG TGGCGTATAT 360 



1068 



WO 03/042661 



PCT/US02/36810 
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TGTTGCOGTG 
ATTGTGCCAC 
GTTCGAACGA 
ACAATGAGCA 
AGCGTTCCAA 
TCTACCTTTG 
CCATCTTCAC 
GAACCCACCA 
TCTTCAGATG 
ACGGCCAATA 
GCTCTTTTGG 
AGTGTTTCAT 
CAAGCAGAAG 
GTGCTCTTTG 
ATGTCTTTTA 
CAGTGATACT 
AATGTAGTCC 
TCTCTCAAAC 



TTGAGCACCG 
CCAAGGTCAC 
GCACCACTGT 
TTCCAACGAC 
CGACAACGAG 
TTCCTCCAAT 
CTCAGCCAGC 
GCTCACCATT 
GCCTTTGGAA 
CCACTAAAGG 
GTGTCATCAT 
TTAGCAGCCT 
ACAATATCTA 
AGAGTTTACG 
GACTCCAAGA 
GGGTAGAGTA 
TAATTTGTTT 
ATGAACACTT 



TGGGTGGTTC 
GACTACTCCA 
TCCAACGACA 
AAOGACTGTT 
CATTCCAACA 
GCCTTTGCCC 
AGAAACCCAC 
GTACTCTTAC 
TAACAATCAA 
AATCTATGCT 
TGCCAAAAAG 
TCAAATTAAA 
CATTGAGAAT 
CCCATGACTG 
CAATTTTTCT 
ACTCTCCCAC 
TGCTAAAACT 
TAGAATTGTA 



AATGACATGA 
ATTGTCACAA 
ACGACTGTTC 
CCGACGACAA 
ACAACAAGTG 
AGGCAGAACC 
CCTACGACAC 
ACAACAGATG 
ACTCAACTGT 
GGAGTCTGTA 
TATTTCTTCA 
GCTTTGCAAA 
AGTCTTTATG 
CAGAAGACTG 
GTTTCAGTTT 
TCCAAACTGT 
GGCTCAATCC 
TGTTCTCTTT 



AAATCACCGT 
CTGTTCCAAC 
CAACGACAAC 
TGACTGTTTC 
TTCCAGTGAC 
ATGAACCAGT 
TGCAGGGAGC 
GGAATGACAC 
TCCTAGAACA 
TTTCTGTCTT 
AAAAGGAGGT 
ATGCAGTTGA 
CCACGGACTA 
AACAGGTATC 
CATCTGGCAT 
GTATAGTCAA 
TTCTGATCAT 
AGACCCCATA 



ATCATTGGAG 
CGTCAOGACT 
TGTTCCAACA 
AACGACAACG 
AACAACGGTC 
AGCCACTTCA 
AATAAGGAGA 
CGTGACAGAG 
TAGTCTACTG 
GGTGCTTCTT 
TCAACAACTA 
AAAGGAAGTC 
AGACCCAGTG 
AGCACATCAG 
TCCAACATGT 
CCTCATCATT 
TGCAGAGTTT 
AATCCTGTAT 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



Seq ID NO: 302 Protein sequence 
Protein Accession #: NP_03633B.l 



1 
I 

MHPQWHjSL 
IVWTNGTHVT 
SLEIVPPKVT 
TTTSVPTTTS 
IRREPTSSPL 
VLLALLGVII 



11 
I 

ILHLADSVAG 
YRKDTRYKL.L 
TTPIVTTVPT 
IPTTTSVPVT 
YSYTTDGNDT 
AKKYFPKKEV 



21 
I 

SVKVGGEAGP 
GDLSRRDVSL 
VTTVRTSTTV 
TTVSTFVPPM 
VTESSDGLWN 
QQLSVSFSSL 



31 
I 

SVTLPCHYSG 
TIENTAVSDS 
PTTTTVPTTT 
PLPRQNHEPV 
NNQTQLFLEH 
QIKALQNAVB 



41 
I 

AVTSMCWNRG 
GVYCCRVEHR 
VPTTMSIPTT 
ATSPSSPQPA 
SLLTANTTKG 
KEVQAEDNIY 



51 
I 

SCSLFTCQNG 
GWFNDMKITV 
TTVPTTMTVS 
ETHPTTLQGA 
IYAGVCISVL 
IENSLYATD 



60 
120 
180 
240 
300 



Seq ID NO: 303 dna sequence 

Nucleic Acid Accession #: NM_001044. 

Coding sequence: 129.. 1991 



1 11 21 31 41 51 

I I I I I I 

ACCGCTCCGG AGCGGGAGGG GAGGCTTCGC GGAAOGCTCT CGGCGCCAGG ACTCGCGTGC 60 

AAAGCCCAGG CCCGGGCGGC CAGACCAAGA GGGAAGAAGC ACAGAATTCC TCAACTCCCA 120 

GTGTGCCCAT GAGTAAGAGC AAATGCTCCG TGGGACTCAT GTCTTCCGTG GTGGCCCCGG 180 

CTAAGGAGCC CAATGCCGTG GGCCCGAAGG AGGTGGAGCT CATCCTTGTC AAGGAGCAGA 240 

ACGGAGTGCA GCTCACCAGC TCCACCCTCA CCAACCCGCG GCAGAGCCCC GT GGAGG CCC 300 

AGGATCGGGA GACCTGGGGC AAGAAGATCG ACTTTCTCCT GTCCGTCATT GGCTTTGCTG 360 

TGGACCTGGC CAACGTCTGG CGGTTCCCCT ACCTGTGCTA CAAAAATGGT GGCGGTGCCT 420 

TCCTGGTCCC CTACCTGCTC TTCATGGTCA TTGCTGGGAT GCCACTTTTC TACATGGAGC 480 

TGGCCCTCGG CCAGTTCAAC AGGGAAGGGG CCGCTGGTGT CTGGAAGATC TGCCCCATAC 540 

TGAAAGGTGT GGGCTTCAOG GTCATCCTCA TCTCACTGTA TGTCGGCTTC TTCTACAACG 600 

TCATCATCGC CTGGGCGCTG CACTATCTCT TCTCCTCCTT CACCACGGAG CTCCCCTGGA 660 

TCCACTGCAA CAACTCCTGG AACAGCCCCA ACTGCTOGGA TGCCCATCCT GGTGACTCCA 720 

GTGGAGACAG CTCGGGCCTC AACGACACTT TTGGGACCAC ACCTGCTGCC GAGTACTTTG 780 

AACGTGGCGT GCTGCACCTC CACCAGAGCC ATGGCATCGA CGACCTGGGG CCTCOGCGGT 840 

GGCAGCTCAC AGCCTGCCTG GTGCTGGTCA TCGTGCTGCT CTACTTCAGC CTCTGGAAGG 900 

GCGTGAAGAC CTCAGGGAAG GTGGTATGGA TCACAGCCAC CATGCCATAC GTGGTCCTCA 960 

CTGCCCTGCT CCTGCGTGGG GTCACCCTCC CTGGAGCCAT AGACGGCATC AGAGCATACC 1020 

TGAGCGTTGA CTTCTACCGG CTCTGOGAGG CGTCTGTTTG GATTGACGCG GCCACCCAGG 1080 

TGTGCTTCTC CCTGGGCGTG GGGTTCGGGG TGCTGATCGC CTTCTCCAGC TACAACAAGT 1140 

TCACCAACAA CTGCTACAGG GACGCGATTG TCACCACCTC CATCAACTCC CTGAOGAGCT 1200 

TCTCCTCCGG CTTCGTOGTC TTCTCCTTCC TGGGGTACAT GGCACAGAAG CACAGTGTGC 1260 

CCATCGGGGA CGTGGCCAAG GACGGGCCAG GGCTGATCTT CATCATCTAC CCGGAAGCCA 1320 

TCGCCACGCT CCCTCTGTCC TCAGCCTGGG CCGTGGTCTT C3TCATCATG CTGCTCACCC 1380 

TGGGTATCGA CAGOGCCATG GGTGGTATGG AGTCAGTGAT CACCGGGCTC ATCGATGAGT 1440 

TCCAGCTGCT GCACAGACAC CGTGAGCTCT TCACGCTCTT CATCGTCCTG GCGACCTTCC 1500 

TCCTGTCCCT GTTCTGCGTC ACC3VACGGTG GCATCTAOGT CTTCACGCTC CTGGACCATT 1560 

TTGCAGCCGG CACGTCCATC CTCTTTGGAG TGCTCATCGA AGCCATCGGA GTGGCCTGGT 1620 

TCTATGGTGT TGGGCAGTTC AGCGACGACA TCCAGCAGAT GACOGGGCAG CGGCCCAGCC 1680 

TGTACTGGOG GCTGTGCTGG AAGCTGGTCA GCCCCTGCTT TCTCCTGTTC GTGGTCGTGG 1740 

TCAGCATTGT GACCTTCAGA CCCCCCCACT ACGGAGCCTA CATCTTCCCC GACTGGGCCA 1800 

ACGCGCTGGG CTGGGTCATC GCCACATCCT CCATGGCCAT GGTGCCCATC TATGCGGCCT 1860 

ACAAGTTCTG CAGCCTGCCT GGGTCCTTTC GAGAGAAACT GGCCTACGCC ATTGCACCCG 1920 

AGAAGGACCG TGAGCTGGTG GACAGAGGGG AGGTGCGCCA GTTCACGCTC CGCCACTGGC 1980 

TCAAGGTGTA GAGGGAGCAG AGACGAAGAC CCCAGGAAGT CATCCTGCAA TGGGAGAGAC 2040 

ACGAACAAAC CAAGGAAATC TAAGTTTCGA GAGAAAGGAG GGCAACTTCT ACTCTTCAAC '2100 

CTCTACTGAA AACACAAACA ACAAAGCAGA AGACTCCTCT CTTCTGACTG TTTACACCTT 2160 

TCOGTGCCGG GAGCGCACCT CGCCGTGTCT TGTGTTGCTG TAATAACGAC GTAGATCTGT 2220 

GCAGCGAGGT CCACCCCGTT GTTGTCCCTG CAGGGCAGAA AAACGTCTAA CTTCATGCTG 2280 

TCTGTGTGAG GCTCCCTCCC TCCCTGCTCC CTGCTCCCGG CTCTGAGGCT GCCCCAGGGG 2340 

CACTGTGTTC TCAGGCGGGG ATCACGATCC TTGTAGACGC ACCTGCTGAG AATCCCCGTG 2400 

CTCACAGTAG CTTCCTAGAC CATTTACTTT GCCCATATTA AAAAGCCAAG TGTCCTGCTT 2460 

GGTTTAGCTG TGCAGAAGGT GAAATGGAGG AAACCACAAA TTCATGCAAA GTCCTTTCCC 2520 

GATGCGTGGC TCCCAGCAGA GGCCGTAAAT TGAGCGTTCA GTTGACACAT TGCACACACA 2580 

GTCTGTTCAG AGGCATTGGA GGATGGGGGT CCTGGTATGT CTCACCAGGA AATTCTGTTT 2640 

ATGTTCTTGC AGCAGAGAGA AATAAAACTC CTTGAAACCA GCTCAGGCTA CTGCCACTCA 2700 

GGCAGCCTGT GGGTCCTTGT GGTGTAGGGA ACGGCCTGAG AGGAGCGTGT CCTATCCCCG 2760 

GACGCATGCA GGGCCCCCAC AGGAGCGTGT CCTATCCCCG GACGCATGCA GGGCCCCCAC 2820 
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AGGAGCATGT CCTATCCCTG GACGCATGCA GGGCCCCCAC AGGAGCGTGT ACTACCCCAG 2880 

AACGCATGCA GGGCCCCCAC AGGAGCGTGT ACTACCCCAG GACGCATGCA GGGCCCCCAC 2940 

TGGAGCGTGT ACTACCCCAG GACGCATGCA GGGCCCCCAC AGGAGCGTGT CCTATCCCCG 3000 

GACCGGACGC ATGCAGGGCC CCCACAGGAG CGTGTACTAC CCCAGGACGC ATGCAGGGCC 3060 

CCCACAGGAG CGTGTACTAC CCCAGGATGC ATGCAGGGCC CCCACAGGAG CGTGTACTAC 3120 

CCCAGGACGC ATGCAGGGCC CCCATGCAGG CAGCCTGCAG ACCAACACTC TGCCTGGCCT 3180 

TGAGCCGTGA CCTCCAGGAA GGGACCCCAC TGGAATTTTA TTTCTCTCAG GTGCGTGCCA 3240 

CATCAATAAC AACAGTTTTT ATGTTTGCGA ATGGCTTTTT AAAATCATAT TTACCTGTGA 3300 

ATCAAAACAA ATTCAAGAAT GCAGTATCCG CGAGCCTGCT TGCTGATATT GCAGTTTTTG 3360 

TTTACAAGAA TAATTAGCAA TACTGAGTGA AGGATGTTGG CCAAAAGCTG CTTTCCATGG 3420 

CACACTGCCC TCTGCCACTG ACAGGAAAGT GGATGCCATA GTTTGAATTC ATGCCTCAAG 3480 

TCGGTGGGCC TGCCTACGTG CTGCCCGAGG GCAGGGGCCG TGCAGGGCCA GTCATGGCTG 3540 

TCCCCTGCAA GTGGACGTGG GCTCCAGGGA CTGGAGTGTA ATGCTCGGTG GGAGCCGTCA 3600 

GCCTGTGAAC TGCCAGGCAG CTGCAGTTAG CACAGAGGAT GGCTTCCCCA TTGCCTTCTG 3660 

GGGAGGGACA CAGAGGACGG CTTCCCCATC GCCTTCTGGC CGCTGCAGTC AGCACAGAGA 3720 

GCGGCTTCCC CATTGCCTTC TGGGGAGGGA CACAGAGGAC AGTTTCCCCA TCGCCTTCTG 3780 

GTTGTTGAAG ACAGCACAGA GAGCGGCTTC CCCATCGCCT TCTGGGGAGG GGCTCCGTGT 3840 

AGCAACCCAG GTGTTGTCCG TGTCTGTTGA CCAATCTCTA TTCAGCATCG TGTGGGTCCC 3900 
TAAGCACAAT AAAAGACATC CACAATGGAA AAAAAAAAAG GAATTC 

Seq ID NO: 304 Protein sequence 
Protein Accession fl: NP_001035.1 

1 11 21 31 41 51 

I I I I I I 

MSKSKCSVGI* MSSWAPAKE PNAVGPKEVE LILVKEQNGV QLTSSTLTNP RQSPVEAQDR 60 

ETWGKKIDFL LSVIGFAVDL ANVWRPPYLC YKNGGGAFLV PYLLFMVTAG MPLFYMELAL 120 

GQFNREGAAG VWKICPILKG VGFTVILISL YVGFFYNVII AWALHYLFSS PTTELPWIHC 180 

NNSWNSPNCS DAHPGDSSGD SSGLNDTFGT TPAAEYPERG VLHLHQSHGI DDLGPPRWQL 240 

TACLVLVIVL LYFSLWKGVK TSGKWWITA TMPYWLTAL LLRGVTLPGA IDGIRAYLSV 300 

DPYRLCEASV WIDAATQVCF SLGVGFGVLI AFSSYNKFTN NCYRDAIVTT SINSLTSFSS 360 

GFWFSFLGY MAQKHSVPIG DVAKDGPGLI FIIYPEAIAT LPLSSAWAW FPIMLLTLGI 420 

DSAMGGMESV ITGLIDEFQL LHRHRELFTL FIVLATFLLS LFCVTNGGIY VFTLLDHFAA 480 

GTSILFGVLI EAIGVAWFYG VGQFSDDIQQ MTGQRPSLYW RLCWKLVSPC FLLFWWSI 540 

VTFRPPHYGA YIFPDWANAL GWVIATSSMA MVPIYAAYKF CSLPGSFREK LAYAIAPEKD 600 
RELVDRGEVR QFTLRHWLKV 

Seq ID NO: 305 DNA sequence 

Nucleic Acid Accession #: NM_001216.1 

Coding sequence: 43.. 1422 

1 11 21 31 41 51 

I I I I I I 

GCCCGTACAC ACCGTGTGCT GGGACACCCC ACAGTCAGCC GCATGGCTCC CCTGTGCCCC 60 

AGCCCCTGGC TCCCTCTGTT GATCCCGGCC CCTGCTCCAG GCCTCACTGT GCAACTGCTG 120 

CTGTCACTGC TGCTTCTGAT GCCTGTCCAT CCCCAGAGGT TGCCCCGGAT GCAGGAGGAT 180 

TCCCCCTTGG GAGGAGGCTC TTCTGGGGAA GATGACCCAC TGGGCGAGGA GGATCTGCCC 240 

AGTGAAGAGG ATTCACCCAG AGAGGAGGAT CCACCCGGAG AGGAGGATCT ACCTGGAGAG 300 

GAGGATCTAC CTGGAGAGGA GGATCTACCT GAAGTTAAGC CTAAATCAGA AGAAGAGGGC 360 

TCCCTGAAGT TAGAGGATCT ACCTACTGTT GAGGCTCCTG GAGATCCTCA AGAACCCCAG 420 

AATAATGCCC ACAGGGACAA AGAAGGGGAT GACCAGAGTC ATTGGCGCTA TGGAGGCGAC 480 

CCGCCCTGGC CCCGGGTGTC CCCAGCCTGC GCGGGCCGCT TCCAGTCCCC GGTGGATATC 540 

CGCCCCCAGC TCGCCGCCTT CTGCCCGGCC CTGCGCCCCC TGGAACTCCT GGGCTTCCAG 600 

CTCCCGCCGC TCCCAGAACT GCGCCTGCGC AACAATGGCC ACAGTGTGCA ACTGACCCTG 660 

CCTCCTGGGC TAGAGATGGC TCTGGGTCCC GGGCGGGAGT ACCGGGCTCT GCAGCTGCAT 720 

CTGCACTGGG GGGCTGCAGG TCGTCCGGGC TCGGAGCACA CTGTGGAAGG CCACCGTTTC 780 

CCTGCCGAGA TCCACGTGGT TCACCTCAGC ACCGCCTTTG CCAGAGTTGA CGAGGCCTTG 840 

GGGCGCCCGG GAGGCCTGGC CGTGTTGGCC GCCTTTCTGG AGGAGGGCCC GGAAGAAAAC 900 

AGTGCCTATG AGCAGTTGCT GTCTCGCTTG GAAGAAATCG CTGAGGAAGG CTCAGAGACT 960 

CAGGTCCCAG GACTGGACAT ATCTGCACTC CTGCCCTCTG ACTTCAGCCG CTACTTCCAA 1020 

TATGAGGGGT CTCTGACTAC ACCGCCCTGT GCCCAGGGTG TCATCTGGAC TGTGTTTAAC 1080 

CAGACAGTGA TGCTGAGTGC TAAGCAGCTC CACACCCTCT CTGACACCCT GTGGGGACCT 1140 

GGTGACTCTC GGCTACAGCT GAACTTCCGA GCGACGCAGC CTTTGAATGG GCGAGTGATT 1200 

GAGGCCTCCT TCCCTGCTGG AGTGGACAGC AGTCCTCGGG CTGCTGAGCC AGTCCAGCTG 1260 

AATTCCTGCC TGGCTGCTGG TGACATCCTA GCCCTGGTTT TTGGCCTCCT TTTTGCTGTC 1320 

ACCAGCGTCG CGTTCCTTGT GCAGATGAGA AGGCAGCACA GAAGGGGAAC CAAAGGGGGT 1380 

GTGAGCTACC GCCCAGCAGA GGTAGCCGAG ACTGGAGCCT AGAGGCTGGA TCTTGGAGAA 1440 

TGTGAGAAGC CAGCCAGAGG CATCTGAGGG GGAGCCGGTA ACTGTCCTGT CCTGCTCATT 1500 
ATGCCACTTC CTTTTAACTG CCAAGAAATT TTTTAAAATA AATATTTATA AT 

Seq ID NO: 306 Protein sequence 
Protein Accession #: NP__001207.1 

1 11 21 31 41 51 

! I I I I I 

MAPLCPSPWL PLLIPAPAPG LTVQLLLSLL LLMPVHPQRL PRMQEDSPLG GGSSGEDDPL 60 
GEEDLPSEED SPREEDPPGE EDLPGEEDLP GEEDLPEVKP KSEEEGSLKL EDLPTVEAPG 120 
DPQEPQNNAH RDKEGDDQSH WRYGGDPPWP RVSPACAGRF QSPVDIRPQL AAPCPALRPL 180 
ELLGFQLPPL PELRLRNNGH SVQLTLPPGL EMALGPGREY RALQLHLHWG AAGRPGSEHT 240 
VEGHRFPAEI HWHLSTAFA RVDEALGRPG GLAVLAAFLE EGPEENSAYE QLLSRLEEIA 300 
EEGSETQVPG LDISALLPSD FSRYFQYEGS LTTPPCAQGV IWTVFNQTVM LSAKQLHTLS 360 
DTLWGPGDSR LQLNFRATQP LNGRVIEASF PAGVDSSPRA AEPVQLNSCL AAGDILALVF 420 
GLLFAVTSVA FLVQMRRQHR RGTKGGVSYR PAEVAETGA 
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Seq ID NO: 307 DNA sequence 

Nucleic Acid Accession NM_003039.1 

Coding sequence: 76.. 1581 

1 11 21 31 41 51 

I I I I I I 

CTTCTCTCTC CATTCAGTGC ACGCGTTACT TTGGCTAAAA GGAGGTGAGC GGCACTCTGC 60 

CCTTCCAGAG CAAGCATGGA GCAACAGGAT CAGAGCATGA AGGAAGGGAG GCTGACGCTT 120 

GTGCTTGCCC TGGCAACCCT GATAGCTGCC TTTGGGTCAT CCTTCCAGTA TGGGTACAAC 180 

GTGGCTGCTG TCAACTCCCC AGCACTGCTC ATGCAACAAT TTTACAATGA GACTTACTAT 240 

GGTAGGACCG GTGAATTCAT GGAAGACTTC CCCTTGACGT TGCTGTGGTC TGTAACCGTG 300 

TCCATGTTTC CATTTGGAGG GTTTATCGGA TCCCTCCTGG TCGGCCCCTT GGTGAATAAA 360 

TTTGGCAGAA AAGGGGCCTT GCTGTTCAAC AACATATTTT CTATCGTGCC TGCGATCTTA 420 

ATGGGATGCA GCAGAGTCGC CACATCATTT GAGCTTATCA TTATTTCCAG ACTTTTGGTG 480 

GGAATATGTG CAGGTGTATC TTCCAACGTG GTCCCCATGT ACTTAGGGGA GCTGGCCCCT 540 

AAAAACCTGC GGGGGGCTCT CGGGGTGGTG CCCCAGCTCT TCATCACTGT TGGCATCCTT 60 0 

GTGGCCCAGA TCTTTGGTCT TCGGAATCTC CTTGCAAACG TAGATGGCTG GCCGATCCTG 660 

CTGGGGCTGA CCGGGGTCCC CGCGGCGCTG CAGCTCCTTC TGCTGCCCTT CTTCCCCGAG 720 

AGCCCCAGGT ACCTGCTGAT TCAGAAGAAA GACGAAGCGG CCGCCAAGAA AGCCCTACAG 780 

ACGCTGCGCG GCTGGGACTC TGTGGACAGG GAGGTGGCCG AGATCCGGCA GGAGGATGAG 840 

GCAGAGAAGG CCGCGGGCTT CATCTCCGTG CTGAAGCTGT TCCGGATGCG CTCGCTGCGC 900 

TGGCAGCTGC TGTCCATCAT CGTCCTCATG GGCGGCCAGC AGCTGTCGGG CGTCAACGCT 960 

ATCTACTACT ACGCGGACCA GATCTACCTG AGCGCCGGCG TGCCGGAGGA GCACGTGCAG 1020 

TACGTGACGG CCGGCACCGG GGCCGTGAAC GTGGTCATGA CCTTCTGCGC CGTGTTCGTG 1080 

GTGGAGCTCC TGGGTCGGAG GCTGCTGCTG CTGCTGGGCT TCTCCATCTG CCTCATAGCC 1140 

TGCTGCGTGC TCACTGCAGC TCTGGCACTG CAGGACACAG TGTCCTGGAT GCCATACATC 1200 

AGCATCGTCT GTGTCATCTC CTACGTCATA GGACATGCCC TCGGGCCCAG TCCCATACCC 1260 

GCGCTGCTCA TCACTGAGAT CTTCCTGCAG TCCTCTCGGC CATCTGCCTT CATGGTGGGG 1320 

GGCAGTGTGC ACTGGCTCTC CAACTTCACC GTGGGCTTGA TCTTCCCGTT CATCCAGGAG 1380 

GGCCTCGGCC CGTACAGCTT CATTGTCTTC GCCGTGATCT GCCTCCTCAC CACCATCTAC 1440 

ATCTTCTTGA TTGTCCCGGA GACCAAGGCC AAGACGTTCA TAGAGATCAA CCAGATTTTC 1500 

ACCAAGATGA ATAAGGTGTC TGAAGTGTAC CCGGAAAAGG AGGAACTGAA AGAGCTTCCA 1560 

CCTGTCACTT CGGAACAGTG ACTCTGGAGA GGAAGCCAGT GGAGCTGGTC TGCCAGGGGC 1620 

TTCCCACTTT GGCTTATTTT TCTGACTTCT AGCTGTCTGT GAATATCCAG AAATAAAACA 1680 

ACTCTGATGT GGAATGCAGT CCTCATCTCC AGCCTCCCCA CCCCAGTGGG AACTGTGCAA 1740 

AGGGCTGCCT TGCTGTTCTT GAAGCTGGGC TGTCTCTCTC CATGTTGGCC TGTCACCAGA 1800 

CCCGAGTCAA TTAAACAGCT GGTCCTCCAC TTTGCTGGTT CAGCCTTCGT GTGGCTCCTG 1860 

GTAACGTGGC TCCACCTTGA TGGGTCAACC TTTGTGTGGC TCCTGGTAAC ATAACAACAA 1920 

CAGTTACTAT AGTGGTGAGA TGGAAGGAAT CAAATTTTGC CAGAGAAACT AACTCGGTGG 1980 

CCCCAACAGG TCTTCCGGGG CCATGGGCAT TTGTTTAGAG CCAAATTCAT CCTCTTACCA 2040 

GATCCTTTTC CAGAAATACC TGTCTAGGAA GGTGTGATGT CAGAAACAAT GACATCCAGA 2100 

AAGCTGAGGA ACAGGTTCCT GTGGAGACAC TGAGTCAGAA TTCTTCATCC AAATTATTTT 2160 
GTTAGTGGAA AATGGAATTG CTTCTGTGTA GTCAATAAAA TGAACCTGAT CACTTTTC 

Seq ID NO: 308 Protein sequence 
Protein Accession #: NP_003030.1 

1 11 21 31 41 51 

I I I t I I 

MEQQDQSMKE GRLTLVLAIiA TLIAAFGSSF QYGYNVAAVN SPALLMQQFY NETYYGRTGE 60 
FMEDFPLTLL WSVTVSMPPF GGPIGSLLVG PLVNKFGRKG ALLFNNIFSI VPAILMGCSR 120 
VATSFELIII SRLLVGICAG VSSNWPMYL GKLAPKNLRG ALGWPQLFI TVGILVAQIF 180 
GLRNLIiANVD GWPILLGLTG VPAALQLLLL PFFPBSPRYL LIQKKDEAAA KKALQTLRGW 240 
DSVDREVAEI RQEDEAEKAA GFISVLKLFR MRSLRWQLLS IIVLMGGQQL SGVNAIYYYA 300 
DQIYLSAGVP EEHVQYVTAG TGAVNWMTF CAVFWELLG RRLLLLLGFS ICLIACCVLT 360 
AALALQDTVS WMPYISIVCV ISYVIGHALG PSPIPALLIT EIFLQSSRPS AFMVGGSVHW 420 
LSNFTVGIiIF PFIQEGLGPY SFIVFAVICL LTTIYIFI.IV PETKAKTFIE INQIFTKMNK 480 
VSEVYPEKEE LKELPPVTSE Q 

Seq ID NO: 309 DNA sequence 

Nucleic Acid Accession #: NM_001252.1 

Coding sequence : 138.. 719 

1 11 21 31 41 51 

I 1 I I I 1 

GGCTGGTCCC CTGACAGGTT GAAGCAAGTA GACGCCCAGG AGCCCCGGGA GGGGGCTGCA 60 

GTTTCCTTCC TTCCTTCTCG GCAGCGCTCC GCGCCCCCAT CGCCCCTCCT GCGCTAGCGG 120 

AGGTGATCGC CGCGGCGATG CCGGAGGAGG GTTCGGGCTG CTCGGTGCGG CGCAGGCCCT 180 

ATGGGTGCGT CCTGCGGGCT GCTTTGGTCC CATTGGTCGC GGGCTTGGTG ATCTGCCTCG 240 

TGGTGTGCAT CCAGCGCTTC GCACAGGCTC AGCAGCAGCT GCCGCTCGAG TCACTTGGGT 300 

GGGAOGTAGC TGAGCTGCAG CTGAATCACA CAGGACCTCA GCAGGACCCC AGGCTATACT 360 

GGCAGGGGGG CCCAGCACTG GGCCGCTCCT TCCTGCATGG ACCAGAGCTG GACAAGGGGC 420 

AGCTACGTAT CCATCGTGAT GGCATCTACA TGGTACACAT CCAGGTGACG CTGGCCATCT 480 

GCTCCTCCAC GAOGGCCTCC AGGCACCACC CCACCACCCT GGCCGTGGGA ATCTGCTCTC 540 

CCGCCTCCCG TAGCATCAGC CTGCTGCGTC TCAGCTTCCA CCAAGGTTGT ACCATTGCCT 600 

CCCAGCGCCT GACGCCCCTG GCCCGAGGGG ACACACTCTG CACCAACCTC ACTGGGACAC 660 

TTTTGCCTTC CCGAAACACT GATGAGACCT TCTTTGGAGT GCAGTGGGTG CGCCCCTGAC 720 

CACTGCTGCT GATTAGGGTT TTTTAAATTT TATTTTATTT TATTTAAGTT CAAGAGAAAA 780 

AGTGTACACA CAGGGGCCAC CCGGGGTTGG GGTGGGAGTG TGGTGGGGGG TAGTGGTGGC 840 
AGGACAAGAG AAGGCATTGA GCTTTTTCTT TCATTTTCCT ATTAAAAA 

Seq ID NOx 310 Protein sequence 
Protein Accession #: NP_001243.1 
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1 11 21 31 41 SI 

I I I I I I 

MPEEGSGCSV RRRPYGCVLR AALVPLVAGL VICLWCIQR PAQAQQQLPL ESLGWDVAEL 60 

QLNHTGPQQD PRLYWQGGPA LGRSPLHGPE LDKGQLRIHR DGIYMVHIQV TLAICSSTTA 120 

SRHHPTTLAV GICSPASRSI SLLRLSFHQG CTIASQRLTP LARGDTLCTN LTGTLLPSRN 180 
TDETPFGVQW VRP 

Seq ID NO: 311 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..3978 

1 11 21 31 41 51 

I I I I I I 

ATGGTGGGTG AAGGACCCTA CCTTATCTCA GATCTGGACC AGOGAGGCCG GCGGAGATCC 60 

TTTGCAGAAA GATATGACCC CAGCCTGAAG ACCATGATCC CAGTGCGACC CTGTGCAAGG 120 

TTAGCACCCA ACCCGGTGGA TGATGCCGGG CTACTCTCCT TCGCCACATT TTCCTGGCTC 180 

ACGCOGGTGA TGGTGAAAGG CTACCGGCAA AGGCTGACCG TAGACACCCT GCCCCCATTG 240 

TCGACATATG ACTCATCTGA CACCAATGCC AAAAGATTTC GAGTCCTTTG GGATGAAGAG 300 

GTAGCAAGGG TGGGTCCTGA GAAGGCCTCT CTGAGCCACG TGGTGTGGAA ATTCCAGAGG 360 

ACACGCGTGT TGATGGACAT CGTGGCCAAC ATCCTGTGCA TCATCATGGC AGCCATAGGG 420 

CCGACAGTTC TCATTCACCA AATCCTCCAG CAGACTGAGA GGACCTCTGG GAAAGTCTGG 480 

GTTGGCATTG GACTGTGCAT AGCCCTTTTT GCCACCGAGT TTACCAAAGT CTTCTTTTGG 540 

GCCCTTGCCT GGGCCATCAA CTACCGCACG GCCATCCGGT TGAAGGTGGC GCTCTCCACC 600 

TTGGTTTTTG AAAACCTAGT GTCCTTCAAG ACATTGACCC ACATCTCTGT TGGCGAGGTG 660 

CTCAATATAC TGTCAAGTGA TAGCTATTCT TTGTTTGAAG CTGCCTTGTT TTGTCCTTTG 720 

CCAGCCACCA TCCCGATCCT AATGGTCTTT TGTGCGGCGT ACGCCTTTTT CATTCTGGGG 780 

CCCACAGCTC TCATCGGGAT ATCAGTGTAT GTCATATTCA TACCCGTCCA GATGTTTATG B40 

GCCAAGCTCA ATTCAGCTTT CCGAAGGTCA GCAATTTTGG TGACAGACAA GCGAGTTCAG 900 

ACAATGAATG AGTTTCTGAC CTGCATCAGG CTGATCAAAA TGTATGCCTG GGAGAAATCT 960 

TTTACCAACA CTATCCAAGA TATAAGAAGG AGGGAAAGAA AATTACTGGA AAAAGCTGGA 1020 

TTTGTCCAAA GTGGAAACTC TGCCCTGGCC CCCATCGTGT CCACCATAGC CATCGTGCTG 1080 

ACATTATCCT GCCACATCCT CCTGAGACGC AAACTCACCG CACCCGTGGC ATTTAGTGTG 1140 

ATTGCCATGT TTAATGTAAT GAAGTTTTCC ATTGCAATCT TGCCCTTCTC CATCAAAGCA 1200 

ATGGCTGAAG CGAATGTCTC TCTAAGGAGA ATGAAGAAAA TTCTCATAGA TAAAAGCCCC 1260 

CCATCTTACA TCACCCAACC AGAAGACCCA GATACTGTCT TGCTTTTAGC AAATGCCACC 1320 

TTGACATGGG AGCATGAAGC CAGCAGGAAA AGTACCCCAA AGAAATTGCA GAACCAGAAA 1380 

AGGCATTTAT GCAAGAAACA GAGGTCAGAG GCATACAGTG AGAGGAGTCC ACCAGCCAAG 1440 

GGAGCCACTG GCCCAGAGGA GCAAAGTGAC AGCCTCAAAT CGGTTCTGCA CAGCATAAGC 1500 

TTTGTGGTGA GAAAGTTATG TCGTTATCCC GAAGCCCAGC TCCTGGCTTG GAGGTGGCCA 1560 

GCAGTGTTTG TTGGGAGAAT CATCAGAGGA TACAGGCCTC ATGGATTTTC TGCTAAAGAC 1620 

AAGGATGAAT CTAGAAGGCT TCTTACTTGG CCCCAAGAAG TGGATAGGAC TCAAAGGGCA 1680 

GCCAAATACC TGGGGAAGAT CTTGGGAATA TGTGGGAATG TGGGAAGTGG AAAGAGCTCC 1740 

CTCCTTGCAG CTCTCCTAGG ACAGATGCAG CTGCAGAAAG GGGTGGTGGC AGTCAATGGA 1800 

ACTTTGGCCT ACGTTTCACA GCAGGCATGG ATCTTTCATG GAAATGTGAG AGAAAACATA I860 

CTCTTTGGAG AAAAGTATGA TCACCAAAGG TATCAGCACA CAGTCCGCGT CTGTGGCCTC 1920 

CAGAAGGACC TGAGCAACCT CCCCTATGGA GACCTGACTG AGATTGGGGA GCGGGGCCTC 1980 

AACCTCTCTG GGGGGCAGAG GCAGAGGATT AGCCTGGCCC GCGCTGTCTA CTCCGACCGT 2040 

CAGCTCTACC TGCTGGACGA CCCCCTGTCG GCCGTGGACG CCCACGTGGG GAAGCACGTC 2100 

TTTGAGGAGT GCATTAAGAA GACGCTCAGG GGAAAGACAG TCGTCCTGGT GACCCACCAG 2160 

CTACAGTTCT TAGAGTCTTG TGATGAAGTT ATTTTATTAG AAGATGGAGA GATTTGTGAA 2220 

AAGGGAACCC ACAAGGAGTT AATGGAGGAG AGAGGGCGCT ATGCAAAACT GATTCACAAC 2280 

CTGCGAGGAT TGCAGTTCAA GGATCCTGAA CACCTTTACA ATGCAGCAAT GGTGGAAGCC 2340 

TTCAAGGAGA GCCCTGCTGA GAGAGAGGAA GATGCTGGTA TAATCGGGTA CCTCCTTTCT 2400 

CTCTTCACTG TGTTCCTCTT CCTCCTGATG ATTGGCAGCG CTGCCTTCAG CAACTGGTGG 2460 

CTGGOTCTCT GGTTGGACAA GGGCTCACGG ATGACCTGTG GGCCCCAGGG CAACAGGACC 2520 

ATGTGTGAGG TCGGCGCGGT GCTGGCAGAC ATCGGTCAGC ATGTGTACCA GTGGGTGTAC 2580 

ACTGCAAGCA TGGTGTTCAT GCTGGTGTTT GGCGTCACCA AAGGCTTCGT CTTCACCAAG 2640 

ACCACACTGA TGGCATCCTC CTCTCTGCAT GACACGGTGT TTGATAAGAT CTTAAAGAGC 2700 

CCAATGAGTT TCTTTGACAC GACTCCCACT GGCAGGCTAA TGAACCGTTT TTCCAAGGAT 2760 

ATGGACGAGC TGGATGTGAG GCTGCCGTTT CACGCAGAGA ACTTTCTGCA GCAGTTTTTT 2820 

ATGGTGGTGT TTATTCTCGT GATCTTGGCT GCTGTGTTTC CTGCTGTCCT TTTAGTCGTG 2880 

GCCAGCCTTG CTGTAGGCTT CTTCATTCTG TTACGCATTT TCCACAGAGG AGTCCAGGAG 2940 

CTCAAGAAGG TGGAGAATGT CAGCCGGTCA CCCTGGTTCA CCCACATCAC CTCCTCCATG 3000 

CAGGGCCTGG GCATCATTCA CGCCTATGGC AAGAAGGAGA GCTGCATCAC CTATACTTCA 3060 

TCCAAAGGCC TGTCATTGTC ATACATCATC CAGCTGAGCG GACTGCTCCA AGTGTGTGTG 3120 

CGAACGGGAA CAGAGACGCA AGCCAAATTC ACCTCCGTGG AGCTGCTCAG GGAATACATT 3180 

TCGACCTGTG TTCCTGAATG CACTCATCCC CTCAAAGTGG GGACCTGTCC CAAGGACTGG 3240 

CCCAGCTGTG GGGAGATCAC CTTCAGAGAC TATCAGATGA GATACAGAGA CAACACCCCC 3300 

CTTGTTCTCG ACAGCCTGAA CTTGAACATA CAAAGTGGGC AGACAGTCGG GATTGTTGGA 3360 

AGAACAGGTT CCGGAAAGTC ATCGTTAGGA ATGGCTTTGT TTCGTCTGGT GGAGCCAGCC 3420 

AGTGGCACAA TCTTTATTGA TGAGGTGGAT ATCTGCATTC TCAGCTTGGA AGACCTCAGA 3480 

ACCAAGCTGA CTGTGATCCC ACAGGATCCT GTCCTGTTTG TAGGTACAGT AAGGTACAAC 3540 

TTGGATCCCT TTGAGAGTCA CACCGATGAG ATGCTCTGGC AGGTTCTGGA GAGAACATTC 3600 

ATGAGAGACA CAATAATGAA ACTCCCAGAA AAATTACAGG CAGAAGTCAC AGAAAATGGA 3660 

GAAAACTTCT CAGTAGGGGA ACGTCAGCTG CTTTGTGTGG CCCGAGCTCT TCTCCGTAAT 3720 

TCAAAGATCA TTCTCCTTGA TGAAGCCACC GCCTCTATGG ACTCCAAGAC TGACACCCTG 3780 

GTTCAGAACA CCATCAAAGA TGCCTTCAAG GGCTGCACTG TGCTGACCAT CGCCCACCGC 3840 

CTCAACACAG TTCTCAACTG CGATCACGTC CTGGTTATGG AAAATGGGAA GGTGATTGAG 3900 

TTTGACAAGC CTGAAGTCCT TGCAGAGAAG CCAGATTCTG CATTTGCGAT GTTACTAGCA 3960 
GCAGAAGTCA GATTGTAG 

Seq ID NO: 312 Protein sequence 
Protein Accession 8: Eos sequence 
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MVGEGPYLIS DLDQRGRRRS FAERYDPSLK TMIPVRPCAR LAPNPVDDAG LLSFATFSWL 
TPVMVKGYRQ RLTVDTLPPL STYDSSDTNA KRFRVLWDEE VARVGPEKAS LSHWWKFQR 
TRVLMDIVAN ILCIIMAAIG PTVLIHQILQ QTERTSGKVW VGIGLCIALP ATEFTKVFFW 
ALAWAINYRT AIRLKVALST LVFENLVSFK TLTHISVGEV LNILSSDSYS LFEAALFCPL 
PATIPILMVF CAAYAFFILG PTALIGISVY VIFIPVQMFM AKLNSAFRRS AILVTDKRVQ 
TMNEFLTCIR LIKMYAWEKS FTNTIQDIRR RERKLLEKAG FVQSGNSALA PIVSTIAIVL 
TLSCHILLRR KLTAPVAFSV IAMFNVMKFS IAILPFSIKA MAEANVSLRR MKKILIDKSP 
PSYITQPEDP DTVLLLANAT LTWEHEASRK STPKKLGNQK RHLCKKQRSE AYSERSPPAK 
GATGPBEQSD SLKSVLHSIS PWRKLCRYP EAQLLAWRWP AVFVGRI IRG YRPHGFSAKD 
KDESRRLLTW PQEVDRTQRA AKYLGKILGI CGNVGSGKSS LLAALLGQMQ LQKGWAVNG 
TLAYVSQQAW IFHGNVRENI LFGEKYDHQR YQHTVRVCGL QKDLSNLPYG DLTEIGERGL 
NLSGGQRQRI SLARAVYSDR QLYLLDDPLS AVDAHVGKHV FEECIKKTLR GKTWLVTHQ 
LQFLESCDEV ILLEDGBICE KGTHKELMEE RGRYAKLIHN LRGLQFKDPE HLYNAAMVEA 
FKESPAEREE DAGIIGYLLS LFTVFLFLLM IGSAAFSNWW LGLWLDKGSR MTCGPQGNRT 
MCEVGAVLAD IGQHVYQWVY TASMVFMLVF GVTKGFVFTK TTLMASSSLH DTVFDKILKS 
PMSFFDTTPT GRLMNRFSKD MDELDVRLPF HAENFLQQFF MWFILVILA AVFPAVLLW 
ASLAVGFFIL LRIFHRGVQE LKKVENVSRS PWFTHITSSM QGLGIIHAYG KKESCITYTS 
SKGLSLSYII QLSGLLQVCV RTGTETQAKF TSVELLREYI STCVPECTHP LKVGTCPKDW 
PSCGEITFRD YQMRYRDNTP LVLDSLNLNI QSGQTVGIVG RTGSGKSSLG MALFRLVEPA 
SGTIFIDEVD ICILSLEDLR TKLTVIPQDP VLFVGTVRYN LDPFESHTDE MLWQVLERTF 
MRDTIMKLPE KLQAEVTENG ENFSVGERQL LCVARALLRN SKI ILLDEAT ASMDSKTDTL 
VQNTIKDAFK GCTVLTIAHR LNTVLNCDHV LVMENGKVTE FDKPEVLAEK PDSAFAMLLA 
AEVRL 

Seq ID NO: 313 DNA sequence 
Nucleic Acid Accession #i Z31560 
Coding sequence: 1-966 

1 11 21 31 41 51 

I I I I I I 

CACAGCGCCC GCATGTACAA CATGATGGAG ACGGAGCTGA AGCCGCCGGG CCCGCAGCAA 60 

ACTTCGGGGG GCGGCGGCGG CAACTCCACC GCGGCGGCGG CCGGCGGCAA CCAGAAAAAC 120 

AGCCCGGACC GCGTCAAGCG GCCCATGAAT GCCTTCATGG TGTGGTCCCG CGGGCAGCGG 180 

OGCAAGATGG CCCAGGAGAA CCCCAAGATG CACAACTCGG AGATCAGCAA GCGCCTGGGC 240 

GCCGAGTGGA AACTTTTGTC GGAGACGGAG AAGCGGCCGT TCATCGACGA GGCTAAGCGG 300 

CTGCGAGCGC TGCACATGAA GGAGCACCCG GATTATAAAT ACCGGCCCCG GCGGAAAACC 360 

AAGACGCTCA TGAAGAAGGA TAAGTACACG CTGCCCGGCG GGCTGCTGGC CCCCGGCGGC 420 

AATAGCATGG OGAGCGGGGT CGGGGTGGGC GCCGGCCTGG GCGCGGGCGT GAACCAGCGC 480 

ATGGACAGTT ACGCGCACAT GAACGGCTGG AGCAACGGCA GCTACAGCAT GATGCAGGAC 540 

CAGCTGGGCT ACCCGCAGCA CCCGGGCCTC AATGCGCACG GCGCAGCGCA GATGCAGCCC 600 

ATGCACCGCT ACGACGTGAG CGCCCTGCAG TACAACTCCA TGACCAGCTC GCAGACCTAC 660 

ATGAACGGCT CGCCCACCTA CAGCATGTCC TACTCGCAGC AGGGCACCCC TGGCATGGCT 720 

CTTGGCTCCA TGGGTTCGGT GGTCAAGTCC GAGGCCAGCT CX^GCCCCCC TGTGGTTACC 780 

TCTTCCTCCC ACTCCAGGGC GCCCTGCCAG GCCGGGGACC TCOGGGACAT GATCAGCATG 840 

TATCTCCCCG GCGCCGAGGT GCCGGAACCC GCOGCCCCCA GCAGACTTCA CATGTCCCAG 900 

CACTACCAGA GCGGCCCGGT GCCCGGCACG GCCATTAACG GCACACTGCC CCTCTCACAC 960 

ATGTGAGGGC CGGACAGCGA ACPGGAGGGG GGAGAAATTT TCAAAGAAAA ACGAGGGAAA 1020 

TGGGAGGGGT GCAAAAGAGG AGAGTAAGAA ACAGCATGGA GAAAACCCGG TACGCTCAAA 1080 
AAAAA 

Seq ID NO: 314 Protein sequence 
Protein Accession #t CAA83435 

1 11 21 31 41 51 

! I I I 1 I 

HSARMYNMME TELKPPGPQQ TSGGGGGNST AAAAGGNQKN SPDRVKRPMN AFMVWSRGQR 60 

RKMAQENPKM HNSEISKRLG AEWKLLSETE KRPFIDEAKR LRALEMKEHP DYKYRPRRKT 120 

KTLMKKDKYT LPGGLIAPGG NSMASGVGVG AGLGAGVNQR MDSYAHMNGW SNGSYSMMQD 180 

QLGYPQHPGL NAHGAAQMQP MHRYDVSALQ YNSMTSSQTY MNGSPTYSMS YSQQGTPGMA 240 

LGSKGSWKS EASSSPPWT SSSHSRAPOQ AGDLRDMISM YLPGAEVPEP AAPSRLHMSQ 300 
HYQSGPVPGT AINGTLPLSH M 

Seq ID NO: 315 DNA sequence 
Nucleic Acid Accession fl: U91618 
Coding sequence: 29.. 541 

1 11 21 31 41 51 

I I I I I I 

CGGACTTGGC TTGTTAGAAG GCTGAAAGAT GATGGCAGGA ATGAAAATCC AGCTTGTATG 60 

CATGCTACTC CTGGCTTTCA GCTCCTGGAG TCTGTGCTCA GATTCAGAAG AGGAAATGAA 120 

AGCATTAGAA GCAGATTTCT TGACCAATAT GCATACATCA AAGATTAGTA AAGCACATGT 180 

TCCCTCTTGG AAGATGACTC TGCTAAATGT TTGCAGTCTT GTAAATAATT TGAACAGCCC 240 

AGCTGAGGAA ACAGGAGAAG TTCATGAAGA GGAGCTTGTT GCAAGAAGGA AACTTCCTAC 300 

TGCTTTAGAT GGCTTTAGCT TGGAAGCAAT GTTGACAATA TACCAGCTCC ACAAAATCTG 360 

TCACAGCAGG GCTTTTCAAC ACTGGGAGTT AATCCAGGAA GATATTCTTG ATACTGGAAA 420 

TGACAAAAAT GGAAAGGAAG AAGTCATAAA GAGAAAAATT CCTTATATTC TGAAACGGCA 480 

GCTGTATGAG AATAAACCCA GAAGACCCTA CATACTCAAA AGAGATTCTT ACTATTACTG S40 

AGAGAATAAA TCATTTATTT ACATGTGATT GTGATTCATC ATCCCTTAAT TAAATATCAA 600 

ATTATATTTG TGTGAAAATG TGACAAACAC ACTTATCTGT CTCTTCTACA ATTGTGGTTT 660 

ATTGAATGTG TTTTTCTGCA CTAATAGAAA TTAGACTAAG TGTTTTCAAA TAAATCTAAA 720 
TCTTCAAAAA AAAAAAAAAA AAATGGGGCC GCAATT 
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Seq ID NO: 316 Protein sequence 
Protein Accession ft: AABS0564 

1 11 21 31 41 51 

| | | I I I 

MMAGMKIQLV CMLMJVFSSW SLCSDSEEEM KALEADFLTN MHTSKISKAH VPSWKMTLLN 

VCSLVNNLNS PAEETGEVHE EELVARRKLP TALDGFSLEA MLTIYQLHKI CHSRAFQHWB 

LIQEDILDTG NDKNGKEEVI KRKIPYILKR QLYENKPRRP YILKRDSYYY 

Seq ID NO: 317 DNA sequence 

Nucleic Acid Accession ft: NM__006536.2 

Coding sequence: 109. .2940 



60 
120 



1 11 21 31 

till 
ACCTAAAACC TTGCAAGTTC AGGAAGAAAC CATCTGCATC 
ATGTATGCAG CAGGCTCAGT GTGAGTGAAC TGGAGGCTTC 
AGCATTGCAG GTCCTATTTG CAACCTGAAG TTTGTGACTC 
GAACTCCCAT TCCTGGGAGC TGGAGTACAG CTTCAAGACA 
ATTGCAATTA ATCCTCAGGT ACCTGAGAAT CAGAACCTCA 
ATAACTGAAG CTTCATTTTA CCTATTTAAT GCTACCAAGA 
ATAAAGATTT TAATACCTGC CACATGGAAA GCTAATAATA 
TCATATGAAA AGGCAAATGT CATAGTGACT GACTGGTATG 
TACACCCTAC AATACAGAGG GTGTGGAAAA GAGGGAAAAT 
TTCCTACTGA ATGATAACTT AACAGCTGGC TACGGATCAC 
GAATGGGCCC ACCTCCGTTG GGGTGTGTTC GATGAGTATA 
ATAAATGGGC AAAATCAAAT TAAAGTGACA AGGTGTTCAT 
GTGTCTGAAA AAGGTCCTTG CCCCCAAGAA AACTGTATTA 
GGATGCACCT TTATCTACAA TAGCACCCAA AATGCAACTG 
AGTTTATCTT CTGTGGTTGA ATTTTGTAAT GCAAGTACCC 
CTACAGAACC AGATGTGCAG CCTCAGAAGT GCATGGGATG 
TTTCACCACA GCTTTCCCAT GAATGGGACT GAGCTTCCAC 
GTACAGGCTG GTGACAAAGT GGTCTGTTTA GTGCTGGATG 
GCTGACAGAC TCCTTCAACT ACAACAAGCC GCAGAATTTT 
ATTCATACCT TCGTGGGCAT TGCCAGTTTC GACAGCAAAG 
CACCAAATTA ACAGCAATGA TGATCGAAAG TTGCTGGTTT 
TCAGCTAAAA CAGACATCAG CATTTGTTCA GGGCTTAAGA 
AAACTGAATG GAAAAGCTTA TGGCTCTGTG ATGATATTAG 
CTTCTTGGCA ATTGCTTACC CACTGTGCTC AGCAGTGGTT 
CTGGGTTCAT CTGCAGCCCC AAATCTGGAG GAATTATCAC 
TTCTTTGTTC CAGATATATC AAACTCCAAT AGCATGATTG 
TCTGGAACTG GAGACATTTT CCAGCAACAT ATTCAGCTTG 
AAACCTCACC ATCAATTGAA AAACACAGTG ACTGTGGATA 
ATGTTTCTAG TTACGTGGCA GGCCAGTGGT CCTCCTGAGA 
GGACGAAAAT ACTACACAAA TAATTTTATC ACCAATCTAA 
TGGATTCCAG GAACAGCTAA GCCTGGGCAC TGGACTTACA 
TCTCTGCAAG CCCTGAAAGT GACAGTGACC TCTCGCGCCT 
GCCACTGTGG AAGCCTTTGT GGAAAGAGAC AGCCTCCATT 
TATGOCAATG TGAAACAGGG ATTTTATCCC ATTCTTAATG 
GAGCCAGAGA CTGGAGATCC TGTTACGCTG AGAC TCCTT G 
GTTATAAAAA ATGATGGAAT TTACTCGAGG TATTTTTTCT 
TATAGCTTGA AAGTGCATGT CAATCACTCT CCCAGCATAA 
CCAGGGAGTC ATGCTATGTA TGTACCAGGT TACACAGCAA 
GCTCCAAGGA AATCAGTAGG CAGAAATGAG GAGGAGCGAA 
AGCTCAGGAG GCTCCTTTTC AGTGCTGGGA GTTCCAGCTG 
CCACCATGCA AAATTATTGA CCTGGAAGCT GTAAAAGTAG 
TGGACAGCAC CTGGAGAAGA CTTTGATCAG GGCCAGGCTA 
AGTAAAAGTC TACAGAATAT CCAAGATGAC TTTAACAATG 
AAGCGAAATC CTCAGCAAGC TGGCATCAGG GAGATATTTA 
ACGAATGGAC CTGAACATCA GCCAAATGGA GAAACACATG 
GCAATAOGAG CAATGGATAG GAACTCCTTA CAGTCTGCTG 
CCTCTGTTTA TTCCCCCCAA TTCTGATCCT GTACCTGCCA 
GGAGTTTTAA CAGCAATGGG TTTGATAGGA ATCATTTGCC 
CATACTTTAA GCAGGAAAAA GAGAG CAGAC AAGAAAGAGA 
ATAAATATCC AAAGTGTCTT CCTTCTTAGA TATAAGACCC 
CATACTAACA AAGTCAAATT AACATCAAAA CTGTAT TAAA 
ATACAGATAA QATTTTTACA TGGTAGATCA ACAATTCTTT 
CCTTACACTT TGGCTATGAA CAAATAATAA AAATTATTCT 
GCAAAGGGAA GGGTAAAGTC GGACCAGTGT CAAGGAAAGT 
AATAGCCCCA AGCAGAGAAA AGGAGGGTAG GTCTGCATTA 
TCATTTAGTT ACTTTGATTA ATTTTTCTTT TCTCCTTATC 
TTTACATGAA GATCATGCTA TATTTTATAT ATGTAGCCCC 
CTTGCTATTT TGTTATATAT ATTTCAGATG ACATCTCCCT 
TTTCACTGTA AGAGGTAACC TTTAACAATA TGGGTATTAC 
TTTATGACAA AGGTCTATTG AATTTATTTG TNTGTAAGTT 
TTTCTAAGTT TATTGCCTTG GGTTATTATG GAATGATAGT 
TACCTAGGAA A 

Seq ID NO: 318 Protein sequence 
Protein Accession #: NP_006527.1 



41 51 

I I 
CATATTGAAA ACCTGACACA 
TCTACAACAT GACCCAAAGG 
TCCTGGTTGC CTTAAGTTCA 
ATGGGTATAA TGGATTGCTC 
TCTCAAACAT TAAGGAAATG 
GAAGAGTATT TTTCAGAAAT 
ACAGCAAAAT AAAACAAGAA 
GGGCACATGG AGATGATCCA 
ACATTCATTT CACACCTAAT 
GAGGCCGAGT GTTTGTCCAT 
ACAATGACAA ACCTTTCTAC 
CTGACAT CAC AGGCATTTTT 
TTAGTAAGCT TTTTAAAGAA 
CATCAATAAT GTTCATGCAA 
ACAACCAAGA AGCACCAAAC 
TAATCACAGA CTCTGCTGAC 
CTCCTCCCAC ATTCTCGCTT 
TGTCCAGCAA GATGGCAGAG 
ATTTGATGCA GATTGTTGAA 
GAGAGATCAG AGCCCAGCTA 
CATATCTGCC CACCACTGTA 
AAGGATTTGA GGTGGTTGAA 
TGACCAGCGG AGATGATAAG 
CAACAATTCA CTCCATTGCC 
GTCTTACAGG AGGTTTAAAG 
ATGCTTTCAG TAGAATTTCC 
AAAGTACAGG TGAAAATGTC 
ATACTGTGGG CAACGACACT 
TTATATTATT TGATCCTGAT 
CTTTTCGGAC AGCTAGTCTT 
CCCTGAACAA TACCCATCAT 
CCAACTCAGC TGTGCCCCCA 
TTCCTCATCC TGTGATGATT 
CCACTGTCAC TGCCACAGTT 
ATGATGGAGC AGGTGCTGAT 
CCTTTGCTGC AAATGGTAGA 
GCACCCCAGC CCACTCTATT 
ACGGTAATAT TCAGATGAAT 
AGTGGGGCTT TAGCCGAGTC 
GCCCCCACCC TGATGTGTTT 
AAGAGGAATT GACCCTATCT 
CAAGCTATGA AATAAGAATG 
CTATTTTAGT AAATACATCA 
CGTTCTCACC CCAGATTTCC 
AAAGCCACAG AATTTATGTT 
TATCTAACAT TGCCCAGGCG 
GAGATTATCT TATATTGAAA 
TTATTATAGT TGTGACACAT 
ATGGAACAAA ATT ATT AT AA 
ATGGCCTTCG ACTACAAAAA 
ATGCATTGAG TTTTTGTACA 
TTGGGGGTAG ATTAGAAAAC 
TTAAAGTAAT GTCTTTAAAG 
TTGTTTTATT GAGGTGGAAA 
TAACTGTCTG TGTGAAGCAA 
TGTGCAGTAC AGGTTGCTTG 
TAATGCAAAG CTCTTTACCT 
GCTAATGCTC AGAGATCTTT 
CTTTGTCTCT TCATACCGGT 
TCTACTCCCA TCAAAGCAGC 
TATAGCCCCN TATAATGCCT 
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MTQRSIAGPI CNLKFVTLLV ALSSELPFLG AGVQLQDNGY NGLLIAINPQ VPENQNLISN 
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IKEMITEASF 
GDDPYTLQYR 
KPFYINGQNQ 
MFMQSLSSW 
TFSLVQAGDK 
RAQLHQINSN 
GDDKLLGNCL 
SRISSGTGDI 
FDPDGRKYYT 
AVPPATVEAF 
AGADVIKNDG 
IQMNAPRKSV 
LTLSWTAPGE 
PQISTNGPEH 
LILKGVLTAM 



YLFNATKRRV 
GCGKEGKYIH 
IKVTRCSSDI 
EFCNASTHNQ 
WCLVLDVSS 
DDRKLLVSYL 
PTVLSSGST1 
FQQHIQLEST 
NNFITNLTFR 
VERDSLHFPH 
IYSRYFFSFA 



DPDQGQATSY 
QPNGETKESH 
GLIGIICLII 



FFRNIKILIP 
FTPNFLLNDN 
TGIFVCEKGP 
EAPNLQNQMC 
KMAEADRLLQ 
PTTVSAKTDI 
HSIALGSSAA 
GENVKPHHQL 
TASLWIPGTA 
PVMIYANVKQ 
ANGRYSLKVH 
FSRVSSGGSF 
EIRMSKSLQN 
RIYVAIRAMD 
WTHHTLSRK 



ATWKANNNSK 
LTAGYGSRGR 
CPQENCIISK 
SLRSAWDVIT 
LOQAAKFYLM 
SICSGLKKGF 
PNLEELSRLT 
KNTVTVDNTV 
KPGHWTYTLN 
GFYPILNATV 
VNHSPSISTP 
SVLGVPAGPH 
IQDDFNNAIL 
RNSLQSAVSN 
KRADKKEKGT 



IKQESYEKAN 
VFVHEHAHLR 
LFKEGCTFIY 
DSADFHHSFP 
QIVEIHTFVG 
EWEKLNGKA 
GGLKFFVPDI 
GNDTMFLVTW 
NTHHSLQALK 
TATVEPETGD 
AHSIPGSHAM 
PDVFPPCKI I 
VNTSKRNPQQ 
IAQAPLFIPP 
KLL 



VIVTDWYGAH 
WGVFDEYNND 
NSTQNATASI 
KNGTELPPPP 
IASFDSKGEI 
YGSVMILVTS 
SNSNSMIDAF 
QASGPPEIIL 
VTVTSRASNS 
PVTLRLLDDG 
YVPGYTANGK 
DLEAVKVEEE 
AGIREIFTFS 
NSDPVPARDY 
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900 



Seq ID NO: 319 DNA sequence 

Nucleic Acid Accession #t NM_000228.1 

Coding sequence : 82 . . 3 600 

1 11 21 31 41 51 

| | I I I I 

GCTTTCAGGC GATCTGGAGA AAGAACGGCA GAACACACAG CAA GGAAA GG TCCTTTCTGG 
GGATCACCCC ATTGGCTGAA GATGAGACCA TTCTTCCTCT TGTGTTTTGC CCTGCCTGGC 
CTCCTGCATG CCCAACAAGC CTGCTCCCGT GGGGCCTGCT ATCCACCTGT TGGGGACCTG 
CTTGTTGGGA GGACCCGGTT TCTCCGAGCT TCATCTACCT GTGGACTGAC CAAGCCTGAG 
ACCTACTGCA CCCAGTATGG CGAGTGGCAG ATGAAATGCT GCAAGTGTGA CTCCAGGCAG 
CCTCACAACT ACTACAGTCA CCGAGTAGAG AATGTGGCTT CATCCTCCGG CCCCATGCGC 
TGGTGGCAGT CCCAGAATGA TGTGAACCCT GTCTCTCTGC AGCTGGACCT GGACAGGAGA 
TTCCAGCTTC AAGAAGTCAT GATGGAGTTC CAGGGGCCCA TGCCCGCCGG CATGCTGATT 
GAGCGCTCCT CAGACTTCGG TAAGACCTGG CGAGTGTACC AGTACCTGGC TGCCGACTGC 
ACCTCCACCT TCCCTCGGGT CCGCCAGGGT CGGCCTCAGA GCTGGCAGGA TGTTCGGTGC 
CAGTCCCTGC CTCAGAGGCC TAATGCACGC CTAAATGGGG GGAAGGTCCA ACTTAACCTT 
ATGGATTTAG TGTCTGGGAT TCCAGCAACT CAAAGTCAAA AAATTCAAGA GGTGGGGGAG 
ATCACAAACT TGAGAGTCAA TTTCACCAGG CTGGCCCCTG TGCCCCAAAG GGGCTACCAC 
CCTCCCAGCG CCTACTATGC TGTGTCCCAG CTCCGTCTGC AGGGGAGCTG CTTCTGTCAC 
GGCCATGCTG ATCGCTGCGC ACCCAAGCCT GGGGCCTCTG CAGGCCCCTC CACCGCTGTG 
CAGGTCCACG ATGTCTGTGT CTGCCAGCAC AACACTGCCG GCCCAAATTG TGAGCGCTGT 
GCACCCTTCT ACAACAACCG GCCCTGGAGA CCGGCGGAGG GCCAGGACGC CCATGAATGC 
CAAAGGTGCG ACTGCAATGG GCACTCAGAG ACATGTCACT TTGACCCCGC TGTGTTTGCC 
GCCAGCCAGG GGGCATATGG AGGTGTGTGT GACAATTGCC GGGACCACAC CGAAGGCAAG 
AACTGTGAGC GGTGTCAGCT GCACTATTTC CGGAACCGGC GCCCGGGAGC TTCCATTCAG 
GAGACCTGCA TCTCCTGCGA GTGTGATCCG GATGGGGCAG TGCCAGGGGC TOCCTGTGAC 
CCAGTGACCG GGCAGTGTGT GTGCAAGGAG CATGTGCAGG GAGAGCGCTG TGACCTATGC 
AAGCCGGGCT TCACTGGACT CACCTACGCC AACCCGCAGG GCTGOCACCG CTGTGACTGC 
AACATCCTGG GGTCCCGGAG GGACATGCCG TGTGACGAGG AGAGTGGGCG CTGCCTTTGT 
CTGCCCAACG TGGTGGGTCC CAAATGTGAC CAGTGTGCTC CCTACCACTG GAAGCTGGCC 
AGTGGCCAGG GCTGTGAACC GTGTGCCTGC GACCCGCACA ACTCCCCTCA GCCCACAGTG 
CAACCAGTTC ACAGGGCAGT GCCCTGTCGG GAAGGCTTTG GTGGCCTGAT GTGCAGCGCT 
GCAGCCATCC GCCAGTGTCC AGACCGGACC TATGGAGACG TGGCCACAGQ ATGCCGAGCC 
TGTGACTGTG ATTTCCGGGG AACAGAGGGC CCGGGCTGCG ACAAGGCATC AGGCCGCTGC 
CTCTGCCGCC CTGGCTTGAC CGGGCCCCGC TGTGACCAGT GCCAGCGAGG CTACTGCAAT 
CGCTACCCGG TGTGCGTGGC CTGCCACCCT TGCTTCCAGA CCTATGATGC GGACCTCCGG 
GAGCAGGCCC TGCGCTTTGG TAGACTCCGC AATGCCACCG CCAGCCTGTG GTCAGGGCCT 
GGGCTGGAGG ACCGTGGCCT GGCCTCCCGG ATCCTAGATG CAAAGAGTAA GATTGAGCAG 
ATCCGAGCAG TTCTCAGCAG CCCCGCAGTC ACAGAGCAGG AGGTGGCTCA GGTGGCCAGT 
GCCATCCTCT CCCTCAGGCG AACTCTCCAG GGCCTGCAGC TGGATCTGCC CCTGGAGGAG 
GAGACGTTGT CCCTTCCGAG AGACCTGGAG AGTCTTGACA GAAGCTTCAA TGGTCTCCTT 
ACTATCTATC AGAGGAAGAG GGAGCAGTTT GAAAAAATAA GCAGTGCTGA TCCTTCAGGA 
GCCTTCCGGA TGCTGAGCAC AGCCTACGAG CAGTCAGCCC AGGCTGCTCA GCAGGTCTCC 
GACAGCTCGC GCCTTTTGGA CCAGCTCAGG GACAGCCGGA GAGAGGCAGA GAGGCTGGTG 
CGGCAGGCGG GAGGAGGAGG AGGCACCGGC AGCCCCAAGC TTGTGGCCCT GAGGCTGGAG 
ATGTCTTCGT TGCCTGACCT GACACCCACC TTCAACAAGC TCTGTGGCAA CTCCAGGCAG 
ATGGCTTGCA CCCCAATATC ATGCCCTGGT GAGCTATGTC CCCAAGACAA TGGCACAGCC 
TGTGGCTCCC GCTGCAGGGG TGTCCTTCCC AGGGCCGGTG GGGCCTTCTT GATGGOGGGQ 
CAGGTGGCTG AGCAGCTGCG GGGCTTCAAT GCCCAGCTCC AGCGGACCAG GCAGATGATT 
AGGGCAGCCG AGGAATCTGC CTCACAGATT CAATCCAGTG CCCAGCGCTT GGAGACCCAG 
GTGAGCGCCA GCCGCTCCCA GATGGAGGAA GATGTCAGAC GCACACGGCT CCTAATCCAG 
CAGGTCCGGG ACTTCCTAAC AGACCCCGAC ACTGATGCAG CCACTATCCA GGAGGTCAGC 
GAGGCCGTGC TGGCCCTGTG GCTGCCCACA GACTCAGCTA CTGTTCTGCA GAAGATGAAT 
GAGATCCAGG CCATTGCAGC CAGGCTCCCC AACGTGGACT TGGTGCTGTC CCAGACCAAG 
CAGGACATTG CGCGTGCCCG CCGGTTGCAG GCTGAGGCTG AGGAAGCCAG GAGCCGAGCC 
CATGCAGTGG AGGGCCAGGT GGAAGATGTG GTTGGGAACC TGCGGCAGGG GACAGTGGCA 
CTGCAGGAAG CTCAGGACAC CATGCAAGGC ACCAGCCGCT CCCTTCGGCT TATCCAGGAC 
AGGGTTGCTG AGGTTCAGCA GGTACTGCGG CCAGCAGAAA AGCTGGTGAC AAGCATGACC 
AAGCAGCTGG GTGACTTCTG GACACGGATG GAGGAGCTCC GCCACCAAGC CCGGCAGCAG 
GGGGCAGAGG CAGTCCAGGC CCAGCAGCTT GCGGAAGGTG CCAGCGAGCA GGCATTGAGT 
GCCCAAGAGG GATTTGAGAG AATAAAACAA AAGTATGCTG AGTTGAAGGA CCGGTTGGGT 
CAGAGTTCCA TGCTGGGTGA GCAGGGTGCC CGGATCCAGA GTGTGAAGAC AGAGGCAGAG 
GAGCTGTTTG GGGAGACCAT GGAGATGATG GACAGGATGA AAGACATGGA GTTGGAGCTG 
CTGCGGGGCA GCCAGGCCAT CATGCTGCGC TCGGCGGACC TGACAGGACT GGAGAAGCGT 
GTGGAGCAGA TCCGTGACCA CATCAATGGG CGCGTGCTCT ACTA TGCC AC CTGCAAGTGA 
TGCTACAGCT TCCAGCCCGT TGCCCCACTC ATC TGCCG CC TTTGCTTTTG GTTGGGGGCA 
GATTGGGTTG GAATGCTTTC CATCTCCAGG AGACTTTCAT GCAGCCTAAA GTACAGCCTG 
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GACCACCCCT GGTGTGTAGC TAGTAAGATT ACCCTGAGCT GCAGCTGAGC CTGAGCCAAT 
GGGACAGTTA CACTTGACAG ACAAAGATGG TGGAGATTGG CATGCCATTG AAACTAAGAG 
CTCTCAAGTC AAGGAAGCTG GGCTGGGCAG TATCCCCCGC CTTTAGTTCT CCACTGGGGA 
GGAATCCTGG ACCAAGCACA AAAACTTAAC AAAAGTGATG TAAAAATGAA AAGCCAAATA 
AAAATCTTTG G 
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Seq ID NO: 320 Protein sequence 
Protein Accession 8: NP 000219.1 



1 
I 

MRPFFLLCFA 
EWQMKCCKCD 
MEFQGPMPAG 
NARLNGGKVQ 
VSQLRLQGSC 
PWRPAEGQDA 
HYFRNRRPGA 
TYANPQGCHR 
CACDPHNSPQ 
TEGPGCDKAS 
RLRNATASLW 
TLQGLQLDLP 
AYEQSAQAAQ 
TPTFNKLCGN 
GFNAQLQRTR 
DPDTDAATIQ 
RLQAEAEEAR 
VLRPAEKLVT 
IKQKYAELKD 
MLRSADLTGL 



11 
I 

LPGLLHAQQA 
SRQPHNYYSH 
MLIERSSDFG 
LNLMDLVSGI 
FCHGHADRCA 
HECQRCDCNG 
SIQETCISCE 
CDCNILGSRR 
PTVQPVHRAV 
GRCLCRPGLT 
SGPGLEDRGL 
LEEETLSLPR 
QVSDSSRLLD 
SRQMACTPIS 
QMIRAAEESA 
EVSEAVLALW 
SRAHAVEGQV 
SMTKQLGDFVJ 
RLGQSSMLGE 
EKRVEQ1RDH 



21 
I 

CSRGACYPPV 
RVENVASSSG 
KTWRVYQYLA 
PATQSQKIQE 
PKPGASAGPS 
HSETCHFDPA 
CDPDGAVPGA 



PCREGFGGLM 
GPRCDQCQRG 
ASRILDAKSK 
DLESLDRSFN 
QLRDSRREAE 
CPGELCPQDN 
SQIQSSAQRL 
LPTDSATVLQ 
EDWGNLRQG 
TRMEELRHQA 
QGARIQSVKT 
INGRVLYYAT 



31 
I 

GDLLVGRTRF 
PMRWWQSQND 
ADCTSTFPRV 
VGEITNLRVN 
TAVQVHDVCV 
VFAASQGAYG 
PCDPVTGQCV 
CLCLPNWGP 
CSAAAIRQCP 
YCNRYPVCVA 
IEQIRAVLSS 
GLLTMYQRKR 
RLVRQAGGGG 
GTACGSRCRG 
ETQVSASRSQ 
KMNEIQAIAA 
TVALQEAQDT 
RQQGAEAVQA 
EAEELFGETM 
CK 



41 
I 

LRASSTCGLT 
VNPVSLQLDL 
RQGRPQSWQD 
FTRLAPVPQR 
CQHNTAGPNC 
GVCDNCRDHT 
CKEHVQGERC 
KCDQCAPYHW 
DRTYGDVATG 
CHPCFQTYDA 
PAVTEQEVAQ 
EQFEKISSAD 
GTGSPKLVAL 
VLPRAGGAFL 
MBEDVRRTRL 
RLPNVDLVLS 
MQGTSRSLRL 
QQLAEGASEQ 



51 
I 

KPETYCTQYG 
DRRFQLQEVM 
VRCQSLPQRP 
GYHPPSAYYA 
ERCAPFYNNR 
EGKNCERCQL 
DLCKPGFTGL 
KLASGQGCEP 
CRACDCDFRG 
DLREQALRFG 
VASAILSLRR 
PSGAFRMLST 
RliEMSSLPDL 
MAGQVAEQLR 
LIQQVRDFLT 
QTKQDIARAR 
IQDRVAEVQQ 
ALSAQEGFER 
LELLRGSQAI 
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Seq ID NO: 321 DNA sequence 

Nucleic Acid Accession ft: NM__ 001944.1 

Coding sequence: 84.. 3083 

1 11 21 31 41 51 

I 1 I I 1 I 

TTTTCTTAGA CATTAACTGC AGACGGCTGG CAGGATAGAA GCAGCGGCTC ACTTGGACTT 60 

TTTCACCAGG GAAATCAGAG ACAATGATGG GGCTCTTCCC CAGAACTACA GGGGCTCTGG 120 

CCATCTTCGT GGTGGTCATA TTGGTTCATG GAGAATTGCG AATAGAGACT AAAGGTCAAT 180 

ATGATGAAGA AGAGATGACT ATGCAACAAG CTAAAAGAAG GCAAAAACGT GAATGGGTGA 240 

AATTTGCCAA ACCCTGCAGA GAAGGAGAAG ATAACTCAAA AAGAAACCCA ATTGCCAAGA 300 

TTACTTCAGA TTACCAAGCA ACCCAGAAAA TCACCTACCG AATCTCTGGA GTGGGAATCG 360 

ATCAGCCGCC TTTTGGAATC TTTGTTGTTG ACAAAAACAC TGGAGATATT AACATAACAG 420 

CTATAGTCGA CCGGGAGGAA ACTCCAAGCT TCCTGATCAC ATGTCGGGCT CTAAATGCCC 480 

AAGGACTAGA TGTAGAGAAA CCACTTATAC TAACGGTTAA AATTTTGGAT ATTAATGATA 540 

ATCCTCCAGT ATTTTCACAA CAAATTTTCA TGGGTGAAAT TGAAGAAAAT AGTGCCTCAA 600 

ACTCACTGGT GATGATACTA AATGCCACAG ATGCAGATGA ACCAAACCAC TTGAATTCTA 660 

AAATTGCCTT CAAAATTGTC TCTCAGGAAC CAGCAGGCAC ACCCATGTTC CTCCTAAGCA 720 

GAAACACTGG GGAAGTCCGT ACTTTGACCA ATTCTCTTGA CCGAGAGCAA GCTAGCAGCT 780 

ATCGTCTGGT TGTGAGTGGT GCAGACAAAG ATGGAGAAGG ACTATCAACT CAATGTGAAT 840 

GTAATATTAA AGTGAAAGAT GTCAACGATA ACTTCCCAAT GTTTAGAGAC TCTCAGTATT 900 

CAGCACGTAT TGAAGAAAAT ATTTTAAGTT CTGAATTACT TCGATTTCAA GTAACAGATT 960 

TGGATGAAGA GTACACAGAT AATTGGCTTG CAGTATATTT CTTTACCTCT GGGAATGAAG 1020 

GAAATTGGTT TGAAATACAA ACTGATCCTA GAACTAATGA AGGCATCCTG AAAGTGGTGA 1080 

AGGCTCTAGA TTATGAACAA CTACAAAGCG TGAAACTTAG TATTGCTGTC AAAAACAAAG 1140 

CTGAATTTCA CCAATCAGTT ATCTCTCGAT ACCGAGTTCA GTCAACCCCA GTCACAATTC 1200 

AGGTAATAAA TGTAAGAGAA GGAATTGCAT TCCGTCCTGC TTCCAAGACA TTTACTGTGC 1260 

AAAAAGGCAT AAGTAGCAAA AAATTGGTGG ATTATATCCT GGGAACATAT CAAGCCATCG 1320 

ATGAGGACAC TAACAAAGCT GCCTCAAATG TCAAATATGT CATGGGACGT AACGATGGTG 1380 

GATACCTAAT GATTGATTCA AAAACTGCTG AAATCAAATT TGTCAAAAAT ATGAACCGAG 1440 

ATTCTACTTT CATAGTTAAC AAAACAATCA CAGCTGAGGT TCTGGCCATA GATGAATACA 1500 

CGGGTAAAAC TTCTACAGGC ACGGTATATG TTAGAGTACC CGATTTCAAT GACAATTGTC 1560 

CAACAGCTGT CCTCGAAAAA GATGCAGTTT GCAGTTCTTC ACCTTCCGTG GTTGTCTCCG 1620 

CTAGAACACT GAATAATAGA TACACTGGCC CCTATACATT TGCACTGGAA GATCAACCTG 1680 

TAAAGTTGCC TGCCGTATGG AGTATCACAA CCCTCAATGC TACCTCGGCC CTCCTCAGAG 1740 

CCCAGGAACA GATACCTCCT GGAGTATACC ACATCTCCCT GGTACTTACA GACAGTCAGA 1800 

ACAATCGGTG TGAGATGCCA CGCAGCTTGA CACTGGAAGT CTGTCAGTGT GACAACAGGG 1860 

GCATCTGTGG AACTTCTTAC CCAACCACAA GCCCTGGGAC CAGGTATGGC AGGCCGCACT 1920 

CAGGGAGGCT GGGGCCTGCC GCCATCGGCC TGCTGCTCCT TGGTCTCCTG CTGCTGCTGT 1980 

TGGCCCCCCT TCTGCTGTTG ACCTGTGACT GTGGGGCAGG TTCTACTGGG GGAGTGACAG 2040 

GTGGTTTTAT CCCAGTTCCT GATGGCTCAG AAGGAACAAT TCATCAGTGG GGAATTGAAG 2100 

GAGCCCATCC TGAAGACAAG GAAATCACAA ATATTTGTGT GCCTCCTGTA ACAGCCAATG 2160 

GAGCCGATTT CATGGAAAGT TCTGAAGTTT GTACAAATAC GTATGCCAGA GGCACAGCGG 2220 

TGGAAGGCAC TTCAGGAATG GAAATGACCA CTAAGCTTGG AGCAGCCACT GAATCTGGAG 2280 

GTGCTGCAGG CTTTGCAACA GGGACAGTGT CAGGAGCTGC TTCAGGATTC GGAGCAGCCA 2340 

CTGGAGTTGG CATCTGTTCC TCAGGGCAGT CTGGAACCAT GAGAACAAGG CATTCCACTG 2400 

GAGGAACCAA TAAGGACTAC GCTGATGGGG CGATAAGCAT GAATTTTCTG GACTCCTACT 2460 

TTTCTCAGAA AGCATTTGCC TGTGCGGAGG AAGACGATGG CCAGGAAGCA AATGACTGCT 2520 

TGTTGATCTA TGATAATGAA GGCGCAGATG CCACTGGTTC TCCTGTGGGC TCCGTGGGTT 2580 

GTTGCAGTTT TATTGCTGAT GACCTGGATG ACAGCTTCTT GGACTCACTT GGACCCAAAT 2640 

TTAAAAAACT TGCAGAGATA AGCCTTGGTG TTGATGGTGA AGGCAAAGAA GTTCAGCCAC 2700 

CCTCTAAAGA CAGCGGTTAT GGGATTGAAT CCTGTGGCCA TCCCATAGAA GTCCAGCAGA 2760 
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CAGGATTTGT TAAGTGCCAG ACTTTGTCAG GAAGTCAAGG AGCTTCTGCT TTGTCCGCCT 2B20 

CTGGGTCTGT CCAGCCAGCT GTTTCCATCC CTGACCCTCT GCAGCATGGT AA CTATT TAG 2880 

TAACGGAGAC TTACTCGGCT TCTGGTTCCC TCGTGCAACC TTCCACTGCA GGCTTTGATC 2940 

CACTTCTCAC ACAAAATGTG ATAGTGACAG AAAGGGTGAT CTGTCCCATT TCCAGTGTTC 3000 

5 CTGGCAACCT AGCTGGCCCA ACGCAGCTAC GAGGGTCACA TACTATGCTC TGTACAGAGG 3060 

ATCCTTGCTC CCGTCTAATA TGACCAGAAT GAGCTGGAAT ACCACACTGA CCAAATCTGG 3120 

ATCTTTGGAC TAAAGTATTC AAAATAGCAT AGCAAAGCTC ACTGTATTGG GCTAATAATT 3180 

TGGCACTTAT TAGCTTCTCT CATAAACTGA TCACGATTAT AAATTAAATG TTTGGGTTCA 3240 

TACCCCAAAA GCAATATGTT GTCACTCCTA ATTCTCAAGT ACTATTCAAA TTGTAGTAAA 3300 
10 TCTTAAAGTT TTTCAAAACC CTAAAATCAT ATTCGC 



65 



Seq ID NO: 322 Protein sequence 
Protein Accession #: NP_001935.1 

15 1 11 21 31 41 51 

I | | I I I 

MMGLFPRTTG ALAIFVWIL VHGELRIETK GQYDEEEMTM QQAKRRQKRE WVKFAKPCRE 60 

GEDNSKRNPI AKITSDYQAT QKITYRISGV GIDQPPFGIF WDKNTGDIN ITAIVDREET 120 

PSFLITCRAI* NAQGLDVEKP LILTVKIU3I NDNPPVFSQQ IFMGEIEENS ASNSLVMILN 180 

20 ATDADEPNHL NSKIAFKIVS QEPAGTPMFL LSRNTGEVRT LTNSLDREQA SSYRLWSGA 240 

DKDGEGLSTQ CECNIKVKDV NDNFPMFRDS QYSARIEENI LSSEIiLRFQV TDLDEEYTDN 300 

WLAVYFFTSG NEGNWFEIQT DPRTNEGILK WKALDYEQli QSVKLSIAVK NKAEFHQSVI 360 

SRYRVQSTPV TIQVINVREG IAFRPASKTF TVQKGISSKK LVDYILGTYQ AIDEDTNKAA 420 

SNVKYVMGRN DGGYLMIDSK TAEIKFVKNM NRDSTFIVNK TITAEVLAID EYTGKTSTGT 480 

25 VYVRVPDFND NCPTAVLEKD AVCSSSPSW VSARTLNNRY TGPYTFALED QPVKLPAVWS 540 

ITTUJATSAL LRAQEQIPPG VYHISLVLTD SQNNRCEMPR SLTLEVCQCD NRGICGTSYP 600 

TTSPGTRYGR PHSGRLGPAA IGLLLLGLLL LLLAPLLLLT CDCGAGSTGG VTGGFIPVPD 660 

GSEGTIHQWG IEGAHPEDKE ITNICVPPVT ANGADFMESS EVCTNTYARG TAVEGTSGME 720 

MTTKLGAATE SGGAAGFATG TVSGAASGFG AATGVGICSS GQSGTMRTRH STGGTNKDYA 780 

30 DGAISMNFLD SYFSQKAFAC AEEDDGQEAN DCU.IYDNEG ADATGSPVGS VGCCSFIADD 840 

LDDSFLDSLG PKFKKLAEIS LGVDGEGKEV QPPSKDSGYG IESCGHPIEV QQTGFVKOQT 900 

LSGSQGASAL SASGSVQPAV SIPDPLQHGN YLVTETYSAS GSLVQPSTAG FDPLLTQNVI 960 
VTERVICPIS SVPGNLAGPT QLRGSHTMLC TEDPCSRLI 

35 Seq ID NO: 323 DNA sequence 

Nucleic Acid Accession #: XM_058069.2 
Coding sequence t 1 . . 1413 

1 11 21 31 41 51 

40 | 1 I I I 1 

ATGAAGTTTC TTCTAATACT GCTCCTGCAG GCCACTGCTT CTGGAGCTCT TCCCCTGAAC 60 

AGCTCTACAA GCCTGGAAAA AAATAATGTG CTATTTGGTG AAAGATACTT AGAAAAATTT 120 

TATGGCCTTG AGATAAACAA ACTTCCAGTG ACAAAAATGA AATATAGTGG AAACTTAATG 180 

AAGGAAAAAA TCCAAGAAAT GCAGCACTTC TTGGGTCTGA AAGTGACCGG GCAACTGGAC 240 

45 ACATCTACCC TGGAGATGAT GCACGCACCT CGATGTGGAG TCCCCGATGT CCATCATTTC 300 

AGGGAAATGC CAGGGGGGCC CGTATGGAGG AAACATTATA TCACCTACAG AATCAATAAT 360 

TACACACCTG ACATGAACCG TGAGGATGTT GACTACGCAA TCCGGAAAGC TTTCCAAGTA 420 

TGGAGTAATG TTACCCCCTT GAAATTCAGC AAGATTAACA CAGGCATGGC TGACATTTTG 480 

GTGGTTTTTG CCCGTGGAGC TCATGGAGAC TTCCATGCTT TTGATGGCAA AGGTGGAATC 540 

50 CTAGCCCATG CTTTTGGACC TGGATCTGGC ATTGGAGGGG ATGCACATTT CGATGAGGAC 600 

GAATTCTGGA CTACACATTC AGGAGGCACA AACTTGTTCC TCACTGCTGT TCACGAGATT 660 

GGCCATTCCT TAGGTCTTGG CCATTCTAGT GATCCAAAGG CCGTAATGTT CCCCACCTAC 720 

AAATATGTTG ACATCAACAC ATTTCGCCTC TCTGCTGATG ACATACGTGG CATTCAGTCC 780 

CTGTATGGAG ACCCAAAAGA GAACCAACGC TTGCCAAATC CTGACAATTC AGAACCAGCT 840 

55 CTCTGTGACC CCAATTTGAG TTTTGATGCT GTCACTACCG TGGGAAATAA GATCTTTTTC 900 

TTCAAAGAGA GGTTCTTCTG GCTGAAGGTT TCTGAGAGAC CAAAGACCAG TGTTAATTTA 960 

ATTTCTTCCT TATGGCCAAC CTTGCCATCT GGCATTGAAG CTGCTTATGA AATTGAAGCC 1020 

AGAAATCAAG TTTTTCTTTT TAAAGATGAC AAATACTGGT TAATTAGCAA TTTAAGACCA 1080 

GAGCCAAATT ATCCCAAGAG CATACATTCT TTTGGTTTTC CTAACTTTGT GAAAAAAATT 1140 

60 GATGCAGCTG TTTTTAACCC ACGTTTTTAT AGGACCTACT TCTTTGTAGA TAACCAGTAT 1200 

TGGAGGTATG ATGAAAGGAG ACAGATGATG GACCCTGGTT ATCCCAAACT GATTACCAAG 1260 

AACTTCCAAG GAATCGGGCC TAAAATTGAT GCAGTCTTCT ACTCTAAAAA CAAATACTAC 1320 

TATTTCTTCC AAGGATCTAA CCAATTTGAA TATGACTTCC TACTCCAACG TATCACCAAA 1380 
ACACTGAAAA GCAATAGCTG GTTTGGTTGT TGA 



Seq ID NO: 324 Protein sequence 
Protein Accession #: P39900 



1 11 21 31 41 51 

70 | | | | I I 

MKFLLILLLQ ATASGALPLN SSTSLEKNNV LFGERYLEKF YGbEINKLPV TKMKYSGNLM 60 

KEKIQEMQHF LGLKVTGQLD TSTLEMMHAP RCGVPDVHHF REMPGGPVWR KHYITYRINN 120 

YTPDMNREDV DYAIRKAFQV WSNVTPLKFS KINTGMADIL WFARGAHGD FHAFDGKGGI 180 

LAHAFGPGSG IGGDAHFDED EFWTTHSGGT NLFLTAVHEI GHSLGLGHSS DPKAVMFPTY 240 

75 KYVDINTFRI* SADDIRGIQS LYGDPKENQR LPNPDNSEPA LCDPNLSFDA VTTVGNKIFF 300 

FKDRFFWIiKV SERPKTSVNL ISSLWPTLPS GIEAAYEIEA RNQVFLFKDD KYWLISNLRP 360 

EPNYPKSIHS FGFPNFVKKI DAAVFNPRFY RTYFFVDNQY WRYDERRQMM DPGYPKLITK 420 
NFQGIGPKID AVFYSKNKYY YFFQGSNQFE YDFLLQRITK TLKSNSWFGC 

80 Seq ID NO: 325 DNA sequence 

Nucleic Acid Accession #: NM_024423.1 
Coding sequence: 64.. 2590 

1 11 21 31 41 51 
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| I I I I I 

GGCAGGTCTC GCTCTOGGCA CCCTCCCGGC GCCCGCGTTC TCCTGGCCCT GCCCGGCATC 
CCGATGGCCG CCGCTGGGCC CCGGCGCTCC GTGCGCGGAG CCGTCTGCCT GCATCTGCTG 
CTGACCCTCG TGATCTTCAG TCGTGATGGT GAAGCCTGCA AAAAGGTGAT ACTTAATGTA 
5 CCTTCTAAAC TAGAGGCAGA CAAAATAATT GGCAGAGTTA ATTTGGAAGA GTGCTTCAGG 
TCTGCAGACC TCATCCGGTC AAGTGATCCT GATTTCAGAG TTCTAAATGA TGGGTCAGTG 
TACACAGCCA GGGCTGTTGC GCTGTCTGAT AAGAAAAGAT CATTTACCAT ATGGCTTTCT 
GACAAAAGGA AACAGACACA GAAAGAGGTT ACTGTGCTGC TAGAACATCA GAAGAAGGTA 
TCGAAGACAA GACACACTAG AGAAACTGTT CTCAGGCGTG CCAAGAGGAG ATGGGCACCT 
10 ATTCCTTGCT CTATGCAAGA GAATTCCTTG GGCCCTTTCC CATTGTTTCT TCAACAAGTT 
GAATCTGATG CAGCACAGAA CTATACTGTC TTCTACTCAA TAAGTGGACG TGGAGTTGAT 
AAAGAACCTT TAAATTTGTT TTATATAGAA AGAGACACTG GAAATCTATT TTGCACTCGG 
CCTGTGGATC GTGAAGAATA TGATGTTTTT GATTTGATTG CTTATGCGTC AACTGCAGAT 
GGATATTCAG CAGATCTGCC CCTCCCACTA CCCATCAGGG TAGAGGATGA AAATGACAAC 
15 CACCCTGTTT TCACAGAAGC AATTTATAAT TTTGAAGTTT TGGAAAGTAG TAGACCTGGT 
ACTACAGTGG GGGTGGTTTG TGCCACAGAC AGAGATGAAC CGGACACAAT GCATACGCGC 
CTGAAATACA GCATTTTGCA GCAGACACCA AGGTCACCTG GGCTCTTTTC TGTGCATCCC 
AGCACAGGCG TAATCACCAC AGTCTCTCAT TATTTGGACA GAGAGGTTGT AGACAAGTAC 
TCATTGATAA TGAAAGTACA AGACATGGAT GGCCAGTTTT TTGGATTGAT AGGCACATCA 
20 ACTTGTATCA TAACAGTAAC AGATTCAAAT GATAATGCAC CCACTTTCAG ACAAAATGCT 
TATGAAGCAT TTGTAGAGGA AAATGCATTC AATGTGGAAA TCTTACGAAT ACCTATAGAA 
GATAAGGATT TAATTAACAC TGCCAATTGG AGAGTCAATT TTACCATTTT AAAGGGAAAT 
GAAAATGGAC ATTTCAAAAT CAGCACAGAC AAAGAAACTA ATGAAGGTGT TCTTTCTGTT 
GTAAAGCCAC TGAATTATGA AGAAAACCGT CAAGTGAACC TGGAAATTGG AGTAAACAAT 
25 GAAGCGCCAT TTGCTAGAGA TATTCCCAGA GTGACAGCCT TGAACAGAGC CTTGGTTACA 
GTTCATGTGA GGGATCTGGA TGAGGGGCCT GAATGCACTC CTGCAGCCCA ATATGTGCGG 
ATTAAAGAAA ACTTAGCAGT GGGGTCAAAG ATCAAOGGCT ATAAGGCATA TGACCCCGAA 
AATAGAAATG GCAATGGTTT AAGGTACAAA AAATTGCATG ATCCTAAAGG TTGGATCACC 
ATTGATGAAA TTTCAGGGTC AATCATAACT TCCAAAATCC TGGATAGGGA GGTTGAAACT 
30 CCCAAAAATG AGTTGTATAA TATTACAGTC CTGGCAATAG ACAAAGATGA TAGATCATGT 
ACTGGAACAC TTGCTGTGAA CATTGAAGAT GTAAATGATA ATCCACCAGA AATACTTCAA 
GAATATGTAG TCATTTGCAA ACCAAAAATG GGGTATACCG ACATTTTAGC TGTTGATCCT 
GATGAACCTG TCCATGGAGC TCCATTTTAT TTCAGTTTGC CCAAT ACTT C TCCAGAAATC 
AGTAGACTGT GGAGCCTCAC CAAAGTTAAT GATACAGCTG CCCGTCTTTC ATATCAGAAA 
35 AATGCTGGAT TTCAAGAATA TACCATTCCT ATTACTGTAA AAGACAGGGC CGGCCAAGCT 
GCAACAAAAT TATTGAGAGT TAATCTGTGT GAATGTACTC ATCCAACTCA GTGTCGTGCG 
ACTTCAAGGA GTACAGGAGT AATACTTGGA AAATGGGCAA TC CTTGCAAT ATTACTGGGT 
ATAGCACTGC TCTTTTCTGT ATTGCTAACT TTAGTATGTG GAGTTTTTGG TGCAACTAAA 
GGGAAACGTT TTCCTGAAGA TTTAGCACAG CAAAACTTAA TTATATCAAA CACAGAAGCA 
40 CCTGGAGACG ATAGAGTGTG CTCTGOCAAT GGATTTATGA CCCAAACTAC CAACAACTCT 
AGCCAAGGTT TTTGTGGTAC TATGGGATCA GGAATGAAAA ATGGAGGGCA GGAAACCATT 
GAAATGATGA AAGGAGGAAA CCAGACCTTG GAATCCTGCC GGGGGGCTGG GCATCATCAT 
ACCCTGGACT CCTGCAGGGG AGGACACACG GAGGTGGACA ACTGCAGATA CACTTACTCG 
GAGTGGCACA GTTTTACTCA ACCCOGTCTC GGTGAAGAAT CCATTAGAGG ACACACTGGT 
45 TAAAAATTAA ACATAAAAGA AATTGCATOG ATGTAATCAG AATGAAGACC GCATGCCATC 
CCAAGATTAT GTCCTCACTT ATAACTATGA GGGAAGAGGA TCTCCAGCTG GTTCTGTGGG 
CTGCTGCAGT GAAAAGCAGG AAGAAGATGG CCTTGACTTT TTAAATAATT TGGAACCCAA 
ATTTATTACA TTAGCAGAAG CATGCACAAA GAGATAATGT CACAGTGCTA CAATTAGGTC 
TTTGTCAGAC ATTCTGGAGG TTTCCAAAAA TAATATTGTA AAGTTCAATT TCA ACATGT A 
50 TGTATATGAT GATTTTTTTC TCAATTTTGA ATTATGCTAC TCACCAATTT ATATTTTTAA 
AGCCAGTTGT TGCTTATCTT TTCCAAAAAG TGAAAAATGT TAAAACAGAC AACTGGTAAA 
TCTCAAACTC CAGCACTGGA ATTAAGGTCT CTAAAGCATC TGCTCTTTTT TTTTTTTA03 
GATATTTTAG TAATAAATAT GCTGGATAAA TATTAGTCCA ACAATAGCTA AGTTATGCTA 
ATATCACATT ATTATGTATT CACTTTAAGT GATAGTTTAA AAAATAAACA AGAAATATTG 
55 AGTATCACTA TGTGAAGAAA GTTTTGGAAA AGAAACAATG AAGACTGAAT TAAATTAAAA 
ATGTTGCAGC TCATAAAGAA TTGGGACTCA CCCCTACTGC ACTACCAAAT TCATTTGACT 
TTGGAGGCAA AATGTGTTGA AGTGCCCTAT GAAGTAGCAA TTTTCTATAG GAATATAGTT 
GGAAATAAAT GTGTGTGTGT ATATTATTAT TAATCAATGC AATAT TTAAA ATGAAATGAG 
AACAAAGAGG AAAATGGTAA AAACTTGAAA TGAGGCTGGG GTATAGTTTG TCCTACAATA 
60 GAAAAAAGAG AGAGCTTCCT AGGCCTGGGC TCTTAAATGC TGCATTATAA CTGAGTCTAT 
GAGGAAATAG TTCCTGTCCA ATTTGTGTAA TTTGTTTAAA ATTGTAAATA A ATTAA ACTT 
TTCTGGTTTC TGTGGGAAGG AAATAGGGAA TCCAATGGAA CAGTAGCTTT GCTTTGCAGT 
CTGTTTCAAG ATTTCTGCAT CCACAAGTTA GTAGCAAACT GGGG AATACT CGCTGCAGCT 
GGGGTTCCCT GCTTTTTGGT AGCAAGGGTC CAGAGATGAG GTGTTTTTTT CGGQGAGCTA 
65 ATAACAAAAA CATTTTAAAA CTTACCTTTA CTGAAGTTAA ATCCTCTATT GCTGTTTCTA 
TTCTCTCTTA TAGTGACCAA CATCTTTTTA ATTTAGATCC AAATAACCAT GTCCTCCTAG 
AGTTTAGAGG CTAGAGGGAG CTGAGGGGAG GATCTTACTG AAAGCACCCT GGGGAGATTG 
ATTGTCCTTA AACCTAAGCC CCACAAACTT GACACCTGAT CAGGTCTGGG AGCTACAAAA 
^ TTTCATTTTT CTCCTCACTG OCCTTCTTCT GAGTGGCATT GGCCTGAATC AAGGAAAGCC 

70 AGGCCTTGTG GGCCCCCTTC TTTOGGCTTT CTGCTAAAGC AACACCTCCA GCAGAGATTC 
CCTTAAGTGA CTCCAGGTTT TCCACCATCC TTCAGCGTGA ATTAATTTTT AATCAGTTTG 
CTTTCTOCAG AGAAATTTTA AAATAATAGA AGAAATAGAA ATTTTGAATG TATAAAAGAA 
AAAGATCAAG TTGTCATTTT AGAACAGAGG GAACTTTGGG AGAAAGCAGC CCAAGTAGGT 
TATTTGTACA GTCAGAGGGC AACAGGAAGA TGCAGGCCTT CAAGGGCAAG GAGAGGCCAC 
75 AAGGAATATG GGTGGGAGTA AAAGCAACAT CGTCTGCTTC ATACTTTTTC CTAGGCTTGG 
CACTGCCTTT TCCTTTCTCA GGCCAATGGC AACTGCCATT TGAGTCCGGT GAGGGATCAG 
CCAACCTCTT CTCTATGGCT CACCTTATTT GGAGTGAGAA ATCAAGGAGA CAGAGCTGAC 
TGCATGATGA GTCTGAAGGC ATTTGCAGGA TGAGCCTGAA CTGGTTGTGC AGAACAAACA 
AGGCATTCAT GGGAATTGTT GTATTCCTTC TGCAGCCCTC CTTCTGGGCA CTAAGAAGGT 
80 CTATGAATTA AATGCCTATC TAAAATTCTG ATTTATTCCT ACATTTTCTG TTTTCTAATT 
TGACCCTAAA ATCTATGTGT TTTAGACTTA GACTTTTTAT TGCCCCCCCC CCCTTTTTTT 
TTGAGACGGA GTCTCGCTCT GACGCACAGG CTGGAGTGCA GTGGCTCCGA TCTCTGCTCA 
CTGAAAGCTC CGCCTCCCGG GTTCATGCCA TTCTCCTGCC TCAGCCTCCT GAGTAGCTGG 
GACTACAGGC GCCCACCACC ACGCCCGGCT AATTTTTTGT ATTTTTAATA GAGACGGGGT 

1078 
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TTCACTGTGT TAGCCAGGAT GGTCTCGATC TCCTGACCTC 
TCCCAAAGTG CTGGGATTAC AGGCATGACC CACCOCTCCC 
GTCGTCTTCT TTTAATGTAA TCATTTTGAA CATGTGTGAA 
TCAATCTTQA AATACTCAAC CAAAAGACAG TCGAGAAGCC 
GCACAAAATA TTGGTCTGAG AATGGAATTC TCTGTAAGCC 
TGTAACCAGA AGCCAGTTTT ATCTAACGGC TACTGAAACA 
CCCACTCACC GATCAAAACC TGCTACCTCC CCAAGACTTT 
TCAAAGAGCA ACCAGTATCA CTTCCCTGTT TATAAAACCT 
TGAACATGCT GAAAACCACC TGGTCTGCAT GTATGCCCGA 
AAATGAAAAT TTAATTTTAG GGATTCATTT CTATATTTTC 
TCCTTATATG TGTAAGGTGA AATTTATGGT ATTTGAGTGT 
AGCTTTCATT TTTCCCCCAG TGAATGATTT AGAATTTTTT 
TTTTCTTACT TTTATAAGGA AGCAGCTGTC TAAAATGCAG 
TTTAAACAGA GTTTTAGTAT TGCTATTAAA AGAAGTTACT 
CTGCTTAAAA TAAGCAAAAA TTGGATGCAT AAAGTAATAT 
AATAAAACAA TATTAACTTG GCTGCTTAAA ATAAGCAAAA 
TTTACAGATG TGGGGAGATG TAATAAAACA ATATTAACTT 
TTTAGAGATT AAATAATTCT AAGATGATCA CTTTGCAAAA 
GAAATAGAAA TACTCAATTA TGTCTTTGTT GTATTAATGG 
TCATTATCAA ATTGTCGACA TCATTAATAT ATATTGTAAT 
TTGAAGCACA GCTTTACAGA TGAGTATCTA TGATACATAT 
GTATTAAAAG TATTAGAAGG TGGTTATAAT TGCAGAGTAT 
ACAGGGGTTT TACTTTGAGG ACCAGTGTAG TCAAGGGAAA 
CAGGCAATAT TGCAGTCTTG ATTCTGCCAC TTACAGGATA 
GACAAGATGA TCCAACCATA AAGGTGCTCT GTGCTTCACA 
GGAGTGTGCT CCCCTACAAA CGTTAAGACT GATCATTTCA 
AAAGCCTTAC ATTTTAATAT AGGTTGAACC AAAATTTCAA 
ACCATTATTT TTGTGTATGT CTTCAAGAAT GTTCATTGGA 
ATACCGGATA CATTTCACGT GTCCTTCAGT ATTGATTTGG 
GTTGAGAAGC ATGGACACTA GAGCCAGAAT GCTTGGATAT 
ACTTCTGTGT GACCTTTGAA AGGCTACTTA TTTCCTCTCT 
ATGAACAATG CCAGCCTCAT GGGGTTGTTG AATGATTAAA 
ACATAGAACA CTGCCTGCAC ATAGTAAAAG AATTATAAGT 
ATGTAGTTGG ATATACTACC GAACAATATC TAATCTCTTT 
CATATATATA ATCCCGAAAC ATG 

Seq ID NO: 326 Protein sequence 
Protein Accession ft: NP 077741.1 



GTGATCCGCC 
GGCCTTGTTT 
AGTTGATCAT 
AGGGGGAGAA 
TAGTTGCTGA 
CCCACTGTGT 
ACTAGTGCCG 
CTAACCATCT 
ATTTGTAATT 
ACATATGTAG 
GCAAGAAAAT 
ATGTAAATAT 

TTGCTTTTAA 
TTACAGATGT 
ATTGGATGCA 
GGTTTCTTGT 
TTATGCTTAT 
GGAATATTTT 
GTTGGGAAGA 
GTATAATAAA 
TCCATGAATA 
ACATGAGTTA 
GATAATGCCT 
GTGAATCTTT 
AAAATCTATT 
TTCCAGTAAC 
TTTTTGTTTG 
TTGAATATTG 
GAATCCTGGA 
TAGCTTTCTC 
TTAGTTAATA 
GTGAGGTAGT 
TTAGGGAAAT 



TGCCTCGGCC 
TCCGTTTAAA 
ACGAATTGGA 
AGAACTCAGG 
AATTTCCTGC 
TTTGCTCACT 
ATAAACTTTC 
CTTTGTTCTT 
CTTTTCTCTC 
TATTATTATT 
ATATTTTTAA 
ACAGAATGTT 
TTTGCAATGT 
AGAAACTTGG 
GGGGAGATGT 
TAAAGTAATA 
TTTTGCTGTA 
GGCTGGCATG 
GGACAATGTT 
GATCACTATT 
TTTTGATCGG 
GTACACTGAC 
AAAAGAAAAG 
GAACTTTAAT 
TCCCCATGCA 
AGCTATATCA 
TTCTATTGTA 
TAATAGTAAA 
GGTCATAATG 
TCTGTCACTT 
ATTAAAATCA 
TACCTAAAGT 
TGGTAAAATT 
AAAGTTTGTG 



5040 
S100 
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1 
I 

MAAAGPRRSV 
ADLIRSSDPD 
KTRHTRETVL 
EPLNLFYIBR 
PVFTEAIYNF 
TGVITTVSHY 
EAFVEENAFN 
KPLNYEENRQ 
KENLAVGSKI 
KNELYNITVL 
EPVHGAPFYF 
TKLLRVNLCE 
KRFPEDLAQQ 
KMKGGNQTLE 



11 
I 

RGAVCLHLLL 
FRVLNDGSVY 
RRAKRRWAPI 
DTGNLFCTRP 
EVLESSRPGT 
LDREWDKYS 
VEILRIPIED 
VNLEIGVNNE 
NGYKAYDPEN 
AIDKDDRSCT 
SLPNTSPEIS 
CTHPTQCRAT 
NLIISNTEAP 
SCRGAGHHHT 



21 
I 

TLVIFSRDGE 
TARAVALSDK 
PCSMQENSLG 
VDREEYDVFD 
TVGWCATDR 
LIMKVQDMDG 
KDLINTANWR 
APFARDZPRV 
RNGNGLRYKK 
GTLAVNIEDV 
RLWSLTKVND 
SRSTGVILGK 
GDDRVCSANG 
LDSCRGGHTE 



31 
I 

ACKKVILNVP 
KRSFTIWLSD 
PFPLFLQQVE 
LIAYASTADG 
DEPDTMHTRL 
QFFGLIGTST 
VNPTILKGNE 
TALNRALVTV 
LHDPKGWITI 
NDNPPEIXQE 
TAARLSYQKN 
WAILAILLGI 
FMTQTTNNSS 
VDNCRYTYSE 



41 
I 

SKLEADKIIG 
KRKQTQKEVT 
SDAAQNYTVF 
YSADLPLPLP 
KYSILQQTPR 
CIITVTDSND 
NGHFKISTDK 
RVRDLDEGPE 
DBISGSIITS 
YWICKPKMG 
AGFQEYTIPI 
ALLFSVLLTL 
QGFCGTMGSG 
WHSFTQPRLG 



51 
I 

RVNLEECFRS 
VLLEHQKKVS 
YSISGRGVDK 
IRVEDENDNH 
SPGLFSVHPS 
NAPTFRQNAY 
ETNEGVLSW 
CTPAAQYVRI 
KILDREVETP 
YTDILAVDPD 
TVKDRAGQAA 
VCGVFGATKG 
MKNGGQETIE 
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GGCAGGTCTC 
CCGATGGCCG 
CTGACCCTCG 
CCTTCTAAAC 
TCTGCAGACC 
TACACAGCCA 
GACAAAAGGA 
TCGAAGACAA 
ATTCCTTGCT 
GAATCTGATG 
AAAGAACCTT 
CCTGTGGATC 
GGATATTCAG 
CACCCTGTTT 
ACTACAGTGG 
CTGAAATACA 
AGCACAGGCG 
TCATTGATAA 
ACTTGTATCA 
TATGAAGCAT 
GATAAGGATT 
GAAAATGGAC 




GCATTTTGCA 
TAATCACCAC 
TGAAAGTACA 
TAACAGTAAC 
TTGTAGAGGA 
TAATTAACAC 
ATTTCAAAAT 



21 
CC 

CCGGCGCTCC 
TCGTGATGGT 
CAAAATAATT 
AAGTGATCCT 
GCTGTCTGAT 
GAAAGAGGTT 
AGAAACTGTT 
GAATTCCTTG 
CTATACTGTC 
TTATATAGAA 
TGATGTTTTT 
CCTCCCACTA 
AATTTATAAT 
TGCCACAGAC 
GCAGACACCA 
AGTCTCTCAT 
AGACATGGAT 
AGATTCAAAT 
AAATGCATTC 
TGCCAATTGG 
CAGCACAGAC 



31 
I 

GCCCGCGTTC 
GTGCGCGGAG 
GAAGCCTGCA 
GGCAGAGTTA 
GATTTCAGAG 
AAGAAAAGAT 
ACTGTGCTGC 
CTCAGGCGTG 
GGCCCTTTCC 
TTCTACTCAA 
AGAGACACTG 
GATTTGATTG 
CCCATCAGGG 
TTTGAAGTTT 
AGAGATGAAC 
AGGTCACCTG 
TATTTGGACA 
GGCCAGTTTT 
GATAATGCAC 
AATGTGGAAA 
AGAGTCAATT 
AAAGAAACTA 



41 
I 

CCGTCTGCCT 
AAAAGGTGAT 
ATTTGGAAGA 
TTCTAAATGA 
CATTTACCAT 
TAGAACATCA 
CCAAGAGGAG 
CATTGTTTCT 
TAAGTGGACG 
GAAATCTATT 
CTTATGCGTC 
TAGAGGATGA 
TGGAAAGTAG 
CGGACACAAT 
GGCTCTTTTC 
GAGAGGTTGT 
TTGGATTGAT 
CCACTTTCAG 
TCTTACGAAT 
TTACCATTTT 
ATGAAGGTGT 
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51 
I 

GCCCGGCATC 
GCATCTGCTG 
ACTTAATGTA 
GTGCTTCAGG 
TGGGTGAGTG 
ATGGCTTTCT 
GAAGAAGGTA 
ATGGGCACCT 
TCAACAAGTT 
TGGAGTTGAT 
TTGCACTCGG 
AACTGCAGAT 
AAATGACAAC 
TAGACCTGGT 
GCATACGCGC 
TGTGCATCCC 
AGACAAGTAC 
AGGCACATCA 
ACAAAATGCT 
ACCTATAGAA 
AAAGGGAAAT 
TCTTTCTGTT 
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GTAAAGCCAC TGAATTATGA AGAAAACCGT CAAGTGAACC TGGAAATTGG AGTAAACAAT 1380 
GAAGCGCCAT TTGCTAGAGA TATTCCCAGA GTGACAGCCT TGAACAGAGC CTTGGTTACA 1440 
GTTCATGTGA GGGATCTGGA TGAGGGGCCT GAATGCACTC CTGCAGCCCA ATATGTGCGG 1500 
ATTAAAGAAA ACTTAGCAGT GGGGTCAAAG ATCAACGGCT ATAAGGCATA TGACCCCGAA 1560 
AATAGAAATG GCAATGGTTT AAGGTACAAA AAATTGCATG ATCCTAAAGG T TGGA TCACC 1620 
ATTGATGAAA TTTCAGGGTC AATCATAACT TCCAAAATCC TGGATAGGGA GGTTGAAACT 1680 
CCCAAAAATG AGTTGTATAA TATTACAGTC CTGGCAATAG ACAAAGATGA TAGATCATGT 1740 
ACTGGAACAC TTGCTGTGAA CATTGAAGAT GTAAATGATA ATCCACCAGA AATACTTCAA 1800 
GAATATGTAG TCATTTGCAA ACCAAAAATG GGGTATACCG ACATTTTAGC TGTTGATCCT 1B60 
GATGAACCTG TCCATGGAGC TCCATTTTAT TTCAGTTTGC CCAATACTTC TCCAGAAATC 1920 
AGTAGACTGT GGAGCCTCAC CAAAGTTAAT GATACAGCTG CCCGTCTTTC ATATCAGAAA 1980 
AATGCTGGAT TTCAAGAATA TACCATTCCT ATTACTGTAA AAGACAGGGC CGGCCAAGCT 2040 
GCAACAAAAT TATTGAGAGT TAATCTGTGT GAATGTACTC ATCCAACTCA GTGTCGTGCG 2100 
ACTTCAAGGA GTACAGGAGT AATACTTGGA AAATGGGCAA TCCTTGCAAT ATTACTGGGT 2160 
ATAGCACTGC TCTTTTCTGT ATTGCTAACT TTAGTATGTG GAGTTTTTGG TGCAACTAAA 2220 
GGGAAACGTT TTCCTGAAGA TTTAGCACAG CAAAACTTAA TTATATCAAA CACAGAAGCA 2280 
CCTGGAGACG ATAGAGTGTG CTCTGCCAAT GGATTTATGA CCCAAACTAC CAACAACTCT 2340 
AGCCAAGGTT TTTGTGGTAC TATGGGATCA GGAATGAAAA ATGGAGGGCA GGAAACCATT 2400 
GAAATGATGA AAGGAGGAAA CCAGACCTTG GAATCCTGCC GGGGGGCTGG GCATCATCAT 2460 
ACCCTGGACT CCTGCAGGGG AGGACACACG GAGGTGGACA ACTGCAGATA CACTTACTCG 2S20 
GAGTGGCACA GTTTTACTCA ACCCCGTCTC GGTGAAAAAT TGCATCGATG TAATCAGAAT 2580 
GAAGACCGCA TGCCATCCCA AGATTATGTC CTCACTTATA ACTATGAGGG AAGAGGATCT 2640 
CCAGCTGGTT CTGTGGGCTG CTGCAGTGAA AAGCAGGAAG AAGATGGCCT TGACTTTTTA 2700 
AATAATTTGG AACCCAAATT TATTACATTA GCAGAAGCAT GCACAAAGAG ATAATGTCAC 2760 
AGTGCTACAA TTAGGTCTTT GTCAGACATT CTGGAGGTTT CCAAAAATAA TATTGTAAAG 2820 
TTCAATTTCA ACATGTATGT ATATGATGAT TTTTTTCTCA ATTTTGAATT ATGCTACTCA 2880 
CCAATTTATA TTTTTAAAGC CAGTTGTTGC TTATCTTTTC CAAAAAGTGA AAAATGTTAA 2940 
AACAGACAAC TGGTAAATCT CAAACTCCAG CACTGGAATT AAGGTCTCTA AAGCATCTGC 3000 
TCTTTTTTTT TTTTACGGAT ATTTTAGTAA TAAATATGCT GGATAAATAT TAGTCCAACA 3060 
ATAGCTAAGT TATGCTAATA TCACATTATT ATGTATTCAC TTTAAGTGAT AGTTTAAAAA 3120 
ATAAACAAGA AATATTGAGT ATCACTATGT GAAGAAAGTT TTGGAAAAGA AACAATGAAG 3180 
ACTGAATTAA ATTAAAAATG TTGCAGCTCA TAAAGAATTG GGACTCACCC CTACTGCACT 3240 
ACCAAATTCA TTTGACTTTG GAGGCAAAAT GTGTTGAAGT GCCCTATGAA GTAGCAATTT 3300 
TCTATAGGAA TATAGTTGGA AATAAATGTG TGTGTGTATA TTATTATTAA TCAATGCAAT 3360 
ATTTAAAATG AAATGAGAAC AAAGAGGAAA ATGGTAAAAA CTTGAAATGA GGCTGGGGTA 3420 
TAGTTTGTCC TACAATAGAA AAAAGAGAGA GCTTCCTAGG CCTGGGCTCT TAAATGCTGC 3480 
ATTATAACTG AGTCTATGAG GAAATAGTTC CTGTCCAATT TGTGTAATTT GTTTAAAATT 3540 
GTAAATAAAT TAAACTTTTC TGGTTTCTGT GGGAAGGAAA TAGGGAATCC AATGGAACAG 3600 
TAGCTTTGCT TTGCAGTCTG TTTCAAGATT TCTGCATCCA CAAGTTAGTA GCAAACTGGG 3660 
GAATACTCGC TGCAGCTGGG GTTCCCTGCT TTTTGGTAGC AAGGGTCCAG AGATGAGGTG 3720 
TTTTTTTCGG GGAGCTAATA ACAAAAACAT TTTAAAACTT ACCTTTACTG AAGTTAAATC 3780 
CTCTATTGCT GTTTCTATTC TCTCTTATAG TGACCAACAT CTTTTTAATT TAGATCCAAA 3840 
TAACCATGTC CTCCTAGAGT TTAGAGGCTA GAGGGAGCTG AGGGGAGGAT CTTACTGAAA 3900 
GCACCCTGGG GAGATTGATT GTCCTTAAAC CTAAGCCCCA CAAACTTGAC ACCTGATCAG 3960 
GTCTGGGAGC TACAAAATTT CATTTTTCTC CTCACTGCCC TTCTTCTGAG TGGCATTGGC 4020 
CTGAATCAAG GAAAGCCAGG CCTTGTGGGC CCCCTTCTTT CGGCTTTCTG CTAAAGCAAC 4080 
ACCTCCAGCA GAGATTCCCT TAAGTGACTC CAGGTTTTCC ACCATCCTTC AGCGTGAATT 4140 
AATTTTTAAT CAGTTTGCTT TCTCCAGAGA AATTTTAAAA TAATAGAAGA AATAGAAATT 4200 
TTGAATGTAT AAAAGAAAAA GATCAAGTTG TCATTTTAGA ACAGAGGGAA CTTTGGGAGA 4260 
AAGCAGCCCA AGTAGGTTAT TTGTACAGTC AGAGGGCAAC AGGAAGATGC AGGCCTTCAA 4320 
GGGCAAGGAG AGGCCACAAG GAATATGGGT GGGAGTAAAA GCAACATCGT CTGCTTCATA 4380 
CTTTTTCCTA GGCTTGGCAC TGCCTTTTCC TTTCTCAGGC CAAT GGCAA C TGCCATTTGA 4440 
GTCCGGTGAG GGATCAGCCA ACCTCTTCTC TATGGCTCAC CTTATTTGGA GTGAGAAATC 4500 
AAGGAGACAG AGCTGACTGC ATGATGAGTC TGAAGGCATT TGCAGGATGA GCCTGAACTG 4560 
GTTGTGCAGA ACAAACAAGG CATTCATGGG AATTGTTGTA TTCCTTCTGC AGCCCTCCTT 4620 
CTGGGCACTA AGAAGGTCTA TGAATTAAAT GCCTATCTAA AATTCTGATT TATTC CTACA 4680 
TTTTCTGTTT TCTAATTTGA CCCTAAAATC TATGTGTTTT AGACTTAGAC TTTTTATTGC 4740 
CCCCCCOCCC TTTTTTTTTG AGACGGAGTC TCGCTCTGAC GCACAGGCTG GAGTGCAGTG 4800 
GCTCCGATCT CTGCTCACTG AAAGCTCCGC CTCCCGGGTT CATGCCATTC TCCTGCCTCA 4860 
GCCTCCTGAG TAGCTGGGAC TACAGGCGCC CACCACCACG CCOGGCTAAT TTTTTGTATT 4920 
TTTAATAGAG ACGGGGTTTC ACTGTGTTAG CCAGGATGGT CTCGATCTCC TGACCTCGTG 4980 
ATCCGCCTGC CTCGGCCTCC CAAAGTGCTG GGATTACAGG CATGACCCAC CGCTCCCGGC 5040 
CTTGTTTTCC GTTTAAAGTC GTCTTCTTTT AATGTAATCA TTTTGAACAT GTGTGAAAGT 5100 
TGATCATACG AATTGGATCA ATCTTGAAAT ACTCAACCAA AAGACAGTCG AGAAGCCAGG 5160 
GGGAGAAAGA ACTCAGGGCA CAAAATATTG GTCTGAGAAT GGAATTCTCT GTAAGCCTAG 5220 
TTGCTGAAAT TTCCTGCTGT AACCAGAAGC CAGTTTTATC TAACGGCTAC TGAAA CACC C 5280 
ACTGTGTTTT GCTCACTCCC TCACTCACCG ATCAAAACCT GCTACCTCCC CAAGACTTTA 5340 
CTAGTGCCGA TAAACTTTCT CAAAGAGCAA CCAGTATCAC TTCCCTGTTT ATAAAACCTC 5400 
TAACCATGTC TTTGTTCTTT GAACATGCTG AAAACCACCT GGTCTGCATG TATGCCCGAA 5460 
TTTGTAATTC TTTTCTCTCA AATGAAAATT TAATTTTAGG GATTCATTTC TATATTTTCA 5520 
CATATGTAGT ATTATTATTT CCTTATATGT GTAAGGTGAA ATTTATGGTA TTTGAGTGTG 5580 
CAAGAAAATA TATTTTTAAA GCTTTCATTT TTCCCCCAGT GAATGATTTA GAATTTTTTA 5640 
TGTAAATATA CAGAATGTTT TTTCTTACTT TTATAAGGAA GCAGCTGTCT AAAATGCAGT 5700 
GGGGTTTGTT TTGCAATGTT TTAAACAGAG TTTTAGTATT GCTATTAAAA GAAGTTACTT 5760 
TGCTTTTAAA GAAACTTGGC TGCTTAAAAT AAGCAAAAAT TGGATGCATA AAGTAATATT 5820 
TACAGATGTG GGGAGATGTA ATAAAACAAT ATTAACTTGG TTTCTTGTTT TTGCTGTATT 5880 
TAGAGATTAA ATAATTCTAA GATGATCACT TTGCAAAATT ATGCTTATGG CTGGCATGGA 5940 
AATAGAAATA CTCAATTATG TCTTTGTTGT ATTAATGGGG AATATTTTGG ACAATGTTTC 6000 
ATTATCAAAT TGTCGACATC ATTAATATAT ATTGTAATGT TGGGAAGAGA TCACTATTTT 6060 
GAAGCACAGC TTTACAGATG AGTATCTATG ATACATATGT ATAATAAATT TTGATCGGGT 6120 
ATTAAAAGTA TTAGAAGGTG GTTATAATTG CAGAGTATTC CATGAATAGT ACACTGACAC 61 BO 
AGGGGTTTTA CTTTGAGGAC CAGTGTAGTC AAGGGAAAAC ATGAGTTAAA AAGAAAAGCA 6240 
GGCAATATTG CAGTCTTGAT TCTGCCACTT ACAGGATAGA TAATGCCTGA ACTTTAATGA 6300 
CAAGATGATC CAACCATAAA GGTGCTCTGT GCTTCACAGT GAATCTTTTC CCCATGCAGG 6360 
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AGTGTGCTCC CCTACAAACG TTAAGACTGA TCATTTCAAA AATCTATTAQ CTATATCAAA 6420 

AGCCTTACAT TTTAATATAG GTTGAACCAA AATTTCAATT CCAGTAACTT CTATTGTAAC 6480 

CATTATTTTT GTGTATGTCT TCAAGAATGT TCATTGGATT TTTGTTTGTA ATAGTAAAAT 6540 

ACCGGATACA TTTCACGTGT CCTTCAGTAT TGATTTGGTT GAATATTGGG TCATAATGGT 6600 

5 TGAGAAGCAT GGACACTAGA GCCAGAATGC TTGGATATGA ATCCTGGATC TGTCACTTAC 6660 

TTCTGTGTGA CCTTTGAAAG GCTACTTATT TCCTCTCTTA GCTTTCTCAT TAAAATCAAT 6720 

GAACAATGCC AGCCTCATGG GGTTGTTGAA TGATTAAATT AGTTAATATA CCTAAAGTAC 6780 

ATAGAACACT GCCTGCACAT AGTAAAAGAA TTATAAGTGT GAGGTAGTTG GTAAAATTAT 6840 

GTAGTTGGAT ATACTACCGA ACAATATCTA ATCTCTTTTT AGGGAAATAA AGTTTGTGCA 6900 
10 TATATATAAT CCCGAAACAT G 

Seq ID NO: 328 Protein sequence 
Protein Accession fl: NP_Q01932.1 

15 1 11 21 31 41 51 

I I ! I I I 

MAAAGPRRSV RGAVCLHLLL TLVIFSRDGE ACKKVILNVP SKLEADKIIG RVNLEECFRS 60 

ADLIRSSDPD FRVLNDGSVY TARAVALSDK KRSFTIWLSD KRKQTQKEVT VLLEKQKKVS 120 

KTRHTRETVL RRAKRRWAPI PCSMQENSLG PFPLPLQQVE SDAAQNYTVF YSISGRGVDK 180 

20 EPLNLFYIER DTGNLPCTRP VDREEYDVFD LIAYASTADG YSADLPLPLP IRVEDENDNH 240 

PVFTEAIYNF EVLESSRPGT TVGWCATDR DEPDTMHTRL KYSILQQTPR SPGLFSVHPS 30O 

TGVITTVSHY LDREWDKYS LIMKVQDMDG QFFGLIGTST CIITVTDSND NAPTFRQNAY 360 

EAFVEENAFN VEILRIPIED KDLINTANWR VNFTILKGNE NGHFKISTDK ETNEGVLSW 420 

KPLNYEENRQ VNLEIGVNNE APFARDIPRV TALNRALVTV HVRDLDEGPE CTPAAQYVRI 480 

25 KENLAVGSKI NGYKAYDPEN RNGNGLRYKX IiHDPKGWITI DEISGSIITS KILDREVETP 540 

KNELYNITVL AIDKDDRSCT GTLAVNIEDV KDNPPEILQE YWICKPKMG YTDILAVDPD 600 

EPVHGAPFYP SLPNTSPEIS RLWSLTKVND TAARLSYQKN AGFQEYTIPI TVKDRAGQAA 660 

TKLLRVNLCE CTHPTQCRAT SRSTGVIIX3K HAILAILLGI ALLFSVLLTL VCGVFGATKG 720 

KRFPEDLAQQ NLIISNTEAP GDDRVCSAKG FMTQTTNNSS QGFCGTMGSG MKNGGQETIE 780 

30 MMKGGNQTLE SCRGAGHHHT LDSCRGGHTE VDNCRYTYSE WHSFTQPRLG EKLHRCNQNE 840 
DRMPSQDYVL TYNYEGRGSP AGSVGCCSEK QEEDGLDFLN NLEPKFITLA EACTKR 



Seq ID NO: 329 DNA sequence 
35 Nucleic Acid Accession #: NMJJ16583.2 
Coding sequence : 72 . . 842 

1 11 21 31 |1 51 

40 . GGAGTGGGGG AGAGAGAGGA GACCAGGACA GCTGCTGAGA CCTCTAAGAA GTCCAGATAC 
TAAGAGCAAA GATGTTTCAA ACTGGGGGCC TCATTGTCTT CTACGGGCTG TTAGCCCAGA 
CCATGGCCCA GTTTGGAGGC CTGCCCGTGC CCCTGGACCA GACCCTGCCC TTGAATGTGA 
ATCCAGCCCT GCCCTTGAGT CCCACAGGTC TTGCAGGAAG CTTGACAAAT GCCCTCAGCA 
ATGGCCTGCT GTCTGGGGGC CTGTTGGGCA TTCTGGAAAA CCTTCCGCTC CTGGACATCC 

45 TGAAGCCTGG AGGAGGTACT TCTGGTGGCC TCCTTGGGGG ACTGCTTGGA AAAGTGACGT 
CAGTGATTCC TGGCCTGAAC AACATCATTG ACATAAAGGT CACTGACCCC CAGCTGCTGG 
AACTTGGCCT TGTGCAGAGC CCTGATGGCC ACCGTCTCTA TGTCACCATC CCTCTCGGCA 
TAAAGCTCCA AGTGAATACG CCCCTGGTCG GTGCAAGTCT GTTGAGGCTG GCTGTGAAGC 
TGGACATCAC TGCAGAAATC TTAGCTGTGA GAGATAAGCA GGAGAGGATC CACCTGGTCC 

50 TTGGTGACTG CACCCATTCC CCTGGAAGCC TGCAAATTTC TCTGCTTGAT GGACTTGGCC 
CCCTCCCCAT TCAAGGTCTT CTGGACAGCC TCACAGGGAT CTTGAATAAA GTCCTGCCTG 
AGTTGGTTCA GGGCAACGTG TGCCCTCTGG TCAATGAGGT TCTCAGAGGC TTGGACATCA 
CCCTGGTGCA TGACATTGTT AACATGCTGA TCCACGGACT ACAGTTTGTC ATCAAGGTCT 
AAGCCTTCCA GGAAGGGGCT GGCCTCTGCT GAGCTGCTTC CCAGTGCTCA CAGATGGCTG 

55 GCCCATGTGC TGGAAGATGA CACAGTTGCC TTCTCTCCGA GGAACCTGCC CCCTCTCCTT 
TCCCACCAGG CGTGTGTAAC ATCCCATGTG CCTCACCTAA TAAAATGGCT CTTCTTCTGC 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAA 

Seq ID NO: 330 Protein sequence 
60 Protein Accession #: NP_057667.l 

1 11 21 31 41 51 

I I I I i I 

MFQTGGLIVF YGLLAQTMAQ FGGLPVPLDQ TLPLNVNPAL PLSPTGLAGS LTNALSNGLL 
65 SGGLLGILEN LPLLDILKPG GGTSGGLLGG LLGKVTSVIP GLNNIIDIKV TDPQLLELGL 
VQSPDGHRLY VTIPLGIKLQ VNTPLVGASL LRIAVKLDIT AEILAVRDKQ ERIHLVLGDC 
THSPGSLQIS LLDGLGPLPI QGLLDSLTGI LNKVLPELVQ GNVCPLVNEV LRGLDITLVH 
DIVNMLIHGL QFVIKV 

70 

Seq ID NOs 331 DNA sequence 

Nucleic Acid Accession tt: NM_004363.1 

Coding sequence: 115.. 2223 

75 1 11 21 31 41 51 

I I I I I I 

CTCAGGGCAG AGGGAGGAAG GACAGCAGAC CAGACAGTCA CAGCAGCCTT GACAAAACGT 
TCCTGGAACT CAAGCTCTTC TCCACAGAGG AGGACAGAGC AGACAGCAGA GACCATGGAG 
_ TCTCCCTCGG CCCCTCCCCA CAGATGGTGC ATCCCCTGGC AGAGGCTCCT GCTCACAGCC 

80 TCACTTCTAA CCTTCTGGAA CCCGCCCACC ACTGCCAAGC TCACTATTGA ATCCACGCCG 
TTCAATGTCG CAGAGGGGAA GGAGGTGCTT CTACTTGTCC ACAATCTGCC CCAGCATCTT 
TTTGGCTACA GCTGGTACAA AGGTGAAAGA GTGGATGGCA ACCGTCAAAT TATAGGATAT 
GTAATAGGAA CTCAACAAGC TACCCCAGGG CCCGCATACA GTGGTCGAGA GATAATATAC 
CCCAATGCAT CCCTGCTGAT CCAGAACATC ATCCAGAATG ACACAGGATT CTACACCCTA 
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CACGTCATAA AGTCAGATCT TGTGAATGAA GAAGCAACTG GCCAGTTCCG GGTATACCCG S40 

GAGCTGCCCA AGCCCTCCAT CTCCAGCAAC AACTCCAAAC CCGTGGAGGA CAAGGATGCT 600 

GTGGCCTTCA CCTGTGAACC TGAGACTCAG GACGCAACCT ACCTGTGGTG GGTAAACAAT 660 

CAGAGCCTCC CGGTCAGTCC CAGGCTGCAG CTGTCCAATG GCAACAGGAC CCTCACTCTA 720 

5 TTCAATGTCA CAAGAAATGA CACAGCAAGC TACAAATGTG AAACCCAGAA CCCAGTGAGT 780 

GCCAGGCGCA GTGATTCAGT CATCCTGAAT GTCCTCTATG GCCCGGATGC CCCCACCATT 840 

TCCCCTCTAA ACACATCTTA CAGATCAGGG GAAAATCTGA ACCTCTCCTG CCACGCAGCC 900 

TCTAACCCAC CTGCACAGTA CTCTTGGTTT GTCAATGGGA CTTTCCAGCA ATCCACCCAA 960 

GAGCTCTTTA TCCCCAACAT CACTGTGAAT AATAGTGGAT CCTATACGTG CCAAGCCCAT 1020 

10 AACTCAGACA CTGGCCTCAA TAGGACCACA GTCACGACGA TCACAGTCTA TGCAGAGCCA 1080 

CCCAAACCCT TCATCACCAG CAACAACTCC AACCCCGTGG AGGATGAGGA TGCTGTAGCC 1140 

TTAACCTGTG AACCTGAGAT TCAGAACACA ACCTACCTGT GGTGGGTAAA TAATCAGAGC 1200 

CTCCCGGTCA GTCCCAGGCT GCAGCTGTCC AATGACAACA GGACCCTCAC TCTACTCAGT 1260 

GTCACAAGGA ATGATGTAGG ACCCTATGAG TGTGGAATCC AGAACGAATT AAGTGTTGAC 1320 

15 CACAGCGACC CAGTCATCCT GAATGTCCTC TATGGCCCAG ACGACCCCAC CATTTCCCCC 1380 

TCATACACCT ATTACCGTCC AGGGGTGAAC CTCAGCCTCT CCTGCCATGC AGCCTCTAAC 1440 

CCACCTGCAC AGTATTCTTG GCTGATTGAT GGGAACATCC AGCAACACAC ACAAGAGCTC 1500 

TTTATCTCCA ACATCACTGA GAAGAACAGC GGACTCTATA CCTGCCAGGC CAATAACTCA 1560 

GCCAGTGGCC ACAGCAGGAC TACAGTCAAG ACAATCACAG TCTCTGCGGA GCTGCCCAAG 1620 

20 CCCTCCATCT CCAGCAACAA CTCCAAACCC GTGGAGGACA AGGATGCTGT GGCCTTCACC 1680 

TGTGAACCTG AGGCTCAGAA CACAACCTAC CTGTGGTGGG TAAATGGTCA GAGCCTCCCA 1740 

GTCAGTCCCA GGCTGCAGCT GTCCAATGGC AACAGGACCC TCACTCTATT CAATGTCACA 1800 

AGAAATGACG CAAGAGCCTA TGTATGTGGA ATCCAGAACT CAGTGAGTGC AAACCGCAGT 1860 

GACCCAGTCA CCCTGGATGT CCTCTATGGG CCGGACACCC CCATCATTTC CCCCCCAGAC 1920 

25 TCGTCTTACC TTTCGGGAGC GAACCTCAAC CTCTCCTGCC ACTCGGCCTC TAACCCATCC 1980 

CCGCAGTATT CTTGGCGTAT CAATGGGATA CCGCAGCAAC ACACACAAGT TCTCTTTATC 2040 

GCCAAAATCA CGCCAAATAA TAACGGGACC TATGCCTGTT TTGTCTCTAA CTTGGCTACT 2100 

GGCCGCAATA ATTCCATAGT CAAGAGCATC ACAGTCTCTG CATCTGGAAC TTCTCCTGGT 2160 

CTCTCAGCTG GGGCCACTGT CGGCATCATG ATTGGAGTGC TGGTTGGGGT TGCTCTGATA 2220 

30 TAGCAGCCCT GGTGTAGTTT CTTCATTTCA GGAAGACTGA CAGTTGTTTT GCTTCTTCCT 2280 

TAAAGCATTT GCAACAGCTA CAGTCTAAAA TTGCTTCTTT ACCAAGGATA TTTACAGAAA 2340 

AGACTCTGAC CAGAGATCGA GACCATCCTA GCCAACATCG TGAAACCCCA TCTCTACTAA 2400 

AAATACAAAA ATGAGCTGGG CTTGGTGGCG CGCACCTGTA GTCCCAGTTA CTCGGGAGGC 2460 

TGAGGCAGGA GAATCGCTTG AACCCGGGAG GTGGAGATTG CAGTGAGCCC AGATCGCACC 2520 

35 ACTGCACTCC AGTCTGGCAA CAGAGCAAGA CTCCATCTCA AAAAGAAAAQ AAAAGAAGAC 2580 

TCTGACCTGT ACTCTTGAAT ACAAGTTTCT GATACCACTG CACTGTCTGA GAATTTCCAA 2640 

AACTTTAATG AACTAACTGA CAGCTTCATG AAACTGTCCA CCAAGATCAA GCAGAGAAAA 2700 

TAATTAATTT CATGGGACTA AATGAACTAA TGAGGATTGC TGATTCTTTA AATGTCTTGT 2760 

TTCCCAGATT TCAGGAAACT TTTTTTCTTT TAAGCTATCC ACTCTTACAG CAATTTGATA 2820 

40 AAATATACTT TTGTGAACAA AAATTGAGAC ATTTACATTT TCTCCCTATG TGGTCGCTCC 2880 

AGACTTGGGA AACTATTCAT GAATATTTAT ATTGTATGGT AATATAGTTA TTGCACAAGT 2940 
TCAATAAAAA TCTGCTCTTT GTATAACAGA AAAA 

Seq ID NO: 332 Protein sequence 
45 Protein Accession #: NP_004354.l 

1 11 21 31 41 51 

I I I I I I 

MESPSAPPHR WCIPWQRLLIj TASLLTFNNP PTTAKLTIES TPFNVAEGKE VLLLVHNLPQ 60 

50 HLFGYSWYKG ERVDGNRQII GYVIGTQQAT PGPAYSGREI IYPNASLLIQ NIIQNDTGFY 120 

TLHVIKSDLV NEEATGQFKV YPELPKPSIS SNNSKPVEDK DAVAFTCEPE TQDATYLWWV 180 

NNQSLPVSPR LQLSNGNRTL TLFNVTRNDT ASYKCETQNP VSARRSDSVI LNVLYGPDAP 240 

TISPLNTSYR SGENLNLSCH AASNPPAQYS WFVNGTFQQS TQELFIPNIT VNNSGSYTOQ 300 

AHNSDTGLNR TTVTTITVYA EPPKPFITSN KSNPVEDEDA VALTCEPEIQ NTTYLWWVNN 360 

55 QSLPVSPRLQ LSNDNRTLTL LSVTRNDVGP YECGIQNELS VDHSDPVILN VLYGPDDPTI 420 

SPSYTYYRPG VNLSLSCHAA SNPPAQYSWL IDGNIQQHTQ ELFISNITEK NSGLYTCQAN 480 

NSASGHSRTT VKTITVSAEL PKPSISSNNS KPVEDKDAVA PTCEPEAQNT TYLWWVNGQS 540 

LPVSPRLQLS NGNRTLTLFN VTRNDARAYV CGIQNSVSAN RSDPVTLDVL YGPDTPIISP 600 

PDSSYLSGAN LNLSCHSASN PSPQYSWRIN GIPQQHTQVL, FIAKITPNNN GTYACFVSNL 660 
60 ATGRNNSIVK SITVSASGTS PGLSAGATVG IMIGVLVGVA LI 

Seq ID NO: 333 DNA sequence 
Nucleic Acid Accession #: NMJ>06952.1 
£t Coding sequence: 11.. 7 93 

65 

1 11 21 31 41 51 

I I I I I I 

AATCCCGACA ATGGCGAAAG ACAACTCAAC TGTTCGTTGC TTCCAGGGCC TGCT GATT TT 60 

TGGAAATGTG ATTATTGGTT GTTGCGGCAT TGCCCTGACT GCGGAGTGCA TCTTCTTTGT 120 

70 ATCTGACCAA CACAGCCTCT ACCCACTGCT TGAAGCCACC GACAACGATG ACATCTATGG 180 

GGCTGCCTGG ATCGGCATAT TTGTGGGCAT CTGCCTCTTC TGCCTGTCTG TTCTAGGCAT 240 

TGTAGGCATC ATGAAGTCCA GCAGGAAAAT TCTTCTGGCG TATTTCATTC TGATGTTTAT 300 

AGTATATGCC TTTGAAGTGG CATCTTGTAT CACAGCAGCA ACACAACGAG ACTTTTTCAC 360 

ACCCAACCTC TTCCTGAAGC AGATGCTAGA GAGGTACCAA AACAACAGCC CTCCAAACAA 420 

75 TGATGACCAG TGGAAAAACA ATGGAGTCAC CAAAACCTGG GACAGGCTCA TGCTCCAGGA 480 

CAATTGCTGT GGCGTAAATG GTCCATCAGA CTGGCAAAAA TACACATCTG CCTTCCGGAC 540 

TGAGAATAAT GATGCTGACT ATCCCTGGCC TCGTCAATGC TGTGTTATGA ACAATCTTAA 600 

AGAACCTCTC AACCTGGAGG CTTGTAAACT AGGCGTGCCT GGTTTTTATC ACAATCAGGG 660 

CTGCTATGAA CTGATCTCTG GTCCAATGAA CCGACACGCC TGGGGGGTTG CCTGGTTTGG 720 

80 ATTTGCCATT CTCTGCTGGA CTTTTTGGGT TCTCCTGGGT ACCATGTTCT ACTGGAGCAG 780 
AATTGAATAT TAAGAA 

Seq ID NO: 334 Protein sequence 
Protein Accession ft: NP_008B83.1 
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1 11 21 31 41 SI 

I I I 1 I I 

MAKDNSTVRC FQGLLIFGNV IIGCCGIALT AECIFFVSDQ HSLYPLLEAT DNDDIYGAAW 

IGIFVGICLP CLSVLGIVGI MKSSRKILLA YPILMFIVYA FEVASCITAA TQRDFFTPNL 

FLKQMLERYO NNSPPNNDDQ WKNNGVTKTW DRLMLQDNCC GVNGPSDWQK YTSAFRTENN 

DADYPWPRQC CVMNNLKEPL NLEACKLGVP GFYHNQGCYB LISGPMNRHA WGVAWFGFAI 
LCWTFWVLLG TMPYWSRIEY 

Seq ID NO: 335 DNA sequence 

Nucleic Acid Accession #: NM_002638.l 

Coding sequence : 120 . . 473 



60 
120 
180 
240 



1 
I 

CAATACAGCT 
GCTGGACTGC 
TGAGGGCCAG 
AGGCAGCTGT 
TCAATGGACA 
CGCAAGAGCC 
TCCGGTGCGC 
TCAAGAAGTG 
CGGTCCTTGC 
TGCTGCCCTT 
GAGCTGCCTC 



11 
I 

AAGGAATTAT 
ATAAAGATTG 
CAGCTTCTTG 
CACGGGAGTT 
AGATCCCGTT 
AGTCAAAGGT 
CATGTTGAAT 
CTGTGAAGGC 
TGCACCTGTG 
CCCCTTCCCA 
TCTCATCCAC 




CCTGTTAAAG 
AAAGGACAAG 
CCAGTCTCCA 
CCCCCTAACC 
TCTTGCGGGA 
CCGTCCCCAG 
CACTGTCCAT 
TTTCCAATAA 



31 
I 

TACCACAGAC 
AGCTCTTAGC 
TGTTCCTCAT 
GTCAAGACAC 
TTTCAGTTAA 
CTAAGCCTGG 
GCTGCTTGAA 
TGGCCTGTTT 
AGCTACAGGC 
TCTTCCTCCC 
A 



41 
I 

CCGCCCTGGA 
CAAACACCTT 
CGCTGGGACG 
TGTCAAAGGC 
AGGTCAAGAT 
CTCCTGCCCC 
AGATACTGAC 
CGTTCCCCAG 
CCCATCTGGT 
ATTCAGGATG 



51 
I 

gccaggccaa 
cctgacacca" 
ctggttctag 
cgtgttccat 
aaagtcaaag 

ATTATCTTGA 
TGCCCAGGAA 
TGAAGGGAGC 
CCTAAGTCCC 
CCCACGGCTG 



Seq ID NO: 336 Protein sequence 
Protein Accession ft: NP_002629.1 

1 11 21 31 41 51 

I 1 I I 1 I 

MRASSFLIW VFLIAGTLVL EAAVTGVPVK GQDTVKGRVP FNGQDPVKGQ VSVKGQDKVK 
AQEPVKGPVS TKPGSCPIIL IRCAMLNPPN RCLKDTDCPG IKKCCEGSCG MACFVPQ 

Seq ID NO: 337 DNA sequence 

Nucleic Acid Accession #i NM_001793.2 

Coding sequence: 71.. 2560 



I 



AAAGGGGCAA 
CTCTGCAGCC 
CTGGCTGCAG 
CTTGGAGGCG 
CTGCCCTGGG 
TGGCGAGACA 
ATCCAAACGT 
TGAAAATGGC 
AGACACCAAG 
CTTCGCTGTA 
GATTGCCAAG 
CCCCATGAAC 
GGACACCTTC 
GACAGCCACG 
CCATAGCCAA 
CACCATCAGC 
CATCCAGGCC 
GATCCTTGAT 
GCCTGAGAAT 
CAACTCACCA 
TACCATCACC 
TTTTGAGGCC 
GCTGAAGCTC 
ACCTGTGTTT 
GCCTGTGTGT 
CATCCTGAGA 
TGTGGGCACC 
GGTCTTGGCC 
ACTGATTGAT 
CCAAAGCCCT 
CCCTTTCCAG 
GGAAGGTGAC 
GCACCTTTCT 
GTGCGACTGC 
CCCTGTGCTG 
GAGAAAGAAG 
CGTCTTCTAC 
GCTCCACCGA 
CATCATCCCG 
TATAATTGAG 
CTTGGTGTTC 
CTCCGCCTCC 
GAAGCTGGCA 



11 
I 

GAGCTGAGCG 
ATGGGGCTCC 
TGCGCGGCCT 
GGAGGCGCGG 
CAAGAGCCAG 
GTCCAGGAAA 
ATCTTACGAA 
AAGGGTCCCT 
ATTTTCTACA 
GAGAAGGAGA 
TATGAGCTCT 
ATCTCCATCA 
CGAGGGAGTG 
GATGAGGATG 
GAACCAAAGG 
GTCATCTCCA 
ACAGACATGG 
GCCAATGACA 
GCAGTGGGCC 
GCGTGGCGTG 
ACCCACCCTQ 
AAAAACCAGC 
CCAACCTCCA 
GTCCCACCCT 
GTCTACACTG 
GACCCAGCAG 
CTCGACCGTG 
ATGGACAATG 
GTCAATGACC 
GTGCGCCAGG 
GCCCAGCTCA 
ACAGTGGTCT 
CTGTCTGACC 
CATGGCCATG 
GGGGCTGTCC 
CGGAAGATCA 
TATGGCGAAG 
GGTCTGGAGG 
ACACCCATGT 
AACCTGAAGG 
GACTATGAGG 
GACCAAGACC 
GACATGTACG 



21 
I 

GAACACCGGC 
CTCGTGGACC 
CCGAGCCGTG 
AGCAGGAGCC 
CTCTGTTTAG 
GAAGGTCACT 
GACACAAGAG 
TCCCCCAGAG 
GCATCACGGG 
CAGGCTGGTT 
TTGGCCACGC 
TCGTGACCGA 
TCTTAGAGGG 
ATGCCATCTA 
ACCCACACGA 
GTGGCCTGGA 
ATGGGGACGG 
ATGCTCCCAT 
ATGAGGTGCA 
CCACCTACCT 
AGAGCAACCA 
ACACCCTGTA 
CAGCCACCAT 
CCAAAGTCGT 
CAGAAGACCC 
GGTGGCTAGC 
AGGATGAGCA 
GAAGCCCTCC 
ATGGCCCAGT 
TGCTGAACAT 
CAGATGACTC 
TGTCCCTGAA 
ATGGCAACAA 
TCGAAACCTG 
TGGCTCTGCT 
AGGAGCCCCT 
AGGGGGGTGG 
CCAGGCCGGA 
ACCGTCCTCG 
CGGCTAACAC 
GCAGCGGCTC 
AAGATTACGA 
GTGGCGGGGA 



31 
I 

CCGCCGTCGC 
TCTCGCGTCT 
CCGGGCGGTC 
CGGCCAGGCG 
CACTGATAAT 
GAAGGAAAGG 
AGATTGGGTG 
ACTGAATCAG 
GCCGGGGGCA 
GTTGTTGAAT 
TGTGTCAGAG 
CCAGAATGAC 
AGTCCTACCA 
CACCTACAAT 
CCTCATGTTC 
CCGGGAAAAA 
CTCCACCACC 
GTTTGACCCC 
GAGGCTGACG 
TATCATGGGC 
GGGCATCCTG 
OGTTGAAGTG 
AGTGGTCCAC 
TGAGGTCCAG 
TGACAAGGAG 
CATGGACCCA 
GTTTGTGAGG 
CACCACTGGC 
CCCTGAGCCC 
CACGGACAAG 
AGACATCTAC 
GAAGTTCCTG 
AGAGCAGCTG 
CCCTGGACCC 
GTTCCTCCTG 
CCTACTCCCA 
CGAAGAGGAC 
GGTGGTTCTC 
GCCAGCCAAC 
AGACCCCACA 
CGACGCCGCG 
TTATCTGAAC 
GGACGACTAG 



41 
I 

GGCAGCTGCT 
CTCCTCCTTC 
TTCAGGGAGG 
CTGGGGAAAG 
GATGACTTCA 
AATCCATTGA 
GTTGCTCCAA 
CTCAAGTCTA 
GACAGCCCCC 
AAGCCACTGG 
AATGGTGCCT 
CACAAGCCCA 
GGTACTTCTG 
GGGGTGGTTG 
ACCATTCACC 
GTCCCTGAGT 
ACGGCAGTGG 
CAGAAGTACG 
GTCACTGATC 
GGTGACGAOG 
ACAACCAGGA 
ACCAACGAGG 
GTGGAGGATG 
GAGGGCATCC 
AATCAAAAGA 
GACAGTGGGC 
AACAACATCT 
ACGGGAACCC 
CGTCAGATCA 
GACCTGTCTC 
TGGACGGCAG 
AAGCAGGATA 
ACGGTGATCA 
TGGAAGGGAG 
CTGGTGCTGC 
GAAGATGACA 
CAGGACTATG 
CGCAATGACG 
CCAGATGAAA 
GCCCCGCCCT 
TCCCTGAGCT 
GAGTGGGGCA 
GCGGCCTGCC 



51 
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TCACCCCTCT 
TCCAGGTTTG 
CTGAAGTGAC 
TATTCATGGG 
CTGTGCGGAA 
AGATCTTCCC 
TATCTGTCCC 
ATAAAGATAG 
CTGAGGGTGT 
ACCGGGAGGA 
CAGTGGAGGA 
AGTTTACCCA 
TGATGCAGGT 
CTTACTCCAT 
GGAGCACAGG 
ACACACTGAC 
CAGTAGTGGA 
AGGCCCATGT 
TGGACGCCCC 
GGGACCATTT 
AGGGTTTGGA 
CCCCTTTTGT 
TGAATGAGGC 
CCACTGGGGA 
TCAGCTACCG 
AGGTCACAGC 
ATGAAGTCAT 
TTCTGCTAAC 
CCATCTGCAA 
CCCACACCTC 
AGGTCAACGA 
CATATGACGT 
GGGCCACTGT 
GTTTCATCCT 
TTTTGTTGGT 
CCCGTGACAA 
ACATCACCCA 
TGGCACCAAC 
TCGGCAACTT 
ACGACACCCT 
CCCTCACCTC 
GCCGCTTCAA 
TGCAGGGCTG 



60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
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1860 
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1980 
2040 
2100 
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2280 
2340 
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2460 
2520 
2580 
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GGGACCAAAC GTCAGGCCAC AGA6CATCTC CAAGGGGTCT CAGTTCCCCC TTCAGCTGAG 2640 

GACTTCGGAG CTTGTCAGGA AGTGGCCGTA GCAACTTGGC GGAGACAGGC TATGAGTCTG 2700 

ACGTTAGAGT GGTTGCTTCC TTAGCCTTTC AGGATGGAGG AATGTGGGCA GTTTGACTTC 2760 

AGCACTGAAA ACCTCTCCAC CTGGGCCAGG GTTGCCTCAG AGGCCAAGTT TCCAGAAGCC 2820 

TCTTACCTGC CGTAAAATGC TCAACCCTGT GTCCTGGGCC TGGGCCTGCT GTGACTGACC 2880 

TACAGTGGAC TTTCTCTCTG GAATGGAACC TTCTTAGGCC TCCTGGTGCA ACTTAATTTT 2940 

TTTTTTTAAT GCTATCTTCA AAACGTTAGA GAAAGTTCTT CAAAAGTGCA GCCCAGAGCT 30 00 

GCTGGGCCCA CTGGCCGTCC TGCATTTCTG GTTTCCAGAC CCCAATCCCT CCCATTCGGA 3060 

TGGATCTCTG CGTTTTTATA CTGAGTGTGC CTAGGTTGCC CCTTATTTTT TATTTTCCCT 3120 

GTTGCGTTGC TATAGATGAA GGGTGAGGAC AATCGTGTAT ATGTACTAGA ACTTTTTTAT 3180 
TAAAGAAACT TTTCCCAGAA AAAAA 



Seq ID NO: 338 Protein sequence 
Protein Accession #: NP_001784.2 

1 11 21 31 41 51 

1 I I I I I 

MGLPRGPLAS LLLLQVCWLQ CAASEPCRAV FREAEVTLEA GGAEQEPGQA LGKVFMGCPG 60 

QEPALFSTDN DDFTVRNGET VQERRSLKER NPLKIPPSKR ILRRHKRDWV VAPISVPENG 120 

KGPFPQRLNQ LKSNKDRDTK IFYSITGPGA DSPPEGVFAV EKETGWLLLN KPLDREEIAK 180 

YELFGHAVSE NGASVEDPMN ISIIVTDQND HKPKFTQDTF RGSVLEGVLP GTSVMQVTAT 240 

DEDDAIYTYN GWAYSIHSQ EPKDPHDLMF TIHRSTGTIS VISSGLDREK VPEYTLTIQA 300 

TDMDGDGSTT TAVAWEILD ANDNAPMFDP QKYEAHVPEN AVGHEVQRLT VTDLDAPNSP 360 

AWRATYLIMG GDDGDHFTIT THPESNQGIL TTRKGLDFEA KNQHTLYVEV TNEAPFVLKL 420 

PTSTATIWH VEDVNEAPVF VPPSKWEVQ EGIPTGEPVC VYTAEDPDKE NQKISYRILR 480 

DPAGWLAMDP DSGQVTAVGT LDREDEQFVR NNIYEVMVLA MDNGSPPTTG TGTLLLTLID 540 

VNDHGPVPEP RQITICNQSP VRQVLNITDK DLSPHTSPFQ AQLTDDSDIY WTAEVNEEGD 600 

TWLSLKKFL KQDTYDVHLS LSDHGNKEQL TVIRATVCDC HGHVETCPGP WKGGFILPVL 660 

GAVLALLFLL LVLLLLVRKK RKIKEPLLLP EDDTRDNVFY YGEEGGGEED QDYDITQLHR 720 

GLEARPEWL RNDVAPTIIP TPMYRPRPAN PDEIGNFIIE NLKAANTDPT APPYDTLLVF 780 
DYEGSGSDAA SLSSLTSSAS DQDQDYDYLN EWGSRFKKLA DMYGGGEDD 

Seq ID NO: 339 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence : 1 . . 672 

1 11 21 31 41 51 

| | I i ) I 

ATGAGGCTCC AAAGACCCCG ACAGGCCCCG GCGGGTGGGA GGCGCGCGCC CCGGGGCGGG 60 
CGGGGCTCCC CCTACCGGCC AGACCCGGGG AGAGGCGCGC GGAGGCTGCG AAGGTTCCAG 120 
AAGGGCGGGG AGGGGGCGCC GCGGGCTGAC CCTCCCTGGG CACCGCTGGG GACGATGGCG 180 
CTGCTCGCCT TGCTGCTGGT CGTGGCCCTA CCGCGGGTGT GGACAGACGC CAACCTGACT 240 
GCGAGACAAC GAGATCCAGA GGACTCCCAG CGAACGGACG AGGGTGACAA TAGAGTGTGG 300 
TGTCATGTTT GTGAGAGAGA AAACACTTTC GAGTGCCAGA ACCCAAGGAG GTGCAAATGG 360 
ACAGAGCCAT ACTGCGTTAT AGOGGCCGTG AAAATATTTC CACGTTTTTT CATGGTTGCG 420 
AAGCAGTGCT CCGCTGGTTG TGCAGOGATG GAGAGACCCA AGCCAGAGGA GAAGCGGTTT 480 
CTCCTGGAAG AGCCCATGCC CTTCTTTTAC CTCAAGTGTT GTAAAATTCG CTACTGCAAT 540 
TTAGAGGGGC CACCTATCAA CTCATCAGTG TTCAAAGAAT ATGCTGGGAG CATGGGTGAG 600 
AGCTGTGGTG GGCTGTGGCT GGCCATCCTC CTGCTGCTGG CCTCCATTGC AGCCGGCCTC 660 
AGCCTGTCTT GA 

Seq ID NO: 340 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I | I I ! I 

MRLQRPRQAP AGGRRAPRGG RGSPYRPDPG RGARRLRRFQ KGGEGAPRAD PPWAPLGTMA 60 
LLALLLWAIi PRVWTDANLT ARQRDPEDSQ RTDEGDNRVW CHVCERENTF ECQNPRRCKW 120 
TEPYCVIAAV KIFPRFFMVA KQCSAGCAAM ERPKPEEKRF LLEEPMPFFY IiKCCKIRYCN 180 
LEGPPINSSV FKEYAGSMGE SCGGLWLAII* LLLASIAAGL SLS 



Seq ID NO: 341 DNA sequence 

Nucleic Acid Accession #: XM_035292.2 

Coding sequence: 53.. 1576 

1 11 21 31 41 51 

I | I I I I 

GCTCGCTGGG CCGCGGCTCC CGGGTGTCCC AGGCCCGGCC GGTGCGCAGA GCATGGCGGG 60 

TGCGGGCCCG AAGCGGOGCG CGCTAGCGGC GCCGGCGGCC GAGGAGAAGG AAGAGGCGCG 120 

GGAGAAGATG CTGGCCGCCA AGAGCGCGGA CGGCTCGGCG CCGGCAGGCG AGGGCGAGGG 180 

CGTGACCCTG CAGCGGAACA TCACGCTGCT CAACGGCGTG GCCATCATCG TGGGGACCAT 240 

TATCGGCTCG GGCATCTTCG TGACGCCCAC GGGCGTGCTC AAGGAGGCAG GCTCGCOGGG 300 

GCTGGCGCTG GTGGTGTGGG CCGCGTGCGG CGTCTTCTCC ATCGTGGGCG CGCTCTGCTA 360 

CGCGGAGCTC GGCACCACCA TCTCCAAATC GGGCGGCGAC TACGCCTACA TGCTGGAGGT 420 

CTACGGCTCG CTGCCCGCCT TCCTCAAGCT CTGGATCGAG CTGCTCATCA TCCGGCCTTC 480 

ATCGCAGTAC ATCGTGGCCC TGGTCTTCGC CACCTACCTG CTCAAGCCGC TCTTCCCCAC 540 

CTGCCCGGTG CCCGAGGAGG CAGCCAAGCT CGTGGCCTGC CTCTGCGTGC TGCTGCTCAC 600 

GGCCGTGAAC TGCTACAGCG TGAAGGCCGC CACCCGGGTC CAGGATGCCT TTGCCGCCGC 660 

CAAGCTCCTG GCCCTGGCCC TGATCATCCT GCTGGGCTTC GTCCAGATCG GAAAGGGTGA 720 

TGTGTCCAAT CTAGATCCCA ACTTCTCATT TGAAGGCACC AAACTGGATG TGGGGAACAT 780 

TGTGCTGGCA TTATACAGCG GCCTCTTTGC CTATGGAGGA TGGAATTACT TGAATTTOGT 840 

CACAGAGGAA ATGATCAACC • CCTACAGAAA CCTGCCCCTG GCCATCATCA TCTCCCTGCC 900 

CATCGTGACG CTGGTGTACG TGCTGACCAA CCTGGCCTAC TTCACCACCC TGTCCACCGA 960 

GCAGATGCTG TCGTCCGAGG CCGTGGCCGT GGACTTCGGG AACTATCACC TGGGCGTCAT 1020 
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GTCCTGGATC ATCCCCGTCT TCGTGGGCCT GTCCTGCTTC GGCTCCGTCA ATGGGTCCCT 
GTTCACATCC TCCAGGCTCT TCTTCGTGGG GTCCCGGGAA GGCCACCTGC CCTCCATCCT 
CTCCATGATC CACCCACAGC TCCTCACCCC CGTGCCGTCC CTCGTGTTCA CGTGTGTGAT 
GACGCTGCTC TACGCCTTCT CCAAGGACAT CTTCTCCGTC ATCAACTTCT TCAGCTTCTT 
CAACTGGCTC TGCGTGGCCC TGGCCATCAT CGGCATGATC TGGCTGCGCC ACAGAAAGCC 
TGAGCTTGAG CGGCCCATCA AGGTGAACCT GGCCCTGCCT GTGTTCTTCA TCCTGGCCTG 
CCTCTTCCTG ATCGCCGTCT CCTTCTGGAA GACACCCGTG GAGTGTGGCA TCGGCTTCAC 
CATCATCCTC AGCGGGCTGC CCGTCTACTT CTTCGGGGTC TGGTGGAAAA ACAAGCCCAA 
GTGGCTCCTC CAGGGCATCT TCTCCACGAC CGTCCTGTGT CAGAAGCTCA TGCAGGTGGT 
CCCCCAGGAG ACATAGCCAG GAGGCCGAGT GGCTGCCGGA GGAGCATGC 

Seq ID NO: 342 Protein sequence 
Protein Accession XP_035292.2 

1 11 21 31 41 51 

I I 1 I I I 

MAGAGPKRRA LAAPAAEEKE EAREKMLAAK SADGSAPAGE GEGVTLQRNI TLLNGVAI IV 60 

GTIIGSGIFV TPTGVLKEAG SPGLALWWA ACGVFSIVGA LCYAELGTTI SKSGGDYAYM 120 

LEVYGSLPAF LKLWIELLII RPSSQYIVAL VFATYLLKPL FPTCPVPEEA AKLVACLCVL 180 

LLTAVNCYSV KAATRVQDAF AAAKLLALAL IILLGFVQIG KGDVSNLDPN FSFEGTKIiDV 240 

GNIVLALYSG LFAYGGWNYL NFVTEEMINP YRNLPLAIII SLPIVTLVYV LTNLAYFTTL 300 

STEQMLSSEA VAVDFGNYHL GVMSWIIPVF VGLSCFGSVN GSLFTSSRLF FVGSREGHLP 360 

SILSMIHPQL LTPVPSLVFT CVMTLLYAFS KDIFSVINFF SFFNWLCVAL AIIGMIWLRH 420 

RKPELERPIK VNLALPVFFI LACLFLIAVS FWKTPVECGI GFTIILSGLP VYFFGVWWKN 480 
KPKWLLQGIF STTVLCQKLM QWPQET 

Seq ID NO: 343 DNA sequence 

Nucleic Acid Accession #: NM_00526B.l 

Coding sequence: 168.. 989 

1 11 21 31 41 51 

III 111 

TAAAAAGCAA AAGAATTCGC GGCCGCGTCG ACACGGGCTT CCCCGAAAAC CTTCCCCGCT 60 

TCTGGATATG AAATTCAAGC TGCTTGCTGA GTCCTATTGC CGGCTGCTGG GAGCCAGGAG 120 

AGCCCTGAGG AGTAGTCACT CAGTAGCAGC TGACGCGTGG GTCCACCATG AACTGGAGTA 180 

TCTTTGAGGG ACTCCTGAGT GGGGTCAACA AGTACTCCAC AGCCTTTGGG CGCATCTGGC 240 

TGTCTCTGGT CTTCATCTTC CGCGTGCTGG TGTACCTGGT GACGGCCGAG CGTGTGTGGA 300 

GTGATGACCA CAAGGACTTC GACTGCAATA CTCGCCAGCC CGGCTGCTCC AACGTCTGCT 360 

TTGATGAGTT CTTCCCTGTG TCCCATGTGC GCCTCTGGGC CCTGCAGCTT ATCCTGGTGA 420 

CATGCCCCTC ACTGCTCGTG GTCATGCAOG TGGCCTACCG GGAGGTTCAG GAGAAGAGGC 480 

ACCGAGAAGC CCATGGGGAG AACAGTGGGC GCCTCTACCT GAACCCCGGC AAGAAGCGGG 540 

GTGGGCTCTG GTGGACATAT GTCTGCAGCC TAGTGTTCAA GGCGAGCGTG GACATCGCCT 600 

TTCTCTATGT GTTCCACTCA TTCTACCCCA AATATATCCT CCCTCCTGTG GTCAAGTGCC 660 

ACGCAGATCC ATGTCCCAAT ATAGTGGACT GCTTCATCTC CAAGCCCTCA GAGAAGAACA 720 

TTTTCACCCT CTTCATGGTG GCCACAGCTG CCATCTGCAT CCTGCTCAAC CTCGTGGAGC 780 

TCATCTACCT GGTGAGCAAG AGATGCCACG AGTGCCTGGC AGCAAGGAAA GCTCAA GCCA 840 

TGTGCACAGG TCATCACCCC CACGGTACCA CCTCTTCCTG CAAACAAGAC GACCTCCTTT 900 

CGGGTGACCT CATCTTTCTG GGCTCAGACA GTCATCCTCC TCTCTTACCA GACCGCCCCC 960 

GAGACCATGT GAAGAAAACC ATCTTGTGAG GGGCTGCCTG GACTGGTCTG GCAGGTTGGG 1020 

CCTGGATGGG GAGGCTCTAG CATCTCTCAT AGGTGCAACC TGAGAGTGGG GGAGCTAAGC 1080 

CATGAGGTAG GGGCAGGCAA GAGAGAGGAT TCAGACGCTC TGGGAGCCAG TTCCTAGTCC 1140 

TCAACTCCAG CCACCTGCCC CAGCTCGACG GCACTGGGCC AGTTCCCCCT CTGCTCTGCA 1200 
GCTCGGTTTC CTTTTCTAGA ATGGAAATAG TGAGGGCCAA TGC 

Seq ID NO: 344 Protein sequence 
Protein Accession ft* NP_005259.1 

1 11 21 31 41 51 

I I I I I I 

MNWSIFEGLL SGVNKYSTAF GRIWI*SLVFI FRVLVYLVTA ERVWSDDHKD FDCNTRQPGC 60 
SNVCFDEFFP VSHVRLWALQ LILVTCPSLL WMHVAYREV QEKRHREAHG ENSGRLYLNP 120 
GKKRGGLWWT YVCSLVFKAS VDIAFLYVFH SFYPKYILPP WKCHADPCP NIVDCFISKP 180 
SEKNIFTLFM VATAAICILL NLVELIYLVS KRCHECLAAR KAQAMCTGHH PHGTTSSCKQ 240 
DDLLSGDLIP LGSDSHPPLL PDRPRDHVKK TIL 

Seq ID NO: 345 DNA sequence 

Nucleic Acid Accession #: NM_002391.1 

Coding sequence: 26.. 457 

1 11 21 31 41 51 

I I I I I I _ c 

CGGGCGAAGC AGCGCGGGCA GCGAGATGCA GCACCGAGGC TTCCTCCTCC TCACCCTCCT 60 

CGCCCTGCTG GCGCTCACCT COGCGGTCGC CAAAAAGAAA GATAAGGTGA AGAAGGGCGG 120 

CCCGGGGAGC GAGTGCGCTG AGTGGGCCTG GGGGCCCTGC ACCCCCAGCA GCAAGGATTG 180 

CGGCGTGGGT TTCCGCGAGG GCACCTGCGG GGCCCAGACC CAGCGCATCC GGTGCAGGGT 240 

GCCCTGCAAC TGGAAGAAGG AGTTT GG AGC CGACTGCAAG TACAAGTTTG AGAACTGGGG 300 

TGCGTGTGAT GGGGGCACAG GCACCAAAGT CCGCCAAGGC ACCCTGAAGA AGGCGCGCTA 360 

CAATGCTCAG TGCCAGGAGA CCATCCGCGT CACCAAGCCC TGCACCCCCA AGACCAAAGC 420 

AAAGGCCAAA GCCAAGAAAG GGAAGGGAAA GGACTAGACG CCAAGCCTGG ATGCCAAGGA 480 

GCCCCTGGTG TCACATGGGG CCTGGCCACG CCCTCCCTCT CCCAGGCCCG AGATGTGACC 540 

CACCAGTGCC TTCTGTCTGC TCGTTAGCTT TAATCAATCA TGCCCTGCCT TGTCCCTCTC 600 

ACTCCCCAGC CCCACCCCTA AGTGCCCAAA GTGGGGAGGG ACAAGGGATT CTGGGAAGCT 660 

TGAGCCTCCC CCAAAGCAAT GTGAGTCCCA GAGCCCGCTT TTGTTCTTCC CCACAATTCC 720 

ATTACTAAGA AACACATCAA ATAAACTGAC TTTTTCCCCC CAATAAAAGC TCTTCTTTTT 780 
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TAATAT 

Seq ID NO: 346 Protein sequence 1 
Protein Accession ft: NP_002382.1 

1 11 21 31 41 51 

I I I I 1 I 

MQHRGFLLLT LLALLALTSA VAKKKDKVKK GGPGSECAEW AWGPCTPSSK DCGVGFREGT 60 
CGAQTQRIRC RVPCNWKKEF GADCKYKFEN WGACDGGTGT KVRQGTLKKA RYNAQCQETI 120 
RVTKPCTPKT KAKAKAKKGK, GKD 

Seq ID NO: 347 DNA sequence 

Nucleic Acid Accession 8: NM_006783.1 

Coding sequence: 1..786 

1 11 21 31 41 51 

I I I I I I 

ATGGATTGGG GGACGCTGCA CACTTTCATC GGGGGTGTCA ACAAACACTC CACCAGCATC 60 
GGGAAGGTGT GGATCACAGT CATCTTTATT TTCCGAGTCA TGATCCTAGT GGTGGCTGCC 120 
CAGGAAGTGT GGGGTGACGA GCAAGAGGAC TTCGTCTGCA ACACACTGCA ACCGGGATGC 180 
AAAAATGTGT GCTATGACCA CTTTTTCCCG GTGTCCCACA TCCGGCTGTG GGCCCTCCAG 240 
CTGATCTTCG TCTCCACCCC AGCGCTGCTG GTGGCCATGC ATGTGGCCTA CTACAGGCAC 300 
GAAACCACTC GCAAGTTCAG GCGAGGAGAG AAGAGGAATG ATTTCAAAGA CATAGAGGAC 360 
ATTAAAAAGC ACAAGGTTCG GATAGAGGGG TCGCTGTGGT GGACGTACAC CAGCAGCATC 420 
TTTTTCCGAA TCATCTTTGA AGCAGCCTTT ATGTATGTGT TTTACTTCCT TTACAATGGG 480 
TACCACCTGC CCTGGGTGTT GAAATGTGGG ATTGACCCCT GCCCCAACCT TGTTGACTGC 54 0 
TTTATTTCTA GGCCAACAGA GAAGACCGTG TTTACCATTT TTATGATTTC TGCGTCTGTG 600 
ATTTGCATGC TGCTTAACGT GGCAGAGTTG TGCTACCTGC TGCTGAAAGT GTGTTTTAGG 660 
AGATCAAAGA GAGCACAGAC GCAAAAAAAT CACCCCAATC ATGCCCTAAA GG AGAG TAAG 720 
CAGAATGAAA TGAATGAGCT GATTTCAGAT AGTGGTCAAA ATGCAATCAC AGGTTTCCCA 780 
AGCTAA 

Seq ID NO: 348 Protein sequence 
Protein Accession ft: NP_006774.1 

1 11 21 31 41 51 

I I I I I I 

MDWGTLHTFI GGVNKHSTSI GKVWITVIFI FRVMILWAA QEVWGDEQED FVCNTLQPGC 60 
KNVCYDHPFP VSHIRLWALQ LIFVSTPALL VAMHVAYYRH ETTRKFRRGE KRNDFKDIED 120 
IKKHKVRIEG SLWWTVTSSI FFRIIFEAAP MYVFYFLYNG YHLPWVLKCG IDPCPNLVDC 180 
FISRPTKKTV FTIFMISASV ICMLLNVAEL CYLLLKVCFR RSKRAQTQKN HPNHALKESK 240 
QNEMNELISD SGQNAITGFP S 

Seq ID NO: 349 DNA sequence 

Nucleic Acid Accession ft: NM_002571.1 

Coding sequence: 99.. 587 

1 11 21 31 41 51 

I I I I I I 

CATCCCTCTG GCTCCAGAGC TCAGAGCCAC CCACAGCCGC AGCCATGCTG TGCCTCCTGC 60 

TCACCCTGGG CGTGGCCCTG GTCTGTGGTG TCCCGGCCAT GGACATCCCC CAGACCAAGC 120 

AGGACCTGGA GCTCCCAAAG TTGGCAGGGA COTGGCACTC CATGGCCATG GCGACCAACA 180 

ACATCTCCCT CATGGOGACA CTGAAGGCCC CTCTGAGGGT CCACATCACC TCACTGTTGC 240 

CCACCCCCGA GGACAACCTG GAGATCGTTC TGCACAGATG GGAGAACAAC AGCTGTGTTG 300 

AGAAGAAGGT CCTTGGAGAG AAGACTGGGA ATCCAAAGAA GTTCAA GATC AACTATACGG 360 

TGGCGAACGA GGCCACGCTG CTCGATACTG ACTACGACAA TTTCCTGTTT CTCTGCCTAC 420 

AGGACACCAC CACCCCCATC CAGAGCATGA TGTGCCAGTA CCTGGCCAGA GTCCTGGTGG 480 

AGGACGATGA GATCATGCAG GGATTCATCA GGGCTTTCAG GCCCCTGCCC AGGCACCTAT 540 

GGTACTTGCT GGACTTGAAA CAGATGGAAG AGCCGTGCCG TTTCTAGCTC ACCTCCGCCT 600 

CCAGGAAGAC CAGACTCCCA CCCTTCCACA CCTCCAGAGC AGTGG GACT T CCTCCTGCCC 660 

TTTCAAAGAA TAACCACAGC TCAGAAGACG ATGACGTGGT CATCTGTGTC GCCATCCCCT 720 

TCCTGCTGCA CACCTGCACC ATTGCCATGG GGAGGCTGCT CCCTGGGGGC AGAGTCTCTG 780 
GCAGAGGTTA TTAATAAACC CTTGGAGCAT G 

Seq ID NO: 350 Protein sequence 
Protein Accession #» NP_002562.l 

1 11 21 31 41 51 

] I I 1 I I 

MDIPQTKQDL ELPKLAGTWH SMAMATNNIS LMATLKAPLR VHITSIiLPTP EDNLEIVLHR 60 
WENNSCVEKK VLGEKTGNPK KFKINYTVAN EATLLDTDYD NFLFLCLQDT TTPIQSMMCQ 120 
YLARVLVEDD EIMQGFIRAF RPLPRHLWYL LDLKQMEEPC RF 

Seq ID NO: 351 DNA sequence 

Nucleic Acid Accession #: NM_006500.1 

Coding sequence: 27.. 1967 

1 11 21 31 41 51 

I I I 1 I I 

ACTTGCGTCT CGCCCTCCGG CCAAGCATGG GGCTTCCCAG GCTGGTCTGC GCCTTCTTGC 60 
TCGCCGCCTG CTGCTGCTGT CCTCGCGTCG CGGGTGTGCC CGGAGAGGCT GAGCAGCCTG 120 
CGCCTGAGCT GGTGGAGGTG GAAGTGGGCA GCACAGCCCT TCTGAAGTGC GGCCTCTCCC 180 
AGTCCCAAGG CAACCTCAGC CATGTCGACT GGTTTTCTGT CCACAAGGAG AAGCGGACGC 240 
TCATCTTCCG TGTGCGCCAG GGCCAGGGCC AGAGCGAACC TGGGGAGTAC GAGCAGCGGC 300 
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TCAGCCTCCA GGACAGAGGG GCTACTCTGG CCCTGACTCA AGTCACCCCC CAAGACGAGC 360 
GCATCTTCTT GTGCCAGGGC AAGCGCCCTC GGTCCCAGGA GTACCGCATC CAGCTCCGCG 420 
TCTACAAAGC TCCGGAGGAG CCAAACATCC AGGTCAACCC CCTGGGCATC CCTGTGAACA 480 
GTAAGGAGCC TGAGGAGGTC GCTACCTGTG TAGGGAGGAA CGGGTACCCC ATTCCTCAAG 540 
5 TCATCTGGTA CAAGAATGGC CGGCCTCTGA AGGAGGAGAA GAACCGGGTC CACATTCAGT 600 
CGTCCCAGAC TGTGGAGTCG AGTGGTTTGT ACACCTTGCA GAGTATTCTG AAGGCACAGC 660 
TGGTTAAAGA AGACAAAGAT GCCCAGTTTT ACTGTGAGCT CAACTACCGG CTGCCCAGTG 720 
GGAACCACAT GAAGGAGTCC AGGGAAGTCA CCGTCCCTGT TTTCTACCCG ACAGAAAAAG 780 
TGTGGCTGGA AGTGGAGCCC GTGGGAATGC TGAAGGAAGG GGACCGCGTG GAAATCAGGT 840 
10 GTTTGGCTGA TGGCAACCCT CCACCACACT TCAGCATCAG CAAGCAGAAC CCCAGCACCA 900 
GGGAGGCAGA GGAAGAGACA ACCAACGACA ACGGGGTCCT GGTGCTGGAG CCTGCCCGGA 960 
AGGAACACAG TGGGCGCTAT GAATGTCAGG CCTGGAACTT GGACACCATG ATATCGCTGC 1020 
TGAGTGAACC ACAGGAACTA CTGGTGAACT ATGTGTCTGA CGTCCGAGTG AGTCCCGCAG 1080 
CCCCTGAGAG ACAGGAAGGC AGCAGCCTCA CCCTGACCTG TGAGGCAGAG AGTAGCCAGG 1140 
15 ACCTCGAGTT CCAGTGGCTG AGAGAAGAGA CAGACCAGGT GCTGGAAAGG GGGCCTGTGC 1200 
TTCAGTTGCA TGACCTGAAA CGGGAGGCAG GAGGCGGCTA TCGCTGOGTG GCGTCTGTGC 1260 
CCAGCATACC CGGCCTGAAC CGCACACAGC TGGTCAAGCT GGCCATTTTT GGCCCCCCTT 1320 
GGATGGCATT CAAGGAGAGG AAGGTGTGGG TGAAAGAGAA TATGGTGTTG AATCTGTCTT 1380 
GTGAAGCGTC AGGGCACCCC CGGCCCACCA TCTCCTGGAA CGTCAACGGC ACGGCAAGTG 1440 
20 AACAAGACCA AGATCCACAG CGAGTCCTGA GCACCCTGAA TGTCCTCGTG ACCCCGGAGC 1500 
TGTTGGAGAC AGGTGTTGAA TGCACGGCCT CCAACGACCT GGGCAAAAAC ACCAGCATCC 1560 
TCTTCCTGGA GCTGGTCAAT TTAACCACCC TCACACCAGA CTCCAACACA ACCACTGGCC 1620 
TCAGCACTTC CACTGCCAGT CCTCATACCA GAGCCAACAG CACCTCCACA GAGAGAAAGC 1680 
TGCCGGAGCC GGAGAGCCGG GGCGTGGTCA TCGTGGCTGT GATTGTGTGC ATCCTGGTCC 1740 
25 TGGCGGTGCT GGGCGCTGTC CTCTATTTCC TCTATAAGAA GGGCAAGCTG CCGTGCAGGC 1800 
GCTCAGGGAA GCAGGAGATC ACGCTGCCCC CGTCTCGTAA GACCGAACTT GTAGTTGAAG 1860 
TTAAGTCAGA TAAGCTCCCA GAAGAGATGG GCCTCCTGCA GGGCAGCAGC GGTGACAAGA 1920 
GGGCTCCGGG AGACCAGGGA GAGAAATACA TCGATCTGAG GCATTAGCCC CGAATCACTT 1980 
CAGCTCCCTT CCCTGCCTGG ACCATTCCCA GCTCCCTGCT CACTCTTCTC TCAGCCAAAG 2040 
30 CCTCCAAAGG GACTAGAGAG AAGCCTCCTG CTCCCCTCAC CTGCACACCC CCTTTCAGAG 2100 
GGCCACTGGG TTAGGACCTG AGGACCTCAC TTGGCCCTGC AAGCCGCTTT TCAGGGACCA 2160 
GTCCACCACC ATCTCCTCCA CGTTGAGTGA AGCTCATCCC AAGCAAGGAG CCCCAGTCTC 2220 
CCGAGCGGGT AGGAGAGTTT CTTGCAGAAC GTGTTTTTTC TTTACACACA TTATGGCTGT 2280 
AAATACCTGG CTCCTGCCAG CAGCTGAGCT GGGTAGCCTC TCTGAGCTGG TTTCCTGCCC 2340 
35 CAAAGGCTGG CTTCCACCAT CCAGGTGCAC CACTGAAGTG AGGACACACC GGAGCCAGGC 2400 
GCCTGCTCAT GTTGAAGTGC GCTGTTCACA CCCGCTCCGG AGAGCACCCC AGCGGCATCC 2460 
AGAAGCAGCT GCAGTGTTGC TGCCACCACC CTCCTGCTCG CCTCTTCAAA GTCTCCTGTG 2520 
ACATTTTTTC TTTGGTCAGA AGCCAGGAAC TGGTGTCATT CCTTAAAAGA TACGTGCCGG 2580 
GGCCAGGTGT GGTGGCTCAC GCCTGTAATC CCAGCACTTT GGGAGGCCGA GGCGGGCGGA 2640 
40 TCACAAAGTC AGGACGAGAC CATCCTGGCT AACACGGTGA AACCCTGTCT CTACTAAAAA 2700 
TACAAAAAAA AATTAGCTAG GCGTAGTGGT TGGCACCTAT AGTCCCAGCT ACTCGGAAGG 2760 
CTGAAGCAGG AGAATGGTAT GAATCCAGGA GGTGGAGCTT GCAGTGAGCC GAGACCGTGC 2820 
CACTGCACTC CAGCCTGGGC AACACAGCGA GACTCCGTCT CGAGGAAAAA AAAAGAAAAG 2880 
ACGCGTACCT GCGGTGAGGA AGCTGGGCGC TGTTTTCGAG TTCAGGTGAA TTAGCCTCAA 2940 
45 TCCCCGTGTT CACTTGCTCC CATAGCCCTC TTGATGGATC ACGTAAAACT GAAAGGCAGC 3000 
GGGGAGCAGA CAAAGATGAG GTCTACACTG TCCTTCATGG GGATTAAAGC TATGGTTATA 3060 
TTAGCACCAA ACTTCTACAA ACCAAGCTCA GGGCCCCAAC CCTAGAAGGG CCCAAATGAG 3120 
AGAATGGTAC TTAGGGATGG AAAACGGGGC CTGGCTAGAG CTTCGGGTGT GTGTGTCTGT 3180 
CTGTGTGTAT GCATACATAT GTGTGTATAT ATGGTTTTGT CAGGTGTGTA AATTTGCAAA 3240 
50 TTGTTTCCTT TATATATGTA TGTATATATA TATATGAAAA TATATATATA TATGAAAAAT 3300 
AAAGCTTAAT TGTCCCAGAA AATCATACAT TGCTTTTTTA TTCTACATGG GTACCACAGG 3360 
AACCTGGGGG CCTGTGAAAC TACAACCAAA AGGCACACAA AACCGTTTCC AGTTGGCAGC 3420 
AGAGATCAGG GGTTACCTCT GCTTCTGAGC AAATGGCTCA AGCTCTACCA GAGCAGACAG 3480 
CTACCCTACT TTTCAGCAGC AAAACGTCCC GTATGACGCA GCACGAAGGG CCTGGCAGGC 3540 
55 TGTTAGCAGG AGCTATGTCC CTTCCTATCG TTTCCGTCCA CTT 

Seq ID NO: 352 Protein sequence 
Protein AcceGsion #: NP__006491.1 

60 1 11 21 31 41 51 

! I I I I I 

GLPRLVCAFL LAACCCCPRV AGVPGBAEQP APELVEVEVG STALLKCGLS QSQGNLSHVD 60 

WPSVHKEKRT LIFRVRQGQG QSEPGEYEQR LSLQDRGATL ALTQVTPQDE RIFLCQGKRP 120 

RSQEYRIQLR VYKAPBEPNI QVNPLGIPVN SKEPEEVATC VGRNGYPIPQ VIWYKNGRPL 180 

65 KEEKNRVHIQ SSQTVKSSGL YTLQSILKAQ LVKEDKDAQF YCELNYRLPS GNHMKESREV 240 

TVPVFYPTEK VWLEVEPVGM LKEGDRVEIR CLADGNPPPH FSISKQNPST REAEEETTND 300 

NGVLVLEPAR KEHSGRYECQ AWNUDTMISL LSEPQELLVN YVSDVRVSPA APERQEGSSL 360 

TLTCEAESSQ DLEFQWLREE TDQVLERGPV LQLHDLKREA GGGYRCVASV PSIPGLNRTQ 420 

LVKLAIFGPP WMAFKERKVW VKENMVLNLS CEASGHPRPT ISWNV NGTA S EQDQDPQRVL 480 

70 STLNVLVTPE LLBTGVECTA SNDLGKNTSI LFLELVNLTT LTPDSNTTTG LSTSTASPHT 540 

RANSTSTERK LPEPESRGW IVAVIVCILV LAVLGAVLYF LYKKGKLPCR RSGKQBITLP 600 
PSRKTELWB VKSDKLPEEM GLLQGSSGDK RAPGDQGEKY IDLRH 

Seq ID NO: 353 DNA sequence 
75 Nucleic Acid Accession #: NM_0031B3.3 
Coding sequence: 165.. 2 63 9 

1 ii 21 31 41 51 

80 TCGAGCCTGG CGGTAGAATC TTCCCAGTAG GCGGCGCGGG AGGGAAAAGA GGATTGAGGG 60 

GCTAGGCCGG GCGGATCCCG TCCTCCCCCG ATGTGAGCAG TTTTCCGAAA CCCCGTCAGG 120 

CGAAGGCTGC CCAGAGAGGT GGAGTCGGTA GCGGGGCCGG GAACATGAGG CAGTCTCTCC 180 

TATTCCTGAC CAGCGTGGTT CCTTTCGTGC TGGCGCCGCG ACCTCCGGAT GACCCGGGCT 240 

TCGGCCCCCA CCAGAGACTC GAGAAGCTTG ATTCTTTGCT CTCAGACTAC GATATTCTCT 300 
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CTTTATCTAA 
TAGAAACACT 
GTACTGAACG 
AGTACACTGC 
GGGTTCTAGC 
AATATAACAT 
TTTATAAATC 
ATTTAAAAGT 
AAGAGCTTGT 
AATTATTGGT 
CAACTACAAA 
CATGGGATAA 
AGTCTCCACA 
ATGAAGAAAA 
CTGAGGAAGC 
GAACTCTTGG 
CAAAGGCTTA 
GCACAAAGAA 
AATTGGGACA 
ATGAGGACCA 
ACAATAAGAT 
CCCAGGAGTG 
GAGAAGAGTG 
GCACGTTGAA 
AGTTTGAGAC 
CCTACTGCAC 
TTTGCTTGGA 
AGCAGCTGGA 
ACCTTTCTGG 
AAGGAAAGCC 
AGGATGTAAT 
AGTTTTTAGC 
CTTTCAGCAT 
CTCTGTTTCA 
TTATCAAACC 
CTTCGGOGCC 
CCAGCACAGA 
GCACAGCTGC 
CTGCCTCCTT 
TTAGTTCTCA 
TCAATCACAG 
TTTGAACTTC 
TAAGGTGAAA 
TTGACCTGTG 
TGTAAATTAA 
ATACACTGTA 
ATAGTTTTTT 
GTATACATTG 
CAGGGAAAAT 
TAAGACTAAT 
TTGAATTTAT 
TTTTATAAGT 
CTATTGGCTG 
CGCCACTACA 



TATCCAGCAG 
ACTAACTTTT 
TTTTTCACAA 
AAAATGGCAG 
CCACATAAGA 
AGAGCCACTT 
TGAAGATATC 
GGATAATGAA 
TCATCGAGTG 
GGTAGCAGAT 
TTACTTAATA 
TGCAGGTTTT 
AGAGGTAAAA 
GGATGCTTGG 
ATCTAAAGTT 
ATTAGCTTAT 
TTATAGCCCA 
TTATGGTAAA 
TAATTTTGGA 
GGGAGGGAAA 
GTTTTCAAAC 
TTTTCAAGAA 
TGATCCTGGC 
GGAAGGTGTC 
TGCCCAGAAG 
AGGTAATAGC 
TCTTGGCAAG 
GTCCTGTGCA 
CCGCTGTGTG 
CTGTACAGTA 
TGAACGATTT 
AGACAACATC 
TCTTGTCCAT 
CCCCAGTAAC 
CTTTCCTGCG 
AGCAGCTCCA 
CTCCCATATG 
CAAGTCATTT 
TAAACTGCAG 
GCTCTTCTGA 
CTTGTATTTT 
CTGCAGGTAA 
GTGAATCTAC 
GTGCAAAAGC 
TCTTTTATAT 
ATGAACCGCT 
TTTTTTTTTX 
TATCTAAATT 
ATATATCTAA 
GCTCAGAACC 
GGAATCTACC 
ATCTTCATGT 
GGAGTGGTGG 
CTCCAGCCTG 



CATTCGGTAA 
TCAGCTTTGA 
AATTTCAAGG 
GACTTCTTCA 
GATGATGATG 
TGGAGATTTG 
AAGAATGTTT 
GAGTTGCTCC 
AAAAGAAGAG 
CATCGCTTCT 
GAGCTAATTG 
AAAGGCTATG 
CCTGGTGAAA 
GATGTGAAGA 
TGCTTGGCAC 
GTTGGCTCTC 
GTTGGGAAGA 
ACCATCCTTA 
GCAGAACATG 
TATGTCATGT 
TGCAGTAAAC 
CGCAGCAATA 
ATCATGTATC 
CAGTGCAGTG 
AAGTGCCAGG 
AGTGAGTGCC 
TGTAAGGATG 
TGTAATGAAA 
CCCTATGTCG 
GGATTTTGTG 
TGGGATTTCA 
GTTGGGTCTG 
TGTGTGGATA 
GTCGAAATGC 
CCCCAGACTC 
AAACTGGACC 
GACGAGGATG 
GAGGATCTCA 
CGTCAGAATC 
CTTAAGTGTG 
GTGAAGACTG 
ACAGTTCTTG 
TTATTTTGAG 
AGAAAATACA 
TGATAACAGC 
GAATATGAAG 
TTTTTTTTGC 
GTGGGTCTAT 
ATTTAGAAAT 
TAACCACTAC 
AACTGTTTAG 
ATCCCTGTTA 
CTCATGCCTG 
GGTGACAGAG 



GAAAAAGAGA 
AAAGGCATTT 
TCGTGGTGGT 
CTGGACACGT 
TTATAATCAG 
TTAATGATAC 
CACGTTTGCA 
CAAAAGGGTT 
CTGACCCAGA 
ACAGATACAT 
ACAGAGTTGA 
GAATACAGAT 
AGCACTACAA 
TGTTGCTAGA 
ACCTTTTCAC 
CCAGAGCAAA 
AAAATATCTA 
CAAAGGAAGC 
ATCCGGATGG 
ATCCCATAGC 
AATCAATCTA 
AAGTTTGTGG 
TGAACAACGA 
ACAGGAACAG 
AGGCGATTAA 
CGCCTCCAGG 
GGAAATGCAT 
CTGACAACTC 
ATGCTGAACA 
ACATGAATGG 
TTGACCAGCT 
TCCTGGTTTT 
AGAAATTGGA 
TGAGCAGCAT 
CAGGCCGCCT 
ACCAGAGAAT 
GGTTTGAGAA 
CGGACCATCC 
GTGTTAACAG 
CAAAATATTT 
GGAAGTGACT 
TGTGGTTTGG 
GCTTTCAGGT 
GCTGGATTGG 
ACTGACTAGG 
CATTTGGCAT 
CTTCAACTAA 
TTCTAGTTAT 
CATTTGGGTT 
CTTACAGTGA 
GGCCCTGATT 
CTGATAGGGA 
TAATCCCAGC 
TGAGATCTGC 



TCTACAGACT 
TAAATTATAC 
GGATGGTAAA 
GGTTGGTGAG 
AATCAACACA 
CAAAGACAAA 
GTCTCCAAAA 
AGTAGACAGA 
TCCCATGAAG 
GGGCAGAGGG 
TGACATCTAT 
AGAGCAGATT 
CATGGCAAAA 
GCAATTTAGC 
ATACCAAGAT 
CAGCCATGGA 
TTTGAATAGT 
TGACCTGGTT 
TCTAGCAGAA 
TGTGAGTGGC 
TAAGACCATT 
GAACTCGAGG 
CACCTGCTGC 
TCCTTGCTGT 
TGCTACTTGC 
AAATGCTGAA 
CCCTTTCTGC 
CTGCAAGGTG 
AAAGAACTTA 
CAAATGTGAG 
GAGCATCAAT 
CTCCTTGATA 
TAAACAGTAT 
GGATTCTGCA 
GCAGCCTGCC 
GGACACCATC 
GGACCCCTTC 
GGTCGCCAGA 
CAAAGAAACA 
TTATAGATTT 
TAGCAGATGC 
CCCTTCTCCT 
TTTAGTTTTT 
GTTATGAATA 
GAAATGATCA 
TTATTTGTGA 
AAACAAAGGA 
TACCCAGAGT 
AATATGGCTC 
GGGCTATACA 
TGCTGGGCAG 
TACATGTCTT 
ACTTGGAGAG 
CTC 




GGTTTGACGA 
ACAACTCATG 
TGTGCCCCGA 
GATCACGAGA 
GAAAGTAAGG 
GTGGATGAAG 
AACAGCGACT 
AAAAACTGTC 
AAAGGCGTGT 
AATGACACTG 
GAGAGGGAAC 
TGCTGCAGGG 
TTTTTGAGGA 
AAACGAGTAC 
ACTTTTGGAA 
TTTTGGATTC 
GAATCTCTGT 
TCGGTTCGCA 
CCTGTGATCC 
CAGGAAGACC 
CCAAATAGCA 
AGTGAAAAGG 
GAGTGCTAAT 
GACCTACAAA 
TGGTCATGTG 
TTTGAAAAGG 
AAAATATCTT 
TTTAOGTTTT 

g t t ttt t tt t 

GAAAAGTGGA 
GATAAATTTA 
TTTTATGTAG 
TTCATAATTC 
TGGTAGCCAG 
XTTTTCTGTA 
AGAAAATTCA 
GCTGAGGTTG 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 



Seq ID NO: 354 Protein sequence 
Protein Accession #: NP_J)03174.2 



1 
I 

MRQSLLFLTS 
QTSTHVETLL 
GEPDSRVLAH 
PKVCGYLKVD 
RGEESTTTNY 
AKSYPNEEKD 
HGGVCPKAYY 
AECAPNKDQG 
SRVDEGEECD 
TCKGVSYCTG 
KVCCRDLSGR 
XNTFGKFLAD 
SASVRIIKPF 
PPPNSSTAAK 



11 
I 

WPFVIAPRP 
TPSALKRHFK 
IRDDDVIIRI 
NEELLPKGLV 
LIELIDRVDD 
AWDVKMLLEQ 
SPVGKKNIYI, 
GKYVMYPIAV 
PGIMYLNNDT 
NSSECPPPGN 
CVPYVDAEQK 
NIVGSVLVFS 
PAPQTPGRLQ 
SFEDLTDHPV 



21 
I 

PDDPGFGPHQ 
LYLTSSTERP 
NTDGAEYNIE 
DREPPEELVH 
IYRNTSWDNA 
PSPDIAEEA8 
NSGLTSTKNY 



CCNSDCTLKE 
AENDTVCLDL 
NLFLRKGKPC 
LIFVJIPFSIL 
PAPVIPSAPA 
ARSEKAASFK 



31 
I 

RLEKLDSLLS 
SQNFKWWD 
PLWRFVNDTK 
KVKRRADPDP 
GFKGYGIQIE 
KVCLAHLPTY 
GKTILTKEAD 
SNCSKQSIYK 
GVQCSDRNSP 
GKCKDGKCIP 
TVGFCDMNGK 
VHCVDKKLDK 
APKLDHQRMD 
LQRQNRVNSK 



DYDILSLSNI 
GKNESBYTAK 
DKRMLVYKSE 
MKNTCKLLW 
QIRILKSPQE 
QDFDMGTLGL 
LVTTHELGHN 
TIESKAQECF 
CCKNCQFETA 
FCEREQQLES 
CEKRVQDVIE 
QYESLSLPHP 
TIQEDPSTDS 
ETEC 



51 
I 

QQBSVRKRDL 
WQDFPTGHW 
DIKNVSRLQS 
ADHRFYRVMG 
VKPGEKHYNM 
AYVGSPRANS 
FGAEHDPDGL 
QERSNKVCGN 
QKKCQEAINA 
CACNETDNSC 
RFWDPIDQLS 
SNVEMLSSMD 
HMDEDGFEKD 



Seq ID NO: 355 DNA sequence 

Nucleic Acid Accession #; NM_021832.1 

Coding sequence: 164.. 2248 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



41 



51 



1 11 21 31 

I 1 I I I I 

TCGAGCCTGG CGGTAGAATC TTCCCAGTAG GCGGOGCGGG AGGAAAAGAG GATTGAGGGG 
CTAGGCCGGG CGGATCCCGT CCTCCCCCGA TGTGAGCAGT TTTCCGAAAC CCCGTCAGGC 
GAAGGCTGCC CAGAGAGGTG GAGTCGGTAG CGGGGCCGGG AACATGAGGC AGTCTCTCCT 

1088 



60 
120 
180 



WO 03/042661 



PCT/US02/36810 



ATTCCTGACC AGCGTGGTTC CTTTCGTGCT GGCGCCGCGA CCTCCGGATG ACCCGGGCTT 240 

CGGCCCCCAC CAGAGACTCG AGAAGCTTGA TTCTTTGCTC TCAGACTACG ATATTCTCTC 300 

TTTATCTAAT ATCCAGCAGC ATTCGGTAAG AAAAAGAGAT CTACAGACTT CAACACATGT 360 

AGAAACACTA CTAACTTTTT CAGCTTTGAA AAGGCATTTT AAATTATACC TGACATCAAG 420 

5 TACTGAACGT TTTTCACAAA ATTTCAAGGT CGTGGTGGTG GATGGTAAAA ACGAAAGCGA 480 

GTACACTGTA AAATGGCAGG ACTTCTTCAC TGGACACGTG GTTGGTGAGC CTGACTCTAG 540 

GGTTCTAGCC CACATAAGAG ATGATGATGT TATAATCAGA ATCAACACAG ATGGGGCCGA 600 

ATATAACATA GAGCCACTTT GGAGATTTGT TAATGATACC AAAGACAAAA GAATGTTAGT 660 

TTATAAATCT GAAGATATCA AGAATGTTTC ACGTTTGCAG TCTCCAAAAG TGTGTGGTTA 720 

10 TTTAAAAGTG GATAATGAAG AGTTGCTCCC AAAAGGGTTA GTAGACAGAG AACCACCTGA 780 

AGAGCTTGTT CATCGAGTGA AAAGAAGAGC TGACCCAGAT CCCATGAAGA ACACGTGTAA 840 

ATTATTGGTG GTAGCAGATC ATCGCTTCTA CAGATACATG GGCAGAGGGG AAGAGAGTAC 900 

AACTACAAAT TACTTAATAG AGCTAATTGA CAGAGTTGAT GACATCTATC GGAACACTTC 960 

ATGGGATAAT GCAGGTTTTA AAGGCTATGG AATACAGATA GAGCAGATTC GCATTCTCAA 1020 

15 GTCTCCACAA GAGGTAAAAC CTGGTGAAAA GCACTACAAC ATGGCAAAAA GTTACCCAAA 1080 

TGAAGAAAAG GATGCTTGGG ATGTGAAGAT GTTGCTAGAG CAATTTAGCT TTGATATAGC 1140 

TGAGGAAGCA TCTAAAGTTT GCTTGGCACA CCTTTTCACA TACCAAGATT TTGATATGGG 1200 

AACTCTTGGA TTAGCTTATG TTGGCTCTCC CAGAGCAAAC AGCCATGGAG GTGTTTGTCC 1260 

AAAGGCTTAT TATAGCCCAG TTGGGAAGAA AAATATCTAT TTGAATAGTG GTTTGACGAG 1320 

20 CACAAAGAAT TATGGTAAAA CCATCCTTAC AAAGGAAGCT GACCTGGTTA CAACTCATGA 1380 

ATTGGGACAT AATTTTGGAG CAGAACATGA TCCGGATGGT CTAGCAGAAT GTGCCCCGAA 1440 

TGAGGACCAG GGAGGGAAAT ATGTCATGTA TCCCATAGCT GTGAGTGGCG ATCACGAGAA 1S00 

CAATAAGATG TTTTCAAACT GCAGTAAACA ATCAATCTAT AAGACCATTG AAAGTAAGGC 1560 

CCAGGAGTGT TTTCAAGAAC GCAGCAATAA AGTTTGTGGG AACTCGAGGG TGGATGAAGG 1620 

25 AGAAGAGTGT GATCCTGGCA TCATGTATCT GAACAACGAC ACCTGCTGCA ACAGCGACTG 1680 

CACGTTGAAG GAAGGTGTCC AGTGCAGTGA CAGGAACAGT CCTTGCTGTA AAAACTGTCA 1740 

GTTTGAGACT GCCCAGAAGA AGTGCCAGGA GGCGATTAAT GCTACTTGCA AAGGCGTGTC 1800 

CTACTGCACA GGTAATAGCA GTGAGTGCCC GCCTCCAGGA A ATGC TGAAG ATGACACTGT 1860 

TTGCTTGGAT CTTGGCAAGT GTAAGGATGG GAAATGCATC CCTTTCTGCG AGAGGGAACA 1920 

30 GCAGCTGGAG TCCTGTGCAT GTAATGAAAC TGACAACTCC TGCAAGGTGT GCTGCAGGGA 1980 

CCTTTCCGGC CGCTGTGTGC CCTATGTCGA TGCTGAACAA AAGAACTTAT TTTTGAGGAA 2040 

AGGAAAGCCC TGTACAGTAG GATTTTGTGA CATGAATGGC AAATGTGAGA AACGAGTACA 2100 

GGATGTAATT GAACGATTTT GGGATTTCAT TGACCAGCTG AGCATCAATA CTTTTGGAAA 2160 

GTTTTTAGCA GACAACATCG TTGGGTCTGT CCTGGTTTTC TCCTTGATAT TTTGGATTCC 2220 

35 TTTCAGCATT CTTGTCCATT GTGTGTAACG TCGAAATGCT GAGCAGCATG GATTCTGCAT 2280 

CGGTTCGCAT TATCAAACCC TTTCCTGCGC CCCAGACTCC AGGCCGCCTG CAGCCTGCCC 2340 

CTGTGATCCC TTCGGCGCCA GCAGCTCCAA AACTGGACCA CCAGAGAATG GACA CCAT CC 2400 

AGGAAGACCC CAGCACAGAC TCACATATGG ACGAGGATGG GTTTGAGAAG GACCCCTTCC 2460 

CAAATAGCAG CACAGCTCCC AAGTCATTTG AGGATCTCAC GGACCATCCG GTCACCAGAA 2S20 

40 GTGAAAAGGC TGCCTCCTTT AAACTGCAGC GTCAGAATCG TGTTGACAGC AAAGAAACAG 2580 

AGTGCTAATT TAGTTCTCAG CTCTTCTGAC TTAAGTGTGC AAAATATTTT TATAGATTTG 2640 

ACCTACAATC AATCACAGCT TATATTTTGT GAAGACTGGG AAGTGACTTA GCAGA TGCTG 2700 

GTCATGTGTT TGAACTTCCT GCAGGTAAAC AGTTCTTGTG TGGTTTGGCC CTTCTCCTTT 2760 

TGAAAAGGTA AGGTGAAGGT GAATCTAGCT TATTTTGAGG CTTTCAGGTT TTAGTTTTTA 2820 

45 AAATATCTTT TGACCTGTGG TGCAAAAGCA GAAAATACAG CTGGATTGGG TTATGAGTAT 2880 

TTACGTTTTT GTAAATTAAT CTTTTATATT GATAACAGGC ACTGACTAGG GAAATGATCA 2940 

GTTTTTTTTT ATACACTGTA ATGAACCGCT GAATATGAAG CATTTGGCAT TTATTTGTGA 3000 

GAAAAGTGGA ATAGTTTTTT TTTTTTTTTT TTTTTTTTGC CTTCAACTAA AAACAAAGGA 3060 
GATAAATTTA GTATACATTG TATCTAAATT GTGGGTCTAT TTCTAGTTAT TACCCAGAGT 3120 
50 TTTTATGTAG CAGGGAAAAT ATATATCTAA ATTTAGAAAT CATTTGGGTT AATATGGCTC 3180 
TTCATAATTC TAAGACTAAT GCTCAGAACC TAACCACTAC CTTACAGTGA GGGCTATACA 3240 
TGGTAGCCAG TTGAATTTAT GGAATCTACC AACTGTTTAG GGCCCTGATT TGCTGGGCAG 3300 
TTTTTCTGTA TTTTATAAGT AT CTTCATGT ATCCCTGTTA CTGATAGGGA TACATGTCTT 3360 
AGAAAATTCA CTATTGGCTG GGAGTGGTGG CTCATGCCTG TAATCCCAGC ACTTGGAGAG 3420 
55 3421 GCTGAGGTTG CGCCACIACA CTCCAGCCTG GGTGACAGAG TGAGATCTGC CTC 



Seq ID NO: 356 Protein sequence 
Protein Accession #: NP_068604.1 



60 
65 
70 
75 
80 



i 
I 

MRQSLLFLTS 
QTSTHVETLL 
GEPDSRVLAH 
PKVCGYLKVD 
RGEBSTTTNY 
AKSYPNEEKD 
HGGVCPKAYY 
AECAPNEDQG 
SRVDEGEECD 
TCKGVSYCTG 
KVCCRDLSGR 
INTFGKFLAD 



11 
I 

WPFVLAPRP 
TFSALKRHFK 
IRDDDVIIRI 
NEELLPKGLV 
LIELIDRVDD 
AWDVKMLL.EQ 
SPVGKKNIYL 
GKYVMYPIAV 
PGIMYLNNDT 
NSSECPPPGN 
CVPYVDAEQK 
NIVGSVLVPS 



21 
I 

PDDPGPGPHO 
LYLTSSTERF 
NTDGAEYNIE 
DREPPEELVH 
IYRNTSWDNA 
FSFDIAEEAS 
NSGLTSTKNY 



CCNSDCTLKE 
AEDDTVCLDL 
NLPLRKGKPC 
LIFWIPPSIL 



31 
I 

RLEKLDSLLS 
SQNFKVWVD 
PLWRFVNDTK 
RVKRRADPDP 
GFKGYGIQIE 
KVCLAHLPTY 
GKTILTKEAD 
SNCSKQSIYK 
GVQCSDRNSP 
GKCKDGKCIP 
TVGFCDMNGK 
VHCV 



41 
I 

DYDILSLSNI 
GKNESEYTVK 
DKRMLVYKSE 
MKNTCKLLW 
QIRILKSPQE 
QDFDMGTLGL 
LVTTHELGHN 
TIESKAQECF 
CCKNOQFETA 
FCEREQQLES 
CEKRVQDVIE 



Seq ID NO: 357 DNA sequence 

Nucleic Acid Accession #: NM_004994. 

Coding sequence: 20. .2143 



51 
1 

QQHSVRKRDL 
WQDPFTGHW 
DIKNVSRLQS 
ADHRFYRYMG 
VKPGEKHYNM 
AYVGSPRANS 
FGAEHDPDGL 
QERSNKVCGN 
QKKCQEAINA 
CACNETDNSC 
RFWDFIDQLS 



51 



1 11 21 31 41 

| ) 

AGACACCTCT GCCCTCACCA TGAGCCTCTG GCAGCCCCTG GTCCTGGTGC TCCTGGTGCT 
GGGCTGCTGC TTTGCTGCCC CCAGACAGCG CCAGTCCACC CTTGTGCTCT TCCCTGGAGA 
CCTGAGAACC AATCTCACCG ACAGGCAGCT GGCAGAGGAA TACCTGTACC GCTATGGTTA 
CACTCGGGTG GCAGAGATGC GTGGAGAGTC GAAATCTCTG GGGCCTGCGC TGCTGCTTCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



60 
120 
180 
240 



1089 



WO 03/042661 



PCTAJS02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



CCAGAAGCAA 
GCGAACCCCA 
CAAGTGGCAC 
GGCGGTGATT 
CACCTTCACT 
GCACGGAGAC 
TGGCCCCGGC 
GGGCGTCGTG 
CATCTTCGAG 
CTGGTGCAGT 
GAGACTCTAC 
CCAAGGCCAA 
CGCCACCACC 
CTCGACGGTG 
GGGTAAGGAG 
TACCACCTCG 
TTTGTTCCTC 
GCCGGAGGCG 
CGACGTGAAT 
AACCACCACC 
TGTCCACCCC 
AGGTCCCCCC 
TGCCTGCAAC 
CAAGGATGGG 
CCTTATCGCC 
GCTCTCCAAG 
GGTGCTGGGC 
CGGGGCCCTC 
GTTCGACGTG 
CCCCGGGGTG 
CCAGGACCGC 
GGGCTACGTG 
GCAGTGCCAT 
CAAACTGGTA 
TCACCTTTGT 



CTGTCCCTGC 
CGGTGCGGGG 
CACCACAACA 
GACGACGCCT 
OGCGTGTACA 
GGGTATCCCT 
ATTCAGGGAG 
GTTCCAACTC 
GGCCGCTCCT 
ACCACGGCCA 
ACCCGGGACG 
TCCTACTCCG 
GCCAACTACG 
ATGGGGGGCA 
TACTCGACCT 
AACTTTGACA 
GTGGCGGCGC 
CTCATGTACC 
GGCATCCGGC 
ACACCGCAGC 
TCAGAGCGCC 
ACTGCTGGCC 
GTGAACATCT 
AAGTACTGGC 
GACAAGTGGC 
AAGCTTTTCT 
CCGAGGCGTC 
CGGAGTGGCA 
AAGGCGCAGA 
CCTTTGGACA 
TTCTACTGGC 
ACCTATGACA 
GTAAATCCCC 
TTCTGTTCTG 
TTTTTGTT GG 



CCGAGACCGG 
TCCCAGACCT 
TCACCTATTG 
TTGCCCGCGC 
GCCGGGACGC 
TCGACGGGAA 
ACGCCCATTT 
GGTTTGGAAA 
ACTCTGCCTG 
ACTACGACAC 
GCAATGCTGA 
CCTGCACCAC 
ACCGGGACAA 
ACTCGGCGGG 
GTACCAGCGA 
GCGACAAGAA 
ATGAGTTCGG 
CTATGTACCG 
ACCTCTATGG 
CCACGGCTCC 
CCACAGCTGG 
CTTCTACGGC 
TCGACGCCAT 
GATTCTCTGA 
CCGCGCTGCC 
TCTTCTCTGG 
TGGACAAGCT 
GGGGGAAGAT 
TGGTGGATCC 
CGCACGACGT 
GCGTGAGTTC 
TCCTGCAGTG 
ACTGGGACCA 
GAGGAAAGGG 
AGTGTTTCTA 



TGAGCTGGAT 
GGGCAGATTC 
GATCCAAAAC 
CTTCGCACTG 
AGACATCGTC 
GGACGGGCTC 
CGACGATGAC 
CGCAGATGGC 
CACCACCGAC 
CGACGACCGG 
TGGGAAACCC 
GGACGGTCGC 
GCTCTTCGGC 
GGAGCTGTGC 
GGGCCGCGGA 
GTGGGGCTTC 
CCACGCGCTG 
CTTCACTGAG 
TCCTCGCCCT 
CCCGACGGTC 
CCCCACAGGT 
CACTACTGTG 
CGCGGAGATT 
GGGCAGGGGG 
CCGCAAGCTG 
GCGCCAGGTG 
GGGCCTGGGA 
GCTGCTGTTC 
CCGGAGCGCC 
CTTCCAGTAC 
CCGGAGTGAG 
CCCTGAGGAC 
ACCCTGGGGA 
AGGAGTGGAG 
ATAAACTTGG 



AGCGCCACGC 
CAAACCTTTG 
TACTCGGAAG 
TGGAGCGCGG 
ATCCAGTTTG 
CTGGCACACG 
GAGTTGTGGT 
GCGGCCTGCC 
GGTCGCTCCG 
TTTGGCTTCT 
TGCCAGTTTC 
TCCGACGGCT 
TTCTGCCCGA 
GTCTTCCCCT 
GATGGGCGCC 
TGCCCGGACC 
GGCTTAGATC 
GGGCCCCCCT 
GAACCTGAGC 
TGCCCCACCG 
CCCCCCTCAG 
CCTTTGAGTC 
GGGAACCAGC 
AGCCGGCCGC 
GACTCGGTCT 
TGGGTGTACA 
GCCGACGTGG 
AGCGGGCGGC 
AGCGAGGTGG 
CGAGAGAAAG 
TTGAACCAGG 
TAGGGCTCCC 
AGGAGCCAGT 
GTGGGCTGGG 
ATTCTCTAAC 



TGAAGGCCAT 
AGGGCGACCT 
ACTTGCCGCG 
TGACGCCGCT 
GTGTCGCGGA 
CCTTTCCTCC 
CCCTGGGCAA 
ACTTCCCCTT 
ACGGCTTGCC 
GCCCCAGCGA 
CATTCATCTT 
ACCGCTGGTG 
CCCGAGCTGA 
TCACTTTCCT 
TCTGGTGCGC 
AAGGATACAG 
ATTCCTCAGT 
TGCATAAGGA 
CACGGCCTCC 
GACCCCCCAC 
CTGGCCCCAC 
CGGTGGACGA 
TGTATTTGTT 
AGGGCCCCTT 
TTGAGGAGCC 
CAGGCGCGTC 
CCCAGGTGAC 
GCCTCTGGAG 
ACCGGATGTT 
CCTATTTCTG 
TGGACCAAGT 
GTCCTGCTTT 
TTGCCGGATA 
CCCTCTCTTC 
CTTT 



300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1060 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1660 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 



Seq ID NO: 358 Protein sequence 
Protein Accession 8: NP_004985.1 



I 

MSLWQPLVLV 
RGESKSLGPA 
ITYWIQNYSE 
PDGKDGLLAH 
YSACTTDGRS 
ACTTDGRSDG 



PMYRFTEGPP 
PTAGPTGPPS 
RFSEGRGSRP 
LDKLGLGADV 
THDVFQYREK 



11 
I 

LLVLGCCFAA 
LLUjQKQLSL 
DLPRAVIDDA 
AFPPGPGIQG 
DGLPWCSTTA 
YRWCATTANY 
LWCATTSNFD 
LHKDDVNGIR 
AGPTGPPTAG 
QGPFLIADKW 
AQVTGALRSG 
AYFCQDRFYW 



21 
I 

PRQRQSTLVL 
PETGELDSAT 
FARAFALW S A 
DAHFDDDELW 
NYDTDDRFGF 
DRDKLFGFCP 
SDKKWGFCPD 
HLYGPRPEPE 
PSTATTVPLS 
PALPRKLDSV 
RGKMLLFSGR 
RVSSRSELNQ 



31 
I 

FPGDLRTNLT 
LKAMRTPRCG 
VTPLTFTRVY 
SLGKGWVPT 
CPSERLYTRD 
TRADSTVMGG 
QGYSLFLVAA 
PRPPTTTTPQ 
PVDDACNVNI 
FEEPLSKKLF 
RLWRFDVKAQ 
VDQVGYVTYD 



41 

1 

DRQLAEEYLY 
VPDLGRFQTF 
SRDADIVIQF 
RFGNADGAAC 
GNADGKPCQF 
NSAGELCVFP 
HEFGHALGLD 
PTAPPTVCPT 
FDAIAEIGNQ 
FFSGRQVWVY 
MVDPRSASEV 
ILQCPED 



51 
I 

RYGYTRVAEM 
EGDLKWHHHN 
GVAEHGDGYP 
HFPFIFEGRS 
PFIFQGQSYS 
FTFLGKEYST 
HSSVPEALMY 
GPPTVHPSER 
LYIjFKDGKYW 
TGASVLGPRR 
DRMFPGVPLD 



Seq ID NO: 359 DNA sequence 

Nucleic Acid Accession #: NM_000213.1 

Coding sequence: 127.. 53 85 



60 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
660 



1 
I 

CGCCCGCGCG 
CCCCGAGGTA 
AAGAGGATGG 
AGCGTCAGCC 
ACGGAGTGTG 
CGGCGCTGCA 
GTCATGGAGA 
AGCCAGATGT 
GAGCTGGAGG 
TCCAACTCCA 
GTCCTGAGCC 
AGCGTCCCGC 
CCCCCCTTCT 
AAACTGCAGG 
ATCCTGCAGA 
CTGGTCTTCT 
GGCATCATGA 
TACAGGACAC 
ATCATCCCCA 
TATTTCCCTG 
CTGGAGGAGG 
CGAGGCCTTC 
CACATCCGGC 
GATGGGAOGC 



11 
I 

CTGCAGCCCC 
GGTCCAGGAC 
CAGGGCCACG 
TCTCTGGGAC 
TCCGTGTGGA 
ACACCCAGGC 
GCAGCTTCCA 
CCCCCCAAGG 
TGTTTGAGCC 
TGTCCGATGA 
AGCTCACCAG 
AGACX3GACAT 
CCTTCAAGAA 
GAGAGCGGAT 
CAGCTGTGTG 
CCACCGAGTC 
GCCGCAACGA 
AGGACTACCC 
TCTTTGCTGT 
TCTCCTCACT 
CCTTCAATCG 
GGACAGAGGT 
GGGGGGAAGT 
ACGTGTGCCA 



21 
I 

ATCTCCTAGC 
GGGCGCACAG 
CCCCAGCCCA 
CTTGGCAAAC 
TAAGGACTGC 
GGAGCTGCTG 
AATCACAGAG 
CCTGCGGGTC 
ACTGGAGAGC 
TCTGGACAAC 
CGACTACACT 
GAGGCCTGAG 
CGTCATCAGC 
CTCAGGCAAC 
CACGAGGGAC 
AGCCTTCCAC 
TGAACGGTGC 
GTCGGTGCCC 
CACCAACTAC 
GGGGGTGCTG 
GATCCGCTCC 
CACCTCCAAG 
GGGTATATAC 
GCTGCCGGAG 



31 
I 

GGCAGCCCAG 
CAGCAGCCGA 
TGGGCCAGGC 
CGCTGCAAGA 
GCCTACTGCA 
GCCGCGGGCT 
GAGACCCAGA 
CGTCTGCGGC 
CCCGTGGACC 
CTCAAGAAGA 
ATTGGATTTG 
AAGCTGAAGG 
CTGACAGAAG 
CTGGATGCTC 
ATTGGCTGGC 
TATGAGGCTG 
CACCTGGACA 
ACCCTGGTGC 
TCCTATAGCT 
CAGGAGGACT 
AACCTGGACA 
ATGTTCCAGA 
CAGGTGCAGC 
GACCAGAAGG 



41 

i 

GCGCGGAGGG 
GGCTGGCCGG 
TGCTCCTGGC 
AGGCCCCAGT 
CAGACGAGAT 
GCCAGCGGGA 
TTGACACCAC 
CCGGTGAGGA 
TGTACATCCT 
TGGGGCAGAA 
GCAAGTTTGT 
AGCCCTGGCC 
ATGTGGATGA 
CTGAGGGCGG 
GCCCGGACAG 
ATGGCGCCAA 
CCACGGGCAC 
GCCTGCTCGC 
ACTACGAGAA 
CGTCCAACAT 
TCCGGGCCCT 
AGACGAGGAC 
TGCGGGCCCT 
GCAACATCCA 



51 
i 

AGCGAGTCCG 
GAGAGGGAGG 
AGCCTTGATC 
GAAGAGCTGC 
GTTCAGGGAC 
GAGCATCGTG 
CCTGCGGCGC 
GCGGCATTTT 
CATGGACTTC 
CCTGGCTCGG 
GGACAAAGTC 
CAACAGTGAC 
GTTCCGGAAT 
CTTCGATGCC 
CACCCACCTG 
CGTGCTGGCT 
CTACACCCAG 
CAAGCACAAC 
GCTTCACACC 
CGTGGAGCTG 
AGACAGCCCC 
TGGGTCCTTT 
TGAGCACGTG 
TCTGAAACCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



1090 



WO 03/042661 



PCT/US02/36810 



tccttctccg acggcctcaa gatggacgcg ggcatcatct gtgatgtgtg CACCTGCGAG 1SOO 

CTGCAAAAAG AGGTGCGGTC AGCTCGCTGC AGCTTCAACG GAGACTTCGT GTGCGGACAG 1560 
TGTGTGTGCA GCGAGGGCTG GAGTGGCCAG ACCTGCAACT GCTCCACCGG CTCTCTGAGT 1620 
GACATTCAGC CCTGCCTGCG GGAGGGCGAG GACAAGCCGT GCTCCGGCCG TGGGGAGTGC 1680 
5 CAGTGCGGGC ACTGTGTGTG CTACGGCGAA GGCCGCTACG AGGGTCAGTT CTGCGAGTAT 1740 
GACAACTTCC AGTGTCCCCG CACTTCCGGG TTCCTCTGCA ATGACCGAGG ACGCTGCTCC 1800 
ATGGGCCAGT GTGTGTGTGA GCCTGGTTGG ACAGGCCCAA GCTGTGACTG TCCCCTCAGC 1860 
AATGCCACCT GCATCGACAG CAATGGGGGC ATCTGTAATG GACGTGGCCA CTGTGAGTGT 1920 
GGCCGCTGCC ACTGCCACCA GCAGTCGCTC TACACGGACA CCATCTGOGA GATCAACTAC 1980 
10 TCGGCGATCC ACCCGGGCCT CTGCGAGGAC CTACGCTCCT GCGTGCAGTG CCAGGCGTGG 2040 
GGCACCGGCG AGAAGAAGGG GCGCACGTGT GAGGAATGCA ACTTCAAGGT CAAGATGGTG 2100 
GACGAGCTTA AGAGAGCCGA GGAGGTGGTG GTGCGCTGCT CCTTCCGGGA CGAGGATGAC 2160 
GACTGCACCT ACAGCTACAC CATGGAAGGT GACGGCGCCC CXGGGCCCAA CAGCACTGTC 2220 
CTGGTGCACA AGAAGAAGGA CTGCCCTCCG GGCTCCTTCT GGTGGCTCAT CCCCCTGCTC 2280 
15 CTCCTCCTCC TGCCGCTCCT GGCCCTGCTA CTGCTGCTAT GCTGGAAGTA CTGTGCCTGC 2340 
TGCAAGGCCT GCCTGGCACT TCTCCCGTGC TGCAACCGAG GTCACATGGT GGGCTTTAAG 2400 
GAAGACCACT ACATGCTGCG GGAGAACCTG ATGGCCTCTG ACCACTTGGA CACGCCCATG 2460 
CTGCGCAGCG GGAACCTCAA GGGCCGTGAC GTGGTCCGCT GGAAGGTCAC CAACAACATG 2520 
CAGCGGCCTG GCTTTGCCAC TCATGCCGCC AGCATCAACC CCACAGAGCT GGTGCCCTAC 2580 
20 GGGCTGTCCT TGCGCCTGGC CCGCCTTTGC ACCGAGAACC TGCTGAAGCC TGACACTCGG 2640 
GAGTGCGCCC AGCTGCGCCA GGAGGTGGAG GAGAACCTGA ACGAGGTCTA CAGGCAGATC 2700 
TCCGGTGTAC ACAAGCTCCA GCAGACCAAG TTCCGGCAGC AGCCCAATGC CGGGAAAAAG 2760 
CAAGACCACA CCATTGTGGA CACAGTGCTG ATGGCGCCCC GCTCGGCCAA GCCGGCCCTG 2820 
CTGAAGCTTA CAGAGAAGCA GGTGGAACAG AGGGCCTTCC ACGACCTCAA GGTGGCCCCC 2880 
25 GGCTACTACA CCCTCACTGC AGACCAGGAC GCCCGGGGCA TGGTGGAGTT CCAGGAGGGC 2940 
GTGGAGCTGG TGGACGTACG GGTGCCCCTC TTTATCCGGC CTGAGGATGA CGACGAGAAG 3000 
CAGCTGCTGG TGGAGGCCAT CGACGTGCCC GCAGGCACTG CCACCCTCGG CCGCCGCCTG 30 SO 
GTAAACATCA CCATCATCAA GGAGCAAGCC AGAGACGTGG TGTCCTTTGA GCAGCCTGAG 3120 
TTCTCGGTCA GCCGCGGGGA CCAGGTGGCC CGCATCCCTG TCATCCGGCG TGTCCTGGAC 3180 
30 GGCGGGAAGT CCCAGGTCTC CTACCGCACA CAGGATGGCA CCGCGCAGGG CAACCGGGAC 3240 
TACATCCCCG TGGAGGGTGA GCTGCTGTTC CAGCCTGGGG AGGCCTGGAA AGAGCTGCAG 3300 
GTGAAGCTCC TGGAGCTGCA AGAAGTTGAC TCCCTCCTGC GGGGCCGCCA GGTCCGCCGT 3360 
TTCCACGTCC AGCTCAGCAA CCCTAAGTTT GGGGCCCACC TGGGCCAGCC CCACTCCACC 3420 
ACCATCATCA TCAGGGACCC AGATGAACTG GACCGGAGCT TCACGAGTCA GATGTTGTCA 3480 
35 TCACAGCCAC CCCCTCACGG CGACCTGGGC GCCCCGCAGA ACCCCAATGC TAAGGCCGCT 3540 
GGGTCCAGGA AGATCCATTT CAACTGGCTG CCCCCTTCTG GCAAGCCAAT GGGGTACAGG 3600 
GTAAAGTACT GGATTCAGGG TGACTCCGAA TCCGAAGCCC ACCTGCTCGA CAGCAAGGTG 3660 
CCCTCAGTGG AGCTCACCAA CCTGTACCCG TATTGCGACT ATGAGATGAA GGTGTGCGCC 3720 
TACGGGGCTC AGGGCGAGGG ACCCTACAGC TCCCTGGTGT CCTGCCGCAC CCACCAGGAA 3780 
40 GTGCCCAGCG AGCCAGGGCG TCTGGCCTTC AATGTCGTCT CCTCCACGGT GACCCAGCTG 3840 
AGCTGGGCTG AGCCGGCTGA GACCAACGGT GAGATCACAG CCTACGAGGT CTGCTATGGC 390O 
CTGGTCAACG ATGACAACCG ACCTATTGGG CCCATGAAGA AAGTGCTGGT TGACAACCCT 3960 
AAGAACCGGA TGCTGCTTAT TGAGAACCTT CGGGAGTCCC AGCCCTACCG CTACACGGTG 4020 
AAGGCGCGCA ACGGGGCCGG CTGGGGGCCT GAGCGGGAGG CCATCATCAA CCTGGCCACC 4080 
45 CAGCCCAAGA GGCCCATGTC CATCCCCATC ATCCCTGACA TCCCTATCGT GGACGCCCAG 4140 
AGCGGGGAGG ACTACGACAG CTTCCTTATG TACAGCGATG ACGTTCTACG CTCTCCATCG 4200 
GGCAGCCAGA GGCCCAGCGT CTCCGATGAC ACTGAGCACC TGGTGAATGG CCGGATGGAC 4260 
TTTGCCTTCC CGGGCAGCAC CAACTCCCTG CACAGGATGA CCACGACCAG TGCTGCTGCC 4320 
TATGGCACCC ACCTGAGCCC ACACGTGCCC CACCGCGTGC TAAGCACATC CTCCACCCTC 4380 
50 ACACGGGACT ACAACTCACT GACCCGCTCA GAACACTCAC ACTCGACCAC ACTGCCGAGG 4440 
GACTACTCCA CCCTCACCTC CGTCTCCTCC CACGACTCTC GCCTGACTGC TGGTGTGCCC 4500 
GACACGCCCA CCCGCCTGGT GTTCTCTGCC CTGGGGCCCA CATCTCTCAG AGTGAGCTGG 4560 
CAGGAGCCGC GGTGCGAGCG GCCGCTGCAG GGCTACAGTG TGGAGTACCA GCTGCTGAAC 4620 
GGCGGTGAGC TGCATCGGCT CAACATCCCC AACCCTGCCC AGACCTCGGT GGTGGTGGAA 4680 
55 GACCTCCTGC CCAACCACTC CTACGTGTTC CGCGTGCGGG CCCAGAGCCA GGAAGGCTGG 4740 
GGCCGAGAGC GTGAGGGTGT CATCACCATT GAATCCCAGG TGCACCCGCA GAGCCCACTG 4800 
TGTCCCCTGC CAGGCTCCGC CTTCACTTTG AGCACTCCCA GTGCCCCAGG CCCGCTGGTG 4860 
TTCACTGCCC TGAGCCCAGA CTCGCTGCAG CTGAGCTGGG AGCGGCCACG GAGGCCCAAT 4920 
GGGGATATCG TCGGCTACCT GGTGACCTGT GAGATGGCCC AAGGAGGAGG GCCAGCCACC 4980 
60 GCATTCCGGG TGGATGGAGA CAGCCCCGAG AGCCGGCTGA CCGTGCCGGG CCTCAGCGAG 5040 
AACGTGCCCT ACAAGTTCAA GGTGCAGGCC AGGACCACTG AGGGCTTCGG GCCAGAGCGC 5100 
GAGGGCATCA TCACCATAGA GTCCCAGGAT GGAGGACCCT TCCCGCAGCT GGGCAGCCGT 5160 
GCCGGGCTCT TCCAGCACCC GCTGCAAAGC GAGTACAGCA GCATCACCAC CACCCACACC 5220 
AGCGCCACCG AGCCCTTCCT AGTGGATGGG CCGACCCTGG GGGCCCAGCA CCTGGAGGCA 5280 
65 GGCGGCTCCC TCACCCGGCA TGTGACCCAG GAGTTTGTGA GCCGGACACT GACCACCAGC 5340 
GGAACCCTTA GCACCCACAT GGACCAACAG TTCTTCCAAA CTTGACCGCA CCCTGCCCCA 5400 
CCCCCGCCAT GTCCCACTAG GCGTCCTCCC GACTCCTCTC CCGGAGCCTC CTCAGCTACT 5460 
CCATCCTTGC ACCCCTGGGG GCCCAGCCCA CCCGCATGCA CAGAGCAGGG GCTAGGTGTC 5520 
TCCTGGGAGG CATGAAGGGG GCAAGGTCCG TCCTCTGTGG GCCCAAACCT ATTTGTAACC 5580 
70 AAAGAGCTGG GAGCAGCACA AGGACCCAGC CTTTGTTCTG CACTTAATAA ATGGTTTTGC 5640 
TACTG 



Seq ID NO: 360 Protein sequence 
^ Protein Accession #: NP_000204.1 

1 11 21 31 41 51 

I I I I I I 

MAGPRPSPWA RLLLAALISV SI*SGTIiANRC KKAPVKSCTE CVRVDKDCAY CTDEMFRDRR 60 

CNTQAELLAA GCQRESIWM ESSFQ1TEET QIDTTLRRSQ MSPQGLRVRL RPGEERHFEL 120 

80 EVFEPLESPV DLYILMDFSN SMSDDLDNLK KMGQNLARVL SQLTSDYTIG FGKFVDKVSV 180 

PQTDMRPEKL KEPWPNSDPP FSFKNVISLT EDVDEFRNKL QGERISGNLD APEGGFDAIL 240 

QTAVCTRDIG WRPDSTHLLV FSTESAFHYE ADGANVLAGI MSRNDERCHL DTTGTYTQYR 300 

TQDYPSVPTL VRLLAKHNII PIFAVTNYSY SYYEKLHTYF PVSSLGVLQE DSSNIVELLE 360 

EAFNRIRSNL DIRALDSPRG LRTEVTSKMF QKTRTGSFHI RRGEVGIYQV QLRALEHVDG 420 



1091 



WO 03/042661 



THVCQLPEDQ KGNIHLKPSF SDGLKMDAGI ICDVCTCELQ KEVRSARCSF NQDFVCGQCV 
CSEGWSGQTC NCSTGSLSDI QPCLREGEDK PCSGRGECQC GHCVCYGEGR YEGQFCEYDN 
FQCPRTSGFL CNDRGRCSMG QCVCEPGWTG PSCDCPLSNA TCIDSNGGIC NGRGHCECGR 
CHCHQOSLYT DTICEINYSA IHPGLCEDLR SCVQCQAWGT GEKKGRTCEE CNFKVKMVDE 
LKRAEEWVR CSFRDEDDDC TYSYTMEGDG APGPNSTVLV HKKKDCPPGS FWWLIPLLLI* 
LLPUALLLL LCWKYCACCK ACLALLPCCN RGHMVGFKED HYMLRENLMA SDHLDTPMLR 
SGNLKGRDW RWKVTNNMQR PGFATHAASI NPTELVPYGL SLRLARLCTE NLLKPDTREC 
AQLRQEVEEN LNEVYRQISG VHKLQQTKFR QQPNAGKKQD HTIVDTVLMA PRSAKPALLK 
LTEKQVEQRA FHDLKVAPGY YTLTADQDAR GMVEFQEGVE LVDVRVPLFI RPEDDDEKQL 
LVEAIDVPAG TATLGRRLVN ITIIKEQARD WSFEQPEFS VSRGDQVARI PVIRRVLDGG 
KSQVSYRTQD GTAQGNRDYI PVEGELLFQP GEAWKELQVK LLELQEVDSL LRGRQVRRFH 
VQLSNPKFGA HLGQPHSTTI IIRDPDELDR SFTSQMLSSQ PPPHGDLGAP QNPNAKAAGS 
RKIHFNWLPP SGKPMGYRVK YWIQGDSESE AHLLDSKVPS VELTNLYPYC DYEMKVCAYG 
AQGEGPYSSL VSCRTHQEVP SEPGRLAFNV VSSTVTQLSW AEPAETNGEI TAYEVCYGLV 
NDDNRPIGPM KKVLVDNPKN RMLLIENLRE SQPYRYTVKA RNGAGWGPER EAIXNLATQP 
KRPMSIPIIP DIPIYDAQSG EDYDSFLMYS DDVLRSPSGS QRPSVSDDTE HLVNGRMDFA 
FPGSTNSLHR MTTTSAAAYG THLSPHVPHR VLSTSSTLTR DYNSLTRSEH SHSTTLPRDY 
STLTSVSSHD SRLTAGVPDT PTRLVFSALG PTSLRVSWQE PRCERPLQGY SVEYQLLNGG 
ELHRLNIPNP AQTSWVEDL LPNHSYVFRV RAQSQEGWGR EREGVITIES QVHPQSPLCP 
LPGSAFTLST PSAPGPLVFT ALSPDSLQI*S WERPRRPKGD IVGYLVTCEM AQGGGPATAF 
RVDGDSPESR LTVPGLSENV PYKFKVQART TEGFGPEREG IITIESQDGG PFPQI/3SRAG 
LPQHPLQSEY SSITTTHTSA TEPFLVDGPT LGAQHLEAGG SLTRHVTQEF VSRTLTTSGT 
LSTHMDQQFF QT 

Seq ID NO: 361 DNA sequence 
Nucleic Acid Accession #: NM_013332.1 
Coding sequence : 1 . . 63 

1 11 21 31 41 51 

i I 1 I I I 

GCACGAGGGC GCTTTTGTCT CCGGTGAGTT TTGTGGCGGG AAGCTTCTGC GCTGGTGCTT 60 
AGTAACCGAC TTTCCTCCGG ACTCCTGCAC GACCTGCTCC TACAGCCGGC GA TCCA CTCC 120 
CGGCTGTTCC CCCGGAGGGT CCAGAGGCCT TTCAGAAGGA GAAGGCAGCT CTGTTTCTCT 180 
GCAGAGGAGT AGGGTCCTTT CAGCCATGAA GCATGTGTTG AACCTCTACC TGTTAGGTGT 240 
GGTACTGACC CTACTCTCCA TCTTCGTTAG AGTGATGGAG TCCCTAGAAG GCTTACTAGA 300 
GAGCCCATCG CCTGGGACCT CCTGGACCAC CAGAAGCCAA CTAGCCAACA CAGAGCCCAC 360 
CAAGGGCCTT CCAGACCATC CATCCAGAAG CATGTGATAA GACCTCCTTC CATACTGGCC 420 
ATATTTTGGA ACACTGACCT AGACATGTCC AGATGGGAGT CCCATTCCTA GCAGACAAGC 480 
TGAGCACCGT TGTAACCAGA GAACTATTAC TAGGCCTTGA AGAACCTGTC TAACTGGATG 540 
CTCATTGCCT GGGCAAGGCC TGTTTAGGCC GGTTGCGGTG GCTCATGCCT GTAATCCTAG 600 
CACTTTGGGA GGCTGAGGTG GGTGGATCAC CTGAGGTCAG GAGTTCGAGA CCAGCCTCGC 660 
CAACATGGCG AAACCCCATC TCTACTAAAA ATACAAAAGT TAGCTGGGTG TGGTGGCAGA 720 
GGCCTGTAAT CCCAGTTCCT TGGGAGGCTG AGGCGGGAGA ATTGCTTGAA CCCGGGGACG 780 
GAGGTTGCAG TGAACCGAGA TCGCACTGCT GTACCCAGCC TGGGCCACAG TGCAAGACTC 840 
CATCTCAAAA AAAAAAAGAA AAGAAAAAGC CTGTTTAATG CACAGGTGTG AGTGGATTGC 900 
TTATGGCTAT GAGATAGGTT GATCTCGCCC TTACCCCGGG GTCTGGTGTA TGCTGTGCTT 960 
TCCTCAGCAG TATGGCTCTG ACATCTCTTA GATGTCCCAA CTTCAGCTGT TGGGAGATGG 1020 
TGATATTTTC AACCCTACTT CCTAAACATC TGTCTGGGGT TCCTTTAGTC TTGAATGTCT 1080 
TATGCTCAAT TATTTGGTGT TGAGCCTCTC TTCCACAAGA GCTCCTCCAT GTTTGGATAG 1140 
CAGTTGAAGA GGTTGTGTGG GTGGGCTGTT GGGAGTGAGG ATGGAGTGTT CAGTGCCCAT 1200 
TTCTCATTTT ACATTTTAAA GTCGTTCCTC CAACATAGTG TGTATTGGTC TGAAGGGGGT 1260 
GGTGGGATGC CAAAGCCTGC TCAAGTTATG GACATTGTGG CCACCATGTG GCTTAAATGA 1320 
TTTTTTCTAA CTAATAAAGT GGAATATATA TTTCAAAAAA AAAAAAAAAA AA 

Seq ID NO: 362 Protein sequence 
Protein Accession fh NP_037464.1 

1 11 21 31 41 51 

| I | I I I 

MKHVLNLYLL GWLTIiIiSIF VRVMESLEGL IiESPSPGTSW TTRSQLANTE PTKGLPDHPS 60 
RSM 

Seq ID NO: 363 DNA sequence 

Nucleic Acid Accession #: NM__023915.1 

Coding sequence: 250.. 1326 

1 ll 21 31 41 51 

I 1 I I I I 

GGCACGAGGG TTTCGTTTTC ATGCTTTACC AGAAAATCCA CTTCCCTGCC GACCTTAGTT 60 
TCAAAGCTTA TTCTTAATTA GAGACAAGAA ACCTGTTTCA ACTTGAAGAC ACCGTATGAG 120 
GTGAATGGAC AGCCAGCCAC CACAATGAAA GAAATCAAAC CAGGAATAAC CTATGCTGAA 180 
CCCACGCCTC AATCGTCCCC AAGTGTTTCC TGACACGCAT CTTTGCTTAC AGTGCATCAC 240 
AACTGAAGAA TGGGGTTCAA CTTGACGCTT GCAAAATTAC CAAATAACGA GCTGCACGGC 300 
CAAGAGAGTC ACAATTCAGG CAACAGGAGC GACGGGCCAG GAAAGAACAC CACCCTTCAC 360 
AATGAATTTG ACACAATTGT CTTGCCGGTG CTTTATCTCA TTATATTTGT GGCAAGCATC 420 
TTGCTGAATG GTTTAGCAGT GTGGATCTTC TTCCACATTA GGAATAAAAC CAGCTTCATA 480 
TTCTATCTCA AAAACATAGT GGTTGCAGAC CTCATAATGA CGCTGACATT TCCATTTCGA 540 
ATAGTCCATG ATGCAGGATT TGGACCTTGG TACTTCAAGT TTATTCTCTG CAGATACACT 600 
TCAGTTTTGT TTTATGCAAA CATGTATACT TCCATCGTGT TCCTTGGGCT GATAAG CATT 660 
GATCGCTATC TGAAGGTGGT CAAGCCATTT GGGGACTCTC GGATGTACAG CATAACCTTC 720 
ACGAAGGTTT TATCTGTTTG TGTTTGGGTG ATCATGGCTG TTTTGTCTTT GCCAAACATC 780 
ATCCTGACAA ATGGTCAGCC AACAGAGGAC AATATCCATG ACTGCTCAAA ACTTAAAAGT 840 
CCTTTGGGGG TCAAATGGCA TACGGCAGTC ACCTATGTGA ACAGCTGCTT GTTTGTGGCC 900 
GTGCTGGTGA TTCTGATCGG ATGTTACATA GCCATATCCA GGTACATCCA CAAATCCAGC 960 



1092 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



AGGCAATTCA 
GTGGCTGTGT 
AGTCACTTAG 
ATTACACTTT 
TGTAGGTCAT 
ATCAGATCAC 
GTGTAGGCCT 
TTCATTATCC 



TAAGTCAGTC 
TTTTTACCTG 
ACAGGCTTTT 
TCTTGTCTGC 
TTTCAAGAAG 
TGCAAAGTGT 
TTTATTGTTT 
TTAAAAAAAA 



AAGCCGAAAG 
CTTTCTACCA 
AGATGAATCT 
GTGTAATGTT 
GCTGTTCAAA 
GAGAAGATGG 
GTTGGAATCG 
AA 



CGAAAACATA 
TATCACTTGT 
GCACAAAAAA 
TGCCTGGATC 
AAATCAAATA 
GAAGTTCGCA 
ATATGTACAA 



ACCAGAGCAT 
GCAGAATTCC 
TCCTATATTA 
CAATAATTTA 
TCAGAACCAG 
TATATTATGA 
AGTGTAAATA 



CAGGGTTGTT 1020 

TTTTACTTTT 1 0 80 

CTGCAAAGAA 1140 

CTTTTTCATG 1200 

GAGTGAAAGC 1260 

TTACACTGAT 1320 

AATGTTTCTT 1380 



Seq ID NO: 364 Protein sequence 
Protein Accession ft: NP 076404 



1 11 

I I 
MGFNLTLAJOi PNNELHGQES 
GLAVWIFFHI RNKTSFIFYL 
FYANMYTSIV FLGLISIDRY 
NGQPTEDNIH DCSKLKSPLG 
ISQSSRKRKH NQSIRVWAV 
FLSACNVCLD PIIYFFMCRS 



21 31 

I 1 
HNSGNRSDGP GKNTTLHNEF 
KNIWADLIM TLTFPFRIVH 
LKWKPFGDS RMYSITFTKV 
VKWHTAVTYV NSCLFVAVLV 
FFTCFLPYHL CRIPFTFSHL 
FSRRLFKKSN IRTRSESIRS 



41 
1 

DTIVLPVLYL 
DAGFGPWYFK 
LSVCVWVIMA 
ILIGCYIAIS 
DRLLDESAQK 
LQSVRRSEVR 



51 
I 

IIFVASILLN 
FILCRYTSVL 
VLSLPNIILT 
RYIHKSSRQF 
ILYYCKEITL 
IYYDYTDV 



60 
120 
180 
240 
300 



Seq ID NOi 365 DNA sequence 

Nucleic Acid Accession #: NM_0DS365.1 

Coding sequences 1..948 



ATGTCTCTCG 
GAGGACTTGG 
TCCTCTGACA 
CCTCAGGGAG 
GAGGGCTCCA 
GAGTTCATGT 
CACAAATATC 
AATTACAAGC 
TTTGGCACTG 
CTTGGCCTCT 
CTGATCATTG 
TGGGAAGCGT 
CCCAGGAAGC 
CCCGGCAGTG 
AGCTATGAGA 
CCATCCCTTT 



11 
I 

AGCAGAGGAG 
GCCTGATGGG 
GCAAGGAGGA 
GCGCTTCCTC 
GCAGTCAAGA 
TCCAAGAAGC 
GAGTCAAGGA 
GCTACTTTCC 
ATGTGAAGGA 
CGTGCGATAG 
TCCTGGGTGT 
TGAGTGTGAT 
TGCTCACCCA 
ATCCTGCGCA 
AGGTCATAAA 
ATGAAGAGGT 



21 
I 

TCCGCACTGC 
TGCACAGGAA 
GGAGGTGTCT 
CTCCATTTCC 
AGAGGAAGAG 
ACTGAAATTG 
GCCGGTCACA 
TGTGATCTTC 
GGTGGACCCC 
CATGCTGGGT 
GATCCTAACC 
GGGGGTGTAT 
AGATTGGGTG 
CTACGAGTTC 
TTATTTGGTC 
TTTGGGAGAG 



31 



41 



AAGCCTGATG 
CCCACAGGCG 
GCTGCTGGGT 
GTCTACTACA 
CCAAGCTCCT 
AAGGTGGCTG 
AAGGCAGAAA 
GGCAAAGCCT 
GCCGGCCACT 
GATGGTCATA 
AAAGACAACT 
GTTGGGAAGG 
CAGGAAAACT 
CTGTGGGGTT 
ATGCTCAATG 
GAGCAAGAGG 



AAGACCTTGA 
AGGAGGAGGA 
CATCAAGTCC 
CTTTATGGAG 
CGGTOGACCC 
AGTTGGTTCA 
TGCTGGAGAG 
CCGAGTTCAT 
CCTACATCCT 
GCATGCCCAA 
GCGCCCCTGA 
AGCACATGTT 
ACCTGGAGTA 
CCAAGGCCCA 
CAAGAGAGCC 
GAGTCTGA 




AGAGGTTATC 
CTACGGGGAG 
CCGGCAGGTG 
CGCTGAAACC 
CATCTGCTAC 



Seq ID NO: 366 Protein sequence 
Protein Accession #: NP 005356.1 



51 



1 11 21 31 41 

I 1 I I I I 

MSLEQRSPHC KPDEDLEAQG EDLGLMGAQB PTGEEEETTS SSDSKEEEVS AAGSSSPPQS 
PQGGASSSIS VYYTLWSQFD EGSSSQEEEE PSSSVDPAQL EFMFQEALKIj KVAELVHPLL 
HKYRVKEPVT KAEMLESVIK NYKRYFPVIF GKASEFMQVI FGTDVKEVDP AGHSYILVTA 
LGLSCDSMLG DGHSMPKAAL LIIVLGVILT KDNCAPEEVI WEALSVMGVY VGKEHMFYGE 
PRKLLTQDWV QENYLEYRQV PGSDPAHYEF LWGSKAHAET SYEKVINYLV MLNAREPICY 
PSLYEEVLGE EQEGV 

Seq ID NO: 367 DNA sequence 
Nucleic Acid Accession #: NMJJ14400 
Coding sequence: 8 6.. 112 6 



1 
I 

GGTTACTCAT 
GACGCCAAGG 
GATCTGGACT 
GTGCTACAGC 
GAAGTGCGCG 
CGGACAATTC 
OGGCCTGGAT 
CTGCAACGCC 
ATACCCGCCC 
GGGTACATCG 
CTTCGACGGC 
CTGTGTCCAG 

CCCTCGAATC 
CACATCTGTC 
GCCAGCGCCA 
GGAGCCCAGG 
TCCTGCAAAA 
ATTGGCAGCC 
AAATTTCCCT 
CCCACCACTG 
CTTCTGCTGC 



11 

! 

CCTGGGCTCA 
GAGCAGGACG 
GGAGGCTGGC 
TGCGTGCAGA 
CCGGGCGTGG 
TOGCTGGCAG 
CTTCACGGGC 
AAGCTCAACC 
AACGGCGTGG 
CCGCCGGTCG 
AACGTCACCT 
GATGAATTCT 
TGCCAGGGGT 
CCACCCCTTG 
ACCACTTCTA 
ACCAGTCAGA 
TTGACTGGAG 
GGGGGGCCCC 
CTTCTGTTGG 
CTCACCTACT 
GACTGGGCTG 
GCTGGTTTGC 



21 
I 

GGTAAGAGGG 
GAGCCATGGA 
TGCTGCTGCT 
AAGCAGATGA 
ACGTCTGCAC 
TGCSGGGTTG 
TTCTGGCGTT 
TCACCTCGCG 
AGTGCTACAG 
TGAGCTGCTA 
TGACGGCAGC 
GCACTCGGGA 
CCCGCTGTAA 
TCCGGCTGCC 
CCTCGGCCCC 
CTCCGAGACA 
GCGCCGCTGG 
AGCAGCCCCA 
CC6TGGCTGC 
TCTCTGGCCC 
GCCCAGCCCC 
GGCTTTGGGA 



31 
I 

CCCGAGCTCG 
CCCCGCCAGG 
GCTGCTTCGC 
CGGATGCTCC 
CGAGGCCGTG 
CGGTTCGGGA 
CATCCAGCTG 
GGCGCTOGAC 
CTGTGTGGGC 
CAACGCCAGC 
TAATGTGACT 
TGGAGTAACA 
CTCTGACCTC 
CCCTCCAGAG 
AGTGAGACCC 
GGGAGTAGAA 
CCACCAGGAC 
TAATAAAGGC 
TGGTGTCCTA 
TGGGTACCCC 
TGTTTTTCCA 
AATAAAATAC 



41 
I 

GAGGCGGCAC 
AAAGCAGGTG 
GGAGGAGCGC 
CCGAACAAGA 
GGGGCGGTGG 
CTCCCCGGCA 
CAGCAATGCG 
CCGGCAGGTA 
CTGAGCCGGG 
GATCATGTCT 
GTGTCCTTGC 
GGCCCAGGGT 
CGCAACAAGA 
CCCACGACTG 
ACATCCACCA 
CACGAGGCCT 
CGCAGCAATT 
TGTGTGGCTC 
CTGTGAGCTT 
TCTTCTCATC 
ACATTCCCCA 
CGTTGTATAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



60 
120 
180 
240 
300 



51 
I 

ACCCAGGGGG 
CCCAGGCCAT 
AGGCCCTGGA 
TGAAGACAGT 
AGACCATCCA 
AGAATGACCG 
CTCAGGATCG 
ATGAGAGTGC 
AGGCGTGCCA 
ACAAGGGCTG 
CTGTCCGGGG 
TCACGCTCAG 
CCTACTTCTC 
TGGCCTCAAC 
CCAAACCCAT 
CCCGGGATGA 
CAGGGCAGTA 
CCACAGCTGG 
CTCCACCTGG 
ACTTCCTGTT 
GTATCCCCAG 
ATTCTGGCAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
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GGGTGTTCTA GCTTTTTGAG GACAGCTCCT GTATCCTTCT CATCCTTGTC TCTCOGCTTG 1380 

TCCTCTTGTG ATGTTAGGAC AGAGTGAGAG AAGTCAGCTG TCACGGGGAA GGTGAGAGAG 1440 

AGGATGCTAA GCTTCCTACT CACTTTCTCC TAGCCAGCCT GGACTTTGGA GCGTGGGGTG 1500 

GGTGGGACAA TGGCTCCCCA CTCTAAGCAC TGCCTCCCCT ACTCCCCGCA TCTTTGGGGA 1560 

ATCGGTTCCC CATATGTCTT CCTTACTAGA CTGTGAGCTC CTCGAGGGCA GGGACCGTGC 1620 

CTTATGTCTG TGTGTGATCA GTTTCTGGCA CATAAATGCC TCAATAAAGA TTTAATTACT 1680 
TTGTATAGTG AAAAAAAA 

Seq ID NO: 368 Protein sequence 
Protein Accession U : NP_055215 

1 11 21 31 41 51 

i ] I I I I 

MDPARKAGAQ AMIWTAGWLL LLLLRGGAQA LECYSCVQKA DDGCSPNKMK TVKCAPGVDV 60 
CTBAVGAVET IHGQFSLAVX GCGSGLPGKN DRGLDLHGLL AFIQLQQCAQ DRCNAKLNLT 120 
SRALDPAGNE SAYPPNGVEC YSCVGLSREA COGTSPPWS CYNASDHVYK GCFDGNVTLT 180 
AANVTVSLPV RGCVQDEFCT RDGVTGPGFT LSGSCCQGSR CNSDLRNKTY FSPRIPPLVR 240 
LPPPEPTTVA STTSVTTSTS APVRPTSTTK PMPAPTSQTP RQGVEHEASR DEEPRLTGGA 300 
AGHQDRSNSG QYPAKGGPQQ PHNKGCVAPT AGLAALLLAV AAGVLL 



Seq ID NO: 369 DNA sequence 
Nucleic Acid Accession #: NM_005329.1 
Coding sequence: 1..1662 

1 11 21 31 41 51 

I I I I I I 

ATGCCGGTGC AGCTGACGAC AGCCCTGCGT GTGGTGGGCA CCAGCCTGTT TGCCCTGGCA 60 

GTGCTGGGTG GCATCCTGGC AGCCTATGTG ACGGG CTACC AGTTCATCCA CACGGAAAAG 120 

CACTACCTGT CCTTCGGCCT GTACGGCGCC ATCCTGGGCC TGCACCTGCT CATTCAGAGC 1B0 

CTTTTTGCCT TCCTGGAGCA CCGGCGCATG CGACGTGCCG GCCAGGCCCT GAAGCTGCCC 240 

TCCCCGCGGC GGGGCTCGGT GGCACTGTGC ATTGCCGCAT ACCAGGAGGA CCCTGACTAC 300 

TTGCGCAAGT GCCTGCGCTC GGCCCAGCGC ATCTCCTTCC CTGACCTCAA GGTGGTCATG 360 

GTGGTGGATG GCAACCGCCA GGAGGACGCC TACATGCTGG ACATCTTCCA CGAGGTGCTG 420. 

GGCGGCACCG AGCAGGCCGG CTTCTTTGTG TGGCGCAGCA ACTTCCATGA GGCAGGCGAG 480 

GGTGAGACGG AGGCCAGCCT GCAGGAGGGC ATGGACCGTG TGCGGGATGT GGTGCGGGCC 540 

AGCACCTTCT CGTGCATCAT GCAGAAGTGG GGAGGCAAGC GCGAGGTCAT GTACACGGCC 600 

TTCAAGGCCC TCGGCGATTC GGTGGACTAC ATCCAGGTGT GCGACTCTGA CACTGTGCTG 660 

GATCCAGCCT GCACCATCGA GATGCTTCGA GTCCTGGAGG AGGATCCCCA AGTAGGGGGA 720 

GTCGGGGGAG ATGTCCAGAT CCTCAACAAG TACGACTCAT GGATTTCCTT CCTGAGCAGC 780 

GTGCGGTACT GGATGGCCTT CAACGTGGAG CGGGCCTGCC AGTCCTACTT TGGCTGTGTG 840 

CAGTGTATTA GTGGGCCCTT GGGCATGTAC CGCAACAGCC TCCTCCAGCA GTTCCTGGAG 900 

GACTGGTACC ATCAGAAGTT CCTAGGCAGC AAGTGCAGCT TCGGGGATGA CCGGCACCTC 960 

ACCAACCGAG TCCTGAGCCT TGGCTACCGA ACTAAGTATA CCGCGCGCTC CAAGTGCCTC 1020 

ACAGAGACCC CCACTAAGTA CCTCCGGTGG CTCAACCAGC AAACCCGCTG GAGCAAGTCT 1080 

TACTTCCGGG AGTGGCTCTA CAACTCTCTG TGGTTCCATA AGCACCACCT CTGGATGACC 1140 

TACGAGTCAG TGGTCACGGG TTTCTTCCCC TTCTTCCTCA TTGCCACGGT TATACAGCTT 1200 

TTCTACCGGG GCCGCATCTG GAACATTCTC CTCTTCCTGC TGACGGTGCA GCTGGTGGGC 1260 

ATTATCAAGG CCACCTACGC CTGCTTCCTT CGGGGCAATG CAGAGATGAT CTTCATGTCC 1320 

CTCTACTCCC TCCTCTATAT GTCCAGCCTT CTGCCGGCCA AGATCTTTGC CATTG CTACC 1380 

ATCAACAAAT CTGGCTGGGG CACCTCTGGC CGAAAAACCA TTGTGGTGAA CTTCATTGGC 1440 

CTCATTCCTG TGTCCATCTG GGTGGCAGTT CTCCTGGAGG GGCTGGCCTA CACAGCTTAT 1500 

TGCCAGGACC TGTTCAGTGA GACAGAGCTA GCCTTCCTTG TCTCTGGGGC TATACTGTAT 1560 

GGCTGCTACT GGGTGGCCCT CCTCATGCTA TATCTGGCCA TCATCGCCCG GCGATGTGGG 1620 
AAGAAGCCGG AGCAGTACAG CTTGGCTTTT GCTGAGGTGT GA i 



Seq ID NO: 370 Protein sequence 
Protein Accession #: NP_0 053 20.1 

1 11 21 31 41 51 

| I I 1 1 I 

MPVQLTTALR WGTSLFALA VLGGILAAYV TGYQFIHTEK HYLSFGLYGA ILGLHLLIQS 60 
LFAFLEHRRM RRAGQALKLP SPRRGSVALC IAAYQEDPDY LRKCLRSAQR ISFPDLKWM 120 
WDGNRQEDA YMLDIFHEVL GGTEQAGFFV WRSNFHEAGE GETEASLQEG MDRVRDWRA 180 
STFSCIMQKW GGKREVMYTA FKALGD9VDY IQVCDSDTVL DPACTIEMLR VLEEDPQVGG 240 
VGGDVQILNK YDSWISFLSS VRYWMAFNVE RACQSYFGCV QCISGPLGMY RNSLLQQFLE 300 
DWYHQKFXGS KCSFGDDRHL TNRVLSLGYR TKYTARSKCL TETPTKYLRW LNQQTRWSKS 360 
YFREWLYNSL WFHKHHLWMT YESWTGFFP FFLIATVIQL FYRGRIWNII* LFLLTVQLVG 420 
I IKATYACFL RGNAEMIFMS LYSLLYMSSL LPAKIFAIAT INKSGWGTSG RKTIWNFIG 4 B0 
LIPVSIWVAV LLEGLAYTAY CQDLFSETEL AFLVSGAILY GCYWVALLML YLAIIARRCG 540 
KKPEQYSLAF AEV 

Seq ID NO: 371 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 148-7095 

1 11 21 31 41 51 

I I I I I I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 
CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 
CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 
CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 
CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 
AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 
CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 
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AACACATTCA TTCATAACAC TGGGAAAACA QTGGAAATTA ATCTCACTAA TGACTACCGT 480 
GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 
AAATGCAATA TGTCATCTQA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 
GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 
5 GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 
GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 
TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTAC ATTTAC 840 
AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 
ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 
10 TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 
TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 
AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 
TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 
CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 
15 GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 
TGCACTAATG GCTTATAXGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 13 BO 
AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 
GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 
AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 
20 ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 
AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 
ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 
GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 
AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 
25 AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 
GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 
GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 
GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 
GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 
30 AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 
TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 
CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 
TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 
GTATACAATG GTGAGACACC TCTTCAACCT TCCTACAGTA GTGAAGTCTT TCCTCTAGTC 2460 
35 ACCCCTTTGT TGCTTGACAA TCAGATCCTC AACACTACCC CTGCTGCTTC AAGTAGTGAT 2520 
TCGGCCTTGC ATGCTACGCC TGTATTTCCC AGTGTCGATG TGTCATTTGA ATCCATCCTG 2580 
TCTTCCTATG ATGGTGCACC TTTGCTTCCA TTTTCCTCTG CTTCCTTCAG TAGTGAATTG 2640 
TTTCGCCATC TGCATACAGT TTCTCAAATC CTTCCACAAG TTACTTCAGC TACCGAGAGT 2700 
GATAAGGTGC CCTTGCATGC TTCTCTGCCA GTGGCTGGGG GTGATTTGCT ATTAGAGCCC 2760 
40 AGCCTTGCTC AGTATTCTGA TGTGCTGTCC ACTACTCATG CTGCTTCAGA GACGCTGGAA 2820 
TTTGGTAGTG AATCTGGTGT TCTTTATAAA ACGCTTATGT TTTCTCAAGT TGAACCACCC 2880 
AGCAGTGATG CCATGATGCA TGCACGTTCT TCAGGGCCTG AACCTTCTTA TGCCTTGTCT 2940 
GATAATGAGG GCTCCCAACA CATCTTCACT GTTTCTTACA GTTCTGCAAT ACCTGTGCAT 3000 
GATTCTGTGG GTGTAACTTA TCAGGGTTCC TTATTTAGCG GCCCTAGCCA TATACCAATA 3060 
45 CCTAAGTCTT CGTTAATAAC CCCAACTGCA TCATTACTGC AGCCTACTCA TGCCCTCTCT 3120 
GGTGATGGGG AATGGTCTGG AGCCTCTTCT GATAGTGAAT TTCTTTTACC TGACACAGAT 3180 
GGGCTGACAG CCCTTAACAT TTCTTCACCT GTTTCTGTAG CTGAATTTAC ATATACAACA 3240 
TCTGTGTTTG GTGATGATAA TAAGGCGCTT TCTAAAAGTG AAATAATATA TGGAAATGAG 3300 
ACTGAACTGC AAATTCCTTC TTTCAATGAG ATGGTTTACC CTTCTGAAAG CACAGTCATG 3360 
50 CCCAACATGT ATGATAATGT AAATAAGTTG AATGCGTCTT TACAAGAAAC CTCTGTTTCC 3420 
ATTTCTAGCA CCAAGGGCAT GTTTCCAGGG TCCCTTGCTC ATACCACCAC TAAGGTTTTT 3480 
GATCATGAGA TTAGTCAAGT TCCAGAAAAT AACTTTTCAG TTCAACCTAC ACATACTGTC 3540 
TCTCAAGCAT CTGGTGACAC TTCGCTTAAA CCTGTGCTTA GTGCAAACTC AGAGCCAGCA 3600 
TCCTCTGACC CTGCTTCTAG TGAAATGTTA TCTCCTTCAA CTCAGCTCTT ATTTTATGAG 3660 
55 ACCTCAGCTT CTTTTAGTAC TGAAGTATTG CTACAACCTT CCTTTCAGGC TTCTGATGTT 3720 
GACACCTTGC TTAAAACTGT TCTTCCAGCT GTGCCCAGTG ATCCAATATT GGTTGAAACC 3780 
CCCAAAGTTG ATAAAATTAG TTCTACAATG TTGCATCTCA TTGTATCAAA TTCTGCTTCA 3840 
AGTGAAAACA TGCTGCACTC TACATCTGTA CCAGTTTTTG ATGTGTCGCC TACTTCTCAT 3900 
ATGCACTCTG CTTCACTTCA AGGTTTGACC ATTTCCTATG CAAGTGAGAA ATATGAACCA 3960 
60 GTTTTGTTAA AAAGTGAAAG TTCCCACCAA GTGGTACCTT CTTTGTACAG TAATGATGAG 4020 
TTGTTCCAAA CGGCCAATTT GGAGATTAAC CAGGCCCATC CCCCAAAAGG AAGGCATGTA 4080 
TTTGCTACAC CTGTTTTATC AATTGATGAA CCATTAAATA CACTAATAAA TAAGCTTATA 4140 
CATTCCGATG AAATTTTAAC CTCCACCAAA AGTTCTGTTA CTGGTAAGGT ATTTGCTGGT 4200 
ATTCCAACAG TTGCTTCTGA TACATTTGTA TCTACTGATC ATTCTGTTCC TATAGGAAAT 4260 
65 GGGCATGTTG CCATTACAGC TGTTTCTCCC CACAGAGATG GTTCTGTAAC CTCAACAAAG 4320 
TTGCTGTTTC CTTCTAAGGC AACTTCTGAG CTGAGTCATA GTGCCAAATC TGATGCCGGT 4380 
TTAGTGGGTG GTGGTGAAGA TGGTGACACT GATGATGATG GTGATGATGA TGATGATGAC 4440 
AGAGGTAGTG ATGGCTTATC CATTCATAAG TGTATGTCAT GCTCATCCTA TAGAGAATCA 4500 
CAGGAAAAGG TAATGAATGA TTCAGACACC CACGAAAACA GTCTTATGGA TCAGAATAAT 4560 
70 CCAATCTCAT ACTCACTATC TGAGAATTCT GAAGAAGATA ATAGAGTCAC AAGTGTATCC 4620 
TCAGACAGTC AAACTGGTAT GGACAGAAGT CCTGGTAAAT CACCATCAGC AAATGGGCTA 4680 
TCCCAAAAGC ACAATGATGG AAAAGAGGAA AATGACATTC AGACTGGTAG TGCTCTGCTT 4740 
CCTCTCAGCC CTGAATCTAA AGCATGGGCA GTTCTGACAA GTGATGAAGA AAGTGGATCA 4800 
GGGCAAGGTA CCTCAGATAG CCTTAATGAG AATGAGACTT CCACAGATTT CAGTTTTGCA 4860 
75 GACACTAATG AAAAAGATGC TGATGGGATC CTGGCAGCAG GTGACTCAGA AATAACTCCT 4920 
GGATTCCCAC AGTCCCCAAC ATCATCTGTT ACTAGCGAGA ACTCAGAAGT GTTCCACGTT 4980 
TCAGAGGCAG AGGCCAGTAA TAGTAGCCAX GAGTCTCGTA TTGGTCTAGC TGAGG GGTTQ 5040 
GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 5100 
CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 5160 
80 TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 5220 
ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 5280 
CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGACACTGA AAGAGTTTTA CCAGGAAGTG S340 
CAGAGCTGTA CTGTTGACTT AGGTATTACA GCAGACAGCT CCAACCACCC AGACAACAAG 5400 
CACAAGAATC GATACATAAA TATCGTTGCC TATGATCATA GCAGGGTTAA GCTAGCACAG 5460 
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CTTGCTGAAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC 5520 
AACAGACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGC TGAAGATTTC 5580 
TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTGGAG 5640 
AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCC3ATG GGAGTGAGGA GTACGGGAAC 5700 
TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 5760 
CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 5820 
ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 5880 
CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 5940 
CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG TGCTAGACAG TATGTTGCAG 6000 
CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TAAAACACAT COGT TCACA A 6060 
AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG 6120 
GCCATACTTA GTAAAGAAAC TGAGGTGCTG GACAGTCATA TTCATGCCTA TGTTAATGCA 6180 
CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA AACAATTCCA GCTCCTGAGC 6240 
CAGTCAAATA TACAGCAGAG TGACTATTCT GCAGCCCTAA AGCAATGCAA CAGGGAAAAG 6300 
AATCGAACTT CTTCTATCAT CCCTGTGGAA AGATCAAGGG TTGGCATTTC ATCCCTGAGT 6360 
GGAGAAGGCA CAGACTACAT CAATGCCTCC TATATCATGG GCTATTACCA GAGCAATGAA 6420 
TTCATCATTA CCCAGCACCC TCTCCTTCAT ACCATCAAGG ATTTCTGGAG GATGATATGG 6480 
GACCATAATG CCCAACTGGT GGTTATGATT CCTGATGGCC AAAACATGGC AGAAGATGAA 6540 
TTTGTTTACT GGCCAAATAA AGATGAGCCT ATAAATTGTG AGAGCTTTAA GGTCACTCTT 660O 
ATGGCTGAAG AACACAAATG TCTATCTAAT GAGGAAAAAC T TATAA TTCA GGACTTTATC 6660 
TTAGAAGCTA CACAGGATGA TTATGTACTT GAAGTGAGGC ACTTTCAGTG TCCTAAATGG 6720 
CCAAATCCAG AT AG CCCCAT TAGTAAAACT TTTGAACTTA TAAGTGTTAT AAAAGAAGAA 6780 
GCTGCCAATA GGGATGGGCC TATGATTGTT CATGATGAGC ATGGAGGAGT GACGGCAGGA 6840 
ACTTTCTGTG CTCTGACAAC CCTTATGCAC CAACTAGAAA AAGAAAATTC CGTGGATGTT 6900 
TACCAGGTAG CCAAGATGAT CAATCTGATG AGGCCAGGAG TCTTTGCTGA CATTGAGCAG 6960 
TATCAGTTTC TCTACAAAGT GATCCTCAGC CTTGTGAGCA CAAGGCAGGA AGAGAATCCA 7020 
TCCACCTCTC TGGACAGTAA TGGTGCAGCA TTGCCTGATG GAAATATAGC TGAGAGCTTA 70 BO 
GAGTCTTTAG TTTAACACAG AAAGGGGTGG GGGGACTCAC ATCTGAGCAT TGTTTTCCTC 7140 
TTCCTAAAAT TAGGCAGGAA AATCAGTCTA GTTCTGTTAT CTGTTGATTT CCCATCACCT 72 00 
GACAGTAACT TTCATGACAT AGGATTCTGC CGCCAAATTT ATATCATTAA CAATGTGTGC 7260 
CTTTTTGCAA GACTTGTAAT TTACTTATTA TGTTTGAACT A AAATG ATTG AATTTTACAG 7320 
TATTTCTAAG AATGGAATTG TGGTATTTTT TTCTGTATTG ATTTTAACAG AAAATTTCAA 7380 
TTTATAGAGG TTAGGAATTC CAAACTACAG AAAATGTTTG TTTTTAGTGT CAAATTTTTA 7440 
GCTGTATTTG TAGCAATTAT CAGGTTTGCT AGAAATATAA CTTTTAATAC AGTAGCCTGT 7500 
AAATAAAACA CTCTTCCATA TGATATTCAA CATTTTACAA CTGCAGTATT CACCTAAAGT 7560 
AGAAATAATC TGTTACTTAT TGTAAATACT GCCCTAGTGT CTCCATGGAC CAAATTTATA 7620 
TTTATAATTG TAGATTTTTA TATTTTACTA CTGAGTCAAG TTTT CTAGTT CTGTGTAATT 7680 
GTTTAGTTTA ATGACGTAGT TCATTAGCTG GTCTTACTCT ACCAG TTTTC TGACATTGTA 7740 
TTGTGTTACC TAAGTCATTA ACTTTGTTTC AGCATGTAAT TTTAACTTTT GTGGAAAATA 7800 
GAAATACCTT CATTTTGAAA GAAGTTTTTA TGAGAATAAC ACCTTACCAA ACATTGTTCA 7860 
AATGGTTTTT ATCCAAGGAA TTGCAAAAAT AAATATAAAT ATTGCCATTA AAAAAAAAAA 7920 
AAAAAAAAAA AAAAAAAAAA AAAA 

Seq ID NO: 372 Protein sequence; 

Protein Accession #: built from XP_031379 

1 11 21 31 41 51 

I I I I I I 

MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNIiKKL KFQGWDKTSL ENTFIHNTGK TVBINLTUDY RVSGGVSEMV 120 

FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 180 

ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWIVFK DTVSISBSQb AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 

TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 420 

LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 

RSPTRGSBFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 

ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHY ST FAY FP 720 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNGETPLQ PSYSSEVFPL VTPLLLDNQI 780 

LNTTPAASSS DSALHATPVF PSVDVSFESI LSSYDGAPLL PFSSASFSSE LFRHLHTVSQ 840 

ILPQVTSATE SDKVPLHASL PVAGGDLLLE PSLAQYSDVL STTHAASETL EFGSESGVLY 900 

KTLMFSQVEP PSSDAMMHAR SSGPEPSYAL SDNEGSQHIF TVSYSSAIPV HDSVGVTYQG 960 

SLFSGPSHIP IPKSSLITPT ASLLQPTHAL SGDGEWSGAS SDSEFLLPDT DGLTALNISS 1020 

PVSVAEFTYT TSVFGDDNKA LSKSEIIYGN ETBLQIPSFN EKVYPSESTV MPNMYDNVNK 1080 

LNASLQETSV SISSTKGMFP GSLAHTTTKV FDHEISQVPE NNFSVQPTHT VSQASGDTSL 1140 

KPVLSANSEP ASSDPASSEM LSPSTQLLFY ETSASFSTEV LLQPSFQASD VDTLLKTVLP 1200 

AVPSDPILVE TPKVDKISST MLHLIVSNSA SSENMLHSTS VPVFDVSPTS HMHSASLQGL 1260 

TISYASEKYE PVLLKSESSH QWPSLYSND BLFQTANLEI NQAHPPKGRH VFATPVLSID 1320 

EPLNTLINKL IHSDEILTST KSSVTGKVFA GIPTVASDTF VSTDHSVPIG NGHVAITAVS 1380 

PHRDGSVTST KLLFP SKATS ELSHSAKSDA GLVGGGEDGD TDDDGDDDDD DRGSDGLSIH 1440 

KCMSCSSYRE SQEKVMNDSD THENSLMDQN NPISYSLSEN SEEDNRVTSV SSDSQTGMDR 1500 

SPGKSPSANG LSQKHNDGKE ENDIQTGSAL LPLSPESKAW AVLTSDEESG SGQGTSDSI^ IS 60 

ENETSTDFSF ADTNEKDADG ILAAGDSEIT PGFPQSPTSS VTSENSBVFH VSEAEASNSS 1620 

HESRIGLAEG LESEKKAVIP LVIVSALTFI CLWLVGILI YWRKCFQTAH FYLEDSTSPR 1680 

VISTPPTPIF PISDDVGAIP IKHFPKHVAD LHASSGFTEE FETLKEFYQE VQSCTVDLGI 1740 

TADSSNHPDN KHKNRYINIV AYDHSRVKLA QLAEKDGKLT DYINANYVDG YNRPKAYIAA 1800 

QGPLKSTAED FWRMIWEHNV EVIVMITNLV EKGRRKCDQY WPABGSEEYG NFLVTQKSVQ 1860 

VLAYYTVRNF TLRNTKIKKG SQKGRPSGRV VTQYHYTQWP DMGVPEYSLP VLTFVRKAAY 1920 

AKRHAVGPW VHCSAGVGRT GTYIVLDSML QQIQHEGTVN IFGFLKHIRS QRNYLVQTEE 1980 

QYVFIHDTLV EAILSKETEV LDSHIHAYVN ALLIPGPAGK TKLEKQFQLL, SQSNIQQSDY 2040 

SAALKQCNRE KNRTSSIIPV ERSRVGISSL SGEGTDYINA SYIMGYYQSN EFIITQHPLL 2100 

HTIKDFWRMI WDHNAQLWM IPDGQNMAED EFVYWPNKDE PINCESFKVT LMAEEHKCLS 2160 
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NEEKJbl IQDF I LEATQDDYV LEVRHFQCPK WPNPDSPISK TFELISVIKE EAANRDGPMI 2220 
VHDEHGGVTA GTFCALTTLM HQliEKENSVD VYQVAKMINL MRPGVFADIE QYQFLYKVIL 2280 
SliVSTRQEEN PSTSLDSNGA ALPDGNIAES LESLV 

Seq ID NO: 373 DMA sequence 

Nucleic Acid Accession ft: built £rom NM_0O28Sl 
Coding sequence: 148-4518 

1 ll 21 31 41 51 

! ] I I I I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA SO 
CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 
CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 
CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 
CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 
AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 
CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 
AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 
GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 
AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 
GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 
GGAAAAGGGA AGTTAAGAGC TTT AT CCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 
GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 
TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840' 
AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 
ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 
TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 
TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 
AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 
TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 
CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 
GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 
TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 
AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 
GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 
AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 
ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 
AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 
ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 
GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 
AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 
AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 
GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 
GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 
GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 
GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 
AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 
TCCTTTTCTQ CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 
CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 
TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACOCAACOG 2400 
GTATACAATG CAGAGGCCAG TAATAGTAGC CATGAGTCTC GTATTGGTCT AG CTGAGG GQ 2460 
TTGGAATCCG AGAAGAAGGC AGTTATACCC CTTGTGATCG TGTCAGCCCT GACTTTTATC 2520 
TGTCTAGTGG TTCTTGTGGG TATTCTCATC TACTGGAGGA AATGCTTCCA GACTGCACAC 2580 
TTTTACTTAG AGGACAGTAC ATCCCCTAGA GTTATATCCA CACCTCCAAC ACCTATCTTT 2640 
CCAATTTCAG ATGATGTCGG AGCAATTCCA ATAAAGCACT TTCCAAAGCA TGTTGCAGAT 2700 
TTACATGCAA GTAGTGGGTT TACTGAAGAA TTTGAGACAC TGAAAGAGTT TTACCAGGAA 2760 
GTGCAGAGCT GTACTGTTGA CTTAGGTATT ACAGCAGACA GCTCCAACCA CCCAGACAAC 2820 
AAGCACAAGA ATCGATACAT AAATATCGTT GCCTATGATC ATAGCAGGGT TAAGCTAGCA 2880 
CAGCTTGCTG AAAAGGATGG CAAACTGACT GATTATATCA ATGCCAATTA TGTTGATGGC 2940 
TACAACAGAC CAAAAGCTTA TATTGCTGCC CAAGGCCCAC TGAAATCCAC AGCTGAAGAT 3000 
TTCTGGAGAA TGATATGGGA ACATAATGTG GAAGTTATTG TCATGATAAC AAACCTCGTG 3060 
GAGAAAGGAA GGAGAAAATG TGATCAGTAC TGGCCTGCCG ATGGGAGTGA GGAGTACGGG 3120 
AACTTTCTGG TCACTCAGAA GAGTGTGCAA GTGCTTGCCT ATTATACTGT GAGGAATTTT 3180 
ACTCTAAGAA ACACAAAAAT AAAAAAGGGC TCCCAGAAAG GAAGACCCAG TGGACGTGTG 3240 
GTCACACAGT ATCACTACAC GCAGTGGCCT GACATGGGAG TACCAGAGTA CTCCCTGCCA 3300 
GTGCTGACCT TTGTGAGAAA GGCAGCCTAT GCCAAGCGCC ATGCAGTGGG GCCTGTTGTC 3360 
GTCCACTGCA GTGCTGGAGT TGGAAGAACA GGCACATATA TTGTGCTAGA CAGTATGTTG 3420 
CAGCAGATTC AACACGAAGG AACTGTCAAC ATATTTGGCT TCTTAAAACA CATCCGTTCA 3480 
CAAAGAAATT ATTTGGTACA AACTGAGGAG CAATATGTCT TCATTCATGA TACACTGGTT 3540 
GAGGCCATAC TTAGTAAAGA AACTGAGGTG CTGGACAGTC ATATTCATGC CTATGTTAAT 3600 
GCACTCCTCA TTCCTGGACC AGCAGGCAAA ACAAAGCTAG AGAAACAATT CCAGCTCCTO 3660 
AGCCAGTCAA ATATACAGCA GAGTGACTAT TCTGCAGCCC TAAAGCAATG CAACAGGGAA 3720 
AAGAATCGAA CTTCTTCTAT CATCCCTGTG GAAAGATCAA GGGTTGGCAT TTCATCCCTG 3780 
AGTGGAGAAG GCACAGACTA CATCAATGCC TCCTATATCA TGGGCTATTA CCAGAGCAAT 3840 
GAATTCATCA TTACCCAGCA CCCTCTCCTT CATACCATCA AGGATTTCTG GAGGATGATA 3900 
TGGGACCATA ATGCCCAACT GGTGGTTATG ATTCCTGATG GCCAAAACAT GGCAGAAGAT 3960 
GAATTTGTTT ACTGGCCAAA TAAAGATGAG CCTATAAATT GTGAGAGCTT TAAGGTCACT 4020 
CTTATGGCTG AAGAACACAA ATGTCTATCT AATGAGGAAA AACTTATAAT TCAGGACTTT 4080 
ATCTTAGAAG CTACACAGGA TGATTATGTA CTTGAAGTGA GGCACTTTCA GTGTCCTAAA 4140 
TGGCCAAATC CAGATAGCCC CATTAGTAAA ACTTTTGAAC TTATAAGTGT TATAAAAGAA 4200 
GAAGCTGCCA ATAGGGATGG GCCTATGATT GTTCATGATG AGCATGGAGG AGTGACGGCA 4260 
GGAACTTTCT GTGCTCTGAC AACCCTTATG CACCAACTAG AAAAAGAAAA TTCCGTGGAT 4320 
GTTTACCAGG TAGCCAAGAT GATCAATCTG ATGAGGCCAG GAGTCTTTGC TGACATTGAG 4380 
CAGTATCAGT TTCTCTACAA AGTGATCCTC AGCCTTGTGA GCACAAGGCA GGAAGAGAAT 4440 
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CCATCCACCT CTCTGGACAG TAATGGTGCA GCATTGCCTG ATGGAAATAT AGCTGAGAGC 4500 

TTAGAGTCTT TAGTTTAACA CAGAAAGGGG TGGGGGGACT CACATCTGAG CATTGTTTTC 4560 

CTCTTCCTAA AATTAGGCAG GAAAATCAGT CTAGTTCTGT TATCTGTTGA TTTCCCATCA 4620 

CCTGACAGTA ACTTTCATGA CATAGGATTC TGCCGCCAAA TTTATATCAT TAACAATGTG 4680 

TGCCTTTTTG CAAGACTTGT AATTTACTTA TTATGTTTGA ACT AAAAT GA TTGAATTTTA 4740 

CAGTATTTCT AAGAATGGAA TTGTGGTATT TTTTTCTGTA TTGATTTTAA CAGAAAATTT 4800 

CAATTTATAG AGGTTAGGAA TTCCAAACTA CAGAAAATGT TTGTTTTTAG TGTCAAATTT 4860 

TTAGCTGTAT TTGTAGCAAT TATCAGGTTT GCTAGAAATA TAACTTTTAA TACAGTAGCC 4920 

TGTAAATAAA ACACTCTTCC ATATGATATT CAACATTTTA CAACTGCAGT ATTCACCTAA 4980 

AGTAGAAATA ATCTGTTACT TATTGTAAAT ACTGCCCTAG TGTCTCCATG GACCAAATTT 5040 

ATATTTATAA TTGTAGATTT TTATATTTTA CTACTGAGTC AAGTTTTCTA GTTCTGTGTA 5100 

ATTGTTTAGT TTAATGACGT AGTTCATTAG CTGGTCTTAC TCTACCAGTT TTCTGACATT 5160 

GTATTGTGTT ACCTAAGTCA TTAACTTTGT TTCAGCATGT AATTTTAACT TTTGTGGAAA 5220 

ATAGAAATAC CTTCATTTTG AAAGAAGTTT TTATGAGAAT AACACCTTAC CAAACATTGT 5280 

TCAAATGGTT TTTATCCAAG GAATTGCAAA AATAAATATA AATATTGCCA TTAAAAAAAA 5340 
AAAAAAAAAA AAAAAAAAAA AAAAAAA 



Seq ID NO: 374 Protein sequence i 

Protein Accession #: built from XP_031379 

1 11 21 31 41 51 

| | I I I I 

MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVBEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 120 

FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 180 

ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 

TGKEEIHEAV CSSEPENVQA DPENYTSLLV THERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

HSFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 420 

LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 

RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 

ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 720 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNAEASNS SHESRIGLAE GLESEKKAVI 780 

PLVIVSALTF ICLWIjVGIL IYWRKCFQTA HFYLEDSTSP RVISTPPTPI FPISDDVGAI 840 

PIKHFPKHVA DLHASSGFTE EFETLKEFYQ EVQSCTVDLG ITADSSNHPD NKHKNRYINI 900 

VAYDHSRVKL AQLAEKDGKL TDYINANYVD GYNRPKAYIA AQGPLKSTAE DFWRMIWEHN 960 

VEVIVMITNL VEKGRRKCDQ YWPADGSEEY GNFLVTQKSV QVLAYYTVRN FTLRNTKIKK 1020 

GSQKGRPSGR WTQYHYTQW PDMGVPEYSL PVLTFVRKAA YAKRHAVGPV WHCSAGVGR 1080 

TGTYIVLDSM LQQIQHEGTV NIFGFLKHIR SQRNYLVQTE EQYVFIHDTL VEAILSKETE 1140 

VLDSHIHAYV NALLIPGPAG KTKLEKQFQL LSQSNIQQSD YSAALKQCNR EKNRTSSIIP 1200 

VERSRVGISS LSGEGTDYIN ASYIMGYYQS NEFIITQHPL LHTIKDFWRM IWDHNAQLW 1260 

MIPDGQNMAE DEFVYWPNKD EPINCESFKV TLMAEEHKCIi SNEEKLIIQD FILEATQDDY 1320 

VLEVRHFQCP KWPNPDSPIS KTFELISVIK EEAANRDGPM IVHDEHGGVT AGTFCALTTL 1380 

MHQLEKENSV DVYQVAKMIN LMRPGVFADI EQYQFLYKVI LSLVSTRQEE NPSTSLDSNG 1440 
AALPDGNIAE SLESLV 

Seq ID NO: 375 DNA sequence 

Nucleic Acid Accession #: EOS sequence 

Coding sequence: 148-4494 

1 11 21 31 41 51 

I I I I I I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCAGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 
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GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 
AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 
AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG C TCCA GTCCC 1920 
GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 
5 GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 
GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 
GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 
AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 
TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 
10 CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 
TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACOG 2400 
GTATACAATG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 2460 
GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 2520 
CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 2580 
15 TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 2640 
ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 2700 
CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGGAAGTGC AGAGCTGTAC TGTTGACTTA 2760 
GGTATTACAG CAGACAGCTC CAACCACCCA GACAACAAGC ACAAGAATCG ATACATAAAT 2820 
ATCGTTGCCT ATGATCATAG CAGGGTTAAG CTAGCACAGC TTGCTGAAAA GGATGGCAAA 2880 
20 CTGACTGATT ATATCAATGC CAATTATGTT GATGGCTACA ACAGACCAAA AGCTTATATT 2940 
GCTGCCCAAG GCCCACTGAA ATCCACAGCT GAAGATTTCT GGAGAATGAT ATGGGAACAT 3000 
AATGTGGAAG TTATTGTCAT GATAACAAAC CTCGTGGAGA AAGGAAGGAG AAAATGTGAT 3060 
CAGTACTGGC CTGCCGATGG GAGTGAGGAG TACGGGAACT TTCTGGTCAC TCAGAAGAGT 3120 
GTGCAAGTGC TTGCCTATTA TACTGTGAGG AATTTTACTC TAAGAAACAC AAAAATAAAA 3180 
25 AAGGGCTCCC AGAAAGGAAG ACCCAGTGGA CGTGTGGTCA CACAGTATCA CTACACGCAG 3240 
TGGCCTGACA TGGGAGTACC AGAGTACTCC CTGCCAGTGC TGACCTTTGT GAGAAAGGCA 3300 
GCCTATGCCA AGCGCCATGC AGTGGGGCCT GTTGTCGTCC ACTGCAGTGC TGGAGTTGGA 3360 
AGAACAGGCA CATATATTGT GCTAGACAGT ATGTTGCAGC AGATTCAACA CGAAGGAACT 3420 
GTCAACATAT TTGGCTTCTT AAAACACATC CGTTCACAAA GAAATTATTT GGTACAAACT 3480 
30 GAGGAGCAAT ATGTCTTCAT TCATGATACA CTGGTTGAGG CCATACTTAG TAAAGAAACT 3540 
GAGGTGCTGG ACAGTCATAT TCATGCCTAT GTTAATGCAC TCCTCATTCC TGGACCAGCA 3600 
GGCAAAACAA AGCTAGAGAA ACAATTCCAG CTCCTGAGCC AGTCAAATAT ACAGCAGAGT 3660 
GACTATTCTG CAGCCCTAAA GCAATGCAAC AGGGAAAAGA ATCGAACTTC TTCTATCATC 3720 
CCTGTGGAAA GATCAAGGGT TGGCATTTCA TCCCTGAGTG GAGAAGGCAC AGACTACATC 3780 
35 AATGCCTCCT ATATCATGGG CTATTACCAG AGCAATGAAT TCATCATTAC CCAGCACCCT 3840 
CTCCTTCATA CCATCAAGGA TTTCTGGAGG ATGATATGGG ACCATAATGC CCAACTGGTG 3900 
GTTATGATTC CTGATGGCCA AAACATGGCA GAAGATGAAT TTGTTTACTG GCCAAATAAA 3960 
GATGAGCCTA TAAATTGTGA GAGCTTTAAG GTCACTCTTA TGGCTGAAGA ACACAAATGT 4020 
CTATCTAATG AGGAAAAACT TATAATTCAG GACTTTATCT TAGAAGCTAC ACAGGATGAT 4080 
40 TATGTACTTG AAGTGAGGCA CTTTCAGTGT CCTAAATGGC CAAATCCAGA TAGCCCCATT 4140 
AGTAAAACTT TTGAACTTAT AAGTGTTATA AAAGAAGAAG CTGCCAATAG GGATGGGCCT 4200 
ATGATTGTTC ATGATGAGCA TGGAGGAGTG ACGGCAGGAA CTTTCTGTGC TCTGACAACC 4260 
CTTATGCACC AACTAGAAAA AGAAAATTCC GTGGATGTTT ACCAGGTAGC CAAGATGATC 4320 
AATCTGATGA GGCCAGGAGT CTTTGCTGAC ATTGAGCAGT ATCAGTTTCT CTACAAAGTG 4380 
45 ATCCTCAGCC TTGTGAGCAC AAGGCAGGAA GAGAATCCAT CCACCTCTCT GGACAGTAAT 4440 
GGTGCAGCAT TGCCTGATGG AAATATAGCT GAGAGCTTAG AGTCTTTAGT TTAACACAGA 4500 
AAGGGGTGGG GGGACTCACA TCTGAGCATT GTTTTCCTCT TOCTAAAATT AGGCAGGAAA 4560 
ATCAGTCTAG TTCTGTTATC TGTTGATTTC CCATCACCTG ACAGTAACTT TCATGACATA 4620 
GGATTCTGCC GCCAAATTTA TATCATTAAC AATGTGTGCC TTTTTGCAAG ACTTGTAATT 4680 
50 TACTTATTAT GTTTGAACTA AAATGATTGA ATTTTACAGT ATTTCTAAGA ATGGAATTGT 4740 
GGTATTTTTT TCTGTATTGA TTTTAACAGA AAATTTCAAT TTATAGAGGT TAGGAATTCC 4800 
AAACTACAGA AAATGTTTGT TTTTAGTGTC AAATTTTTAG CTGTATTTGT AGCAATTATC 4860 
AGGTTTGCTA GAAATATAAC TTTTAATACA GTAGCCTGTA AATAAAACAC TCTTCCATAT 4920 
GATATTCAAC ATTTTACAAC TGCAGTATTC ACCTAAAGTA GAAATAATCT GTT ACTTA TT 4980 
55 GTAAATACTG CCCTAGTGTC TCCATGGACC AAATTTATAT TTATAATTGT AGATTTTTAT 5040 
ATTTTACTAC TGAGTCAAGT TTTCTAGTTC TGTGTAATTG TTTAGTTTAA TGACGTAGTT 5100 
CATTAGCTGG TCTTACTCTA CCAGTTTTCT GACATTGTAT TGTGTTACCT AAGTCATTAA 5160 
CTTTGTTTCA GCATGTAATT TTAACTTTTG TGGAAAATAG AAATACCTTC ATTTTGAAAG 5220 
AAGTTTTTAT GAGAATAACA CCTTACCAAA CATTGTTCAA ATGGTTTTTA TCCAAGGAAT 5280 
60 TGCAAAAATA AATATAAATA TTGCCATTAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 5340 
AAA 

Seq ID NO: 376 Protein sequence : 
Protein Accession #t EOS sequence 

65 

1 11 21 31 41 51 

I I I I I I 

MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 
QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 120 

70 FKASKITFHW GKQJMSSDGS EHSLEGQKFP LEKQIYCFDA DRFSSFEEAV KGKGKLRAIiS 180 
ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFILLNL LPHSTDKYYI YNGSLTSPPC 240 
TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 
TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 
HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 420 

75 LIGTEEIIKE EEEGKDIEEG ATVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 
RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 540 
GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 
ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSB ESLKDPSMEG NVWFPSSTDI 660 

0 _ TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 720 

80 TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNEASKSS KESRI GLAEG IiESEKKAVIP 780 
liVIVSALTFI CLWLVGILI YWRKCFQTAH FYLEDSTSPR VISTPPTPIF PISDDVGAIP 840 
IKHFPKHVAD LHASSGFTEE FEEVQSCTVD LGITADSSNH PDNKHKNRYI NIVAYDHSRV 900 
KLAQLAEKDG KLTDYINANY VDGYNRPKAY IAAQGPLKST AEDFWRMIWE HNVEVIVMIT 960 
NLVEKGRRKC DQYWPADGSE EYGNFLVTQK SVQVLAYYTV RNFTLRNTKI KKGSQKGRPS 1020 
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GRWTQYHYT QWPDMGVPEY SLPVLTFVRK AAYAKRHAVG PWVHCSAGV GRTGTYIVLD 1080 

SMLQQIQREG TVNIFGPLKH IRSQRNYLVQ TEEQYVPIHD TLVBAILSKE TEVLDSHIHA 1140 

YVNALLIPGP AGKTKLEKQP QLLSQSNIQQ SDYSAALKQC NREKNRTSSI IPVERSRVGI 1200 

SSLSGEGTDY INASYIMGYY QSNEPIITQH PLLHTIKDFW RMIWDHNAQL WMIPDGQNM 1260 

AEDEFVYWPN KDEPINCESP KVTLMAEEHK CLSNEEKLII QDFILEATQD DYVLEVRHFQ 1320 

CPKWPNPDSP ISKTPELISV IKEEAANRDG PMIVHDEHGG VTAGTFCALT TLMHQLEKEN 1380 

SVDVYQVAKM INLMRPGVFA DIEQYQFLYK VILSLVSTRQ EENPSTSLDS NGAALPDGNI 1440 
AESLESLV 



Seq ID NO: 377 DMA sequence 

Nucleic Acid Accession 8* BOS sequence 

Coding sequence: 501-4514 

1 11 21 31 41 51 

| | I I I 1 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 
CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 
CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 
CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 
CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAT TGGGGAAAGA 300 
AATATCCAAC ATGTAATAGC CCAAAACAAT CTCCTATCAA TATTGATGAA GATCTTACAC 360 
AAGTAAATGT GAATCTTAAG AAACTTAAAT TTCAGGGTTG GGATAAAACA TCATTGGAAA 420 
ACACATTCAT TCATAACACT GGGAAAACAG TGGAAATTAA TCTCACTAAT GACTACCGTG 480 
TCAGCGGAGG AGTTTCAGAA ATGGTGTTTA AAGCAAGCAA GATAACTTTT CACTGGGGAA 540 
AATGCAATAT GTCATCTGAT GGATCAGAGC ATAGTTTAGA AGGACAAAAA TTTCCACTTG 600 
AGATGCAAAT CTACTGCTTT GATGCGGACC GATTTTCAAG TTTTGAGGAA GCAGTCAAAG 660 
GAAAAGGGAA GTTAAGAGCT TTATCCATTT TGTTTGAGGT TGGGACAGAA GAAAATTTGG 720 
ATTTCAAAGC GATTATTGAT GGAGTCGAAA GTGTTAGTCG TTTTGGGAAG CAGGCTGCTT 780 
TAGATCCATT CATACTGTTG AACCTTCTGC CAAACTCAAC TGACAAGTAT TACATTTACA 840 
ATGGCTCATT GACATCTCCT CCCTGCACAG ACACAGTTGA CTGGATTGTT TTTAAAGATA 900 
CAGTTAGCAT CTCTGAAAGC CAGTTGGCTG TTTTTTGTGA AGTTCTTACA ATGCAACAAT 960 
CTGGTTATGT CATGCTGATG GACTACTTAC AAAACAATTT TCGAGAGCAA CAGTACAAGT 1020 
TCTCTAGACA GGTGTTTTCC TCATACACTG GAAAGGAAGA GATTCATGAA GCAGTTTGTA 1080 
GTTCAGAACC AGAAAATGTT CAGGCTGACC CAGAGAATTA TACCAGCCTT CTTGTTACAT 1140 
GGGAAAGACC TCGAGTCGTT TATGATACCA TGATTGAGAA GTTTGCAGTT TTGTACCAGC 1200 
AGTTGGATGG AGAGGACCAA ACCAAGCATG AATTTTTGAC AGATGGCTAT CAAGACTTGG 1260 
GTGCTATTCT CAATAATTTG CTACCCAATA TGAGTTATGT TCTTCAGATA GTAGCCATAT 1320 
GCACTAATGG CTTATATGGA AAATACAGCG ACCAACTGAT TGTCGACATG CCTACTGATA 1380 
ATCCTGAACT TGATCTTTTC CCTGAATTAA TTGGAACTGA AGAAATAATC AAGGAGGAGG 1440 
AAGAGGGAAA AGACATTGAA GAAGGCGCTA TTGTGAATCC TGGTAGAGAC AGTGCTACAA 1500 
ACCAAATCAG GAAAAAGGAA CCCCAGATTT CTACCACAAC ACACTACAAT CGCATAGGGA 1560 
CGAAATACAA TGAAGCCAAG ACTAACCGAT CCCCAACAAG AGGAAGTGAA TTCTCTGGAA 1620 
AGGGTGATGT TCCCAATACA TCTTTAAATT CCACTTCCCA ACCAGTCACT AAATTAGCCA 1680 
CAGAAAAAGA TATTTCCTTG ACTTCTCAGA CTGTGACTGA ACTGCCACCT CACACTGTGG 1740 
AAGGTACTTC AGCCTCTTTA AATGATGGCT CTAAAACTGT TCTTAGATCT CCACATATGA 1800 
ACTTGTCGGG GACTGCAGAA TCCTTAAATA CAGTTTCTAT AACAGAATAT GAGGAGGAGA 1860 
GTTTATTGAC CAGTTTCAAG CTTGATACTG GAGCTGAAGA TTCTTCAGGC TCCAGTCCCG 1920 
CAACTTCTGC TATCCCATTC ATCTCTGAGA ACATATCCCA AGGGTATATA TTTTCCTCCG 1980 
AAAACCCAGA GACAATAACA TATGATGTCC TTATACCAGA ATCTGCTAGA AATGCTTCCG 2040 
AAGATTCAAC TTCATCAGGT TCAGAAGAAT CACTAAAGGA TCCTTCTATG GAGGGAAATG 2100 
TGTGGTTTCC TAGCTCTACA GACATAACAG CACAGCCCGA TGTTGGATCA GGCAGAGAGA 2160 
GCTTTCTCCA GACTAATTAC ACTGAGATAC GTGTTGATGA ATCTGAGAAG ACAACCAAGT 2220 
CCTTTTCTGC AGGCCCAGTG ATGTCACAGG GTCCCTCAGT TACAG ATCTG GAAATGCCAC 2280 
ATTATTCTAC CTTTGCCTAC TTCCCAACTG AGGTAACACC TCATGCTTTT ACCCCATCCT 2340 
CCAGACAACA GGATTTGGTC TCCACGGTCA ACGTGGTATA CTCGCAGACA ACCCAACCGG 2400 
TATACAATGA GGCCAGTAAT AGTAGCCATG AGTCTCGTAT TGGTCTAGCT GAGGGGTTGG 2460 
AATCCGAGAA GAAGGCAGTT ATACCCCTTG TGATCGTGTC AGCCCTGACT TTTAT CTGTC 2520 
TAGTGGTTCT TGTGGGTATT CTCATCTACT GGAGGAAATG CTTCCAGACT GCACACTTTT 2580 
ACTTAGAGGA CAGTACATCC CCTAGAGTTA TATCCACACC TCCAACACCT ATCTTTCCAA 2640 
TTTCAGATGA TGTCGGAGCA ATTCCAATAA AGCACTTTCC AAAGCATGTT GCAGATTTAC 2700 
ATGCAAGTAG TGGGTTTACT GAAGAATTTG AGACACTGAA AGAGTTTTAC CAGGAAGTGC 2760 
AGAGCTGTAC TGTTGACTTA GGTATTACAG CAGACAGCTC CAACCACCCA GACAACAAGC 2820 
ACAAGAATCG ATACATAAAT ATCGTTGCCT ATGATCATAG CAGGGTTAAG CTAGCACAGC 2880 
TTGCTGAAAA GGATGGCAAA CTGACTGATT ATATCAATGC CAATTATGTT GATGGCTACA 2940 
ACAGACCAAA AGCTTATATT GCTGCCCAAG GCCCACTGAA ATCCACAGCT GAAGATTTCT 3000 
GGAGAATGAT ATGGGAACAT AATGTGGAAG TTATTGTCAT GATAACAAAC CTCGTGGAGA 3060 
AAGGAAGGAG AAAATGTGAT CAGTACTGGC CTGCCGATGG GAGTGAGGAG TACGGGAACT 3120 
TTCTGGTCAC TCAGAAGAGT GTGCAAGTGC TTGCCTATTA TACTGTGAGG AATTTTACTC 3180 
TAAGAAACAC AAAAATAAAA AAGGGCTCCC AGAAAGGAAG ACCCAGTGGA CGTGTGGTCA 3240 
CACAGTATCA CTACACGCAG TGGCCTGACA TGGGAGTACC AGAGTACTCC CTGCCAGTGC 3300 
TGACCTTTGT GAGAAAGGCA GCCTATGCCA AGCGCCATGC AGTGGGGCCT GTTGTCGTCC 3360 
ACTGCAGTGC TGGAGTTGGA AGAACAGGCA CATATATTGT GCTAGACAGT ATGTTGCAGC 3420 
AGATTCAACA CGAAGGAACT GTCAACATAT TTGGCTTCTT AAAACACATC CGTTCACAAA 3480 
GAAATTATTT GGTACAAACT GAGGAGCAAT ATGTCTTCAT TCATGATACA CTGGTTGAGG 3540 
CCATACTTAG TAAAGAAACT GAGGTGCTGG ACAGTCATAT TCATGCCTAT GTTAATGCAC 3600 
TCCTCATTCC TGGACCAGCA GGCAAAACAA AGCTAGAGAA ACAATTCCAG CTCCTGAGCC 3660 
AGTCAAATAT ACAGCAGAGT GACTATTCTG CAGCCCTAAA GCAATGCAAC AGGGAAAAGA 3720 
ATCGAACTTC TTCTATCATC CCTGTGGAAA GATCAAGGGT TGGCATTTCA TCCCTGAGTG 3780 
GAGAAGGCAC AGACTACATC AATGCCTCCT ATATCATGGG CTATTACCAG AGCAATGAAT 3840 
TCATCATTAC CCAGCACCCT CTCCTTCATA CCATCAAGGA TTTCTGGAGG ATGATATGGG 3900 
ACCATAATGC CCAACTGGTG GTTATGATTC CTGATGGCCA AAACATGGCA GAAGATGAAT 3960 
TTGTTTACTG GCCAAATAAA GATGAGCCTA TAAATTGTGA GAGCTTTAAG GTCACTCTTA 4020 
TGGCTGAAGA ACACAAATGT CTATCTAATG AGGAAAAACT TATAATTCAG GACTTTATCT 4080 
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TAGAAGCTAC ACAGGATGAT TATGTACTTG AAGTGAGGCA CTTTCAGTGT CCTAAATGGC 4140 

CAAATCCAGA TAGCCCCATT AGTAAAACTT TTGAACTTAT AAGTGTTATA AAAGAAGAAG 4200 

CTGCCAATAG GGATGGGCCT ATGATTGTTC ATGATGAGCA TGGAGGAGTG ACGGCAGGAA 4260 

CTTTCTGTGC TCTGACAACC CTTATGCACC AACTAGAAAA AGAAAATTCC GTGGATGTTT 4320 

5 ACCAGGTAGC CAAGATGATC AATCTGATGA GGCCAGGAGT CTTTGCTGAC ATTGAGCAGT 4380 

ATCAGTTTCT CTACAAAGTG ATCCTCAGCC TTGTGAGCAC AAGGCAGGAA GAGAATCCAT 4440 

CCACCTCTCT GGACAGTAAT GGTGCAGCAT TGCCTGATGG AAATATAGCT GAGAGCTTAG 4500 

AGTCTTTAGT TTAACACAGA AAGGGGTGGG GGGACTCACA TCTGAGCATT GTTTTCCTCT 4560 

TCCTAAAATT AGGCAGGAAA ATCAGTCTAG TTCTGTTATC TGTTGATTTC CCATCACCTG 4620 

10 ACAGTAACTT TCATGACATA GGATTCTGCC GCCAAATTTA TATCATTAAC AATGTGTGCC 4680 

TTTTTGCAAG ACTTGTAATT TACTTATTAT GTTTGAACTA AAATGATTGA ATTTTACAGT 4740 

ATTTCTAAGA ATGGAATTGT GGTATTTTTT TCTGTATTGA TTTTAACAGA AAATTTCAAT 4800 

TTATAGAGGT TAGGAATTCC AAACTACAGA AAATGTTTGT TTTTAGTGTC AAATTTTTAG 4860 

CTGTATTTGT AGCAATTATC AGGTTTGCTA GAAATATAAC TTTTAATACA GTAGCCTGTA 4920 

15 AATAAAACAC TCTTCCATAT GATATTCAAC ATTTTACAAC TGCAGTATTC ACCTAAAGTA 4980 

GAAATAATCT GTTACTTATT GTAAATACTG CCCTAGTGTC TCCATGGACC AAATTTATAT 5040 

TTATAATTGT AGATTTTTAT ATTTTACTAC TGAGTCAAGT TTTCTAGTTC TGTGTAATTG 5100 

TTTAGTTTAA TGACGTAGTT CATTAGCTGG TCTTACTCTA CCAGTTTTCT GACATTGTAT 5160 

TGTGTTACCT AAGTCATTAA CTTTGTTTCA GCATGTAATT TTAACTTTTG TGGAAAATAG 5220 

20 AAATACCTTC ATTTTGAAAG AAGTTTTTAT GAGAATAACA CCTTACCAAA CATTGTTCAA 5280 

ATGGTTTTTA TCCAAGGAAT TGCAAAAATA AATATAAATA TTGCCATTAA AAAAAAAAAA 5340 
AAAAAAAAAA AAAAAAAAAA AAA 

Seq ID NO: 378 Protein sequence* 
25 Protein Accession #: EOS sequence 

I 11 21 31 41 51 

II I I I I 
MVFKASKITP HWGKCNMSSD GSEHSLEGQK FPLEMQIYCF DADRFSSFEE AVKGKGKLRA 

30 LSILFEVGTE ENLDPKAI1D GVESVSRFGK QAALDPFILL NLLPNSTDKY YIYNGSLTSP 
PCTDTVDWIV FKDTVSISES QLAVFCEVLT KQQSGYVMLM DYLQNNFREQ QYKFSRQVFS 
SYTGKEEIHE AVCSSEPENV QADPENYTSL LVTWERPRW YDTMIEKFAV LYQQLDGEDQ 
TKHEFLTDGY QDLGAILNNL LPNMSYVLQI VAICTNGLYG KYSDQLXVDM PTDNPELDLF 
PEL I GTE EI I KEEEEGKDIE EGATVNPGRD SATNQIRKKE PQISTTTHYN RIGTKYNEAK 
35 TNRSPTRGSE FSGKGDVPNT SLNSTSQPVT KLATEKDISL TSQTVTELPP HTVEGTSASL 
NDGSKTVLRS PHMNLSGTAE SLNTVSITEY EEESLLTSFK LDTGAEDSSG SSPATSAIPF 
ISENISQGYI FSSENPETIT YDVLIPESAR NASEDSTSSG SEESLKDPSM EGNVHFPSST 
DITAQPDVGS GRBSFLQTNY TEIRVDESEK TTKSFSAGPV MSQGPSVTDL EMPHYSTFAY 
FPTEVTPHAF TPSSRQQDLV STVNWYSQT TQPVYNEASN SSHESRIGLA EGLESEKKAV 
40 IPLVIVSALT FICLWLVGI LIYWRKCFQT AHFYliEDSTS PRVISTPPTP IFPISDDVGA 
IPIKHFPKHV ADLHASSGFT EEFETLKEFY QEVQSCTVDI* GITADSSNHP DNKHKNRYIN 
IVAYDHSRVK LAQLAEKDGK LTDYINANYV DGYNRPKAYI AAQGPLKSTA EDFWRMIWEH 
NVBVIVMITN LVEKGRRKCD QYWPADGSEE YGNFLVTQKS VQVLAYYTVR NFTLRNTKIK 
KGSQKGRPSG RWTQYHYTQ WPDMGVPEYS LFVLTFVRKA AYAKRHAVGP VWHCSAGVG 
45 RTGTYIVLDS MLQQIQHEGT VNIFGFLKHI RSQRNYLVGT EEQYVFIHDT LVEAILSKET 
EVLDSHIHAY WALL I PGP A GKTKLEKQFQ LLSQSNIQQS DYSAALKQCK REKNRTSSII 
PVERSRVGIS SLSGEGTDYI NASYIMGYYQ SNEFIITQHP LLHTIKDFWR MIWDHNAQLV 
VMIPDGQNMA EDEFVYWPNK DEPINCESFK VTLMAEEHKC LSNEEKLIIQ DFILEATQDD 
YVLEVRHFQC PKHPNPDSPI SKTFELISVI KEEAANRDGP MIVHDEHGGV TAGTFCALTT 
50 LMHQLEKENS VDVYQVAKMI NLMRPGVFAD IEQYQFLYKV ILSLVSTRQE ENPSTSLDSN 
GAALPDGNIA ESLESLV 

Seq ID NO: 379 DNA sequence 
Nucleic Acid Accession #: EOS sequence 
55 Coding sequence* 148-4632 

1 11 21 31 41 51 

CACACATAOG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 

60 CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AACGTTTCCT CGCTTGCATT 180 

CAGCTCCrCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

65 CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

70 GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

75 TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAG TACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTOGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

80 GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 
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ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 
AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 
ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTQ AACTGCCACC TCACACTGTG 1740 
GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 180 0 
5 AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG I860 
AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 
GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 
GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 
GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 
10 GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 
AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 
TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 
CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 
TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 
15 GTATACAATG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAG GGGTTG 2460 
GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 2520 
CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 2580 
TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 2640 
ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 2700 
20 CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGACACTGA AAGAGTTTTA CCAGGAAGTG 2760 
CAGAGCTGTA CTGTTGACTT AGGTATTACA GCAGACAGCT CCAACCACCC AGACAACAAG 2820 
CACAAGAATC GATACATAAA TATCGTTGCC TATGATCATA GCAGGGTTAA GCTAGCACAG 2880 
CTTGCTGAAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC 2940 
AACAGACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGC TGAAGATTTC 3000 
25 TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTGGAG 3060 
AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCCGATG GGAGTGAGGA GTACGGGAAC 3120 
TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 3180 
CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 3240 
ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 3300 
30 CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 3360 
CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG TGCTAGACAG TATGTTGCAG 3420 
CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TAAAACACAT CCGTTCACAA 3480 
AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGG TTGAG 3540 
GCCATACTTA GTAAAGAAAC TGAGGTGCTG GACAGTCATA TTCATGCCTA TGTTAATGCA 3600 
35 CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA AACAATTCCA GGGTCTCACT 3660 
CTGTCACCCA GGCTGGAGTG CAGAGGCACA ATCTCGGCTC ACTGCAACCT TCCTCTCCCT 3720 
GGCTTAACTG ATCCTCCTAC CTCAGCCTCC CGAGTGGCTG GGACTATACT CCTGAGCCAG 3780 
TCAAATATAC AGCAGAGTGA CTATTCTGCA GCCCTAAAGC AATGCAACAG GGAAAAGAAT 3840 
CGAACTTCTT CTATCATCCC TGTGGAAAGA TCAAGGGTTG GCATTTCATC CCTGAGTGGA 3900 
40 GAAGGCACAG ACTACATCAA TGCCTCCTAT ATCATGGGCT ATTACCAGAG CAATGAATTC 3960 
ATCATTACCC AGCACCCTCT CCTTCATACC ATCAAGGATT TCTGGAGGAT GATATGGGAC 4020 
CATAATGCCC AACTGGTGGT TATGATTCCT GATGGCCAAA ACATGGCAGA AGATGAATTT 40 80 
GTTTACTGGC CAAATAAAGA TGAGCCTATA AATTGTGAGA . GCTTTAAGGT CACTCTTATG 4140 
GCTGAAGAAC ACAAATGTCT ATCTAATGAG GAAAAACTTA TAATTCAGGA CTTTATCTTA 4200 
45 GAAGCTACAC AGGATGATTA TGTACTTGAA GTGAGGCACT TTCAGTGTCC TAAATGGCCA 4260 
AATCCAGATA GCCCCATTAG TAAAACTTTT GAACTTATAA GTGTTATAAA AGAAGAAGCT 4320 
GCCAATAGGG ATGGGCCTAT GATTGTTCAT GATGAGCATG GAGGAGTGAC GGCAGGAACT 4380 
TTCTGTGCTC TGACAACCCT TATGCACCAA CTAGAAAAAG AAAATTCCGT GGATGTTTAC 4440 
CAGGTAGCCA AGATGATCAA TCTGATGAGG CCAGGAGTCT TTGCTGACAT TGAGCAGTAT 4500 
50 CAGTTTCTCT ACAAAGTGAT CCTCAGCCTT GTGGGCACAA GGCAGGAAGA GAATCCATCC 4560 
ACCTCTCTGG ACAGTAATGG TGCAGCATTG CCTGATGGAA ATATAGCTGA GAGCTTAGAG 4620 
TCTTTAGTTT AACACAGAAA GGGGTGGGGG GACTCACATC TGAGCATTGT TTTCCTCTTC 4680 
CTAAAATTAG GCAGGAAAAT CAGTCTAGTT CTGTTATCTG TTGATTTCCC ATCACCTGAC 4740 
AGTAACTTTC ATGACATAGG ATTCTGCCGC CAAATTTATA TCATTAACAA TGTGTGCCTT 4800 
55 TTTGCAAGAC TTGTAATTTA CTTATTATGT TTGAACTAAA ATGATTGAAT TTTACAGTAT 4860 
TTCTAAGAAT GGAATTGTGG TATTTTTTTC TGTATTGATT TTAACAGAAA ATTTCAATTT 4920 
ATAGAGGTTA GGAATTCCAA ACTACAGAAA ATGTTTGTTT TTAGTGTCAA ATTTTTAGCT 4980 
GTATTTGTAG CAATTATCAG GTTTGCTAGA AATATAACTT TTAATACAGT AGCCTGTAAA 5040 
TAAAACACTC TTCCATATGA TATTCAACAT TTTACAACTG CAGTATTCAC CTAAAGTAGA 5100 
60 AATAATCTGT TACTTATTGT AAATACTGCC CTAGTGTCTC CATGGACCAA ATTTATATTT 5160 
ATAATTGTAG ATTTTTATAT TTTACTACTG AGTCAAGTTT TCTAGTTCTG TGTAATTGTT 5220 
TAGTTTAATG ACGTAGTTCA TTAGCTGGTC TTACTCTACC AGTTTTCTGA CATTGTATTG 52.80 
TGTTACCTAA GTCATTAACT TTGTTTCAGC ATGTAATTTT AACTTTTGTG GAAAATAGAA 5340 
ATACCTTCAT TTTGAAAGAA GTTTTTATGA GAATAACACC TTACCAAACA TTGTTCAAAT 5400 
65 GGTTTTTATC CAAGGAATTG CAAAAATAAA TATAAATATT GCCATTAAAA AAAAAAAAAA 5460 
AAAAAAAAAA AAAAAAAAAA A 



Seq ID NOt 380 Protein sequence ! 
Protein Accession #: EOS sequence 

70 

1 11 21 31 41 51 

| | 1 1 I I 

MRILKRFIAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNLKKL KPQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 120 

75 PKASKITFHW GXCNMSSDGS EHSLBGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 180 

ILFEVGTEEN liDFKAIIDGV ESVSRFGKQA ALDPFILLNI* LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWIVFK DTVSISESQL AVPCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVPSSY 300 

TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

HEPLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SD QLIVD MPT DNPELDLFPE 420 

80 LI GT EE I IKE BEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 

RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSPKLD TGAEDSSGSS PATSAIPPIS 600 

ENISQGYIFS SENPET1TYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSPSAGPVMS QGPSVTDLEM PHYSTFAYFP 720 
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TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNEASNSS HESRIGLAEG LESEKKAVIP 780 

LVIVSALTFI CLWLVGILI YWRKCPQTAR FYLEDSTSPR VISTPPTPIF PISDDVGAIP 840 

IKHFPKHVAD LHASSGFTEE FETLKEFYQE VQSCTVDLGI TADSSNHPDN KHKNRYINIV 900 

AYDHSRVKLA QLAEKDGKLT DYINANYVDG YNRPKAYIAA QGPLKSTAED FWRMIWEHNV 960 

EVIVMITNLV EKGRRKCDQY WPADGSEEYG NFLVTQKSVQ VLAYYTVRNF TLRNTKIKKG 1020 

SQKGRPSGRV VTQYHYTQWP DMGVPEYSLP VLTFVRKAAY AKRHAVGPW VHCSAGVGRT 1080 

GTYIVLDSML QQIQHEGTVN IFGFLKHIRS QRNYLVQTEE QYVFIHDTLV KAILS KETEV 1140 

LDSH1HAYVN ALL IPG PAG K TKLEKQFQGL TLSPRLECRG TISAHCNLPL PGLTDPPTSA 1200 

SRVAGTILLS QSNIQQSDYS AALKQCNREK NRTSSIIPVE RSRVGISSLS GEGTDYINAS 1260 

YIMGYYQSNE FIITQHPLLH TIKDFWRMIW DHNAQLWMI PDGQNNAEDE FVYWPNKDEP 1320 

INCESFKVTL MAEEHKCLSN EEKLIIQDFI LEATQDDYVL EVRHFQCPKW PNPDSPISKT 1380 

FELISVIKEE AANRDGPMIV HDEHGGVTAG TFCALTTLMH QLEKENSVDV YQVAKMINLM 1440 
RPGVFADIEQ YQFLYKVILS LVGTRQEENP STSLDSNGAA LPDGNIAESL ESLV 



Seq ID NO: 381 DNA sequence 

Nucleic Acid Accession #: NM_002 851.1 

Coding sequence: 148.. 7092 

1 11 21 31 41 51 

111)11 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAG TACA AG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG . AAATGCTTCC 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAG ATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

GTATACAATG GTGAGACACC TCTTCAACCT TCCTACAGTA GTGAAGTCTT TCCTCTAGTC 2460 

ACCCCTTTGT TGCTTGACAA TCAGATCCTC AACACTACCC CTGCTGCTTC AAGTAGTGAT 2520 

TCGGCCTTGC ATGCTACGCC TGTATTTCCC AGTGTCGATG TGTCATTTGA ATCCATCCTG 25B0 

TCTTCCTATG ATGGTGCACC TTTGCTTCCA TTTTCCTCTG CTTCCTTCAG TAGTGAATTG 2640 

TTTCGCCATC TGCATACAGT TTCTCAAATC CTTCCACAAG TTACTTCAGC TACCGAGAGT 2700 

GATAAGGTGC CCTTGCATGC TTCTCTGCCA GTGGCTGGGG GTGATTTGCT ATTAGAGCCC 2760 

AGCCTTGCTC AGTATTCTGA TGTGCTGTCC ACTACTCATG CTGCTTCAGA GACGCTGGAA 2820 

TTTGGTAGTG AATCTGGTGT TCTTTATAAA ACGCTTATGT TTTCTCAAGT TGAACCACCC 2880 

AGCAGTGATG CCATGATGCA TGCACGTTCT TCAGGGCCTG AACCTTCTTA TGCCTTGTCT 2940 

GATAATGAGG GCTCCCAACA CATCTTCACT GTTTCTTACA GTTCTGCAAT ACCTGTGCAT 3000 

GATTCTGTGG GTGTAACTTA TCAGGGTTCC TTATTTAGCG GCCCTAGCCA TATACCAATA 3060 

CCTAAGTCTT CGTTAATAAC CCCAACTGCA TCATTACTGC AGCCTACTCA TGCCCTCTCT 3120 

GGTGATGGGG AATGGTCTGG AGCCTCTTCT GATAGTGAAT TTCTTTTACC TGACACAGAT 3180 

GGGCTGACAG CCCTTAACAT TTCTTCACCT GTTTCTGTAG CTGAATTTAC ATATACAACA 3240 

TCTGTGTTTG GTGATGATAA TAAGGCGCTT TCTAAAAGTG AAATAATATA TGGAAATGAG 3300 

ACTGAACTGC AAATTCCTTC TTTCAATGAG ATGGTTTACC CTTCTGAAAG CACAGTCATG 3360 

CCCAACATGT ATGATAATGT AAATAAGTTG AATGCGTCTT TACAAGAAAC CTCT GTTTCC 3420 

ATTTCTAGCA CCAAGGGCAT GTTTCCAGGG TCCCTTGCTC ATACCACCAC TAAGGTTTTT 3480 
GATCATGAGA TTAGTCAAGT TCCAGAAAAT AACTTTTCAG TTCAACCTAC ACATACTGTC 3540 
TCTCAAGCAT CTGGTGACAC TTCGCTTAAA CCTGTGCTTA GTGCAAACTC AGAGCCAGCA 3600 
TCCTCTGACC CTGCTTCTAG TGAAATGTTA TCTCCTTCAA CTCAGCTCTT ATTTTATGAG 3660 
ACCTCAGCTT CTTTTAGTAC TGAAGTATTG CTACAACCTT CCTTTCAGGC TTCTGATGTT 3720 
GACACCTTGC TTAAAACTGT TCTTCCAGCT GTGCCCAGTG ATCCAATATT GGTTGAAACC 3780 
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CCCAAAGTTG ATAAAATTAG TTCTACAATQ TTGCATCTCA TTGTATCAAA TTCTGCTTCA 3840 
AGTGAAAACA TGCTGCACTC TACATCTGTA CCAGTTTTTG ATGTGTCGCC TACTTCTCAT 3900 
ATGCACTCTG CTTCACTTCA AGGTTTGACC ATTTCCTATG CAAGTGAGAA ATATGAACCA 3960 
GTTTTGTTAA AAAGTGAAAG TTCCCACCAA GTGGTACCTT CTTTGTACAG TAATGATGAG 4020 
5 TTGTTCCAAA CGGCCAATTT GGAGATTAAC CAGGCCCATC CCCCAAAAGG AAGGCATGTA 4080 
TTTGCTACAC CTGTTTTATC AATTGATGAA CCATTAAATA CACTAATAAA TAAGCTTATA 4140 
CATTCCGATG AAATTTTAAC CTCCACCAAA AGTTCTGTTA CTGGTAAGGT ATTTGCTGGT 4200 
ATTCCAACAG TTGCTTCTGA TACATTTGTA TCTACTGATC ATTCTGTTCC TATAGGAAAT 4260 
GGGCATGTTG CCATTACAGC TGTTTCTCCC CACAGAGATG GTTCTGTAAC CTCAACAAAG 4320 
10 TTGCTGTTTC CTTCTAAGGC AACTTCTGAG CTGAGTCATA GTGCCAAATC TGATGCCGGT 4380 
TTAGTGGGTG GTGGTGAAGA TGGTGACACT GATGATGATG GTGATGATGA TGATGACAGA 4440 
GATAGTGATG GCTTATCCAT TCATAAGTGT ATGTCATGCT CATCCTATAG AGAATCACAG 4500 
GAAAAGGTAA TGAATGATTC AGACACCCAC GAAAACAGTC TTATGGATCA GAATAATCCA 4560 
ATCTCATACT CACTATCTGA GAATTCTGAA GAAGATAATA GAGTCACAAG TGTATCCTCA 4620 
15 GACAGTCAAA CTGGTATGGA CAGAAGTCCT GGTAAATCAC CATCAGCAAA TGGGCTATCC 4680 
CAAAAGCACA ATGATGGAAA AGAGGAAAAT GACATTCAGA CTGGTAGTGC TCTGCTTCCT 4740 
CTCAGCCCTG AATCTAAAGC ATGGGCAGTT CTGACAAGTG ATGAAGAAAG TGGATCAGGG 4800 
CAAGGTACCT CAGATAGCCT TAATGAGAAT GAGACTTCCA CAGATTTCAG TTTTGCAGAC 4860 
ACTAATGAAA AAGATGCTGA TGGGATCCTG GCAGCAGOTG ACTCAGAAAT AACTCCTGGA 4920 
20 TTCCCACAGT CCCCAACATC ATCTGTTACT AGCGAGAACT CAGAAGTGTT CCACGTTTCA 4980 
GAGGCAGAGG CCAGTAATAG TAGCCATGAG TCTCGTATTG GTCTAGCTGA GGGGTTGGAA 5040 
TCCGAGAAGA AGGCAGTTAT ACCCCTTGTG ATCGTGTCAG CCCTGACTTT TATCTGTCTA 5100 
GTGGTTCTTG TGGGTATTCT CATCTACTGG AGGAAATGCT TCCAGACTGC ACACTTTTAC 5160 
TTAGAGGACA GTACATCCCC TAGAGTTATA TCCACACCTC CAACACCTAT CTTTCCAATT 5220 
25 TCAGATGATG TCGGAGCAAT TCCAATAAAG CACTTTCCAA A GCAT GTTGC AGATTTACAT 5280 
GCAAGTAGTG GGTTTACTGA AGAATTTGAG ACACTGAAAG AGTTTTACCA GGAAGTGCAG 5340 
AGCTGTACTG TTGACTTAGG TATTACAGCA GACAGCTCCA ACCACCCAGA CAACAAGCAC 5400 
AAGAATCGAT ACATAAATAT CGTTGCCTAT GATCATAGCA GGGTTAAGCT AGCACAGCTT 5460 
GCTGAAAAGG ATGGCAAACT GACTGATTAT ATCAATGCCA ATTATGTTGA TGGCTACAAC 5520 
30 AGACCAAAAG CTTATATTGC TGCCCAAGGC CCACTGAAAT CCACAGCTGA AGATTTCTGG 5580 
AGAATGATAT GGGAACATAA TGTGGAAGTT ATTGTCATGA TAACAAACCT CGTGGA GAAA 5640 
GGAAGGAGAA AATGTGATCA GTACTGGCCT GCCGATGGGA GTGAGGAGTA CGGGAACTTT 5700 
CTGGTCACTC AGAAGAGTGT GCAAGTGCTT GCCTATTATA CTGTGAGGAA TTTTACTCTA 5760 
AGAAACACAA AAATAAAAAA GGGCTCCCAG AAAGGAAGAC CCAGTGGACG TGTGGTCACA 5820 
35 CAGTATCACT ACACGCAGTG GCCTGACATG GGAGTACCAG AGTACTCCCT GCCAGTGCTG 5880 
ACCTTTGTGA GAAAGGCAGC CTATGCCAAG CGCCATGCAG TGGGGCCTGT TGTCGTCCAC 5940 
TGCAGTGCTG GAGTTGGAAG AACAGGCACA TATATTGTGC TAGACAGTAT GTTGCAGCAG 6000 
ATTCAACACG AAGGAACTGT CAACATATTT GGCTTCTTAA AACACATCCG TTCACAAAGA 6060 
AATTATTTGG TACAAACTGA GGAGCAATAT GTCTTCATTC ATGATACACT GGTTGAGGCC 6120 
40 ATACTTAGTA AAGAAACTGA GGTGCTGGAC AGTCATATTC ATGCCTATGT TAATGCACTC 6180 
CTCATTCCTG GACCAGCAGG CAAAACAAAG CTAGAGAAAC AATTCCAGCT CCTGAGCCAG 6240 
TCAAATATAC AGCAGAGTGA CTATTCTGCA GCCCTAAAGC AATGCAACAG GGAAAAGAAT 63 00 
CGAACTrCTT CTATCATCCC TGTGGAAAGA TCAAGGGTTG GCATTTCATC CCTGAGTGGA 6360 
GAAGGCACAG ACTACATCAA TGCCTCCTAT ATCATGGGCT ATTACCAGAG CAATGAATTC 6420 
45 ATCATTACCC AGCACCCTCT CCTTCATACC ATCAAGGATT TCTGGAGGAT GATATGGGAC 6480 
CATAATGCCC AACTGGTGGT TATGATTCCT GATGGCCAAA ACATGGCAGA AGATGAATTT 6540 
GTTTACTGGC CAAATAAAGA TGAGCCTATA AATTGTGAGA GCTTTAAGGT CACT CTTATG 6600 
GCTGAAGAAC ACAAATGTCT ATCTAATGAG GAAAAACTTA TAATTCAGGA CTTTATCTTA 6660 
GAAGCTACAC AGGATGATTA TGTACTTGAA GTGAGGCACT TTCAGTGTCC TAAATGGCCA 6720 
50 AATCCAGATA GCCCCATTAG TAAAACTTTT GAACTTATAA GTGTTATAAA AGAAGAAGCT 6780 
GCCAATAGGG ATGGGCCTAT GATTGTTCAT GATGAGCATG GAGGAGTGAC GGCAGGAACT 6840 
TTCTGTGCTC TGACAACCCT TATGCACCAA CTAGAAAAAG AAAATTCCGT GGATGTTTAC 6900 
CAGGTAGCCA AGATGATCAA TCTGATGAGG CCAGGAGTCT TTGCTGACAT TGAGCAGTAT 6960 
CAGTTTCTCT ACAAAGTGAT CCTCAGCCTT GTGAGCACAA GGCAGGAAGA GAATCCATCC 7020 
55 ACCTCTCTGG ACAGTAATGG TGCAGCATTG CCTGATGGAA ATATAGCTGA GAGCTTAGAG 7080 
TCTTTAGTTT AACACAGAAA GGGGTGGGGG GACTCACATC TGAGCATTGT TTTCCTCTTC 7140 
CTAAAATTAG GCAGGAAAAT CAGTCTAGTT CTGTTATCTG TTGATTTCCC ATCACCTGAC 7200 
AGTAACTTTC ATGACATAGG ATTCTGCCGC CAAATTTATA TCATTAACAA TGTGTGCCTT 7260 
TTTGCAAGAC TTGTAATTTA CTTATTATGT TTGAACTAAA ATGATTGAAT TTTACAGTAT 7320 
60 TTCTAAGAAT GGAATTGTGG TATTTTTTTC TGTATTGATT TTAACAGAAA ATTTCAATTT 7380 
ATAGAGGTTA GGAATTCCAA ACTACAGAAA ATGTTTGTTT TTAGTGTCAA ATTTTTAGCT 7440 
GTATTTGTAG CAATTATCAG GTTTGCTAGA AATATAACTT TTAATACAGT AGCCTGTAAA 7500 
TAAAACACTC TTCCATATGA TATTCAACAT TTTACAACTG CAGTATTCAC CTAAAGTAGA 7560 
AATAATCTGT TACTTATTGT AAATACTGCC CTAGTGTCTC CATGGACCAA ATTTATATTT 7620 
65 ATAATTGTAG ATTTTTATAT TTTACTACTG AGTCAAGTTT TCTAGTTCTG TGTAATTGTT 7680 
TAGTTTAATG ACGTAGTTCA TTAGCTGGTC TTACTCTACC AGTTTTCTGA CATTGTATTG 7740 
TGTTACCTAA GTCATTAACT TTGTTTCAGC ATGTAATTTT AACTTTTGTG GAAAATAGAA 7800 
ATACCTTCAT TTTGAAAGAA GTTTTTATGA GAATAACACC TTACCAAACA TTGTTCAAAT 7860 
GGTTTTTATC CAAGGAATTG CAAAAATAAA TATAAATATT GCCATTAAAA AAAAAAAAAA 7920 
70 AAAAAAAAAA AAAAAAAAAA A 



Seq ID NO; 382 Protein sequence 
Protein Accession ftt NP_002842.l 

75 i ii 21 

I I I 

MRIIiKRFLAC IQLLCVCRLD WANGYYRQQR 
QSPINIDEDL TQVNVNLKKL KFQGWDKTSL 
FKASKITFHW GKCNMSSDGS EHSLEGQKPP 
80 ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA 
TDTVDWIVFK DTVSISESQI* AVFCEVLTMQ 
TGKEEIHEAV CSSEPENVQA DPENYTSLI»V 
HEFLTDGYQD LGAILNNLLP NMSYVLQIVA 
LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA 



31 41 51 

I I I 

KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

BNTFIHNTGK TVEINLTNDY RVSGGVSEMV 120 

LEMQIYCFDA DRFSSPEEAV KGKGKLRALS 180 

ALDPPILLNL LPNSTDKYYI YNGSLTSPPC 240 

QSGYVMLMDY LQNNFREQQY KFSRQVPSSY 300 

TWERPRWYD TMIEKPAVLY QQLDGEDQTK 360 

ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 420 

TNQIRKKBPQ ISTTTHYNRI GTKYNEAKTN 480 
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RSPTRGSEFS GKGDVPNTSL. NSTSQPVTKL ATEKDISLTS QTVTEIiPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSPKLD TGAEDSSGSS PATSAIPPIS 600 
ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 
TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 720 
TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNGETPLQ PSYSSEVFPL VTPLLLDNQI 780 
LNTTPAASSS DSALHATPVF PSVDVSFESI LSSYDGAPLL PFSSASFSSE LFRHLHTVSQ 840 
ILPQVTSATE SDKVPLHASL PVAGGDLLLE PSLAQYSDVL STTHAASETL EFGSESGVLY 900 
KTLMFSQVEP PSSDAMMHAR SSGPEPSYAL SDNEGSQHIP TVSYSSAIPV HDSVGVTYQG 960 

SLFSGPSHIP IPKSSLITPT ASLLQPTHAL SGDGEWSGAS SDSEFLLPDT DGLTALNISS 1020 

PVSVAEFTYT TSVFGDDNKA LSKSEIIYGN ETELQIPSFN EMVYPSESTV MPNMYDNVNK 1080 

LNASLQETSV SISSTKGMFP GSLAHTTTKV FDHEISQVPE NNFSVQPTHT VSQASGDTSL 1140 

KPVLSANSEP ASSDPASSEM LSPSTQLLFY ETSASFSTEV LLQPSFQASD VDTLLKTVLP 1200 

AVPSDPILVE TPKVDKISST MLHLIVSNSA SSENMLHSTS VPVFDVSPTS HMHSASLQGL 1260 

TISYASEKYE PVLLKSESSH QWPSLYSND ELFQTANLEI NQAHPPKGRH VFATPVLSID 1320 

EPLNTLINKL IHSDEILTST KSSVTGKVFA GIPTVASDTF VSTDHSVPIG NGHVAITAVS 1380 

PHRDGSVTST KLLFPSKATS ELSHSAKSDA GLVGGGEDGD TDDDGDDDDD RDSDGLSIHK 1440 

CMSCSSYRES QEKVMNDSDT HENSLMDQNN PISYSLSENS EEDNRVTSVS SDSQTGMDRS 1500 

PGKSPSANGL SQKHNDGKEE NDIQTGSALL PLSPESKAWA VLTSDEESGS GQGTSDSLNE 1560 

NETSTDFSFA DTNEKDADGI LAAGDSEITP GFPQSPTSSV TSENSEVFHV SEAEASNSSH 1620 

ESRIGLAEGL ESEKKAVIPL VIVSALTFIC LWLVGILIY WRKCFQTAHF YLEDSTSPRV 1680 

ISTPPTPIFP ISDDVGAIPI KHFPKHVADL HASSGFTEEF ETLKEFYQEV QSCTVDLGIT 1740 

ADSSNHPDNK HKNRYINIVA YDHSRVKLAQ LAEKDGKLTD YINANYVDGY NRPKAYIAAQ 1800 

GPLKSTAEDF WRMIWEHNVE VIVMITNLVE KGRRKCDQYW PADGSEEYGN FLVTQKSVQV 1860 

LAYYTVRNFT LRNTKIKKGS QKGRPSGRW TQYHYTQWPD KGVPEYSLPV LTFVRKAAYA 1920 

KRHAVGPVW HCSAGVGRTG TYIVLDSMLQ QIQHEGTVNI FGFLKHIRSQ RKYLVQTEEQ 1980 

YVFIHDTLVE AILSKETEVL* DSH1HAYVNA LLIPGPAGKT KLEKQFQLLS QSNIQQSDYS 2040 

AALKQCNREK NRTSSIIFVE RSRVGISSLS GEGTDYINAS YIMGYYQSNE FIITQHPLLH 2100 

TIKDFWRMIW DHNAQLWMI PDGQNMAEDE FVYWPNKDEP INCESFKVTL KAEEHKCLSN 2160 

EEKLIIQDFI LEATQDDYVL EVRHFQCPKW PNPDSPISKT FEL1SVIKEE AANRDGPMIV 2220 

HDEHGGVTAG TFCALTTLMH QLEKENSVDV YQVAKMINLM RPGVFADIEQ YQFLYKVILS 2280 
LVSTRQEENP STSLDSNGAA LPDGNIAESL ESLV 

Seq ID NO: 383 DNA sequence 

Nucleic Acid Accession 8: NM_005688.1 

Coding sequence: 126. .4439 

1 11 21 31 41 51 

| | | I i I 

CCGGGCAGGT GGCTCATGCT CGGGAGCGTG GTTGAGCGGC TGGCGCGGTT GTCCTGGAGC 60 

AGGGGCGCAG GAATTCTGAT GTGAAACTAA CAGTCTGTGA GCCCTGGAAC CTCCGCTCAG 120 

AGAAGATGAA GGATATCGAC ATAGGAAAAG AGTATATCAT CCCCAGTCCT GGGTA TAGAA 180 

GTGTGAGGGA GAGAACCAGC ACTTCTGGGA CGCACAGAGA CCGTGAAGAT TCCAAGTTCA 240 

GGAGAACTCG ACCGTTGGAA TGCCAAGATG CCTTGGAAAC AGCAGCCCGA GCCGAGGGCC 300 

TCTCTCTTGA TGCCTCCATG CATTCTCAGC TCAGAATCCT GGATGAGGAG CATCCCAAGG 360 

GAAAQTACCA TCATGGCTTG AGTGCTCTGA AGCCCATCOG GACTACTTCC AAACACCAGC 420 

ACCCAGTGGA CAATGCTGGG CTTTTTTCCT GTATGACTTT TTCGTGGCTT TCTTCTCTGG 480 

CCCGTGTGGC CCACAAGAAG GGGGAGCTCT CAATGGAAGA CGTGTGGTCT CTGTCCAAGC 540 

ACGAGTCTTC TGACGTGAAC TGCAGAAGAC TAGAGAGACT GTGGCAAGAA GAGCTGAATG 600 

AAGTTGGGCC AGACGCTGCT TCCCTGCGAA GGGTTGTGTG GATCTTCTGC CGCACCAGGC 660 

TCATCCTGTC CATCGTGTGC CTGATGATCA CGCAGCTGGC TGGCTTCAGT GGACCAGCCT 720 

TCATGGTGAA ACACCTCTTG GAGTATACCC AGGCAACAGA GTCTAACCTG CAGTACAGCT 780 

TGTTGTTAGT GCTGGGCCTC CTCCTGACGG AAATCGTGCG GTCTTGGTCG CTTGCACTGA 840 

CTTGGGCATT GAATTACCGA ACCGGTGTCC GCTTGCGGGG GGCCATCCTA ACCATGGCAT 900 

TTAAGAAGAT CCTTAAGTTA AAGAACATTA AAGAGAAATC CCTGGGTGAG CTCATCAACA 960 

TTTGCTCCAA OGATGGGCAG AGAATGTTTG AGGCAGCAGC CGTTGGCAGC CTGCTGGCTG 1020 

GAGGACCCGT TGTTGCCATC TTAGGCATGA TTTATAATGT AATTATTCTG GGACCAACAG 1080 

GCTTCCTGGG ATCAGCTGTT TTTATCCTCT TTTACCCAGC AATGATGTTT GCATCACGGC 1140 

TCACAGCATA TTTCAGGAGA AAATGCGTGG CCGCCACGGA TGAACGTGTC CAG AAGATG A 1200 

ATGAAGTTCT TACTTACATT AAATTTATCA AAATGTATGC CTGGGTCAAA GCATTTTCTC 1260 

AGAGTGTTCA AAAAATCCGC GAGGAGGAGC GTOGGATATT GGAAAAAGCC GGGTACTTCC 1320 

AGGGTATCAC TGTGGGTGTG GCTCCCATTG TGGTGGTGAT TGCCAGCGTG GTGACCTTCT 1380 

CTGTTCATAT GACCCTGGGC TTCGATCTGA CAGCAGCACA GGCTTTCACA GTGGTGACAG 1440 

TCTTCAATTC CATGACTTTT GCTTTGAAAG TAACACOGTT TTCAGTAAAG TCCCTCTCAG 1500 

AAGCCTCAGT GGCTGTTGAC AGATTTAAGA GTTTGTTTCT AATGGAAGAG GTTCACATGA 1560 

TAAAGAACAA ACCAGCCAGT CCTCACATCA AGATAGAGAT GAAAAATGCC ACCTTGGCAT 1620 

GGGACTCCTC CCACTCCAGT ATCCAGAACT CGCCCAAGCT GACCCCCAAA ATGAAAAAAG 1680 

ACAAGAGGGC TTCCAGGGGC AAGAAAGAGA AGGTGAGGCA GCTGCAGCGC ACTGAGCATC 1740 

AGGCGGTGCT GGCAGAGCAG AAAGGCCACC TCCTCCTGGA CAGTGACGAG CGGCCCAGTC 1800 

CCGAAGAGGA AGAAGGCAAG CACATCCACC TGGGCCACCT GCGCTTACAG AGGACACTGC 1860 

ACAGCATCGA TCTGGAGATC CAAGAGGGTA AACTGGTTGG AATCTGCGGC AGTGTGGGAA 1920 

GTGGAAAAAC CTCTCTCATT TCAGCCATTT TAGGCCAGAT GACGCTTCTA GAGGGCAGCA 1980 

TTGCAATCAG TGGAACCTTC GCTTATGTGG CCCAGCAGGC CTGGATCCTC AATGCTACTC 2040 

TGAGAGACAA CATCCTGTTT GGGAAGGAAT ATGATGAAGA AAGATACAAC TCTGTGCTGA 2100 

ACAGCTGCTG CCTGAGGCCT GACCTGGCCA TTCTTCCCAG CAGCGACCTG ACGGAGATTG 2160 

GAGAGCGAGG AGCCAACCTG AGCGGTGGGC AGCGCCAGAG GATCAGCCTT GCCCGGGCCT 2220 

TGTATAGTGA CAGGAGCATC TACATCCTGG ACGACCCCCT CAGTGCCTTA GATGCCCATG 2280 

TGGGCAACCA CATCTTCAAT AGTGCTATCC GGAAACATCT CAAGTCCAAG ACAGTTCTGT 2340 

TTGTTACCCA CCAGTTACAG TACCTGGTTG ACTGTGATGA AGTGATCTTC ATGAAAGAGG 2400 

GCTGTATTAC GGAAAGAGGC ACCCATGAGG AACTGATGAA TTTAAATGGT GACTATGCTA 2460 

CCATTTTTAA TAACCTGTTG CTGGGAGAGA CACCGCCAGT TGAGATCAAT TCAAAAAAGG 2520 

AAACCAGTGG TTCACAGAAG AAGTCACAAG ACAAGGGTCC TAAAACAGGA TCAGTAAAGA 2580 

AGGAAAAAGC AGTAAAGCCA GAGGAAGGGC AGCTTGTGCA GCTGGAAGAG AAAGGGCAGG 2640 

GTTCAGTGCC CTGGTCAGTA TATGGTGTCT ACATCCAGGC TGCTGGGGGC CCCTTGGCAT 2700 

TCCTGGTTAT TATGGCCCTT TTCATGCTGA ATGTAGGCAG CACCGCCTTC AGCACCTGGT 2760 



1105 



WO 03/042661 



GGTTGAGTTA CTGGATCAAG CAAGGAAGCG GGAACACCAC TGTGACTCGA GGGAAOGAGA 2820 
CCTCGGTGAG TGACAGCATG AAGGACAATC CTCATATGCA GTACTATGCC AGCATCTACG 2880 
CCCTCTCCAT GGCAGTCATG CTGATCCTGA AAGCCATTCG AGGAGTTGTC TTTGTCAAGG 2940 
GCACGCTGCG AGCTTCCTCC CGGCTGCATG ACGAGCTTTT CCGAAGGATC CTTCGAAGCC 3000 
CTATGAAGTT TTTTGACACG ACCCCCACAG GGAGGATTCT CAACAGGTTT TCCAAAGACA 3060 
TGGATGAAGT TGACGTGCGG CTGCCGTTCC AGGCCGAGAT GTTCATCCAG AACGTTATCC 3120 
TGGTGTTCTT CTGTGTGGGA ATGATCGCAG GAGTCTTCCC GTGGTTCCTT GTGGCAGTGG 3180 - 
GGCCCCTTGT CATCCTCTTT TCAGTCCTGC ACATTGTCTC CAGGGTCCTG ATTCGGGAGC 3240 
TGAAGCGTCT GGACAATATC ACGCAGTCAC CTTTCCTCTC CCACATCACG TCCAGCATAC 3300 
AGGGCCTTGC CACCATCCAC GCCTACAATA AAGGGCAGGA GTTTCTGCAC AGATACCAGG 3360 
AGCTGCTGGA TGACAACCAA GCTCCTTTTT TTTTGTTTAC GTGTGCGATG CGGTGGCTGG 3420 
CTGTGCGGCT GGACCTCATC AGCATCGCCC TCATCACCAC CACGGGGCTG ATGATCGTTC 3480 
TTATGCACGG GCAGATTCCC CCAGCCTATG CGGGTCTCGC CATCTCTTAT GCTGTCCAGT 3540 
TAACGGGGCT GTTCCAGTTT ACGGTCAGAC TGGCATCTGA GACAGAAGCT CGATTCACCT 3600 
CGGTGGAGAG GATCAATCAC TACATTAAGA CTCTGTCCTT GGAAGCACCT GCCAGAATTA 3660 
AGAACAAGGC TCCCTCCCCT GACTGGCCCC AGGAGGGAGA GGTGACCTTT GAGAACGCAG 3720 
AGATGAGGTA CCGAGAAAAC CTCCCTCTTG TCCTAAAGAA AGTATCCTTC ACGATCAAAC 3780 
CTAAAGAGAA GATTGGCATT GTGGGGCGGA CAGGATCAGG GAAGTCCTCG CTGGGGATGG 3840 
CCCTCTTCCG TCTGGTGGAG TTATCTGGAG GCTGCATCAA GATTGATGGA GTGAGAATCA 3900 
GTGATATTGG CCTTGCCGAC CTCCGAAGCA AACTCTCTAT CATTCCTCAA GAGCCGGTGC 3960 
TGTTCAGTGG CACTGTCAGA TCAAATTTGG ACCCCTTCAA CCAGTACACT GAAGACCAGA 4020 
TTTGGGATGC CCTGGAGAGG ACACACATGA AAGAATQTAT TGCTCAGCTA CCTCTGAAAC 4080 
TTGAATCTGA AGTGATGGAG AATGGGGATA ACTTCTCAGT GGGGGAACGG CAGCTCTTGT 4140 
GCATAGCTAG AGCCCTGCTC CGCCACTGTA AGATTCTGAT TTTAGATGAA GCCACAGCTG 4200 
CCATGGACAC AGAGACAGAC TTATTGATTC AAGAGACCAT CCGAGAAGCA TTTGCAGACT 4260 
GTACCATGCT GACCATTGCC CATCGCCTGC ACACGGTTCT AGGCTCCGAT AGGATTATGG 4320 
TGCTGGCCCA GGGACAGGTG GTGGAGTTTG ACACCCCATC GGTCCTTCTG TCCAACGACA 4380 
GTTCCCGATT CTATGCCATG TTTGCTGCTG CAGAGAACAA GGTCGCTGTC AAGGGCTGAC 4440 
TCCTCCCTGT TGACGAAGTC TCTTTTCTTT AGAGCATTGC CATTCCCTGC CTGGGGCGGG 4500 
CCCCTCATCG CGTCCTCCTA CCGAAACCTT GCCTTTCTCG ' ATTTTATCTT TCGCACAGCA 4560 
GTTCCGGATT GGCTTGTGTG TTTCACTTTT AGGGAGAGTC ATATTTTGAT TATTGTATTT 4620 
ATTCCATATT CATGTAAACA AAATTTAGTT TTTGTTCTTA ATTGCACTCT AAAAGGTTCA 4680 
GGGAACCGTT ATTATAATTG TATCAGAGGC CTATAATGAA GCTTTATACG TGTAGCTATA 4740 
TCTATATATA ATTCTGTACA TAGCCTATAT TTACAGTGAA AATGTAAGCT GTTTATTTTA 4800 
TATTAAAATA AGCACTGTGC TAATAACAGT GCATATTCCT TTCTATCATT TTTGTACAGT 4860 
TTGCTGTACT AGAGATCTGG TTTTGCTATT AGACTGTAGG AAGAGTAGCA TTTCATTCTT 4920 
CTCTAGCTGG TGGTTTCACG GTGCCAGGTT TTCTGGGTGT CCAAAGGAAG ACGTGTGGCA 4980 
ATAGTGGGCC CTCCGACAGC CCCCTCTGCC GCCTCCCCAC AGCCGCTCCA GGGGTGGCTG 5040 
GAGACGGGTG GGCGGCTGGA GACCATGCAG AGCGCCGTGA GTTCTCAGGG CTCCTGCCTT 5100 
CTGTCCTGGT GTCACTTACT GTTTCTGTCA GGAGAGCAGC GGGGOGAAGC CCAGGCCCCT 5160 
TTTCACTCCC TCCATCAAGA ATGGGGATCA CAGAGACATT CCTCCGAGCC GGGGAGTTTC 5220 
TTTCCTGCCT TCTTCTTTTT GCTGTTGTTT CTAAACAAGA ATCAGTCTAT CCACAGAGAG 5280 
TCCCACTGCC TCAGGTTCCT ATGGCTGGCC ACTGCACAGA GCTCTCCAGC TCCAAGACCT 5340 
GTTGGTTCCA AGCCCTGGAG CCAACTGCTG CTTTTTGAGG TGGCACTTTT TCATTTGOCT 5400 
ATTCCCACAC CTCCACAGTT CAGTGGCAGG GCTCAGGATT TCGTGGGTCT GTT TTCC TTT S460 
CTCACCGCAG TCGTCGCACA GTCTCTCTCT CTCTCTCCCC TCAAAGTCTG CAACTTTAAG 5520 
CAGCTCTTGC TAATCAGTGT CTCACACTGG CGTAGAAGTT TTTGTACTGT AAAGAGACCT 5580 
ACCTCAGGTT GCTGGTTGCT GTGTGGTTTG GTGTGTTCCC GCAAACCCCC TTTGTGCTGT 5640 
GGGGCTGGTA GCTCAGGTGG GCGTGGTCAC TGCTGTCATC AGTTGAATGG TCAGCGTTGC 5700 
ATGTCGTGAC CAACTAGACA TTCTGTCGCC TTAGCATGTT TGCTGAACAC CTTGTGGAAG 5760 
CAAAAATCTG AAAATGTGAA TAAAATTATT TTGGATTTTG TAAAAAAAAA AAAAAAAAAA 5820 
AAAAAAAAAA AAAAAAAA 

Seq ID NO: 384 Protein sequence 
Protein Accession #: NP_005679.1 

1 11 21 31 41 51 

I i I I t I 

MKDIDIGKEY IIPSPGYRSV RERTSTSGTH RDREDSKFRR TRPLECQDAL ETAARAEGLS 60 

LDASMHSQLR ILDEEHPKGK YHHGLSALKP IRTTSKHQHP VDNAGLFSCM TFSWLSSLAR 120 

VAHKKGELSM EDVWSLSKHE SSDVNCRRLE RLWQEELNEV GPDAASLRRV VWIFCRTRLI 180 

LSIVCLMITQ IAGPSGPAPM VKHLLEYTQA TESNLQYSLL LVLGLLLTEI VRSWSLALTW 240 

ALNYRTGVRL RGAILTMAFX KILKLKNIKE KSLGELINIC SNDGQRMFEA AAVGSLLAGG 300 

PWAILGMIY NVIILGPTGF LGSAVFILFY PAMMFASRLT AYFRRKCVAA TDERVQKMNE 360 

VLTYIKFIKM YAWVKAFSQS VQKIREEERR ILEKAGYFQG ITVGVAPIW VIASWTFSV 420 

HMTLGFDLTA AQAFTWTVF NSMTFALKVT PFSVKSLSEA SVAVDRFKSL FLMEEVHMIK 480 

NKPASPHIKI EMKNATIiAWD SSHSSIONSP KLTPKMKKDK RASRGKKEKV RQLQRTEHQA 540 

VLAEQKGHLL LDSDERPSPB EEEGKHIHLG HLRLQRTLHS IDLEIQEGKL VGICGSVGSG 600 

KTSLISAILG QMTLLEGSIA ISGTFAYVAQ QAWILNATLR DNILFGKEYD EERYNSVLNS 660 

CCLRPDLAIL PSSDLTEIGE RGANLSGGQR QRISLARALY SDRSIYILDD PLSALDAHVG 720 

NHIFNSAIRK HLKSKTVLFV THQLQYLVDC DEVIFMKEGC ITERGTHEEL MNLNGDYATI 780 

FKNLIiLGETP PVEINSKKET SGSQKKSQDK GPKTGSVKKE KAVKPEEGQL VQLEEKGQGS 840 

VPWSVYGVYI QAAGGPLAFL VIMALFMLNV GSTAFSTWWL SYWIKQGSGN TTVTRGNETS 900 

VSDSMKDNPH MQYYASIYAL SMAVMLILKA IRGWFVKGT LRASSRLHDE LFRRILRSPM 960 

KFFDTTPTGR ILNRFSKDMD EVDVRLPFQA EMFIQNVILV FFCVGMIAGV FPWFLVAVGP 1020 

LVILFSVLHI VSRVLIRELK RU)MITQSPF LSHITSSIQG LATIHAYNKG QEFLHRYQEL 1080 

LDDNQAPFFL FTCAMRWUVV RLDLISIALI TTTGLMIVLM HGQIPPAYAG LAISYAVQLT 1140 

GLFQFTVRLA SETEARFTSV ERINHYIKTL SLEAPARIKN KAPSPDWPQE GEVTFENAEM 1200 

RYRENLPLVL KKVSFTIKPK EKIGIVGRTG SGKSSLGMAL FRLVELSGGC IKIDGVRISD 1260 

IGLADLRSKL SIIPQEPVLF SGTVRSNLDP FNQYTEDQIW DALERTHMKE CIAQLPLKLE 1320 

SEVMENGDNF SVGERQLLCI ARALLRHCKI LILDEATAAM DTETDLLIQE TIREAFADCT 1380 
MLTIAHRLHT VLGSDRIMVL AQGQWEFDT PSVLLSNDSS RFYAMFAAAE NKVAVKG 
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Seq ID NO: 385 DNA sequence 

Nucleic Acid Accession #: NMJJ01327.1 

Coding sequence: 89. -631 

1 11 21 31 41 SI 

] | | I ! I 

AGCAGGGGGC GCTGTGTGTA CCQAGAATAC GAGAATACCT CGTGGGCCCT GACCTTCTCT 60 

CTGAGAGCCG GGCAGAGGCT CCGGAGCCAT GCAGGCCGAA GGCCGGGGCA CAGGGGGTTC 120 

GACGGGCGAT GCTGATGGCC CAGGAGGCCC TGGCATTCCT GATGGCCCAG GGGGCAATGC 180 

TGGCGGCCCA GGAGAGGCGG GTGCCACGGG CGGCAGAGGT CCCCGGGGCG CAGGGGCAGC 240 

AAGGGCCTCG GGGCCGGGAG GAGGCGCCCC GCGGGGTCCG CATGGCGGCG CGGCTTCAGG 300 

GCTGAATGGA TGCTGCAGAT GCGGGGCCAG GGGGCCGGAG AGCCGCCTGC TTGAGTTCTA 360 

CCTCGCCATG CCTTTCGCGA CACCCATGGA AGCAGAGCTG GCCCGCAGGA GCCTGGCCCA 420 

GGATGCCCCA CCGCTTCCCG TGCCAGGGGT GCTTCTGAAG GAGTTCACTG TGTCCGGCAA 480 

CATACTGACT ATCCGACTGA CTGCTGCAGA CCACCGCCAA CTGCAGCTCT CCATCAGCTC 540 

CTGTCTCCAG CAGCTTTCCC TGTTGATGTG GATCACGCAG TGCTTTCTGC CCGTGTTTTT 600 

GGCTCAGCCT CCCTCAGGGC AGAGGCGCTA AGCCCAGCCT GGCGCCCCTT CCTAGGTCAT 660 

GCCTCCTCCC CTAGGGAATG GTCCCAGCAC GAGTGGCCAG TTCATTGTGG GGGCCTGATT 720 
GTTTGTCGCT GGAGGAGGAC GGCTTACATG TTTGTTTCTG TAGAAAATAA AACTGAGCTA 

Seq ID NO: 386 Protein sequence 
Protein Accession #: NP_0013lB.l 

1 11 21 31 41 51 

| t I t I I 

MQAEGRGTGG STGDADGPGG PGIPDGPGGN AGGPGEAGAT GGRGPRGAGA ARASGPGGGA 60 
PRGPHGGAAS GLNGCCRCGA RG P ESRLLE P YLAMPFATPM EAKLARRSLA QDAPPLPVPG 120 
VLLKEFTVSG NILTIRLTAA DHRQLQLSIS SCLQQLSLLM WITQCFLPVP LAQPPSGQRR 

Seq ID NO: 387 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 52.. 459 

1 11 21 31 41 ' 51 

I I I I I I 

CCTCGTGGGC CCTGACCTTC TCTCTGAGAG CCGGGCAGAG GCTCOGGAGC CATGCAGGCC 60 

GAAGGCCAGG GCACAGGGGG TTCGACGGGC GATGCTGATG GCCCAGGAGG CCCTGGCATT 120 

CCTGATGGCC CAGGGGGCAA TGCTGGCGGC CCAGGAGAGG CGGGTGCCAC GGGCGGCAGA 180 

GGTCCCCGGG GCGCAGGGGC AGCAAGGGCC TCGGGGCCGA GAGGAGGCGC CCCGCGGGGT 240 

CCGCATGGCG GTGCCGCTTC TGCGCAGGAT GGAAGGTGCC CCTGCGGGGC CAGGAGGCCG 300 

GACAGCCGCC TGCTTCAGTT CCGACTGACT GCTGCAGACC ACCGCCAACT GCAGCTCTCC 360 

ATCAGCTCCT GTCTCCAGCA GCTTTCCCTG TTGATGTGGA TCACGCAGTG CTTTCTGCCC 420 

GTGTTTTTGG CTCAGGCTCC CTCAGGGCAG AGGCGCTAAG CCCAGCCTGG CGCCCCTTCC 480 

TAGGTCATGC CTCCTCCCCT AGGGAATGGT CCCAGCACGA GTGGCCAGTT CATTGTGGGG 540 

GCCTGATTGT TTGTCGCTGG AGGAGGACGG CTTACATGTT TGTTTCTGTA GAAAATAAAG 600 
CTGAGCTA 

Seq ID NO: 388 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

MQAEGQGTGG STGDADGPGG PGIPDGPGGN AGGPGEAGAT GGRGPRGAGA ARASGPRGGA 60 
PRGPHGGAAS AQDGRCPCGA RRPDSRLLQF RLTAADHRQL QLSISSCLQQ LSLLMWITQC 120 
FLPVFLAQAP SGQRR 

Seq ID NO: 389 DNA sequence 
Nucleic Acid Accession #: NM_005562.1 
Coding sequence: 90. .3671 

1 n 21 31 41 51 

ACAGCGGAGC GCAGAGTGAG AACCACCAAC OGAGGOGCCG GGCAGCGACC CCTGCAGCGG 60 

AGACAGAGAC TGAGCGGCCC GGCACCGCCA TGCCTGCGCT CTGGCTGGGC TGCTGCCTCT 120 

GCTTCTCGCT CCTCCTGCCC GCAGCCCGGG CCACCTCCAG GAGGGAAGTC TGTGATTGCA 180 

ATGGGAAGTC CAGGCAGTGT ATCTTTGATC GGGAACTTCA CAGACAAACT GGTAATGGAT 240 

TCCGCTGCCT CAACTGCAAT GACAACACTG ATGGCATTCA CTGCGAGAAG TGCAAGAATG 300 

GCTTTTACOG GCACAGAGAA AGGGACCGCT GTTTGCCCTG CAATTGTAAC TCCAAAGGTT 360 

CTCTTAGTGC TCGATGTGAC AACTCTGGAC GGTGCAGCTG TAAACCAGGT GTGACAGGAG 420 

CCAGATGCGA CCGATGTCTG CCAGGCTTCC ACATGCTCAC GGATGCGGGG TGCACCCAAG 480 

ACCAGAGACT GCTAGACTCC AAGTGTGACT GTGACCCAGC TGGCATCGCA GGGCCCTGTG 540 

ACGCGGGCCG CTGTGTCTGC AAGCCAGCTG TTACTGGAGA ACGCTGTGAT AGGTGTCGAT 600 

CAGGTTACTA TAATCTGGAT GGGGGGAACC CTGAGGGCTG TACCCAGTGT TTCTGCTATG 660 

GGCATTCAGC CAGCTGCCGC AGCTCTGCAG AATACAGTGT CCATAAGATC ACCTCTACCT 720 

TTCATCAAGA TGTTGATGGC TGGAAGGCTG TCCAACGAAA TGGGTCTCCT GCAAAGCTCC 780 

AATGGTCACA GCGCCATCAA GATGTGTTTA GCTCAGCCCA ACGACTAGAC CCTGTCTATT 840 

TTGTGGCTCC TGCCAAATTT CTTGGGAATC AACAGGTGAG CTATGGGCAA AGCCTGTCCT 900 

TTGACTACCG TGTGGACAGA GGAGGCAGAC ACCCATCTGC CCATGATGTG ATTCTGGAAG 960 

GTGCTGGTCT ACGGATCACA GCTCCCTTGA TGCCACTTGG CAAGACACTG CCTTGTGGGC 1020 

TCACCAAGAC TTACACATTC AGGTTAAATG AGCATCCAAG CAATAATTGG AGCCCCCAGC 1080 

TGAGTTACTT TGAGTATCGA AGGTTACTGC GGAATCTCAC AGCCCTCCGC ATCCGAGCTA 1140 

CATATGGAGA ATACAGTACT GGGTACATTG ACAATGTGAC CCTGATTTCA GCCCGCCCTG 1200 

TCTCTGGAGC CCCAGCACCC TGGGTTGAAC AGTGTATATG TCCTGTTGGG TAC AAGGGG C 1260 

AATTCTGCCA GGATTGTGCT TCTGGCTACA AGAGAGATTC AGCGAGACTG GGGCCTTTTG 1320 
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GCACCTGTAT TCCTTGTAAC TGTCAAGGGG 
GTTATTCAGG GGATGAGAAT CCTGACATTG 
ACGATCCGCA CGACCCCCGC AGCTGCAAGC 
CAGTGATGCC GGAGACGGAG GAGGTGGTGT 
CCCGCTGTGA GCTCTGTGCT GATGGCTACT 
TGAGGCCTTG TCAGCCCTGT CAATGCAACA 
GTGACCGGCT GACAGGCAGG TGTTTGAAGT 
ACCAGTGCAA AGCAGGCTAC TTCGGGGACC 
GAGCTTGCAA CTGTAACCCC ATGGGCTCAG 
GTGTTTGCAA GCCAGGATTT GGTGGCCCCA 
CTTGCTATAA TCAAGTGAAG ATTCAGATGG 
AGGCCCTGAT TTCAAAGGCT CAGGGTGGTG 
GCAGGATGCA GCAGGCTGAG CAGGCCCTTC 
AAGGTGCTAG CAGATCCCTT GGTCTCCAGT 
ACCAGAGCCG CCTGGATGAC CTCAAGATGA 
AGTACCAGAA CCGAGTTCGG GATACTCACA 
CAGAAAGTGA AGCTTCCTTG GGAAACACTA 
CAAATGGCTT TAAAAGTCTG GCTCAGGAGG 
CAGCCAGTAA CATGGAGCAA CTGACAAGGG 
CACTGGTGCG CAAGGCCCTG CATGAAGGAG 
CTGTGGTGCA AGGGCTTGTG GAAAAATTGG 
CAAGGGAGGC CACTCAAGCG GAAATTGAAG 
TCCTGGATTC AGTGTCTCGG CTTCAGGGAG 
CAAAGAGGAT CAAACAAAAA GCGGATTCAC 
AGTTCAAGCG TACACAAAAG AATCTGGGAA 
AGAATGGAAA AAGTGGGAGA GAGAAATCAG 
AAAGCAGAGC ACAAGAAGCA CTGAGTATGG 
TCCTTAAAAA CCTCAGAGAG TTTGACCTGC 
AAGCCATGAA GAGACTCTCC TACATCAGCC 
AGCAAGCAGA AAGAGCCCTG GGGAGCGCTG 
CCGGGGAGGC CCTGGAAATC TCCAGTGAGA 
AAGCCAATGT GACAGCAGAT GGAGCCTTGG 
GTGAGATGAG GGAAGTGGAA GGAGAGCTGG 
TGGATGCAGT ACAGATGGTG ATTACAGAAG 
CTGGGGTTAC AATCCAAGAC ACACTCAACA 
AGCCTCTCAG TGTAGATGAA GAGGGGCTGG 
AGACCCAGAT CAACAGCCAA CTGCGGCCCA 
AGCAGAGGGG CCACCTCCAT TTGCTGGAGA 
AGAACTTGGA GAACATTAGG GACAACCTGC 
AGCAACAGTG AAGCTGCCAT AAATATTTCT 
GGCTOGGGAG CCATGTCATG TGAGTGGGTG 
TATGCTCAGG TCAACTGACC TGACCCCATT 
TGCACCATAC TCCTTGCTTC CTGATGCTGG 
ATGATCAAGG ATCTGGACCC CAAAGAATAG 
ATGTTTGCCT CATAATAGTC GTAAGTGGAG 
ATAGTCAACT TATTCTTTGA GTAATGTGAC 
ATGAAATTCT TCCTAATGTC AGAACAGAGT 
ACTATTGCCT CATATTGTCC TCTGCAAGCT 
ACCCAGGGTG TGAACATGTT CTCCATTTTC 
AGGACCTGTA AGGCAGGCCC ATTCAGAGCT 
GTTCTGGACC TGGGCATGAC ATCCTTTCTT 
ATTTTTATTA AAGCATTTCC TACCAGCAAA 
GTTTCAAAGT GATAGAAAAG TGTGGCTTGG 
ATTAGTCCTA ATTCAATCCT ACTTTTCGAA 
ATCTTATTTT CTCAATCTCC TCTCTCTTTC 
CACACTTCAG CTGGGTCACA TCCATCCCTC 
TTACCTCCAT CCATCCTTCC AACATATATT 
GTGGGACAGT GGTGACATAG TCTCTGCCCT 
AGCATTTTTA AAAAATAAAT TTAAACTTAC 
GCAATAACCG CTTGGTTTGC AACCTCTTTG 
CATGGGGGCA CTTGAGTTTT GGCAAGGCTG 
CATTCCAGCT GTCACTCTGT GCCTTTCTAC 
TAACACCAGT GGGAATTGCT GGAGGAACCA 
TGGTGCTGCC TTGCTTCTGT ATTTCCTTGG 
CAATTGTTAG ATGCC 



GAGGGGCCTG TGATCCAGAC ACAGGAGATT 1380 
AGTGTGCTGA CTGCCCAATT GGTTTCTACA 1440 
CATGTCCCTG TCATAACGGG TTCAGCTGCT 1500 
GCAATAACTG CCCTCCCGGG GTCACCGGTG 1560 
TTGGGGACCC CTTTGGTGAA CATGGCCCAG 1620 
ACAATGTGGA CCCCAGTGCC TCTGGGAATT 1680 
GTATCCACAA CACAGCCGGC ATCTACTGCG 1740 
CATTGGCTCC CAACCCAGCA GACAAGTGTC 1800 
AGCCTGTAGG ATGTCGAAGT GATGGCACCT 1860 
ACTGTGAGCA TGGAGCATTC AGCTGTCCAG 1920 
ATCAGTTTAT GCAGCAGCTT CAGAGAATGG 1980 
ATGGAGTAGT ACCTGATACA GAGCTGGAAG 2040 
AGGACATTCT GAGAGATGCC CAGATTTCAG 2100 
TGGCCAAGGT GAGGAGCCAA GAGAACAGCT 2160 
CTGTGGAAAG AGTTCGGGCT CTGGGAAGTC 2220 
GGCTCATCAC TCAGATGCAG CTGAGCCTGG 2280 
ACATTCCTGC CTCAGACCAC TACGTGGGGC 2340 
CCACAAGATT AGCAGAAAGC CACGTTGAGT 2400 
AAACTGAGGA CTATTCCAAA CAAGCCCTCT 2460 
TCGGAAGCGG AAGCGGTAGC CCGGACGGTG 2520 
AGAAAACCAA GTCCCTGGCC CAGCAGTTGA 2580 
CAGATAGGTC TTATCAGCAC AGTCTCCGCC 2640 
TCAGTGATCA GTCCTTTCAG GTGGAAGAAG 2700 
TCTCAACGCT GGTAACCAGG CATATGGATG 2760 
ACTGGAAAGA AGAAGCACAG CAGCTCTTAC 2820 
ATCAGCTGCT TTCCOGTGCC AATCTTGCTA 2880 
GCAATGCCAC TTTTTATGAA GTTGAGAGCA 2940 
AGGTGGACAA CAGAAAAGCA GAAGCTGAAG 3000 
AGAAGGTTTC AGATGCCAGT GACAAGACCC 3060 
CTGCTGATGC ACAGAGGGCA AAGAATGGGG 3120 
TTGAACAGGA GATTGGGAGT CTGAACTTGG 3180 
CCATGGAAAA GGGACTGGCC TCTCTGAAGA 3240 
AAAGGAAGGA GCTGGAGTTT GACACGAATA 3300 
CCCAGAAGGT TGATACCAGA GCCAAGAACG 3360 
CATTAGACGG CCTCCTGCAT CTGATGGACC 3420 
TCTTACTGGA GCAGAAGCTT TCCCGAGCCA 3480 
TGATGTCAGA GCTGGAAGAG AGGGCACGTC 3540 
CAAGCATAGA TGGGATTCTG GCTGATGTGA 3600 
CCCCAGGCTG CTACAATACC CAGGCTCTTG 3660 
CAACTGAGGT TCTTGGGATA CAGATCTCAG 3720 
GGATGGGGAC ATTTGAACAT GTTTAATGGG 3780 
CCTGATCCCA TGGCCAGGTG GTTGTCTTAT 3840 
GCAATGAGGC AGATAGCACT GGGTGTGAGA 3900 
ACTGGATGGA AAGACAAACT GCACAGGCAG 3960 
TCCTGGAATT TGGACAAGTG CTGTTGGGAT 4020 
TAAAGGAAAA AACTTTGACT TTGCCCAGGC 4080 
GCAACCCAGT CACACTGTGG CCAGTAAAAT 4140 
TCTTGCTGAT CAGAGTTCCT CCTACTTACA 4200 
AAGCTGGAAG AAGTGAGCAG TGTTGGAGTG 4260 
ATGGTGCTTG CTGGTGCCTG CCACCTTCAA 4320 
TTAATGATGC CATGGCAACT TAGAGATTGC 4380 
GCAAATGTTG GGAAAGTATT TACTTTTTCG 4440 
GCATTGAAAG AGGTAAAATT CTCTAGATTT 4500 
CACCAAAAAT GATGCGCATC AATGTATTTT 4560 
CTCCACCCAT AATAAGAGAA TGTTCCTACT 4620 
CATTCATCCT TCCATCCATC TTTCCATCCA 4680 
TATTGAGTAC CTACTGTGTG CCAGGGGCTG 4740 
CATAGAGTTG ATTGTCTAGT GAGGAAGACA 4800 
AAACTTTGTT TGTCACAAGT GGTGTTTATT 4860 
CTCAACAGAA CATATGTTGC AAGACCCTCC 4920 
ACAGAGCTCT GGGTTGTGCA CATTTCTTTG 4980 
AACTGATTGC AACAGACTGT TGAGTTATGA 5040 
GAGGCACTTC CACCTTGGCT GGGAAGACTA 5100 
ATTTTCCTGA AAGTGTTTTT AAATAAAGAA 5160 



Seq ID NO: 390 Protein sequence 
Protein Accession #s NP__005553.1 



1 
I 

MPAIjWIjGCCL 
DGIHCEKCKN 
HMLTDAGCTQ 
PBGCTQCFCY 
SSAQRLDPVY 
MPLGKTLPCG 
DNVTLISARP 
GGACDPDTGD 
CNNCPPGVTG 
CIHNTAGIYC 
NCEHGAFSCP 
QDILRDAQIS 
RLITQMQLSL 



11 
I 

CPSLLLPAAR 
GFYRHRERDR 
DQRLIiDSKCD 
GHSASCRSSA 
FVAPAKFLGN 
LTKTYTPRLN 
VSGAPAPWVE 
CYSGDENPDI 
ARCELCADGY 
DQCKAGYFGD 
ACYNQVKIQM 
EGASRSLGLQ 
AESBASLGNT 



21 
I 

ATSRREVCDC 
CLPCNCNSKG 
CDPAGIAGPC 
EYSVHKITST 
QQVSYGQSLS 
EHPSNNWSPQ 
QCICPVGYKG 
ECADCPIGPY 



PLAPNPADKC 
DQFMQQLQRM 
LAKVRSQENS 
NTPASDHYVG 



31 
I 

NGKSRQCIPD 
SLSARCDNSG 
DAGRCVCKPA 
FHQDVDGWKA 
FDYRVDRGGR 
LSYFEYRRliL 
QFCQDCASGY 
NDPHDPRSCK 
VRPCQPCQCN 
RACNCNPMGS 
EALISKAOGG 
YQSRLDDLKM 



41 
1 

RELHRQTGNG 
RCSCKPGVTG 
VTGERCDRCR 
VQRNGSPAKL 
HPSAHDVILE 
RNLTALRIRA 
KRDSARLGPF 
PCPCHNGFSC 
NNVDPSASGN 
EPVGCRSDGT 
DGWPDTELE 
TVERVRALGS 
ATRLAESHVE 



51 
I 

FRCLNCNDNT 
ARCDRCLPGF 
SGYYNLDGGN 
QWSQRHQDVF 
GAGLRITAPL 
TYGEYSTGYI 
GTCIPCNCQG 
SVMPETEEW 
CDRLTGRCLK 
CVCKPGFGGP 
GRMQQAEQAL 
QYQNRVRDTH 
SASNMEQLTR 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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ETEDYSKQAL SLVRKALHEG VGSGSGSPDG AWQGLVEKL EKTKSLAQQL TREATQAEIE B40 

ADRSYQHSLR LLDSVSRLQG VSDQSFQVEE AKRIKQKADS LSTLVTRHMD EFKRTQKNLG 900 

NWKEEAQQLL QNGKSGREKS DQLLSRANLA KSRAQEALSM GNATFYEVES ILKNLREFDL 960 

QVDNRKAEAE EAMKRLSYIS QKVSDASDKT QQAERALGSA AADAQRAKNG AGEALEISSE 1020 

5 IEQEIGSLNL EANVTADGAL AMEKGLASLK SEMREVEGEL ERKELEFDTN MDAVQMVITE 1080 

AQKVDTRAKN AGVTIQDTLN TLDGLLHLMD QPLSVDEEGL VLLEQKLSRA KTQINSQLRP 1140 
MMSELEERAR QQRGHLHLLE TSIDGILADV KNLENIRDNL PPGCYNTQAL EQQ 

Seq ID NO: 391 DMA sequence 
10 Nucleic Acid Accession ft: AF1010S1.1 
Coding sequence: 221.856 

1 11 21 31 41 51 

I I I I I I 

1!> GAGCAACCTC AGCTTCTAGT ATCCAGACTC CAGCGCCGCC CCGGGCGCGG ACCCCAACCC 60 
CGACCCAGAG CTTCTCCAGC GGCGGCGCAG CGAGCAGGGC TCCCCGCCTT AACTTCCTCC 120 
GCGGGGCCCA GCCACCTTCG GGAGTCCGGG TTGCCCACCT GCAAACTCTC CGCCTTCTGC 180 
ACCTGCCACC CCTGAGCCAG CGCGGGCGCC CGAGCGAGTC ATGGCCAACG CGGGGCTGCA 240 
GCTGTTGGGC TTCATTCTCG CCTTCCTGGG ATGGATCGGC GCCATCGTCA GCACTGCCCT 300 
20 GCCCCAGTGG AGGATTTACT CCTATGCCGG CGACAACATC GTGACCGCCC AGGCCATGTA 360 
CGAGGGGCTG TGGATGTCCT GCGTGTCGCA GAGCACCGGG CAGATCCAGT GCAAAGTCTT 420 
TGACTCCTTG CTGAATCTGA GCAGCACATT GCAAGCAACC CGTGCCTTGA TGGTGGTTGG 4 80 
CATCCTCCTG GGAGTGATAG CAATCTTTGT GGCCACCGTT GGCATGAAGT GTATGAAGTG 540 
CTTGGAAGAC GATGAGGTGC AGAAGATGAG GATGGCTGTC ATTGGGGGTG CGATATTTCT 600 
25 TCTTGCAGGT CTGGCTATTT TAGTTGCCAC AGCATGGTAT GGCAATAGAA TCGTTCAAGA 660 
ATTCTATGAC CCTATGACCC CAGTCAATGC CAGGTACGAA TTTGGTCAGG CTCTCTTCAC 720 
TGGCTGGGCT GCTGCTTCTC TCTGCCTTCT GGGAGGTGCC CTACTTTGCT GTTCCTGTCC 780 
CCGAAAAACA ACCTCTTACC CAACACCAAG GCCCTATCCA AAACCTGCAC CTTCCAGCGG 840 
GAAAGACTAC GTGTGACACA GAGGCAAAAG GAGAAAATCA TGTTGAAACA AACCGAAAAT 900 
30 GGACATTGAG ATACTATCAT TAACATTAGG ACCTTAGAAT TTTGGGTATT GTAATCTGAA 960 
GTATGGTATT ACAAAACAAA CAAACAAACA AAAAACCCAT GTGTTAAAAT ACTCAGTGCT 1020 
AAACATGGCT TAATCTTATT TTATCTTCTT TCCTCAATAT AGGAGGGAAG ATTTTACCAT 1080 
TTGTATTACT GCTTCCCATT GAGTAATCAT ACTCAAATGG GGGAAGGGGT GCTCCTTAAA 1140 
TATATATAGA TATGTATATA TACATGTTTT TCTATTAAAA ATAGACAGTA AAATACTATT 1200 
35 CTCATTATGT TGATACTAGC ATACTTAAAA TATCTCTAAA ATAGGTAAAT GTATTTAATT 1260 
CCATATTGAT GAAGATGTTT ATTGGTATAT TTTCTTTTTC GTCCTTATAT ACATATGTAA 1320 
CAGTCAAATA TCATTTACTC TTCTTCATTA GCTTTGGGTG CCTTTGCCAC AAGACCTAGC 1380 
CTAATTTACC AAGGATGAAT TCTTTCAATT CTTCATGCGT GCCCTTTTCA TATACTTATT 1440 
TTATTTTTTA CCATAATCTT ATAGCACTTG CATCGTTATT AAGCCCTTAT TTGTTTTGTG 1500 
40 TTTCATTGGT CTCTATCTCC TGAATCTAAC ACATTTCATA GCCTACATTT TAGTTTCTAA 1560 
AGCCAAGAAG AATTTATTAC AAATCAGAAC TTTGGAGGCA AATCTTTCTG CATGACCAAA 1620 
GTGATAAATT CCTGTTGACC TTCCCACACA ATCCCTGTAC TCTGACCCAT AGCACTCTTG 1680 
TTTGCTTTGA AAATATTTGT CCAATTGAGT AGCTGCATGC TGTTCCCCCA GGTGTTGTAA 1740 
CACAACTTTA TTGATTGAAT TTTTAAGCTA CTTATTCATA GTTTTATATC CCCCTAAACT 1800 
45 ACCTTTTTGT TCCCCATTCC TTAATTGTAT TGTTTTCCCA AGTGTAATTA TCATGCGTTT 1860 
TATATCTTCC TAATAAGGTG TGGTCTGTTT GTCTGAACAA AGTGCTAGAC TTTCTGGAGT 1920 
GATAATCTGG TGACAAATAT TCTCTCTGTA GCTGTAAGCA AGTCACTTAA TCTTTCTACC 1980 
TCTTTTTTCT ATCTGCCAAA TTGAGATAAT GATACTTAAC CAGTTAGAAG AGGTAGTGTG 2040 
AATATTAATT AGTTTATATT ACTCTCATTC TTTGAACATG AACTATGCCT ATGTAGTGTC 2100 
50 TTTATTTGCT CAGCTGGCTG AGACACTGAA GAAGTCACTG AACAAAACCT ACACACGTAC 2160 
CTTCATGTGA TTCACTGCCT TCCTCTCTCT ACCAGTCTAT TTCCACTGAA CAAAACCTAC 2220 
ACACATACCT TCATGTGGTT CAGTGCCTTC CTCTCTCTAC CAGTCTATTT CCACTGAACA 2280 
AAACCTACGC ACATACCTTC ATGTGGCTCA GTGCCTTCCT CTCTCTACCA GTCTATTTCC 2340 
ATTCTTTCAG CTGTGTCTGA CATGTTTGTG CTCTGTTCCA TTTTAACAAC TGCTCTTACT 2400 
55 TTTCCAGTCT GTACAGAATG CTATTTCACT TGAGCAAGAT GATGTATGGA AAGGGTGTTG 2460 
GCACTGGTGT CTGGAGACCT GGATTTGAGT CTTGGTGCTA TCAATCACCG TCTGTGTTTG 2520 
AGCAAGGCAT TTGGCTGCTG TAAGCTTATT GCTTCATCTG TAAGCGGTGG TTTGTAATTC 2580 
CTGATCTTCC CACCTCACAG TGATGTTGTG GGGATCCAGT GAGATAGAAT ACATGTAAGT 2640 
GTGGTTTTGT AATTTGAAAA GTGCTATACT AAGGGAAAGA ATTGAGGAAT TAACTGCATA 2700 
60 CGTTTTGGTG TTGCTTTTCA AATGTTTGAA AATAAAAAAA TGTTAAGAAA TGGGTTTCTT 2760 
GCCTTAACCA GTCTCTCAAG TGATGAGACA GTGAAGTAAA ATTGAGTGCA CTAAACGAAT 2820 
AAGATTCTGA GGAAGTCTTA TCTTCTGCAG TGAGTATGGC CCAATGCTTT CTGTGGCTAA 2880 
ACAGATGTAA TGGGAAGAAA TAAAAGCCTA CGTGTTGGTA AATCCAACAG CAAGGGAGAT 2940 
TTTTGAATCA TAATAACTCA TAAGGTGCTA TCTGTTCAGT GATGCCCTCA GAGCTCTTGC 3000 
65 TGTTAGCTGG CAGCTGACGC TGCTAGGATA GTTAGTTTGG AAATGGTACT TCATAATAAA 3060 
CTACACAAGG AAAGTCAGCC ACCGTGTCTT ATGAGGAATT GGACCTAATA AATTTT AGTG 3120 
TGCCTTCCAA ACCTGAGAAT ATATGCTTTT GGAAGTTAAA ATTTAAATGG CTTTTGCCAC 3180 
ATACATAGAT CTTCATGATG TGTGAGTGTA ATTCCATGTG GATATCAGTT ACCAAACATT 3240 
ACAAAAAAAT TTTATGGCCC AAAATGACCA ACGAAATTGT TACAATAGAA TTTA TCCAAT 3300 
70 TTTGATCTTT TTATATTCTT CTACCACACC TGGAAACAGA CCAATAGACA TTTTGGGGTT 3360 
TTATAATGGG AATTTGTATA AAGCATTACT CTTTTTCAAT AAATTGTTTT TTAATTTAAA 3420 
AAAAGGAAAA AAAAAAAAAA AAA 

Seq ID NO: 392 Protein sequence 
75 Protein Accession #: AAD16433.1 

I 11 21 31 41 51 

II I 1 I I 

MANAGLQLLG F I LAP LOW I G AIVSTALPQW RIYSYAGDNI VTAQAMYEGL WMSCVSQSTG 60 
80 QIQCKVFDSL LNLSSTLQAT RALMWGILL GVIAIFVATV GMKCMKCLED DEVQKMRMAV 120 
IGGAIPLLAG LAI LVATAWY GNRIVQEFYD PMTPVNARYE FGQALFTGWA AASLCLLGGA 180 
LLCCSCPRKT TSYPTPRPYP KPAPSSGKDY V 
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Seq ID NO: 393 DNA sequence 

Nucleic Acid Accession #: NM_006180.1 

Coding sequence : 352 . . 2820 

1 11 21 31 41 51 

I I I I I I 

CCCCCATTCG CATCTAACAA GGAATCTGCG CCCCAGAGAG TCCCGGACGC CGCCGGTOGG 60 

TGCCCGGCGC GCCGGGCCAT GCAGCGACGG CCGCCGCGGA GCTCCGAGCA GCGGTAGCGC 120 

CCCCCTGTAA AGCGGTTCGC TATGCCGGGA CCACTGTGAA CCCTGCCGCC TGCCGGAACA 180 

CTCTTCGCTC CGGACCAGCT CAGCCTCTGA TAAGCTGGAC TCGGCACGCC CGCAACAAGC 240 

ACCGAGGAGT TAAGAGAGCC GCAAGCGCAG GGAAGGCCTC CCCGCACGGG TGGGGGAAAG 300 

CGGCCGGTGC AGCGCGGGGA CAGGCACTCG GGCTGGCACT GGCTGCTAGG GATGTCGTCC 360 

TGGATAAGGT GGCATGGACC CGCCATGGCG CGGCTCTGGG GCTTCTGCTG GCTGGTTGTG 420 

GGCTTCTGGA GGGCCGCTTT CGCCTGTCCC ACGTCCTGCA AATGCAGTGC CTCTCGGATC 480 

TGGTGCAGCG ACCCTTCTCC TGGCATCGTG GCATTTCCGA GATTGGAGCC TAACAGTGTA 540 

GATCCTGAGA ACATCACCGA AATTTTCATC GCAAACCAGA AAAGGTTAGA AATCATCAAC 600 

GAAGATGATG TTGAAGCTTA TGTGGGACTG AGAAATCTGA CAATTGTGGA TTCTGGATTA 660 

AAATTTGTGG CTCATAAAGC ATTTCTGAAA AACAGCAACC TGCAGCACAT CAATTTTACC 720 

CGAAACAAAC TGACGAGTTT GTCTAGGAAA CATTTCCGTC ACCTTGACTT GTCTGAACTG 780 

ATCCTGGTGG GCAATCCATT TACATGCTCC TGTGACATTA TGTGGATCAA GACTCTCCAA 840 

GAGGCTAAAT CCAGTCCAGA CACTCAGGAT TTGTACTGCC TGAATGAAAG CAGCAAGAAT 900 

ATTCCCCTGG CAAACCTGCA GATACCCAAT TGTGGTTTGC CATCTGCAAA TCTGGCCGCA 960 

CCTAACCTCA CTGTGGAGGA AGGAAAGTCT ATCACATTAT CCTGTAGTGT GGCAGGTGAT 1020 

CCGGTTCCTA ATATGTATTG GGATGTTGGT AACCTGGTTT CCAAACATAT GAATGAAACA 1080 

AGCCACACAC AGGGCTCCTT AAGGATAACT AACATTTCAT CCGATGACAG TGGGAAGCAG 1140 

ATCTCTTGTG TGGCGGAAAA TCTTGTAGGA GAAGATCAAG ATTCTGTCAA CCTCACTGTG 1200 

CATTTTGCAC CAACTATCAC ATTTCTCGAA TCTCCAACCT CAGACCACCA CTGGTGCATT 1260 

CCATTCACTG TGAAAGGCAA CCCCAAACCA GCGCTTCAGT GGTTCTATAA CGGGGCAATA 1320 

TTGAATGAGT CCAAATACAT CTGTACTAAA ATACATGTTA CCAATCACAC GGAGTACCAC 1380 

GGCTGCCTCC AGCTGGATAA TCCCACTCAC ATGAACAATG GGGACTACAC TCTAATAGCC 1440 

AAGAATGAGT ATGGGAAGGA TGAGAAACAG ATTTCTGCTC ACTTCATGGG CTGGCCTGGA 1500 

ATTGACGATG GTGCAAACCC AAATTATCCT GATGTAATTT ATGAAGATTA TGGAACTGCA 1560 

GCGAATGACA TCGGGGACAC CACGAACAGA AGTAATGAAA TCCCTTCCAC AGACGTCACT 1620 

GATAAAACCG GTCGGGAACA TCTCTCGGTC TATGCTGTGG TGGTGATTGC GTCTGTGGTG 1680 

GGATTTTGCC TTTTGGTAAT GCTGTTTCTG CTTAAGTTGG CAAGACACTC CAAGTTTGGC 1740 

ATGAAAGGCC CAGCCTCCGT TATCAGCAAT GATGATGACT CTGCCAGCCC ACTCCATCAC 1800 

ATCTCCAATG GGAGTAACAC TCCATCTTCT TCGGAAGGTG GCCCAGATGC TGTCATTATT 1860 

GGAATGACCA AGATCCCTGT CATTGAAAAT CCCCAGTACT TTGGCATCAC CAACAGTCAG 1920 

CTCAAGCCAG ACACATTTGT TCAGCACATC AAGOGACATA ACATTGTTCT GAAAAGGGAG 1980 

CTAGGCGAAG GAGCCTTTGG AAAAGTGTTC CTAGCTGAAT GCTATAACCT CTGTCCTGAG 2040 

CAGGACAAGA TCTTGGTGGC AGTGAAGACC CTGAAGGATG CCAGTGACAA TGCACGCAAG 2100 

GACTTCCACC GTGAGGCCGA GCTCCTGACC AACCTCCAGC ATGAGCACAT CGTCAAGTTC 2160 

TATGGCGTCT GCGTGGAGGG CGACCCCCTC ATCATGGTCT TTGAGTACAT GAAGCATGGG 2220 

GACCTCAACA AGTTCCTCAG GGCACACGGC CCTGATGCCG TGCTGATGGC TGAGGGCAAC 2280 

CCGCCCACGG AACTGACGCA GTCGCAGATG CTGCATATAG CCCAGCAGAT CGCCGCGGGC 2340 

ATGGTCTACC TGGCGTCCCA GCACTTCGTG CACCGCGATT TGGCCACCAG GAACTGCCTG 2400 

GTCGGGGAGA ACTTGCTGGT GAAAATCGGG GACTTTGGGA TGTCCCGGGA CGTGTACAGC 2460 

ACTGACTACT ACAGGGTCGG TGGCCACACA ATGCTGCCCA TTCGCTGGAT GCCTCCAGAG 2520 

AGGATCATGT ACAGGAAATT CACGACGGAA AGCGACGTCT GGAGCCTGGG GGTCGTGTTG 2580 

TGGGAGATTT TCACCTATGG CAAACAGCCC TGGTACCAGC TGTCAAACAA TGAGGTGATA 2640 

GAGTGTATCA CTCAGGGCCG AGTCCTGCAG CGACCCCGCA CGTGCCCCCA GGAGGTGTAT 2700 

GAGCTGATGC TGGGGTGCTG GCAGCGAGAG CCCCACATGA GGAAGAACAT CAAGGGCATC 2760 

CATACCCTCC TTCAGAACTT GGCCAAGGCA TCTCCGGTCT ACCTGGACAT TCTAGGCTAG 2820 

GGCCCTTTTC CCCAGACCGA TCCTTCCCAA CGTACTCCTC AGACGGGCTG AGAGGATGAA 2880 

CATCTTTTAA CTGCCGCTGG AGGCCACCAA GCTGCTCTCC TTCACTCTGA CAGTATTAAC 2940 

ATCAAAGACT CCGAGAAGCT CTCGAGGGAA GCAGTGTGTA CTTCTTCATC CATAGACACA 3000 

GTATTGACTT CTTTTTGGCA TTATCTCTTT CTCTCTTTCC ATCTCCCTTG GTTGTTCCTT 3060 

TTTCTTTTTT TAAATTTTCT TTTTCTTCTT TTTTTTCGTC TTCCCTGCTT CACGATTCTT 3120 

ACCCTTTCTT TTGAATCAAT CTGGCTTCTG CATTACTATT AACTCTGCAT AGACAAAGGC 3180 

CTTAACAAAC GTAATTTGTT ATATCAGCAG ACACTCCAGT TTGCCCACCA CAACTAACAA 3240 

TGCCTTGTTG TATTCCTGCC TTTGATGTGG ATGAAAAAAA GGGAAAACAA ATATTTCACT 3300 

TAAACTTTGT CACTTCTGCT GTACAGATAT CGAGAGTTTC TATGGATTCA CTTCTATTTA 3360 

TTTATTATTA TTACTGTTCT TATTGTTTTT GGATGGCTTA AGCCTGTGTA TAAAAAAGAA 3420 

AACTTGTGTT CAATCTGTGA AGCCTTTATC TATGGGAGAT TAAAACCAGA GAGAAAGAAG 3480 

ATTTATTATG AACCGCAATA TGGGAGGAAC AAAGACAACC ACTGGGATCA GCTGGTGTCA 3540 

GTCCCTACTT AGGAAATACT CAGCAACTGT TAGCTGGGAA GAATGTATTC GGCACCTTCC 3600 

CCTGAGGACC TTTCTGAGGA GTAAAAAGAC TACTGGCCTC TGTGCCATGG ATGATTCTTT 3660 
TCCCATCACC AGAAATGATA GCGTGCAGTA GAGAGCAAAG ATGGCTT 

Seq ID NO» 394 Protein sequence 
Protein Accession ft: NP_006171.1 

1 11 21 31 41 51 

I I I I I I 

MSSWIRWHGP AMARLWGFCW LWGFWRAAF ACPTSCKCSA SRIWCSDPSP GIVAFPRLEP 60 

NSVDPENITE IFIANQKRLE IINEDDVEAY VGLRNLTIVD SGLKFVAHKA FLKNSNLQHI 120 

NFTRNKLTSL SRKHPRHLDL SELILVGNPF TCSCDIMWIK TLQEAKSSPD TQDLYCLNES 180 

SKNIPLANLQ IPNCGLPSAN LAAPNLTVSE GKSITLSCSV AGDPVPNMYW DVGNLVSKHM 240 

NETSHTQGSL RITNISSDDS GKQISCVAEN LVGEDQDSVN LTVHFAPTIT FLESPTSDHH 300 

WCIPFTVKGN PKPALQWFYN GAILNESKYI CTKIHVTNHT EYHGCLQLDN PTHMNNGDYT 360 

LIAKNEYGKD EKQISAHFMG WPGIDDGANP NYPDVIYEDY GTAANDIGDT TNRSNEIPST '420 

DVTDKTGREH LSVYAWVIA SWGFCLLVM LFLLKLARHS KFGMKGPASV ISNDDDSASP 480 

LHHISNGSNT PSSSEGGPDA VIIGMTKIPV IENPQYFGIT NSQLKPDTFV QHIKRHNIVL 540 

KRELGEGAFG KVFLAECYNL CPEQDKILVA VKTLKDASDN ARKDFHREAE LLTNLQHEHI 600 
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VKFYGVCVEG DPLIMVFEYM KHGDLNKFLR AHGPDAVLMA EGNPPTELTQ SOMLHIAQQI 660 

AAQMVYLASQ HFVHRDLATR NCLVGENLLV KIGDFGMSRD VYSTDYYRVG GHTMLPIRWM 720 

PPESIMYRKF TTESDVWSLG WLWEIFTYG KQPWYQLSNN EVIECITQGR VLQRPRTCPQ 780 
EVYELMLGCW QREPHMRKNI KGIHTLLQNL AKASPVYLDI LG 

Seq ID NO: 395 DNA sequence 
Nucleic Acid Accession 8: AF410899 
Coding sequence: 483.-2999 

X 11 21 31 41 51 

I I I I I 1 

GGGAGCAGGA GCCTCGCTGG CTGCTTCGCT CGCGCTCTAC GCGCTCAGTC CCCGGCGGTA 60 

GCAGGAGCCT GGACCCAGGC GCCGGCGGCG GGCGTGAGGC GCCGGAGCCC GGCCTCGAGG 120 

TGCATACCGG ACCCCCATTC GCATCTAACA AGGAATCTGC GCCCCAGAGA GTCCCGGACG 180 

CCGCCGGTCG GTGCCCGGCG CGCCGGGCCA TGCAGCGACG GCCGCCGCGG AGCTCCGAGC 240 

AGCGGTAGCG CCCCCCTGTA AAGCGGTTCG CTATGCCGGG ACCACTGTGA ACCCTGCCGC 300 

CTGCCGGAAC ACTCTTCGCT CCGGACCAGC TCAGCCTCTG ATAAGCTGGA CTCGGCACGC 360 

CCGCAACAAG CACCGAGGAG TTAAGAGAGC CGCAAGCGCA GGGAAGGCCT CCCCGCACGG 420 

GTGGGGGAAA GCGGCCGGTG CAGCGCGGGG ACAGGCACTC GGGCTGGCAC TGGCTGCTAG 480 

GGATGTCGTC CTGGATAAGG TGGCATGGAC CCGCCATGGC GCGGCTCTGG GGCTTCTGCT S40 

GGCTGGTTGT GGGCTTCTGG AGGGCCGCTT TCGCCTGTCC CACGTCCTGC AAATGCAGTG 600 

CCTCTCGGAT CTGGTGCAGC GACCCTTCTC CTGGCATCGT GGCATTTCCG AGATTGGAGC 660 

CTAACAGTGT AGATCCTGAG AACATCACCG AAATTTTCAT CGCAAACCAG AAAAGGTTAG 720 

AAATCATCAA CGAAGATGAT GTTGAAGCTT ATGTGGGACT GAGAAATCTG ACAATTGTGG 780 

ATTCTGGATT AAAATTTGTG GCTCATAAAG CATTTCTGAA AAACAGCAAC CTGCAGCACA 840 

TCAATTTTAC CCGAAACAAA CTGACGAGTT TGTCTAGGAA ACATTTCCGT CACCTTGACT 900 

TGTCTGAACT GATCCTGGTG GGCAATCCAT TTACATGCTC CTGTGACATT ATGTGGATCA 960 

AGACTCTCCA AGAGGCTAAA TCCAGTCCAG ACACTCAGGA TTTGTACTGC CTGAATGAAA 1020 

GCAGCAAGAA TATTCCCCTG GCAAACCTGC AGATACCCAA TTGTGGTTTG CCATCTGCAA 1080 

ATCTGGCCGC ACCTAACCTC ACTGTGGAGG AAGGAAAGTC TATCACATTA TCCTGTAGTG 1140 

TGGCAGGTGA TCCGGTTCCT AATATGTATT GGGATGTTGG TAACCTGGTT TCCAAACATA 1200 

TGAATGAAAC AAGCCACACA CAGGGCTCCT TAAGGATAAC TAACATTTCA TCCGATGACA 1260 

GTGGGAAGCA GATCTCTTGT GTGGCGGAAA ATCTTGTAGG AGAAGATCAA GATTCTGTCA 1320 

ACCTCACTGT GCATTTTGCA CCAACTATCA CATTTCTCGA ATCTCCAACC TCAGACCACC 1380 

ACTGGTGCAT TCCATTCACT GTGAAAGGCA ACCCCAAACC AGCGCTTCAG TGGTTCTATA 1440 

ACGGGGCAAT ATTGAATGAG TCCAAATACA TCTGTACTAA AATACATGTT ACCAATCACA 1500 

CGGAGTACCA CGGCTGCCTC CAGCTGGATA ATCCCACTCA CATGAACAAT GGGGACTACA 1560 

CTCTAATAGC CAAGAATGAG TATGGGAAGG ATGAGAAACA GATTTCTGCT CACTTCATGG 1620 

GCTGGCCTGG AATTGACGAT GGTGCAAACC CAAATTATCC TGATGTAATT TATGAAGATT 1680 

ATGGAACTGC AGCGAATGAC ATCGGGGACA GCAGGAACAG AAGTAATGAA ATCCCTTCCA 1740 

CAGACGTCAC TGATAAAACC GGTCGGGAAC ATCTCTOGGT CTATGCTGTG GTGGTGATTG 1800 

CGTCTGTGGT GGGATTTTGC CTTTTGGTAA TGCTGTTTCT GCTTAAGTTG GCAAGACACT 1860 

CCAAGTTTGG CATGAAAGAT TTCTCATGGT TTGGATTTGG GAAAGTAAAA TCAAGACAAG 1920 

GTGTTGGCCC AGCCTCCGTT ATCAGCAATG ATGATGACTC TGCCAGCCCA CTCCATCACA 1980 

TCTCCAATGG GAGTAACACT CCATCTTCTT CGGAAGGTGG CCCAGATGCT GTCATTATTG 2040 

GAATGACCAA GATCCCTGTC ATTGAAAATC CCCAGTACTT TGGCATCACC AACAGTCAGC 2100 

TCAAGCCAGA CACATTTGTT CAGCACATCA AGCGACATAA CATTGTTCTG AAAAGGGAGC 2160 

TAGGCGAAGG AGCCTTTGGA AAAGTGTTCC TAGCTGAATG CTATAACCTC TGTCCTGAGC 2220 

AGGACAAGAT CTTGGTGGCA GTGAAGACCC TGAAGGATGC CAGTGACAAT GCACGCAAGG 2280 

ACTTCCACCG TGAGGCCGAG CTCCTGACCA ACCTCCAGCA TGAGCACATC GTCAAGTTCT 2340 

ATGGCGTCTG CGTGGAGGGC GACCCCCTCA TCATGGTCTT TGAGTACATG AAGCATGGGG 2400 

ACCTCAACAA GTTCCTCAGG GCACACGGCC CTGATGCCGT GCTGATGGCT GAGGGCAACC 2460 

CGCCCACGGA ACTGACGCAG TCGCAGATGC TGCATATAGC CCAGCAGATC GCCGCGGGCA 2520 

TGGTCTACCT GGCGTCCCAG CACTTCGTGC ACCGCGATTT GGCCACCAGG AACTGCCTGG 2580 

TCGGGGAGAA CTTX3CTGGTG AAAATCGGGG ACTTTGGGAT GTCCCGGGAC GTGTACAGCA 2640 

CTGACTACTA CAGGGTCGGT GGCCACACAA TGCTGCCCAT TCGCTGGATG CCTCCAGAGA 2700 

GCATCATGTA CAGGAAATTC ACGACGGAAA GCGACGTCTG GAGCCTGGGG GTCGTGTTGT 2760 

GGGAGATTTT CACCTATGGC AAACAGCCCT GGTACCAGCT GTCAAACAAT GAGGTGATAG 2820 

AGTGTATCAC TCAGGGCCGA GTCCTGCAGC GACCCCGCAC GTGCCCCCAG GAGGTGTATG 2880 

AGCTGATGCT GGGGTGCTGG CAGCGAGAGC CCCACATGAG GAAGAACATC AAGGGCATCC 2940 

ATACCCTCCT TCAGAACTTG GCCAAGGCAT CTCCGGTCTA CCTGGACATT CTAGGCTAGG 3000 

GCCCTTTTCC CCAGACCGAT CCTTCCCAAC GTACTCCTCA GACGGGCTGA GAGGATGAAC 3060 

ATCTTTTAAC TGCCGCTGGA GGCCACCAAG CTGCTCTCCT TCACTCTGAC AGTATTAACA 3120 

TCAAAGACTC CGAGAAGCTC TCGAGGGAAG CAGTGTGTAC TTCTTCATCC ATAGACACAG 3180 

TATTGACTTC TTTTTGGCAT TATCTCTTTC TCTCTTTCCA TCTCCCTTGG TTGTTCCTTT 3240 

TTCTTTTTTT AAATTTTCTT TTTCTTCTTT TTTTTCGTCT TCCCTGCTTC ACGATTCTTA 3300 

CCCTTTCTTT TGAATCAATC TGGCTTCTGC ATTACTATTA ACTCTGCATA GACAAAGGCC 3360 

TTAACAAACG TAATTTGTTA TATCAGCAGA CACTCCAGTT TGCCCACCAC AACTAACAAT 3420 

GCCTTGTTGT . ATTCCTGCCT TTGATGTGGA TGAAAAAAAG GGAAAACAAA TATTTCACTT 3480 

AAACTTTGTC ACTTCTGCTG TACAGATATC GAGAGTTTCT ATGGATTCAC TTCTATTTAT 3540 

TTATTATTAT TACTGTTCTT ATTGTTTTTG GATGGCTTAA GCCTGTGTAT AAAAAAGAAA 3600 

ACTTGTGTTC AATCTGTGAA GCCTTTATCT ATGGGAGATT AAAACCAGAG AGAAAGAAGA 3660 

TTTATTATGA ACCGCAATAT GGGAGGAACA AAGACAACCA CTGGGATCAG CTGGTGTCAG 3720 

TCCCTACTTA GGAAATACTC AGCAACTGTT AGCTGGGAAG AATGTATTCG GCACCTTCCC 3780 

CTGAGGACCT TTCTGAGGAG TAAAAAGACT ACTGGCCTCT GTGCCATGGA TGATTCTTTT 3840 

CCCATCACCA GAAATGATAG CGTGCAGTAG AGAGCAAAGA TGGCTTCCGT GAGACACAAG 3900 

ATGGCGCATA GTGTGCTCGG ACACAGTTTT GTCTTCGTAG GTTGTGATGA TAGCACTGGT 3 960 

TTGTTTCTCA AGCGCTATCC ACAGAACCTT TGTCAACTTC AGTTGAAAAG AGGTGGATTC 4020 
ATGTCCAGAG CTCATTTCGG GGTCAGGTGG GAAAGCC 

Seq ID NO: 396 Protein sequence 
Protein Accession #: AAL67965.1 
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MSSWIRWHGP AMARLWGFCW LWGFWRAAF ACPTSCKCSA SRIWCSDPSP GIVAPPRLEP 60 

NSVDPENITE IFIANQKRLE IINEDDVBAY VGLRNLTIVD SGLKFVAHKA FLKNSNLQHI 120 

NFTRNKLTSL SRKHFRHLDL SELILVGNPF TCSCDIMWIK TLQEAKSSPD TQDLYCLNES 180 

SKNIPLANLQ IPNCGLPSAN LAAPNLTVEE GKSITLSCSV AGDPVPNMYW DVGNLVSKHM 240 

NETSHTQGSL RITNISSDDS GKQISCVAEN LVGEDQDSVN LTVHFAPTIT FLESPTSDHH 300 

WCIPFTVKGN PKPALQWFYN GAILNESKYI CTKIHVTNHT EYHGCLQLDN PTHMNNGDYT 360 

LIAKNEYGKD EKQISAHFMG WPGIDDGANP MYPDVIYEDY GTAAND1GDT TNRSNEIPST 420 

DVTDKTGREH LSVYAWVIA SWGFCLLVM LFLLKLARHS KFGMKDFSWF GPGKVKSRQG 480 

VGPASVISND DDSASPLHHI SNGSNTPSSS EGGPDAVIIG MTKIPVIENP QYFGITNSQL 540 

KPDTFVQHIK RHNIVLKREb GEGAFQKVFL AECYNLCPEQ DKILVAVKTL KDASDNARKD 600 

FHREAELLTN LQHEHIVKFY GVCVEGDPLI MVPEYMKHGD LNKFLRAHGP DAVLMAEGNP 660 

PTELTQSQML HIAQQIAAGM VYLASQHFVH RDLATRNCLV GENLLVKIGD FGMSRDVYST 720 

DYYHVGGHTM LPIRWMPPES IMYRKFTTES DVWSLGWLW EIFTYGKQPW YQLSNNEVIE 780 
CITQGRVLQR PRTCPQEVYE LMLGCWQREP HMRKNIKGIH TLLQNLAKAS PVYLDILG 

Seq ID NO: 397 DNA sequence 
Nucleic Acid Accession g: AB052906 
Coding sequence: 74.. 814 

1 11 21 31 41 51 

I I I ! I I 

AAAACCTTGA GGTGATTCAT CTTCCAGGCT CTCCTTCCAT CAAGTCTCTC CTCCCTAGCG 60 

CTCTGGGTCC TTAATGGCAG CAGCCGCCGC TACCAAGATC CTTCTGTGCC TCCCGCTTCT 120 

GCTCCTGCTG TCCGGCTGGT CCCGGGCTGG GCGAGCCGAC CCTCACTCTC TTTGCTATGA 180 

CATCACCGTC ATCCCTAAGT TCAGACCTGG ACCACGGTGG TGTGCGGTTC AAGGCCAGGT 240 

GGATGAAAAG ACTTTTCTTC ACTATGACTG TGGCAACAAG ACAGTCACAC CTGTCAGTCC 300 

CCTGGGGAAG AAACTAAATG TCACAACGGC CTGGAAAGCA CAGAACCCAG TACTGAGAGA 360 

GGTGGTGGAC ATACTTACAG AGCAACTGCG TGACATTCAG CTGGAGAATT ACACACCCAA 420 

GGAACCCCTC ACCCTGCAGG CCAGGATGTC TTGTGAGCAG AAAGCTGAAG GACACAGCAG 480 

TGGATCTTGG CAGTTCAGTT TCGATGGGCA GATCTTCCTC CTCTTTGACT CAGAGAAGAG 540 

AATGTGGACA ACGGTTCATC CTGGAGCCAG AAAGATGAAA GAAAAGTGGG AGAATGACAA 600 

GGTTGTGGCC ATGTCCTTCC ATTACTTCTC AATGGGAGAC TGTATAGGAT GGCTTGAGGA 660 

CTTCTTGATG GGCATGGACA GCACCCTGGA GCCAAGTGCA GGAGCACCAC TOGCCATGTC 720 

CTCAGGCACA ACCCAACTCA GGGCCACAGC CACCACCCTC ATCCTTTGCT GCCTCCTCAT 780 

CATCCTCCCC TGCTTCATCC TCCCTGGCAT CTGAGGAGAG TCCTTTAGAG TGACAGGTTA 840 

AAGCTGATAC CAAAAGGCTC CTGTGAGCAC GGTCTTGATC AAACTCGCCC TTCTGTCTGG 900 

CCAGCTGCCC ACGACCTACG GTGTATGTCC AGTGGCCTCC AGCAGATCAT GATGACATCA 960 

TGGACCCAAT AGCTCATTCA CTGCCTTGAT TCCTTTTGCC AACAATTTTA CCAGCAGTTA 1020 

TACCTAACAT ATTATGCAAT TTTCTCTTGG TGCTACCTGA TGGAATTCCT GCACTTAAAG 1080 

TTCTGGCTGA CTAAACAAGA TATATCATTT TCTTTCTTCT CTTTTTGTTT GGAAAATCAA 1140 

GTACTTCTTT GAATGATGAT CTCTTTCTTG CAAATGATAT TGTCAGTAAA ATAATCACGT 1200 

TAGACTTCAG ACCTCTGGGG ATTCTTTCCG TGTCCTGAAA GAGAATTTTT AAATTATTTA 1260 

ATAAGAAAAA ATTTATATTA ATGATTGTTT CCTTTAGTAA TTTATTGTTC TGTACTGATA 1320 
TTTAAATAAA GAGTTCTATT TCCCAAAAAA AAAAAAAAAA AA 

Seq ID NO: 398 Protein sequence 
Protein Accession #: BAB61048.1 

1 11 21 31 41 51 

I I I I I I 

MAAAAATKIL LCLPLLLLLS GWSRAGRADP HSLCYDITVI PKPRPGPRWC AVQGQVDEKT 60 

FLHYDCGNKT VTPVSPLGKK LNVTTAWKAQ NPVLREWDI LTEQLRDIQL ENYTPKEPLT 120 

LQARMSCEQK AEGHSSGSWQ PSPDGQIFLL FDSEKRMWTT VHPGARKMKE KWENDKWAM 180 

SFHYFSMGDC IGWLEDFLMG MDSTLEPSAG APLAMSSGTT QLRATATTLI LCCLLIILPC 240 
FILPGI 

Seq ID NO: 399 DNA sequence 

Nucleic Acid Accession #: NM_001B98.1 

Coding sequence: 57.. 482 

1 11 21 31 41 51 

I I I I I I 

GGCTCTCACC CTCCTCTCCT GCAGCTCCAG CTTTGTGCTC TGCCTCTGAG GAGACCATGG 60 

CCCAGTATCT GAGTACCCTG CTGCTCCTGC TGGCCACCCT AGCTGTGGCC CTGGCCTGGA 120 

GCCCCAAGGA GGAGGATAGG ATAATCCCGG GTGGCATCTA TAACGCAGAC CTCAATGATG 180 

AGTGGGTACA GCGTGCCCTT CACTTCGCCA TCAGCGAGTA TAACAAGGCC ACCAAAGATG 240 

ACTACTACAG AOGTCCGCTG CGGGTACTAA GAGCCAGGCA ACAGACCGTT GGGGGGGTGA 300 

ATTACTTCTT CGACGTAGAG GTGGGCCGCA CCATATGTAC CAAGTCCCAG CCCAACTTGG 360 

ACACCTGTGC CTTCCATGAA CAGCCAGAAC TGCAGAAGAA ACAGTTGTGC TCTTTCGAGA 420 

TCTAOGAAGT TCCCTGGGAG AACAGAAGGT CCCTGGTGAA ATCCAGGTGT CAAGAATCCT 480 

AGGGATCTGT GCCAGGCCAT TCGCACCAGC CACCACCCAC TCCCACCCCC TGTAGTGCTC 540 

CCACCCCTGG ACTGGTGGCC CCCACCCTGC GGGAGGCCTC CCCATGTGCC TGCGCCAAGA 600 

GACAGACAGA GAAGGCTGCA GGAGTCCTTT GTTGCTCAGC AGGGCGCTCT GCCCTCCCTC 660 

CTTCCTTCTT GCTTCTAATA GCCCTGGTAC ATGGTACACA CCCCCCCACC TCCTGCAATT 720 
AAACAGTAGC ATCGCC 

Seq ID NO: 400 Protein sequence 
Protein Accession #: NP__O01889.1 

1 11 21 31 41 51 

I I I I I I 

MAQYLSTLIsL LIiATLAVALA WSPKEEDRII PGGIYNADLN DEWVQRALHF AISEYNKATK 60 

DDYYRRPLRV LRARQQTVGG VNYFFDVEVG RTICTKSQPN LDTCAFHEQP ELQKKQLCSF 120 
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Seq ID NO: 401 DNA sequence 

Nucleic Acid Accession #: NM_003976.2 

Coding sequence: 299.961 



1 
I 

CTCTGAGCTT 
CATGGAGTTG 
CTACTTCTGC 
GGGTGGCAGG 
CAGGAGGGTG 
GGAACTTGGA 
TGCCCTGTGG 
GGGCTCCGCG 
CGCCGGCCAC 
GCCGCCGCAG 
CGGGGGCCGC 
GGGCTGCCGC 
OGACGAGCTG 
CGACCTCAGC 
GCCCGTCAGC 
CAACAGCACC 
AGGGCTCGCT 
CCTCCCGCAG 
AGGCCCCTAC 
CAGCCCCAGA 
GGAGCCCTTC 
CCCTCCTCTG 
ACAGCATTTG 
CCTGTACTCA 



11 
I 

CTCTGAGCCT 
TGAAAGAATA 
TGGGTTGAGT 
CCGGTCCCCC 
GGGGAACAGC 
CTTGGAGGCC 
CCCACCCTGG 
CCCCGCAGCC 
CTGCCGGGGG 
CCTTCTCGGC 



CTGCGCTCGC 
GTGCGTTTCC 
CTGGCCAGCC 
CAGCCCTGCT 
TGGAGAACCG 
CCAGGGCTTT 
AGTCCCACTA 
CGGTGGGTGA 
GCCCTCACCC 
GGACCCACTT 
ATGAACACTA 
AAGGACACAT 
CTCATGGGAG 



21 
I 

TGTTTGCTCA 
GCTGCAAAGC 
CTAGCTGTGT 
ACAAAAGATA 
TCAACAATGG 
TCTCCACGCT 
CCGCTCTGGC 
CTGCCCCCCG 
GACGCACGGC 
CCGCGCCCCC 
CTGGGGGCCC 
AGCTGGTGCC 
GCTTCTGCAG 
TACTGGGCGC 
GCCGACCCAC 
TGGACCGCCT 
GCAGACTGGA 
GCCAGCGGCC 
TGGATATCAT 
TGOGGATCCC 
CTCACAGACT 
CAGTGGCTGA 
ATTGCAGTTG 
CTGGCCCC 



31 
I 

TCTGGAAAAA 
ACCTAACACA 
AGGCCCCTTG 
ACTCATCTCT 
CTGATGGGCG 
GTCCCACTGC 
TCTGCTGAGC 
CGAAGGCCCC 
CCGCTGGTGC 
GCCGCCTGCA 
GGGCAGCCGC 
GGTGCGCGCG 
CGGCTCCTGC 
CGGGGCCCTG 
GCGCTACGAA 
CTCCGCCACC 
CCCTTACCGG 
TCAGCCAGGG 
CCCOGAACAG 
AGCCTAAAAG 
CTGGCACTGG 
GGCATCAGCC 
CTTGGTTGAA 




ACGAAGGCCT 
GTGAAGGGAC 
ACACCAGAGA 
CCAGGCCTCG 
CCCGCCCAGG 
AGTGCCTGTG 



51 
1 

CCATTTACCT 
CCCAGTGCAG 
GGAGAAACTG 
GCTGCCTCAA 
TGATAGAGAT 
GGCGGCAGCC 
AGGCCTCCCT 
TGGCGTCCCC 
CCCGGCGGCC 
CTCTTCCCCG 
CGGGGGCGCG 
GCCACCGCTC 
GCTCTCCACA 
CGGGCTCCCG 
TCATGGACGT 
GCCTGGGCTG 
TGCCTGGGAC 
CAAAGCTGAG 
AACTGACTAG 
CCTCAGCTAT 
AACCTGGGAC 
CCCTGTAGGG 
CTGGAACTGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



Seq ID NO: 402 Protein sequence 
Protein Accession #: NP_003967.1 

1 11 21 31 41 51 

I I I I I I 

MELGLGGLST LSHCPWPRRQ PALWPTLAAL ALLSSVAEAS LGS APRS PAP REGPPPVLAS 60 

PAGHIiPGGRT ARWCSGRARR PPPQPSRPAP PPPAPPSALP RGGRAARAGG PGSRARAAGA 120 

RGCRLRSQLV PVRALGLGHR SDELVRFRFC SGSCRRARSP HDLSLASLLG AGALRPPPGS 180 
RPVSQPCCRP TRYEAVSFMD VNSTWRTVDR LSATACGCLG 

Seq ID NO: 403 DNA sequence 
Nucleic Acid Accession ft: NM_057091.1 
Coding sequence: 783.. 1445 

1 11 21 31 41 51 

I I I I I I 

ACTGGCCGCT GAGAGAAGAA TCGGGTGGAG CAGAGAGCAG CTGCTGCAGG GCAGACAGCC 60 

GGACCCCCAA ATCTGCACGT ACCAGCAGTC AGCCGCCCCA CGCAGGGACC GGCTTACCCC 120 

TCGCTCCCCG CCCTCACTCA CTTTCTCCCG CCCTCGGCCC GGCCTCCCAG CTCTCTACTT 180 

CGCGTGTCTA CAAACTCAAC TCCOGGTTTC CGTGCCTCTC CACCGCTCGA GTTCTCTACT 240 

CTCCATATCC GAGGGGCCCC TCCCAGCATC TACCCCCCTC CCAACCTCGG GGGACCTAGC 300 

CAAGCTAGGG GGGACTGGAT CCGACGGGTG GAGCAGCCAG GTOAGCCCCG AAAGGTGGGG 360 

CGGGGCAGGG GCGCTCCCAO CCCCACCCCG GGATCTGGTG ACGCTGGGGC TGGAATTTGA 420 

CACCGGACGG CTGCGGCGGC GGGCAGGAGG CTGCTGAGGG ATGGAGTTGG GCCCGGCCCC 480 

CAGACAAGGC CCGGGGGCTC CGCCAGCAGC AGGTCCCTCG GGCCCCAGCC CTCGCTGCCA 540 

CCOGGGCCTG GAGCCCCACA CCCGAGGGTG CAGACTGGCT GCCAAGGCCA CACTTTTGGC 600 

TAAAAGAGGC ACTGCCAGGT GTACAGTCCT GGGCATGCGC TGTTTGAGCT TOGGGGGAGA 660 

GCCCAGCACT GGTCCCCGGA AAGGTGCCTA GAAGAACAAG GTGCAGGACC CCGTGCTGCC 720 

TCAACAGGAG GGTGGGGGAA CAGCTCAACA ATGGCTGATG GGCGCTCCTG GTGTTGATAG 780 

AGATGGAACT TGGACTTGGA GGCCTCTCCA CGCTGTCCCA CTGCCCCTGG CCTAGGCGGC 840 

AGCCTGCCCT GTGGCCCACC CTGGCCGCTC TGGCTCTGCT GAGCAGCGTC GCAGAGGCCT 900 

CCCTGGGCTC CGCGCCCCGC AGCCCTGCCC CCCGOGAAGG CCCCCCGCCT GTCCTGGCGT 960 

CCCCCGCCGG CCACCTGCCG GGGGGACGCA CGGCCCGCTG GTGCAGTGGA AGAGCCCGGC 1020 

GGCCGCCGCC GCAGCCTTCT CGGCCCGCGC CCCCGCCGCC TGCACCCCCA TCTGCTCTTC 1080 

CCCGCGGGGG CCGCGCGGCG CGGGCTGGGG GCCCGGGCAG CCGCGCTCGG GCAGCGGGGG 1140 

CGCGGGGCTG COGCCTGCGC TCGCAGCTGG TGCOGGTGCG CGCGCTCGGC CTGGGCCACC 1200 

GCTCCQACGA GCTGGTGCGT TTCCGCTTCT GCAGCGGCTC CTGCCGCCGC GCGCGCTCTC 1260 

CACACGACCT CAGCCTGGCC AGCCTACTGG GCGCCGGGGC CCTGCGACCG CCCCCGGGCT 1320 

CCCGGCCCGT CAGCCAGCCC TGCTGCCGAC CCACGOGCTA CGAAGCGGTC TCCTTCATGG 1380 

ACGTCAACAG CACCTGGAGA ACCGTGGACC GCCTCTCOGC CACCGCCTGC GGCTGCCTGG 1440 

GCTGAGGGCT CGCTCCAGGG CTTTGCAGAC TGGACCCTTA CCGGTGGCTC TTCCTGCCTG 1500 

GGACCCTCCC GCAGAGTCCC ACTAGCCAGC GGCCTCAGCC AGGGACGAAG GCCTCAAAGC 1560 

TGAGAGGCCC CTACCGGTGG GTGATGGATA TCATCCCCGA ACAGGTGAAG GGACAACTGA 1620 

CTAGCAGCCC CAGAGCCCTC ACCCTGCGGA TCCCAGCCTA AAAGACACCA GAGACCTCAG 1680 

CTATGGAGCC CTTCGGACCC ACTTCTCACA GACTCTGGCA CTGGCCAGGC CTCGAACCTG 1740 

GGACCCCTCC TCTGATGAAC ACTACAGTGG CTGAGGCATC AGCCCCCGCC CAGGCCCTGT 1800 

AGGGACAGCA TTTGAAGGAC ACATATTGCA GTTGCTTGGT TGAAAGTGCC TGTGCTGGAA 1860 
CTGGCCTGTA CTCACTCATG GGAGCTGGCC CC 



Seq ID NO: 404 Protein sequence 
Protein Accession tf: NP 003967.1 



1113 



WO 03/042661 



PCT/US02/36810 



1 11 21 31 41 SI 

I I I I I I 

MELGLGGLST LSHCPWPRRQ PALHPTLAAL ALLSSVAEAS LGSAPRSPAP REGPPPVLAS 60 
5 PAGHLPGGRT ARWCSGRARR PPPQPSRPAP PP PAP PS ALP RGGRAARAGG PGSRARAAGA 120 
RGCRLRSQLV PVRALGLGHR SDELVRPRFC SGSCRRARSP HDLSLASLLG AGALRPPPGS 180 
RPVSQPCCRP TRYEAVSFMD VNSTWRTVDR LSATACGCLG 



10 Seq ID NO j 405 DNA sequence 

Nucleic Acid Accession 8: NM_057160.1 
Coding sequence: 1..714 

1 11 21 31 41 51 

15 | | | | I I 

ATGCCCGGCC TGATCTCAGC CCGAGGACAG CCCCTCCTTG AGGTCCTTCC TCCCCAAGCC 60 

CACCTGGGTG CCCTCTTTCT CCCTGAGGCT CCACTTGGTC TCTCCGCGCA GCCTGCCCTG 120 

TGGCCCACCC TGGCCGCTCT GGCTCTGCTG AGCAGCGTCG CAGAGGCCTC CCTGGGCTCC 180 

GCGCCCCGCA GCCCTGCCCC CCGCGAAGGC CCCCCGCCTG TCCTGGCGTC CCCCGCCGGC 240 

20 CACCTGCCGG GGGGACGCAC GGCCCGCTGG TGCAGTGGAA GAGCCCGGCG GCCGCCGCCG 300 

CAGCCTTCTC GGCCCGCGCC CCCGCCGCCT GCACCCCCAT CTGCTCTTCC CCGCGGGGGC 360 

CGCGCGGCGC GGGCTGGGGG CCCGGGCAGC CGCGCTCGGG CAGCGGGGGC GCGGGGCTGC 420 

CGCCTGCGCr CGCAGCTGGT GCCGGTGCGC GCGCTCGGCC TGGGCCACCG CTCCGACGAG 480 

CTGGTGCGTT TCCGCTTCTG CAGCGGCTCC TGCCGCCGCG CGCGCTCTCC ACACGACCTC S40 

25 AGCCTGGCCA GCCTACTGGG CGCCGGGGCC CTGCGACCGC CCCCGGGCTC CCGGCCCGTC 600 

AGCCAGCCCT GCTGCCGACC CACGCGCTAC GAAGCGGTCT CCTTCATGGA CGTCAACAGC 660 

ACCTGGAGAA CCGTGGACCG CCTCTCCGCC ACCGCCTGCG GCTGCCTGGG CTGAGGGCTC 720 

GCTCCAGGGC TTTGCAGACT GGACCCTTAC CGGTGGCTCT TCCTGCCTGG GACCCTCCCG 780 

CAGAGTCCCA CTAGCCAGCG GCCTCAGCCA GGGACGAAGG CCTCAAAGCT GAGAGGCCCC 840 

30 TACOGGTGGG TGATGGATAT CATCCCCGAA CAGGTGAAGG GACAACTGAC TAGCAGCCCC 900 

AGAGCCCTCA CCCTGCGGAT CCCAGCCTAA AAGACACCAG AGACCTCAGC TATGGAGCCC 960 

TTCGGACCCA CTTCTCACAG ACTCTGGCAC TGGCCAGGCC TCGAACCTGG GACCCCTCCT 1020 

CTGATGAACA CTACAGTGGC TGAGGCATCA GCCCCCGCCC AGGCCCTGTA GGGACAGCAT 1080 

TTGAAGGACA CATATTGCAG TTGCTTGGTT GAAAGTGCCT GTGCTGGAAC TGGCCTGTAC 1140 

35 TCACTCATGG GAGCTGGCCC C 

Seq ID NO: 406 Protein sequence 
Protein Accession #: NP_476501.1 

40 1 11 21 31 41 : 51 

I I I I I I 

MPGLISARGQ PLLEVLPPQA HLGALFLPEA PLGLSAQPAL WPTLAALALL SSVAEASLG3 60 

APRSPAPREG PPPVLASPAG HLPGGRTARW CSGRARRPPP QPSRPAPPPP APPSALPRGG 120 

RAARAGGPGS RARAAGARGC RLRSQIiVPVR ALGLGHRSDB LVRFRFCSGS CRRARSPHDL 180 
45 SLASLLGAGA LRPPPGSRPV SQPCCRPTRY EAVSFMDVNS TWRTVDRLSA TACGCLG 



Seq ID NO: 407 DNA sequence 
NUcleic Acid Accession ft: NMJJS7090.1 
^ Coding sequence: 29.. 715 

1 11 21 31 41 51 

I I I I I I 

CTGATGGGCG CTCCTGGTGT TGATAGAGAT GGAACTTGGA CTTGGAGGCC TCTCCACGCT 60 

GTCCCACTGC CCCTGGCCTA GGCGGCAGGC TCCACTTGGT CTCTCCGCGC AGOCTGCCCT 120 

55 GTGGCCCACC CTGGCCGCTC TGGCTCTGCT GAGCAGCGTC GCAGAGGCCT CCCTGGGCTC 180 

CGCGCCCCGC AGCCCTGCCC CCCGCGAAGG CCCCCCGCCT GTCCTGGCGT CCCCCGCCGG 240 

CCACCTGCCG GGGGGACGCA CGGCCCGCTG GTGCAGTGGA AGAGCCCGGC GGCCGCCGCC 300 

GCAGCCTTCT CGGCCCGCGC CCCCGCCGCC TGCACCCCCA TCIGCTCTTC CCCGCGGGGG 360 

CCGCGCGGCG CGGGCTGGGG GCCCGGGCAG CCGCGCTCGG GCAGCGGGGG CGCGGGGCTG 420 

60 CCGCCTGCGC TCGCAGCTGG TGCCGGTGCG CGCGCTCGGC CTGGGCCACC GCTCCGACGA 480 

GCTGGTGCGT TTCCGCTTCT GCAGCGGCTC CTGCCGCCGC GCGCGCTCTC CACACGACCT 540 

CAGCCTGGCC AGCCTACTGG GCGCCGGGGC CCTGCGACCG CCCCCGGGCT CCCGGCCCGT 600 

CAGCCAGCCC TGCTGCCGAC CCACGCGCTA CGAAGOGGTC TCCTTCATGG ACGTCAACAG 660 

CACCTGGAGA ACCGTGGACC GCCTCTCCGC CACCGCCTGC GGCTGCCTGG GCTGAGGGCT 720 

65 CGCTCCAGGG CTTTGCAGAC TGGACCCTTA CCGGTGGCTC TTCCTGCCTG GGACCCTCCC 780 

GCAGAGTCCC ACTAGCCAGC GGCCTCAGCC AGGGACGAAG GCCTCAAAGC TGAGAGGCCC 840 

CTACCGGTGG GTGATGGATA TCATCCCCGA ACAGGTGAAG GGACAACTGA CTAGCAGCCC 900 

CAGAGCCCTC ACCCTGCGGA TCCCAGCCTA AAAGACACCA GAGACCTCAG CTATGGAGCC 960 

CTTCGGACCC ACTTCTCACA GACTCTGGCA CTGGCCAGGC CTCGAACCTG GGACCCCTCC 1020 

70 TCTGATGAAC ACTACAGTGG CTGAGGCATC AGCCCCCGCC CAGGCCCTGT AGGGACAGCA 1080 

TTTGAAGGAC ACATATTGCA GTTGCTTGGT TGAAAGTGCC TGTGCTGGAA CTGGCCTGTA 1140 
CTCACTCATG GGAGCTGGCC CC 



Seq ID NO: 408 Protein sequence 
75 Protein Accession 8: NP_476431.1 

1 11 21 31 41 51 

I I I I I I 

MELGLGGLST LSHCPWPRRQ APLGLSAQPA LWPTLAALAL LSSVAEASLG SAPRSPAPRE 60 
80 GPPPVLASPA GHLPGGRTAR WCSGRARRPP PQPSRPAPPP PAPPSALPRG GRAARAGGPG 120 
SRARAAGARG CRLRSQLVPV RALGLGHRSD ELVRFRPCSG SCRRARSPHD LSLASLLGAG 180 
ALRPPPGSRP VSQPCCRPTR YEAVSFMDVN STWRTVDRLS ATACGCLG 



1114 



WO 03/042661 



Seq ID NOt 409 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: l.. 1*746 

1 11 21 31 41 Si 

{ | I I I I 

ATGCCACTGA AGCATTATCT CCTTTTGCTG GTGGGCTGCC AAGCCTGGGG TGCAGGGTTG 60 
GCCTACCATG GCTGCCCTAG CGAGTGTACC TGCTCCAGGG OTCCCAGGT GGAGTGCACC 120 
GGGGCACGCA TTGTGGCGGT GCCCACCCCT CTGCCCTGGA ACGCCATGAG CCTGCAGATC 180 
CTCAACACGC ACATCACTGA ACTCAATGAG TCCCCGTTCC TCAATATCTC AGCCCTCATC 240 
GCCCTGAGGA TTGAGAAGAA TGAGCTGTCG CGCATCACGC CTGGGGCCTT CCGAAACCTG 300 
GGCTCGCTGC GCTATCTCAG CCTCGCCAAC AACAAGCTGC AGGTTCTGCC CATCGGCCTC 360 
TTCCAGGGCC TGGACAGCCT TGAGTCTCTC CTTCTGTCCA GTAACCAGCT GTTGCAGATC 420 
CAGCCGGCCC ACTTCTCCCA GTGCAGCAAC CTCAAGGAGC TGCAGTTGCA CGGCAACCAC 480 
CTGGAATACA TCCCTGACGG AGCCTTCGAC CACCTGGTAG GACTCACGAA GCTCAATCTG 540 
GGCAAGAATA GCCTCACCCA CATCTCACCC AGGGTCTTCC AGCACCTGGG CAATCTCCAG 600 
GTCCTCCGGC TGTATGAGAA CAGGCTCACG GATATCCCCA TGGGCACTTT TGATGGGCTT 660 
GTTAACCTGC AGGAACTGGC TCTACAGCAG AACCAGATTG GACTGCTCTC CCCTGGTCTC 720 
TTCCACAACA ACCACAACCT CCAGAGACTC TACCTGTCCA ACAACCACAT CTCCCAGCTG 780 
CCACCCAGCA TCTTCATGCA GCTGCCCCAG CTCAACCGTC TTACTCTCTT TGGGAATTCC 840 
CTGAAGGAGC TCTCTCTGGG GATCTTCGGG CCCATGCCCA ACCTGCGGGA GCTTTGGCTC 900 
TATGACAACC ACATCTCTTC TCTACCCGAC AATGTCTTCA GCAACCTCCG CCAGTTGCAG 960 
GTCCTGATTC TTAGCCGCAA TCAGATCAGC TTCATCTCCC CGGGTGCCTT CAAOGGGCTA 1020 
ACGGAGCTTC GGGAGCTGTC CCTCCACACC AACGCACTGC AGGACCTGGA CGGGAATGTC 1080 
TTCOGCATGT TGGCCAACCT GCAGAACATC TCCCTGCAGA ACAATCGCCT CAGACAGCTC 1140 
CCAGGGAATA TCTTCGCCAA CGTCAATGGC CTCATGGCCA TCCAGCTGCA GAACAACCAG 1200 
CTGGAGAACT TGCCCCTCGG CATCTTCGAT CACCTGGGGA AACTGTGTGA GCTGCGGCTG 1260 
TATGACAATC CCTGGAGGTG TGACTCAGAC ATCCTTCCGC TCCGCAACTG GCTCCTGCTC X320 
AACCAGCCTA GGTTAGGGAC GGACACTGTA CCTGTGTGTT TCAGCCCAGC CAATGTCCGA 1380 
GGCCAGTCCC TCATTATCAT CAATGTCAAC GTTGCTGTTC CAAGCGTCCA TGTCCCTGAG 1440 
GTGCCTAGTT ACCCAGAAAC ACCATGGTAC CCAGACACAC CCAGTTACCC TGACACCACA 1500 
TCCGTCTCTT CTACCACTGA GCTAACCAGC CCTGTGGAAG ACTACACTGA TCTGACTACC 1560 
ATTCAGGTCA CTGATGACCG CAGCGTTTGG GGCATGACCC AGGCCCAGAG CGGGCTGGCC 1620 
ATTGCCGCCA TTGTAATTGG CATTGTCGCC CTGGCCTGCT CCCTGGCTGC CTGCGTCGGC 1680 
TGTTGCTGCT GCAAGAAGAG GAGCCAAGCT GTCCTGATGC AGATGAAGGC ACCCAATGAG 1740 
TGTTAAAGAG GCAGGCTGGA GCAGGGCTGG GGAATGATGG GACTGGAGGA CCTGGGAATT 1800 
TCATCTTTCT GCCTCCACCC CTGGGTCCAT GGAGCTTTCC CGTGATTGCT CTTTCTGGCC 1860 
CTAGATAAAG GTGTGCCTAC CTCTTCCTGA CTTGCCTGAT TCTCCCGTAG AGAAGCAGGT 1920 
CGTGCCGGAC CTTCCTACAA TCAGGAAGAT AGATCCAACT GGCCATGGCA AAAGCCCTGG 1980 
GGATTTCCGA TTCATACCCC TGGGCTTCCT TCGAGAGGGC TCTTCCTCCA AATCCTCCCC 2040 
ACCTGTCCTC CAAGAACAGC CTTCCCTGCG CCCAGGCCCC CTCCGGGCCT CTGTAGACTC 2100 
AGTTAGTCCA CAGCCTGCTC ACTTCGTGGG AATAGTTCTC CGCTGAGATA GCCCCT CTCG 2160 
CCTAAGTATT ATGTAAGTTG ATTTCCCTTC TTTTGTTTCT CTTGTTTGTG CTATGGCTTG 2220 
ACCCAGCATG TCCCCTCAAA TGAAAGTTCT CCCCTTGATT TTCTGCTCCT GAAGGCAGGG 2280 
TGAGTTCTCT CCTCAAAGAA GACTTCAAAC CATTTAACTG GTTTCTTAAG AGCCGTCAAT 2340 
CAGCCTGGTT TTGGGGATGC TATGAAAGAG AGAAGGAAAA TCATGCCGCT CAGTTCCTGG 2400 
AGACAGAAGA GCCGTCATCA GTGTCTCACT TGTGATTTTT ATCTGGAAAA GGAAGAAACA 2460 
CCCCAGCACA GCAAGCTCAG CCTTTTAGAG AAGGATATTT CCAAACTGCA AACTTTGCTT 2520 
TGAAAAGTTT AGCCCTTTAA GGAATGAAAT CATGTAGAAT TTTGGACTTC TAAAAACATT 2580 
AAAATCAGCT TATTAATACG GGATAGAGAA AGAAATCTGG TGCCTGGGGG TCCCTGTGTT 2640 
CACCCCTAGA GTTTGTTTTA AAATTTTTAA TTGAAGCATG TGAAGTGTAC STGCAGAAAA 2700 
GTGGGAACAT GATAGTGTAT GGCTTGGTGG ATTTTCACAA ACTGAACATA CCTGT GTAAT 2760 
CAGCATCTAG ACCCAGACCC AGAGCATCAC AAATATCCCC CATCCTGGGC TTTTCCCAGA 2820 
GGAGATGGGG GCTTCTGAAG ATGGACTTAC CTGGGACCTG CCCCCCATGA GCCAGGACGG 2880 
TCCCCCCACA GTCAGCCTGT GCAAAGGCCC CGTGGCCAGG GGTGGAGGAG AATATGTGGG 2940 
TGTGGACAGG ATGGGAGACT GTGGCCTGAA CAGGAGATTT TATTATATCT GGAGACCCTG 3000 
AGAGAOCCTG AGACCTGGGG CACCATGGCT GGCCAGGTCA GAAGCATCCT GACTGCAGAG 3060 
GTCCGTGCAG CCACACCCTC TTCCCTGCCA GCAAGTTGTC TGCGGC TCAT CGGAGGCCCC 3120 
TCCGCCTGGA GCCTTCTATG GACGTGATAT GCCTGTATCT GTTTTTAATT TTCATTCTTC 3180 
ACTTAGGGGA AGTGAAATCG CTCAGAGATG AGATCCTTTA ATTGAAAACG AAGTGTAACG 3240 
GAATCTAGTG TCTTTCTAAT GTGGTAAAAT TCTCCATCAA CATCACAGTC AGCTGGCAGC 3300 
TGAACTTCAG AATCTCACTT ACAGCAGGCG ACACGGGGGT ACACCGATGG GTCACACTGG 3360 
GTCTGGGGGC TCCCTGGAGC TCCTCCTGCG TGTGGTCTGG TTAGGAGTTG AGTTGTTTGC 3420 
TCCAGGGTTA TTCTCCTCCT CGAGTCACAG TCACAOGAAT ACCTGCCTTC TCTGGCTTTC 3480 
CTGCTATACA CATATTCACA TGGCGCTCAA GAAGTTAGGC TCATGGCAAC GTGTGTCTTT 3540 
CTCTGGACAA CTGGCCCAGT TTACAGTGAA ATGGAGAATT TCAGGTCTCC ACGTCTGCCC 3600 
AGGAAAGAAC TTCAGCTGAC TCCACGGGGA TCTGGAAATC CACGACCAAT CCCGATCGGC 3660 
TCTTATTAGC TCCCCGCTCC ACAAGACACC TGTGCTTTGG AAATCCACCA CCAATCCCGA 3720 
TCGGCTCTTA TTAGCTCCCC GCTCCACAAG ACACCTGTGA TCTGGAAATC TACCACCAAT 3780 
CCCGATCGGC TCTTATTAGC TCCCCGCTCC ACAAGACACC TGTGACATCC TCCAGGGCCA 3840 
CAGGAGCACG TGCTGACCAG TTTTCCCTTC CAGTTCCTGC ACAAAAAGTG TCGAGAGGGC 3900 
TGTTTGCAAA CACTAGTGCA CTTTGTAGCT TTTCACCCTC TGTCCCAGGG AATCTAGGAG 3960 
AGATGAGGCC CGTCAGAGTC AAGAGATGTC ATCCCCCCAG GGTCTCCAAG GCATTTCCAC 4020 
ACTATTGGTG GCACCTGGAG GACATGCACC AAGGCTTGCC AGAGCCAACA GGAAGTGAGC 4080 
CCAGAGCATG GCACATGAGC ATCACCCGCT GATGGTGGCC TGCTGTGCCT GGTGCCAACA 4140 
GGGGCATCCC GGCCCGTACC CCTCCAGACA GGAAGCATGG GTTTGCCCAC AGACCTGTCG 4200 
GGTGCTCCTG TGAGTGGCCT CCAGATGTCT TTGTGCATAG GCACAAGTGG GCCAGGGCTG 4260 
GAGGGAGGTG GGAAACCTCA TCATCCGGTG GGCCCTGCCA ATCTTAACCC AGAACCCTTA 4320 
GGTATTCCTG GCAGTAGCCA TGACATTGGA GCACCTTCCT CTCCAGCCAG AGGCTGACCT 4380 
GAGGGCCACT GTCCTCAGAT GACACCACCC AGGAGCACCC TAGGTGAGGG GTGAGGGCCC 4440 
CCTTATGTGA ACCTCTTGCC TCTTCCTTTC TCCCATCAGA GTGGTTGGAT GGAGCCATTG 4500 
GCCTCCTTTT CTTCAGCGGG CCCTTCAACC TCTCTGCACC ATGTTGTCTG GCTGAGGAGC 4560 
TACTAGAAAA GCTGAGTGGA GTCTCCTTTC CAACAGGATG ATGCATTTGC TCAATTCTCA 4620 
GGGCTGGAAT GAGCCGGCTG GTCCCCCAGA AAGCTGGAGT GGGGTACAGA GTTCAGTTTT 4680 
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5 

10 
15 
20 



CCTCTCTGTT 
GTGTTGGAGA 
TGCACAGATA 
GGTAGGAGTG 
GGTGTTCAAT 
TTGTCTTGGG 
TTAGTCTTGG 
GGAAAAAATA 
TGGGCTGTAT 
AACTTTTCAT 
GAACTTCCAA 
AGTTGGTCGA 
GCCCCCAGAT 
GGAAGGAAGC 



CTTCATGCTG 
GCCCCAGTGC 
AGCCCTGGTG 
GGTGTACAGA 



30 

35 

40 

45 

50 
55 
60 
65 
70 
75 
80 



TACAGCTCCT 
AGAAACAACA 
CTCTTCAAGC 
CCGCCTCTAC 
AGGCTGGGAG 
CTTTCGTCAT 
TCATCAGAAC 
AACTCTTCCA 
GTATATTGTT 
GGACACAATT 
ACTCAGGAAG 
CAGATGTTAG 
CCCACAGTCA 
CATGGCTGTG 
CCCAGGTTTC 
CCTTCAAAGC 
TTGGCGATGC 
GGCAGGGTTG 
AT CAACAAT A 



TGACAGTCCC 
AAAGCCAATT 
ACTGGACGTG 
CCACTTGTGA 
TTTTATTTAT 
TAAACCAAAG 
CTCACTTGGT 
TCCCTTAAAG 
CTTCCTCCTT 
TCCACAACCT 
TTTGCAGAGA 
ATGTATCCTA 
GAACTGAATC 
GTTCAGAGAG 
TTCTTCTCTT 
TAGATCATGT 
ATTTACAGAT 
GGGGGTCTGT 
AATAATATAC 



ACGCCCATCT 
AGAACCACTA 
GATTCTCTCT 
TGGGGTACAG 
CTCTTCAAAC 
GAAATGGAAG 
ACCATATAGA 
AATAGAATAG 
AGAATTTAGA 
TTCAGATGCT 
GCAGACAGCT 
GCTTTTAGCC 
TGCGTTGTTG 
GGTGGGCTGG 
AAGGAGAGAT 
TTGCCTTGCT 
TTCTAGGCCC 
CTTCTGCTGG 
ATGTAT 



GGAGTGGGAG 
TTTTTAAAAA 
CTAGCCCTCA 
AGGCACTTGC 
TTTGTACAAG 
CCATTCCCCT 
TCAAAAGCTT 
TTTGTCCCTC 
GATACAAGAG 
GATGTAGAGC 
AGAGATAACT 
ATAAACCACT 
GGAAGCCAGC 
CAAGCCACTT 
TGTTCTCACC 
TAGAGAATTA 
TCAGGGTTTT 
ATGCTGCTTG 



CTGGGAGTTA 
GTGCTTACTG 
GCACCCCTGC 
TCTTCTGCAT 
AGCTCATGGC 
GTTGCTCTCC 
TGTAACCACA 
TCATGGGAAT 
TTCTACTTAG 
TATTGGGAAA 
CGGGACCCAG 
CAAAGATTCA 
AGTGGCCTTG 
CCGGGGAAAA 
AACCCGCTGC 
CTGCAAATCA 
GTAGAGTGTG 
TAATCCATTT 



4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 



Seq ID NO: 410 Protein sequence 
Protein Accession fl: BAB84587.1 



25 , 



MPLKHYLLLL 
LNTHITELNE 
FQGLDSLESI* 
GKNSLTHISP 
FHNNHNLQRL 
YDNHISSLPD 
FRMLANLQNI 
YDNPWRCDSD 
VPSYPETPWY 
IAAIVIGIVA 



11 
I 

VGCQAWGAGL 
SPFLNISALI 
LLSSNQLLQI 
RVFQHLGNLQ 
YLSNNHISQIi 
NVFSNLRQLQ 
SLQNNRLRQL 
I LPLRNWLLL 
PDTPSYPDTT 
LACSLAACVG 



21 
I 

AYHGCPSECT 
ALRIEKNELS 
QPAHFSQCSN 
VLRLYENRLT 
PPSIFMQLPQ 
VLILSRNQIS 
PGNIFANVNG 
NQPRLGTDTV 
SVSSTTELTS 
CCCCKKRSQA 



31 
I 

CSRASQVECT 
RITPGAFRNli 
LKELQLHGNH 
DIFMGTFDGL 
LNRLTLFGNS 
FISPGAFNGL 
LMAIQLQNNQ 
PVCFSPANVR 
PVEDYTDLTT 
VLMQMKAPNE 



41 
I 

GARIVAVPTP 
GSLRYLSLAN 
LEYIPDGAFD 
VNLQELALQQ 
LKELSLGIFG 
TELRELSLHT 
LENIiPLGIFD 
GQSLIIINVN 
IQVTDDRSVW 
C 



51 
I 

LPWNAMSLQI 
NKLQVLPIGL 
HLVGLTKLNL 
NQIGLLSPGL 
PMPNLRELWIi 
NALQDLDGNV 
HLGKLCELRL 
VAVPSVHVPE 
GMTQAQSGLA 



60 
120 
180 
240 
300 
360 
420 
480 
540 



Seq ID NO: 411 DNA sequence 
Nucleic Acid Accession #« XMJ>981S1 
Coding sequence : 1 . . 447 



ATGATGCATT 
AGTGGGGTAC 
TCTGGAGTGG 
CTATTTTTAA 
TCTCTAGCCC 
CAGAGGCACT 
AACTTTGTAC 
AAGCCATTCC 



11 
I 

TGCTCAATTC 
AGAGTTCAGT 
GAGCTGGGAG 
AAAGTGCTTA 
TCAGCACCCC 
TGCTCTTCTG 
AAGAGCTCAT 
CCTGTTGCTC 



21 
I 

TCAGGGCTGG 
TTTCCTCTCT 
TCAGTGTTGG 
CTGTGCACAG 
TGCGGTAGGA 
CATGGTGTTC 
GGCTTGTCTT 
TCCTTAG 



31 
I 

AATGAGCCGG 
GTTTACAGCT 
AGAAGAAACA 
ATACTCTTCA 
GTGCCGCCTC 
AATAGGCTGG 
GGGCTTTCGT 



41 
I 

CTGGTCCCCC 
CCTTGACAGT 
ACAAAAGCCA 
AGCACTGGAC 
TACCCACTTG 
GAGTTTTATT 
CATTAAACCA 



51 
I 

AGAAAGCTGG 
CCCACGCCCA 
ATTAGAACCA 
GTGGATTCTC 
TGATGGGGTA 
TATCTCTTCA 
AAGGAAATGG 



Seq ID NO: 412 Protein sequence 
Protein Accession #: XP_098151 

21 



51 



1 11 21 31 41 

I I I I I I 

MMHLLNSQGW NEPAGPPESW SGVQSSVFLS VYSSLTVPRP SGVGAGSQCW RRNNKSQLEP 
LFLKSAYCAQ ILFKHWTWIL STiALSTPAVG VPPLPTCDGV QRHLLFCMVF NRLGVLFISS 
NFVQELMACL GLSSLNQRKW KPFPCCSP 

Seq ID NO: 413 DNA sequence 

Nucleic Acid Accession fh NM__00265a.l 

Coding sequence: 77.. 1372 



1 
I 

GTCCCCGCAG 
CCCCGACCTC 
GAGCGACTCC 
TGGAGGAACA 
GAAATTCGGA 
TCACTTTTAC 
CTCTGCCACT 
CCTGGGGAAA 
GCAGGTGGGC 
AAAGCCCTCC 
CCGCTTTAAG 
CATCTACAGG 
CCCTTGCTGG 
CATCGTCTAC 
GGTGGAAAAC 
CATTGCCTTG 
ACAGACCATC 
CACTGGCTTT 



11 
I 

CGCCGTCGCG 
GCCACCATGA 
AAAGGCAGCA 
TGTGTGTCCA 
GGGCAGCACT 
CGAGGAAAGG 
GTCCTTCAGC 
CATAATTACT 
CTAAAGCCGC 
TCTCCTCCAG 
ATTATTGGGG 
AGGCACCGGG 
GTGATCAGCG 
CTGGGTCGCT 
CTCATCCTAC 
CTGAAGATCC 
TGCCTGCCCT 
GGAAAAGAGA 



21 
I 

CCCTCCTGCC 
GAGCCCTGCT 
ATGAACTTCA 
ACAAGTACTT 
GTGAAATAGA 
CCAGCACTGA 
AAACGTACCA 
GCAGGAACCC 
TTGTCCAAGA 
AAGAATTAAA 
GAGAATTCAC 
GGGGCTCTGT 
CCACACACTG 
CAAGGCTTAA 
ACAAGGACTA 
GTTCCAAGGA 
CGATGTATAA 
ATTCTACCGA 



31 
I 

GCAGGCCACC 
GGCGCGCCTG 
TCAAGTTCCA 
CTCCAACATT 
TAAGTCAAAA 
CACCATGGGC 
TGCCCACAGA 
AGACAACCGG 
GTGCATGGTG 
ATTTCAGTGT 
CACCATCGAG 
CACCTACGTG 
CTTCATTGAT 
CTCCAACACG 
CAGCGCTGAC 
GGGCAGGTGT 
CGATCCCCAG 
CTATCTCTAT 



41 
I 

GAGGCCGCCG 
CTTCTCTGCG 
TCGAACTGTG 
CACTGGTGCA 
ACCTGCTATG 
CGGCCCTGCC 
TCTGATGCTC 
AGGCGACCCT 
CATGACTGCG 
GGCCAAAAGA 
AACCAGCCCT 
TGTGGAGGCA 
TACCCAAAGA 
CAAGGGGAGA 
ACGCTTGCTC 
GCGCAGCCAT 
TTTGGCACAA 
CCGGAGCAGC 



60 
120 
180 
240 
300 
360 
420 



60 
120 



51 
I 

CCGTCTAGCG 
TCCTGGTCGT 
ACTGTCTAAA 
ACTGCCCAAA 
AGGGGAATGG 
TGCCCTGGAA 
TTCAGCTGGG 
GGTGCTATGT 
CAGATGGAAA 
CTCTGAGGCC 
GGTTTGCGGC 
GCCTCATCAG 
AGGAGGACTA 
TGAAGTTTGA 
ACCACAACGA 
CCCGGACTAT 
GCTGTGAGAT 
TGAAAATGAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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TGTTGTGAAG CTGATTTCCC ACCGGGAGTG TCAGCAGCCC CACTACTACG GCTCTGAAGT 
CACCACCAAA ATGCTATGTG CTGCTGACCC CCAATGGAAA ACAGATTCCT GCCAGGGAGA 
CTCAGGGGGA CCCCTCGTCT GTTCCCTCCA AGGCCGCATG ACTTTGACTG GAATTGTGAG 
CTGGGGCCGT GGATGTGCCC TGAAGGACAA GCCAGGCGTC TACACGAGAG TCTCACACTT 
CTTACCCTGG ATCCGCAGTC ACACCAAGGA AGAGAATGGC CTGGCCCTCT GAGGGTCCCC 
AGGGAGGAAA CGGGCACCAC CCGCTTTCTT GCTGGTTGTC ATTTTTGCAG TAGAGTCATC 
TCCATCAGCT GTAAGAAGAG ACTGGGAAGA TAGGCTCTGC ACAGATGGAT TTGCCTGTGG 
CACCACCAGG GTGAACGACA ATAGCTTTAC CCTCACGGAT AGGCCTGGGT GCTGGCTGCC 
CAGACCCTCT GGCCAGGATG GAGGGGTGGT CCTGACTCAA CATGTTACTG ACCAGCAACT 
TGT CTTTTT C TGGACTGAAG CCTGCAGGAG TTAAAAAGGG CAGGGCATCT CCTGTGCATG 
GGCTCGAAGG GAGAGCCAGC TCCCCCGACC GGTGGGCATT TGTGAGGCCC ATGGTTGAGA 
AATGAATAAT TTCCCAATTA GGAAGTGTAA GCAGCTGAGG TCTCTTGAGG GAGCTTAGCC 
AATGTGGGAG CAGCGGTTTG GGGAGCAGAG ACACTAACGA CTTCAGGGCA GGGCTCTGAT 
ATTCCATGAA TGTATCAGGA AATATATATG TGTGTGTATG TTTGCACACT TGTTGTGTGG 
GCTGTGAGTG TAAGTGTGAG TAAGAGCTGG TGTCTGATTG TTAAGTCTAA ATATTTCCTT 
AAACTGTGTG GACTGTGATG CCACACAGAG TGGTCTTTCT GGAGAGGTTA TAGGTCACTC 
CTGGGGCCTC TTGGGTCCCC CACGTGACAG TGCCTGGGAA TGTACTTATT CTGCAGCATG 
ACCTGTGACC AGCACTGTCT CAGTTTCACT TTCACATAGA TGTCCCTTTC TTGGCCAGTT 
ATCCCTTCCT TTTAGCCTAG TTCATCCAAT CCTCACTGGG TGGGGTGAGG ACCACTCCTT 
ACACTGAATA TTTATATTTC ACTATTTTTA TTTATATTTT TGTAATTTTA AATAAAAGTG 
ATCAATAAAA TGTGATTTTT CTGA 

Seq ID NO: 414 Protein sequence 
Protein Accession #: NP_002649.l 

1 11 21 31 41 51 

| I I I I 

MRALLARLLL CVLWSDSKG SNELHQVPSN CDCLNGGTCV SNKYFSNIHW CNCPKKFGGQ 

HCEIDKSKTC YEGNGHFYRG KASTDTMGRP CLPWNSATVL QQTYHAHRSD ALQLGLGKHN 120 

YCRNPDNRRR PWCYVQVGLK PLVQECMVHD CADGKKPSSP PEELKFQCGQ KTLRPRPKII 180 

GGEFTTIENQ PWFAAIYRRH RGGSVTYVCG GSIiISPCWVI SATHCPIDYP KKEDYIVYIX3 240 

RSRUJSNTQG EMKFEVENLI LHKDYSADTL AHHNDIALLK IRSKEGRCAQ PSRTIQTICL 300 

PSMYNDPQFG TSCEITGFGK ENSTDYLYPE QLKMtWKLI SHRECQQPHY YGSEVTTKML 360 

CAADPQWKTD SOQGDSGGPL VCSLQGRMTTi TGIVSWGRGC ALKDKPGVYT RVSHFTiPWIR 420 
SHTKEENGLA L 

Seq ID NO: 415 DNA sequence 

Nucleic Acid Accession #: NM_024422.1 

Coding sequence: 202.. 2907 

1 11 21 31 41 51 

I I I I I t 

CGCCAAAGGA AAAGCCCCTT GGATGAGAGG CAGGCGCTTC AGAGAAGCTA AGAAAAGCAC 60 

CTCTCCGCGC GCCCCACCTC CTCCGCCTCG CGCTCCTCCT GAGCAGCGGG CCCAGACTGC 120 

GCTCCGGCCG CGGCCCTCGC CCCGCGGAGC CCTCCTACCC CGGCCCGACG CTCGGCCCGC 180 

GACCTGCCCC GAGCCCTCTC CATGGAGGCA GCCCGCCCCT CCGGCTCCTG GAACGGAGCC 240 

CTCTGCCGGC TGCTCCTGCT GACCCTCGCG ATCTTAATAT TTGCCAGTGA TGCCTGCAAA 300 

AATGTGACAT TACATGTTCC CTCCAAACTA GATGCCGAGA AACTTGTTGG TAGAGTTAAC 360 

CTGAAAGAGT GCTTTACAGC TGCAAATCTA ATTCATTCAA GTGATCCTGA CTTCCAAATT 420 

TTGGAGGATG GTTCAGTCTA TACAACAAAT ACTATTCTAT TGTCCTCGGA GAAGAGAAGT 480 

TTTACCATAT TACTTTCCAA CACTGAGAAC CAAGAAAAGA AGAAAATATT TGTCTTTTTG 540 

GAGCATCAAA CAAAGGTCCT AAAGAAAAGA CATACTAAAG AAAAAGTTCT AAGGCGCGCC 600 

AAGAGAAGAT GGGCTCCAAT TCCTTGTTCG ATGCTAGAAA ACTCCTTGGG TCCTTTTCCA 660 

CTTTTCCTTC AACAGGTTCA ATCTGACACG GCCCAAAACT ATACCATATA CTATTCCATA 720 

AGAGGTCCTG GAGTTGACCA AGAACCTCGG AATTTATTTT ATGTGGAGAG AGACACTGGA 780 

AACTTGTATT GTACTCGTCC TGTAGATCGT GAGCAGTATG AATCTTTTGA GATAATTGCC 840 

TTTGCAACAA CTCCAGATGG GTATACTCCA GAACTTCCAC TGCCCCTAAT AATCAAAATA 900 

GAGGATGAAA ATGATAACTA CCCAATTTTT ACAGAAGAAA CTTATACTTT TACAATTTTT 960 

GAAAATTGCA GAGTGGGCAC TACTGTGGGA CAAGTGTGTG CTACTGACAA AGATGAGCCT 1020 

GACACGATGC ACACACGCCT GAAGTACTCC ATCATTGGGC AGGTGCCACC ATCACCCACC 1080 

CTATTTTCTA TGCATCCAAC TACAGGOGTG ATCACCACAA CATCATCTCA GCTAG ACAGA 1140 

GAGTTAATTG ACAAGTACCA GTTGAAAATA AAAGTACAAG ACATGGATGG TCAGTATTTT 1200 

GGTCTACAGA CAACTTCAAC TTGTATCATT AACATTGATG ATGTAAATGA CCACTTGCCA 1260 

ACATTTACTC GTACTTCTTA TGTGACATCA GTGGAAGAAA ATACAGTTGA TGTGGAAATC 1320 

TTACGAGTTA CTGTTGAGGA TAAGGACTTA GTGAATACTG CTAACTGGAG AGCTAATTAT 1380 

ACCATTTTAA AGGGCAATGA AAATGGCAAT TTTAAAATTG TAACAGATGC CAAAACCAAT 1440 

GAAGGAGTTC TTTGTGTAGT TAAGCCTTTG AATTATGAAG AAAAGCAACA GATGATCTTG 1500 

CAAATTGGTG TAGTTAATGA AGCTCCATTT TCCAGAGAGG CTAGTCCAAG ATCAGCCATG 1560 

AGCACAGCAA CAGTTACTGT TAATGTAGAA GATCAGGATG AGGGCCCTGA GTGTAACCCT 1620 

CCAATACAGA CTGTTCGCAT GAAAGAAAAT GCAGAAGTGG GAACAACAAG CAATGGATAT 1680 

AAAGCATATG ACCCAGAAAC AAGAAGTAGC AGTGGCATAA GGTAT AAGAA ATTAACTGAT 1740 

CCAACAGGGT GGGTCACCAT TGATGAAAAT ACAGGATCAA TCAAAGTTTT CAGAAGCCTG 1800 

GATAGAGAGG CAGAGACCAT CAAAAATGGC ATATATAATA TTACAGTCCT TGCATCAGAC 1860 

CAAGGAGGGA GAACATGTAC GGGGACACTG GGCATTATAC TTCAAGACGT GAATGATAAC 1920 

AGCCCATTCA TACCTAAAAA GACAGTGATC ATCTGCAAAC CCACCATGTC ATCTGCGGAG 1980 

ATTGTTGCGG TTGATCCTGA TGAGCCTATC CATGGCCCAC CCTTTGACTT TAGTCTGGAG 2040 

AGTTCTACTT CAGAAGTACA GAGAATGTGG AGACTGAAAG CAATTAATGA TACAGCAGCA 2100 

CGTCTTTCCT ATCAGAATGA TCCTCCATTT GGCTCATATG TAGTACCTAT AACAGTGAGA 2160 

GATAGACTTG GCATGTCTAG TGTCACTTCA TTGGATGTTA CACTGTGTGA CTGCATTACC 2220 

GAAAATGACT GCACACATCG TGTAGATCCA AGGATTGGCG GTGGAGGAGT ACAACTTGGA 2280 

AAGTGGGCCA TCCTTGCAAT ATTGTTGGGC ATAGCATTGC TCTTTTGCAT CCTGTTTACG 2340 

CTGGTCTGTG GGGCTTCTGG GACGTCTAAA CAACCAAAAG TAATTCCTGA TGATTTAGCC 2400' 

CAGCAGAACC TAATTGTATC AAACACAGAA GCTCCTGGAG ATGACAAAGT GTATTCTGCG 2460 

AATGGCTTCA CAACCCAAAC TGTGGGCGCT TCTGCTCAGG GAGTTTGTGG CACCGTGGGA 2520 
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TCAGGAATCA AAAACGGAGG TCAGGAGACC 
TCGGAATCCT GCCGGGGGGC TGGCCACCAT 
ACGGAGGTGG ACAACTGCAG ATACACTTAC 
CTTGGTGAAA AAGTGTATCT GTGTAATCAA 
5 GTCCTGACAT ATAACTATGA AGGAAGAGGA 
GAACGACAAG AAGAAGATGG GCTTGAATTT 
CTAGCAGAAG CATGCATGAA GAGATGAGTG 
TTTATGACTT TTAAAAAAAA TTACAAACCA 
TGGGGGTTTT TCTCTCATTA TTTGGATGGA 

10 AGACACTATA AACAAGTACA CAAATTTTTC 
TCTATCCAAG GAGGTCTACA GAGAAATTAA 
ATGACAACAG CCAATTTATA GTGCAATAAA 
TATTTGAAGC ACAACCTAAT GGAAAATTGT 
AATTAAGTGT TCATGTGGTG CTTGGAAACT 

15 ACTGCATTCT TGCTATTATT TTATTCTTGT 
TTTCTAGCCA GGCATTGACT ATTACAATTT 



ATCGAAATGG TGAAAGGAGG ACACCAGACC 2S80 

CACACCCTGG ACTCCTGCAG GGGAGGACAC 2640 

TCGGAGTGGC ACAGTTTTAC TCAGCCCCGT 2700 

GATGAAAATC ACAAGCATGC CCAAGACTAT 2760 

TCGGTGGCTG GGTCTGTAGG TTGTTGCAGT 2820 

TTGGATAATT TGGAGCCCAA ATTTAGGACA 2880 

TGTTCTAATA AGTCTCTGAA AGCCAGTGGC 2940 

AGAATTTTTT AAAGCAGAAG ATGCTATTTG 3000 

ATCTCTTTGG TCAAATGCAC ATTTACAGAG 3060 

AATTTTTACA TATTTTTAAA TTACTTATCT 3120 

AGTCTGCCTT ATTTGTTACA TTTGGGTATA 3180 

ATGTAATTAA TTCAAGTCCT TATTATAGAC 3240 

AGAGACCTTG CTTTAACATT ATCTCCAGTT 3300 

GTTGTTTTCC TGAACATCTA AAGTGTGTAG 3360 

AATGTGACCT TTTCACTGTG CAAAGGGAGA 3420 
CATT 



20 



Seq ID N0» 416 Protein sequence 
Protein Accession #: NP_077740.1 



11 21 31 41 51 

| | ! ! 1 1 

MEAARPSGSW NGALCRLLLL TLAILIFASD ACKNVTLHVP SKLDAEKLVG RVNIiKECFTA 60 

ANLIHSSDPD FQILEDGSVY TTNTILLSSE KRSPTILLSN TENQBKKKIF VFLEHQTKVL 120 

25 KKRHTKEKVL RRAKRRWAPI PCSMLENSLG PFPLFLQQVQ SDTAQNYTIY YSIRGPGVDQ 180 

EPRNLFYVER DTGNLYCTRP VDREQYESFE 1 1 AFATTPDG YTPELPLPLI IKIEDENDNY 240 

PIFTEETYTP TIFENCRVGT TVGQVCATDK DEPDTMHTRL KYSIIGQVPP SPTLFSMHPT 300 

TGVITTTSSQ LDRELIDKYQ LKIKVQDMDG QYFGLQTTST CIINIDDVMD HLPTFTRTSY 360 

VTSVEENTVD VEILRVTVED KDLVNTANWR ANYTILKGNE NGNFKIVTDA KTNEGVLCW 420 

30 KPLNYEEXQQ MILQIGWNE APFSREASPR SAMSTATVTV NVEDQDEGPE CNPPIQTVRW 480 

KENAEVGTTS NGYKAYDPET RSSSGIRYKK LTDPTGWVTI DENTGSIKVF RSLDREAETI 540 

KNGIYNITVL ASDQGGRTCT GTLGIILQDV NDNSPFIPKK TVIICKPTMS SAEIVAVDPD 600 

EPIHGPPFDF SLESSTSEVQ RMWRLKAIND TAARLSYQND PPFGSYWPI TVRDRLGMSS 660 

VTSLDVTLCD CITENDCTHR VDPRIGGGGV QLGKWAILAI LLGIALLFCI LFTLVCGASG 720 

35 TSKQPKVIPD DLAQQNLIVS NTEAPGDDKV YSANGFTTQT VGASAQGVCG TVGSGIKNGG 780 

QETIEMVKGG HQTSESCRGA GHHHTLDSCR GGHTEVDNCR YTYSEWHSFT QPRLGEKVYL 840 

CNQDENHKHA QDYVLTYNYE GRGSVAGSVG CCSERQEEDG LEFLDNLEPK FRTIAEACMK 900 
R 

40 Seq ID NO: 417 DNA sequence 

Nucleic Acid Accession #: NM_004949.1 
Coding sequence: 202.. 2745 

1 11 21 31 41 51 

45 | | I I I i 

CGCCAAAGGA AAAGCCCCTT GGATGAGAGG CAGGCGCTTC AGAGAAGCTA AGAAAAGCAC 60 

CTCTCCGCGC GCCCCACCTC CTCCGCCTCG CGCTCCTCCT GAGCAGCGGG CCCAGACTGC 120 

GCTCCGGCCG CGGCCCXCGC CCCGCGGAGC CCTCCTACCC CGGCCCGACG CTCGGCCCGC 180 

GACCTGCCCC GAGCCCTCTC CATGGAGGCA GCCCGCCCCT CCGGCTCCTG GAACGGAGCC 240 

50 CTCTGCCGGC TGCTCCTGCT GACCCTCGCG ATCTTAATAT TTGCCAGTGA TGCCTGCAAA 300 

AATGTGACAT TACATGTTCC CTCCAAACTA GATGCCGAGA AACTTGTTGG TAGAGTTAAC 360 

CTGAAAGAGT GCTTTACAGC TGCAAATCTA ATTCATTCAA GTGATCCTGA CTTCCAAATT • 420 

TTGGAGGATG GTTCAGTCTA TACAACAAAT ACTATTCTAT TGTCCTCGGA GAAGA GAAG T 480 

TTTACCATAT TACTTTCCAA CACTGAGAAC CAAGAAAAGA AGAAAATATT TGTCTTTTTG 540 

55 GAGCATCAAA CAAAGGTCCT AAAGAAAAGA CATACTAAAG AAAAAGTTCT AAGGCGCGCC 600 

AAGAGAAGAT GGGCTCCAAT TCCTTGTTCG ATGCTAGAAA ACTCCTTGGG TCCTTTTCCA 660 

CTTTTCCTTC AACAGGTTCA ATCTGACACG GCCCAAAACT ATACCATATA CTATTCCATA 720 

AGAGGTCCTG GAGTTGACCA AGAACCTCGG AATTTATTTT ATGTGGAGAG AGACACTGGA 780 

AACTTGTATT GTACTCGTCC TGTAGATCGT GAGCAGTATG AATCTTTTGA GATAATTGCC B40 

60 TTTGCAACAA CTCCAGATGG GTATACTCCA GAACTTCCAC TGCCCC TAAT AATC AAAATA 900 

GAGGATGAAA ATGATAACTA CCCAATTTTT ACAGAAGAAA CTTATACTTT TACAATTTTT 960 

GAAAATTGCA GAGTGGGCAC TACTGTGGGA CAAGTGTGTG CTACTGACAA AGATGAGCCT 1020 

GACACGATGC ACACACGCCT GAAGTACTCC ATCATTGGGC AGGTGCCACC ATCACCCACC 1080 

CTATTTTCTA TGCATCCAAC TACAGGCGTG ATCACCACAA CATCATCTCA GCTAG ACAGA 1140 

65 GAGTTAATTG ACAAGTACCA GTTGAAAATA AAAGTACAAG ACATGGATGG TCAGTATTTT 1200 

GGTCTACAGA CAACTTCAAC TTGTATCATT AACATTGATG ATGTAAATGA CCACTTGCCA 1260 

ACATTTACTC GTACTTCTTA TGTGACATCA GTGGAAGAAA ATACAGTTGA TGTGGAAATC 1320 

TTACGAGTTA CTGTTGAGGA TAAGGACTTA GTGAATACTG CTAACTGGAG AGCTAATTAT 1360 

ACCATTTTAA AGGGCAATGA AAATGGCAAT TTTAAAATTG TAACAGATGC CAAAACCAAT 1440 

70 GAAGGAGTTC TTTGTGTAGT TAAGCCTTTG AATTATGAAG AAAAGCAACA GATGATCTTG 1500 

CAAATTGGTG TAGTTAATGA AGCTCCATTT TCCAGAGAGG CTAGTCCAAG ATCAGCCATG 1560 

AGCACAGCAA CAGTTACTGT TAATGTAGAA GATCAGGATG AGGGCCCTGA GTGTAACCCT 1620 

CCAATACAGA CTGTTCGCAT GAAAGAAAAT GCAGAAGTGG GAACAACAAG CAATGGATAT 1680 

AAAGCATATG ACCCAGAAAC AAGAAGTAGC AGTGGCATAA GGTATAAGAA ATTAACTGAT 1740 

75 CCAACAGGGT GGGTCACCAT TGATGAAAAT ACAGGATCAA TCAAAGTTTT CAGAAGCCTG 1800 

GATAGAGAGG CAGAGACCAT CAAAAATGGC ATATATAATA TTACAGTCCT TGCATCAGAC 1860 

CAAGGAGGGA GAACATGTAC GGGGACACTG GG CATT AT AC TTCAAGACGT GAATGATAAC 1920 

AGCCCATTCA TACCTAAAAA GACAGTGATC ATCTGCAAAC CCACCATGTC ATCTGCGGAG 19 B0 

ATTGTTGCGG TTGATCCTGA TGAGCCTATC CATGGCCCAC CCTTTGACTT TAGTCTGGAG 2040 

80 AGTTCTACTT CAGAAGTACA GAGAATGTGG AGACTGAAAG CAATTAATGA TACAGCAGCA 2100 

CGTCTTTCCT ATCAGAATGA TCCTCCATTT GGCTCATATG TAGTACCTAT AACAGTGAGA 2160 

GATAGACTTG GCATGTCTAG TGTCACTTCA TTGGATGTTA CACTGTGTGA CTGCATTACC 2220 

GAAAATGACT GCACACATCG TGTAGATCCA AGGATTGGCG GTGGAGGAGT ACAACTTGGA 2280 

AAGTGGGCCA TCCTTGCAAT ATTGTTGGGC ATAGCATTGC TCTTTTGCAT CCTGTTTACG 2340 
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10 

15 

20 

25 

30 

35 
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CTGGTCTGTG 
CAGCAGAACC 
AATGGCTTCA 
TCAGGAATCA 
TCGGAATCCT 
ACGGAGGTGG 
CTTGGTGAAG 
GTATCTGTGT 
CTATGAAGGA 
AGATGGGCTT 
CATGAAGAGA 
AAAAAATTAC 
TCATTATTTG 
AGTACACAAA 
TCTACAGAGA 
TTTATAGTGC 
CCTAATGGAA 
GTGGTGCTTG 
ATTATTTTAT 
TTGACTATTA 



GGGCTTCTGG 
TAATTGTATC 
CAACCCAAAC 
AAAACGGAGG 
GCCGGGGGGC 
ACAACTGCAG 
AATCCATTAG 
AATCAAGATG 
AGAGGATCGG 
GAATTTTTGG 
TGAGTGTGTT 
AAACCAAGAA 
GATGGAATCT 
TTTTTCAATT 
AATTAAAGTC 
AATAAAATGT 
AATTGTAGAG 
GAAACTGTTG 
TCTTGTAATG 
CAATTTCATT 



GACGTCTAAA 
AAACACAGAA 
TGTGGGCGCT 
TCAGGAGACC 
TGGCCACCAT 
ATACACTTAC 
AGGACACACT 
AAAATCACAA 
TGGCTGGGTC 
ATAATTTGGA 
CTAATAAGTC 
TTTTTTAAAG 
CTTTGGTCAA 
TTTACATATT 
TGCCTTATTT 
AATTAATTCA 
ACCTTGCTTT 
TTTTCCTGAA 
TGACCTTTTC 



CAACCAAAAG 
GCTCCTGGAG 
TCTGCTCAGG 
ATCGAAATGG 
CACACCCTGG 
TCGGAGTGGC 
CTGATTAAAA 
GCATGCCCAA 
TGTAGGTTGT 
GCCCAAATTT 
TCTGAAAGCC 
CAGAAGATGC 
ATGCACATTT 
TTTAAATTAC 
GTTACATTTG 
AGTCCTTATT 
AACATTATCT 
CATCTAAAGT 
ACTGTGCAAA 



TAATTCCTGA 
ATGACAAAGT 
GAGTTTGTGG 
TGAAAGGAGG 
ACTCCTGCAG 
ACAGTTTTAC 
ATTAAACAAT 
GACTATGTCC 
TGCAGTGAAC 
AGGACACTAG 
AGTGGCTTTA 
TATTTGTGGG 
ACAGAGAGAC 
TTATCTTCTA 
GGTATAATGA 
ATAGACTATT 
CCAGTTAATT 
GTGTAGACTG 
GGGAGATTTC 



TGATTTAGCC 
GTATTCTGCG 
CACCGTGGGA 
ACACCAGACC 
GGGAGGACAC 
TCAGCCCCGT 
GAAAGAAAGT 
TGACATATAA 
GACAAGAAGA 
CAGAAGCATG 
TGACTTTTAA 
GGTTTTTCTC 
ACTATAAACA 
TCCAAGGAGG 
CAACAGCCAA 
TGAAGCACAA 
AAGTGTTCAT 
CATTCTTGCT 
TAGCCAGGCA 



2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3160 
3240 
3300 
3360 
3420 
3480 



Seq ID NO: 418 Protein sequence 
Protein Accession ft: NP 004940.1 



1 
I 

MBAARPSGSW 
ANIiIHSSDPD 
KKRHTKEKVIi 
EPRNLFYVER 
PIFTEETYTF 
TGVITTTSSQ 
VTSVEENTVD 
KPLNYEEKQQ 
KENAEVGTTS 
KNGIYNITVL 
EPIHGPPFDF 
VTSLDVTLCD 
TSKQPKVIPD 
QETIEMVKGG 
GHTLIKN 



11 
I 

NGALCRLLLL 
FQILEDGSVY 
RRAKRRWAPI 
DTGNLYCTRP 
TIFENCRVGT 
LDRELTDKYQ 
VEILRVTVED 
MILQIGWNE 
NGYKAYDPET 
ASDQGGRTCT 
SLESSTSEVQ 
CITENDCTHR 
DLAQQNLIVS 
HQTSESCRGA 



21 
I 

TLAILIFASD 
TTNTILLSSE 
PCSMLENSLG 
VDREQYESFE 
TVGQVCATDK 
LKIKVQDMDG 
KDLVNTANWR 
APPSREASPR 
RSSSGIRYKK 
GTLGIIIiQDV 
RMWRliKAIND 
VDPRIQGGGV 
NTEAPGDDKV 
GHHHTLDSCR 



31 
I 

ACKNVTLHVP 
XRSFTILLSN 
PFPLFLQQVQ 
IIAFATTPDG 
DEPDTMHTRL 
QYFGLQTTST 
ANYTILKGNE 
SAMSTATVTV 
LTDPTGWVTI 
NDNSPFIPKK 
TAARLSYQND 
QLGKWAILAI 
YSANGFTTQT 
GGHTEVDNCR 



41 
I 

SKLDAEKLVG 
TENQEKKKIF 
SDTAQNYTIY 
YTPELPLPLI 
KYSIIGQVPP 
CIINIDDVND 
NGNFKIVTDA 
NVEDQDEGPE 
DENTGSIKVF 
TVIICKPTMS 
PPFGSYWPI 
LLGIALLPCI 
VGASAQGVCG 
YTYSEWHSFT 



51 
I 

RVNUCECFTA 
VFLEHQTKVL 
YSIRGPGVDQ 
IKIED END NY 
SPTLFSMHPT 
HLPTFTRTSY 
KTNEGVLCW 
CNPPIQTVRM 
RSLDREAETI 
SAEIVAVDPD 
TVRDRLGMSS 
LFTLVCGASG 
TVGSGIKNGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



Seq ID NO: 419 DNA sequence 
Nucleic Acid Accession #; NM_002722.1 
Coding sequence: 14.. 301 



1 
I 

ACTCTGGACT 
GCGTGGCTCT 
ACCCAGGGGA 
ACATCAACAT 
TCTCGGAGTG 
AATGCCACCT 
ACCCTTGGCT 
AAGCC 



11 
I 

CCGGATGGCT 
GTTACTACAG 
CAATGCCACA 
GCTGACCAGG 
GGGGTCCCCG 
TCTGTCTCCT 
CTGGCCAAAG 



21 
I 

GOCGCACGCC 
CCACTGCTGG 
CCAGAGCAGA 
CCTAGGTATG 
CATGCTGCTG 
ACGACTCCAT 
CTTGCTCCCT 



31 
I 

TCTGCCTCTC 
GTGCCCAGGG 
TGGCCCAGTA 
GGAAAAGACA 
TCCCCAGGGA 
GAGCAGCGCC 
GCTCCCACAC 



41 

I 

CCTGCTGCTC 
AGCCCCACTG 
TGCAGCTGAT 
CAAAGAGGAC 
GCTCAGCCCG 
AGCCCAGCTC 
AGGCTCAATA 



51 
I 

CTGTCCACCT 
GAGCCAGTGT 
CTCCGTAGAT 
AGGCTGGCCT 
CTGGACTTAT 
TCCCCTCTGC 
AAGCAAGTCA 



Seq ID NO: 420 Protein sequence 
Protein Accession #: NP_002713.1 

1 11 21 31 41 51 

I I I 1 I I 

MAAARLCLSL LliLSTCVALL LQPIiLGAQGA PLEPVYPGDN ATPEQMAQYA ADLRRYINMI* 
TRPRYGKRHK EDTLAFSEWG SPHAAVPREL SPLDL 

Seq ID NO: 421 DNA sequence 

Nucleic Acid Accession #: NM_032545.1 

Coding sequence: 46.. 718 



l 
I 

AAACTGATCT 
CCATGTCAGG 
CTATCAAAGA 
GCACCGACAG 
CGAGGGCTGG 
CGCGCGGCCG 
CCCGGCCCAC 
CCTGGAGCAC 
CCTGCACTGC 
CCACGCTCAC 
ACTCCTGCAC 
CGTCCTCCAG 
TTCTATGTTG 



11 
I 

TCAATGCACT 
CTTCTGTTTA 
GAGAAACATA 
TCACCGCTCA 
GGGCCGGAGG 
CGCTGCTGCA 
TTCACCGGCC 
GGAGCCTGGA 

GGGCCGAGCG 
CGCCTCCTGC 



TAAATAATAG 



21 
I 

AAGAGAAGGA 
CGGTCAGTTT 
ACGGCGGTAG 
ACTGGACCTC 
AGCCGCTCCC 
GGAACGGCGG 
GCTACTGCGA 
CCCTCCGCGC 
AGACGCCTGA 
CCGGGGGCGC 
GCCCGGATGC 
GCCCCTGCGG 
ATGTGTTTAG 



31 
I 

GACTCTCAAA 
GGCATTACAG 
AGAGGAAGTC 
CAGTCATTTC 
CTACTCCCGG 
TACCTGCGTG 
GCATGACCAG 
CTGCCACCTC 
CCGCTGTGAC 
GCCCAGCCTG 
GCCCGCGCAC 
AAGGCCGGGA 
TTTACCGTAA 



41 
I 

CCAAAAATGA 
ATCATCAATT 
ACCAAGGTTG 
GGAGAGGTGA 
GCTTTCGGAG 
CTGGGCAGCT 
AGGCGCAGTG 
TGCAGGTGCA 
CCGAAAGACT 
CTACTCTTGC 
CCTCGGTCCC 
CTTGGGCATC 
GCTGAAGCAC 



60 
120 
180 
240 
300 
360 
420 



60 



51 
I 

CCTGGAGGCA 
TGGGAAACAG 
CCACTCAGAA 
CTGGGAGCGC 
AGGGTGCGTC 
TCTGCGTGTG 
AATGCGGCGC 
TCTTCGGGGC 
TCCTGGCCTC 
TGCCCTGCGC 
TGGTCCCTTC 
GCCTTTAATT 
TGGGTGAATA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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TTTTTATTGG GTAATAAATA TTTTCATGAA AGCGCCAAAA AAAAAAAAAA AAAAAAAAAA 
AAAAAA 

Seq ID NO: 422 Protein sequence 
Protein Accession Jh NP_1 15934.1 



840 



11 



21 31 41 51 

I I I I 

MTWRHHVRLL FTVSLALQII NLGNSYQR5K HNGGREEVTK VATQKHRQSP IiNWTSSHPGE 60 
VTGSAEGWGP EEPLPYSRAF GEGASARPRC CRNGGTCVLG SFCVCPAHFT GRYCEHDQRR 120 
SECGALEHGA WTLRACHLCR CIPGALHCLP LQTPDRCDPK DFLASHAHGP SAGGAPSLLL 180 
LLPCALLHRL LRPDAPAHPR SLVPSVLQRE RRPCGRPGLG HRL 

Seq ID NO.* 423 DMA sequence 

Nucleic Acid Accession #: NMJJ06S33.1 

Coding sequence: 72.. 467 



AGGGAGAGAG 
CACAGTCCAC 
TCTCCGGACC 
CGGACCAGGA 
CCGACTGCCG 
AGGGCCGTGG 
CTGCTCGCCT 
GCAAAGTCGA 
CTGGCCCTGC 



11 
I 

GGAGGGGAGG 
GATGGCCCGG 
TGGTGTCAGG 
GTGCAGCCAC 
ATTCCTGACC 
GCGGCTCTTC 
GGGCTATTTC 
TGTGAAGACA 
CGTTTCCCCT 



21 

I 

AAATTGGAGA 
TCCCTGGTGT 
GGTGGTCCTA 
CCTATCTCCA 
ATTCACCGGG 
TGGGGAGGCA 
CCCAGTAGCA 
GACAAATGGG 
CCTTGGGTTT 



31 
I 

CCCCAGCACC 
GCCTTGGTGT 
TGCCCAAGCT 
TGGCTGTGGC 
GCCAAGTGGT 
GCGTTCAGGG 
TTGTCCGAGA 
ATTTCTACTG 
ATGCAAATAC 



41 
1 

CCCTTGCTCA 
CATCATCTTG 
GGCTGACCGG 
CCTTCAGGAC 
GTATGTCTTC 
AGATTACTAT 
GGACCAGACC 
CCAGTGAGCT 
AATCAGCCCA 



51 
I 

CTCTCTTGCT 
CTGTCTGCCT 
AAGCTGTGTG 
TACATGGCCC 
TCCAAGCTGA 
GGAGATCTGG 
CTGAAACCTG 
CAGCCTACCG 
GTGCAAAC 



Seq ID NO: 424 Protein sequence 
Protein Accession #s NP_006524.1 

l 11 



21 31 41 51 • 

I I I I I I 

MARSLVCLGV IILLSAFSGP GVRGGPMPKL ADRKLCADQE CSHPISMAVA LQDYMAPDCR 
FLTIHRGQW YVFSKLKGRG RLFWGGSVQG DYYGDLAARL GYFPSSIVRB DQTLKPGKVD 
VKTDKWDFYC Q 

Seq ID NO: 425 DNA sequence 

Nucleic Acid Accession #: NM_0B0870.1 

Coding sequence: 3.. 710 



I 

AGATGACACA 
AGAAAACCAC 
AAGGGAAAAA 
CCACTGAGAC 
TCACAAAGAC 
CTTCTCATCT 
TTACATCTAG 
CATACCTCAA 
ATTCATTCCC 
TGTTCCTTGG 
AGAACACCCA 
TGATGGAGCA 
GGCGCCCAGC 
AATTTCTATT 
CTTCATCTGT 
AGGGGACAAA 
CTGAGAATGA 
TGAGTCCTAA 
GGTTTATGGG 
TTATTTCCAT 
AGGACCCTTG 
AAGAAGAAAA 
CCAGAAGAAA 
TTAGTTTTCC 
GGACACGCAA 
GACCTTCCCT 
GTAGCTTTAT 
CCTTCACCCC 
ATTTTCTCTC 
AGGAGAATGA 
ATGTT 



11 



I 

AGTCACAGAA 
AAGAACCCCA 
CACACCAGTC 
CATAAAAGCC 
TATAAAACCT 
AAATAAAACT 
AACGAAGCTG 
TAAAGATGGC 
TGCATGGGCC 
CCTGATCTTC 
GTACAATGAT 
GCAGAATCTT 
CCTGGCTCTT 
TCCGGGACTA 
TCTTGAAACT 
GAAGAAAGAA 
AAAGGTGTTT 
AGGAAGGACA 
GAAAGGGAGG 
TCACTATTAC 
CCCCCACCCC 
GAGAATTTAT 
TCATAAATAT 
TAAATGTCTA 
GGAGGGGATT 
GATTGGTGTC 
CTCGTAAAAG 
AGATTCAAGT 
TCTCTTCTTA 
TGATGATAGT 



21 
I 

AAGTCCACAG 
GAAAAGCCTA 
CCAGAAAAGC 
CCAGTAAAGT 
TCAGTCAAGG 
GAAGTTACTC 
AGTTCTATCA 
TCACAGAAAG 
ATAGTTATTG 
TTGGTCTCCT 
GCAGAGGATG 
GGCATGGGCC 
CCATGCTCTG 
CAGGAAGGGC 
GGTTGGGGAA 
TGAATAATAC i 
GATGGACATG 
GGAGCCTTAT 
GACTGAGGGC 
TTAAGAGTTT 
CATTTTTTTA 
TTCCTTCTCC 
CTCTCATCTA 
CAGTGGACGC 
TTTATTTGGC 
TCAGCATTTA 
TTACCCATCT 
TTTCCTCCTT 
TGGCCATTTC 
TTCCTTCTGT 



Seq ID NOs 426 Protein sequence 
Protein Accession #: NP_543146.1 



41 
I 

i AAAGACCACG 
I AGAGAAGACC 
l CCTGGGGAAC 
i CCCAGAAAAA 
v CAAATCTCTC 
I CACTGGTTCT 
! CACAGGAAAC 
I TGGACAGATG 
' GGCTGTGATT 
? GACACGCCGC 
! CAATTCCTAC 
! CCCACGGTGA 
' GGATGAGGAA 
' GACGGTTACC 
i AGCAAGGAGG 
' TCTCTGTAGA 

• ACCAATGCAG 
! CCAGACTGAC 
! GGTTTCAGGA 
: AGGCTCATCT 
i AAAACAAAAA 
t TGCCCTGGAG 

• TCCTCTTCCT 
1 TGGCTTGCTG 
: ACCCACTGAT 
! TCTTCCACCA 

• CCCCATTCTC 

• CATCTGTGTG 
I TGATTGGGTA 
. TTTTTATAAT 



60 
120 
180 
240 
300 
360 
420 
480 




60 
120 



51 
I 

TCAACCACAG 
ATATGCACCA 
ACCACACTGA 
ACAGCAGCAG 
ACTACTACCT 
TTCACCCTCA 
GAGAGCCATC 
GGAGAGAATG 
CTCCTCCTGG 
ACACTAACCC 
CCGGTCTACC 
TCTTGGAGTA 
CCGGACTCAC 
AGTATTAACC 
GTGTAAGTTT 
AGGTAATGGT 
AACACTGCAC 
TTGTGAGTGG 
CAGCATTATG 
CTGAGTTCTC 
AAACGGATCC 
AAAAAAAAGT 
CCCAAATCCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 



1 11 21 31 41 51 

I I I 1 I I 

MTQVTEKSTE HPEKTTSTTE KTTRTPEKPT LYSEKTICTK GKNTPVPEKP TENLGNTTLT 
TETIKAPVKS TENPEKTAAV TKTIKPSVKV TGDKSLTTTS SHLNKTEVTH QVPTGSFTLI 
TSRTKLSSIT SEATGNESHP YLNKDGSQKG IHAGQMGEND SFPAWAIVIV VLVAVILLLV 

1120 



60 
120 
180 



WO 03/042661 



PCT/US02/36810 



FLGLIFLVSY KMRTRRTLTQ NTQYNDAEDE GGPNSYPVYL MEQQNLGMGQ I PS PR 

Seq ID NOs 427 DNA sequence 
Nucleic Acid Accession «: XM_0694B0.1 
5 Coding sequence: 1..4383 

1 11 21 31 41 51 

ATGGACACTG TGCTGGTGCT GCTCCTGGGC CTGCAGGCCT TGGCCGGACC CAGTCCGAAG 60 
10 CCCCAGAAGG ACTCTGTCTC AGACTGGGCC ATTGTGTTGA TCACTCTCAC TTTGGTGGCA 120 
GCAATTGTCA GCCTAATGTA CGGTATCAAG AAGGCCTGCC AGTTCCGGAG GGAGATGAGT 180 
CTGGGGTGTG GCTGTGGCTC TGTGACCCCT TACAGCAGCC ACCATGAGGG GGAGGCTGCC 240 
AGCCAGCGCT ACTCTTGTCA AATGAAAGCT TCTTGGGGGG CAGGTGCTAC TACATTCCAA 300 
GAATATCAGA AAACTGGGGA ACTCTCAACA TCCGATCACA TATTTCCCCT CACTCCAGGC 360 
15 CTTGTTTATA GTATCCCTTT TGATCACATT GTTCTGCATT CAGGACAAAG ACCTCCAGAG 420 
CTCCCTAAAT CTACAGAAAT CCATGAGCAA AAACGCCACT GCAACACCAC ACGCCATTCT 480 
AAGCCAACTG ACAAGCCTAC AGGCAACTCC AAAACTATAG ACCACAAAAG CTCTACAGAT 540 
AATCATGAGG CTCCTCCCAC TTCTGAAGAA AACTCCAGCA ACCAAGGGAA AGACCCAATG 600 
ATCCGGAACC AGCGCTCTGT TGATCCTGCT GACTCCACTA CCACACATAA AGAATCCGCT 660 
20 GGAAAAAAAC ATATAACGCC AGCACCCAAG AGCAAAATAA ACTGTCGTAA GTCCACAACA 720 
GGCAAATCAA CGGTAACAAG AAAATCAGAT AAAACTGGAA GACCTTTGGA AAAGTCCATG 780 
AGTACTTTGG ATAAGACAAG TACCAGCTCA CATAAGACTA CAACTTCCTT CCACAACTCA 840 
GGCAATTCAC AGACCAAGCA AAAAAGCACA TCTTTTCCAG AAAAAATCAC AGCAGCCTCA 900 
AAAACAACAT ACAAGACCAC AGGAACCCCA GAAGAGTCAG AAAAAACTGA AGATTCCAGA 960 
25 ACAACAGTTG CCTCAGACAA GCTCCTGACA AAAACTACAA AAAACATACA AGAGACCATA 1020 
TCAGCCAATG AGCTCACACA ATCTCTAGCA GAGCCTACAG AACATGGAGG AAGGACAGCC 1080 
AATGAGAACA ACACACCATC CCCAGCAGAG CCTACAGAAA ATAGAGAAAG GACAGCCAAT 1140 
GAGAACACCA CACTATCCCC AGCAGAGCCT ACAGAAAATA GAGAAAGGAC AGCCAATGAG 1200 
AACACCGCAC CATTCCCAGC AGGGCCTACA GAAAATAGAG AAATGACAGC CAATGAGAAT 1260 
30 ACCACACTAT TCCCAGCAGA GCCTACAGAA CATGGAGAAA GGACAGCCAA TGAGAACACC 1320 
ACACCATCCC CAGCAGAGCC TACAGAACAT GGAGAAAGGA CAGCCAATGA GAACACTACA 1380 
CCATCCCCAG CAGAGCCTAC AGAACATGGA GAAAGGACCC CATTTGCCAA TGACAAAACC 1440 
ACATCATCCT CAGCAGAGTC TACAGAACAT GGAGAAAGGA CCCCACTGGC CAACGAGAAC 1500 
ACCACACCAT CCCCAGCAGA GCCTACAGAA AATAGAGAAA GGACAGCCAA TGAGAACACC 1560 
35 ACACCATCCC CAGCAGGGCC TACAGAAAAC AGAGAAACGA CAGCCAACGA GAAGACCACA 1620 
CTATCCCCAG TAGAGCCTAC AGAAAATAGA GAAACAACAG CCAATGAGAA GACCACACCA 1680 
TCCCCAGCAG AGCCTACAGA AAATGGACAA AGGACCCCAT TTGCCAATGA GAAAACCACA 1740 
TCATCCTCAG CAGAGCCTAC AGAACACGGA GAAAGGACCC CACTGGCCAA TGAGAACACC 1800 
ACACCATCCC CAGCAGAGCC TACAGAAAAT AGAGAAAGGA CAGCC AATGA GAAGACCACA 1860 
40 CCATCCCCAG CAGAGCCTAC AGAAAATGGA GACAGGACTC CTTTGGCCAA TGAGAACACC 1920 
ACGCCATCTC TAGCAGAGCC TACAGAAAAT GGACAAAGGA CCCCATTTGC CAATGAGAAG 1980 
ACCACATCAT CCTCAGCAGA GCCTACAGAA CACGAAGAAA GGACTCCACT GGCCAATGAG 2040 
AACACCACAC CATCCCCGGC AGAGCCTACA GAAAATAGAG AAAGGACAGC CAATGAGAAC 2100 
ACCACACCAT CCCCAGCAGG GCCTACAGAA AATAGAGAAA TGACAGCCAA CGAGAAGACC 2160 
45 ACACTATTCC CAGCAGAGCC TACAGAAAAT AGAGAAAGGA CAGCCAATGA GAAGACCACA 2220 
TCATCCCCAG CAGAGCCTAC AGAAAATGGA CAAAGGACCC CATTTGCCAA TGAGAAAACC 2280 
ACATCATCCC CAGCAGAGCC TACAGAACAC GGAGAAAGGA CCCCACTGGC CAATGAGAAC 2340 
ACCACACTAT CCCCAGCAGA GCCTACAGAA AATAGAGAAA GGACAGCCAA TGAGAAGACC 2400 
ACACCATTCC CAGCAGAGCC TACAGAAAAT AGAGAAAGGA CAGCCAATGA GAACACCACA 2460 
50 CCATCCCCAG CAGAGCCTAC AGAAAATGGA GACAGGACTC CATTGGCCAA TGAGAAGACC 2520 
ACACCATCTC TAGCAGAGCC TACAGAAAAT GGAAAAAGGA CCCCATTTGC CAATGAGAAG 2580 
ACCACATCAT CCTCAGCAGA GCCTACAGAA CACGCAGAAA GGACTCCACT GGCCAATGAG 2640 
AACACCACAT CATCCCCAGC AGAGCCTACA GAAAATAGAG AAAGGACAGC CAATGAGAAG 2700 
ACCACACAAT TCCCAGCAGA GCCTACAGAA AATAGAGAAA GCACAGCCAA TGAGAAGACC 2760 
55 ACACCATTCC CAGCAGAGCC TACAGAAAAT AGAGAATGGA CAGCCAATGA GAACACCACA 2820 
CTATCCCCAG CAGAGCCTAC AGAACATGAA GAAATGACCC CATTGGCCAA TGAGAAGACC 2880 
ACACTATCCC CAGCAGAGCC TACAGAAAAT GGAGAAAGGA CCCCATTTAC CAATGAGAAG 2940 
ACCACACCAT CCTCAGCAGA GCCTACAGAA CATGGAGAAA GGACCCCACT GGCCAATGAG 3000 
ATCACCACAC CATCCCGAGC AGAGCCTACA GAACATGGAG AAAGGATAGC CAATGAGAAG 3060 
60 GCCACACCAT CCCCAGCAAA GCCTACAGAA CATGGAGAAA CGACAGTCAA TGAGGACACC 3120 
ACACCATCCT CAGCAGAGCC TACAGAAAAT GGAGAAAGGA CCCCACTGGC CAATGAGAAC 3180 
ACCACAACAT CCCCAACAGA GTCTACAGAA CATGGAGAAA GGACAGCCAA TGAGAAGACC 3240 
ACACCATCCC CAGCAGAGCC TACAGAACAT GGAGAAAGGA CACCATCAGC CAATGAGAAG 3300 
ACCATACCAT CTCCAGCAAA GCCTACAGAA CACGAAGAAA TGACCCCATC GGCCAATGAG 3360 
65 AACACCACAC CATCCCCAGT AAAGCCTACA GAACATGGAG AAAAGACTAC ATTGGCCAAT 3420 
GAGAAGATCA CACTATCCCC AGAAGGGCCT ACAGAACATG GAGCAAAAAC TACGTCGGCC 3480 
AATGAGAACA TCACACCATC CCTAGCAAAG CCTACAGAAC ATGGAGAAAG GACCACATCA 3540 
CCCAATGACA AGATCACCTC ATCTGCAGCA GAGTCTACAG AACATAGAGA TAGGGCTACA 3600 
TCAGCCAATG TGATCACACC AGCCCCAGCA GAGCCTATAA AACATGCAAA AAGGACCACA 3660 
70 TTGGCCCATG AGAAGATGAC ACAAGTCACA GAAAAGTCCA CAGAACACCC AGAAAAGACC 3720 
ACGTCAACCA CAGAGAAAAC CACAAGAACC CCAGAAAAGC CTACGCTATA CTCAGAGAAG 3780 
ACCATATGCA CCAAAGGGAA AAACACACCA GTCCCAGAAA AGCCTACAGA AAACCTGGGG 3840 
AACACCACAC TGACCACTGA GACCATAAAA GCCCCAGTAA AGTCCACAGA AAACCCAGAA 3900 
AAAACAGCAG CAGTCACAAA GACTATAAAA CCTTCAGTCA AGGTCACAGG AGACAAATCT 3960 
75 CTCACTACTA CCTCTTCTCA TCTAAATAAA ACTGAAGTTA CTCATCAGGT GCCCACTGGT 4020 
TCTTTCACCC TCATTACATC TAGAACGAAG CTGAGTTCTA TCACATCAGA AGCCACAGGA 4080 
AACGAGAGCC ATCCATACCT CAATAAAGAT GGCTCACAGA AAGGTATCCA CGCTGGACAG 4140 
ATGGGAGAGA ATGATTCATT CCCTGCATGG GCCATAGTTA TTGTGGTCCT GGTGGCTGTG 4200 
ATTCTCCTCC TGGTGTTCCT TGGCCTGATC TTCTTGGTCT CCTATATGAT GCGGACACGC 4260 
80 CGCACACTAA CCCAGAACAC CCAGTACAAT GATGCAGAGG ATGAGGGTGG CCCCAATTCC 4320 
TACCCGGTCT ACCTGATGGA GCAGCAGAAT CTTGGCATGG GCCAGATCCC TTCCCCACGG 4380 
TGA 



1121 



WO 03/042661 



PCT/US02/36810 



Seq ID NO: 428 Protein sequence 
Protein Accession fh XP_069480-1 

1 11 21 31 41 51 

MDTVLVLLLG LQALAGPSPK PQKDSVSDWA IVLITLTLVA AIVSLMYGIK KACQFRREMS 60 

LGCGCGSVTP YSSHHEGEAA SQRYSCQMKA SWGAGATTFQ EYQKTGELST SDHTFPLTPG 120 

LVYSIPFDHI VLHSGQRPPE LPKSTEIHEQ KRHCNTTRHS KPTDKPTGNS KTIDHKSSTD 180 

NHEAPPTSEE NSSNQGKDPM IRNQRSVDPA DSTTTHKESA GKKHITPAPK SKINCRKSTT 240 

10 GKSTVTRKSD KTGRPLEKSM STLDKTSTSS HKTTTSFHNS GNSQTKQKST SFPEKITAAS 300 

KTTYKTTGTP EESEKTEDSR TTVASDKLLT KTTKNIQETI SANELTQSLA EPTEHGGRTA 360 

NENNTPSPAE PTENRERTAN ENTTLSPAEP TENRERTANE NTAPFPAGPT ENREMTANEN 420 

TTLFPAEPTE HGERTANENT TPSPAEPTEH GERTAMENTT PSPAEPTEHG ERTPFANDKT 480 

TSSSAESTEH GERTPLANEN TTPSPAEPTE NRERTANENT TPSPAGPTEN RETTANEKTT 540 

15 LSPVEPTENR ETTANEKTTP SPAEPTENGQ RTP FANE KIT SSSAEPTEHG ERTPLANENT 600 

TPSPAEPTEN RERTANEKTT PSPAEPTENG DRTPLANEKT TPSLAEPTEN GQRTPFANEK 660 

TTSSSAEPTE HEERTPLANE NTTPSPAEPT ENRERTANEN TTPSPAGPTE NREMTANEKT 720 

TLFPABPTEN RERTANEKTT SSPAEPTENG QRTPFANEKT TSSPAEPTEH GERTPLANEN 780 

TTLSPAEPTE NRERTANEKT TPFPAEPTEN RERTANENTT PSPAQPTENG DRTPLANEKT 840 

20 TPSLAEPTEN GKRTPFANBK TTSSSAEPTE HAERTPLANE NTTSSPAEPT ENRERTANEK 900 

TTQFPAEPTE NRESTANEKT TPFPAEPTEN REWTANENTT LSPAEPTEHE EMTPLANEKT 960 

TLSPAEPTEN GERTPFTNEK TTPSSAEPTE HGERTPLANE ITTPSRAEPT EHGERIANEK 1020 

ATPSPAKPTE HGETTVNEDT TPSSAEPTEN GERTPLANEN TTTSPTESTE HGERTANEKT 1080 

TPSPAEPTEH GERTPSANEK TIPSPAKPTE HEEMTPSANE NTTPSPVKPT EHGEKTTLAN 1140 

25 BKITLSPEGP TEHGAKTTSA NEKITPSLAK PTEHGERTTS PNDKITSSAA ESTEHRDRAT 1200 

SANVITPAPA EPIKHAKRTT LAHEKMTQVT EKSTEHPEKT TSTTEKTTRT PEKPTLYSEK 1260 

TICTKGKNTP VPEKPTENLG NTTLTTETIK APVKSTENPB KTAAVTKTIK PSVKVTGDKS 1320 

LTTTSSHLNK TEVTHQVPTG SFTLITSRTK LSSITSEATG NESHPYLNKD GSQKGIHAGQ 1380 

MGENDSFPAW AIVIWLVAV ILLLVFLGLI FLVSYMMRTR RTLTQNTQYN DAEDEGGPNS 1440 
30 YPVYLMEQQN LGMGQIPSPR 
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ATGTGGCCTC GCCTGGCCTT TTGTTGCTGG GGTCTGGCGC TCGTTTCGGG CTGGGCGACC 60 

TTTCAGCAGA TGTCCCCGTC GCGCAATTTC AGCTTCCGCC TCTTCCCCGA GACCGCGCCC 120 

40 GGGGCCCCCG GGAGTATCCC CGCGCCGCCC GCTCCTGGCG ACGAAGCGGC GGGGAGCAGA 180 

GTGGAGCGGC TGGGCCAGGC GTTCCGGCGA CGCGTGCGGC TGCTGCGGGA GCTCAGCGAG 240 

CGCCTGGAGC TTGTCTTCCT GGTGGATGAT TCGTCCAGCG TGGGCGAAGT CAACTTCCGC 300 

AGCGAGCTCA TGTTCGTCCG CAAGCTGCTG TCCGACTTCC CCGTGGTGCC CACGGCCACG 360 

CGCGTGGCCA TCGTGACCTT CTCGTCCAAG AACTACGTGG TGCCGCGCGT CGATTACATC 420 

45 TCCACCCGCC GCGCGCGCCA GCACAAGTGC GCGCTGCTCC TCCAAGAGAT CCCTGCCATC 480 

TCCTACCGAG GTGGCGGCAC CTACACCAAG GGCGCCTTCC AGCAAGCCGC GCAAATTCTT 540 

CTTCATGCTA GAGAAAACTC AACAAAAGTT GTATTTCTCA TCACTGATGG ATATTCCAAT 600 

GGGGGAGACC CTAGACCAAT TGCAGCGTCA CTGCGAGATT CAGGAGTGGA GATCTTCACT 660 

TTTGGCATAT GGCAAGGGAA CATTCGAGAG CTGAATGACA TGGCTTCCAC CCCAAAGGAG 720 

50 GAGCACTGTT ACCTGCTACA CAGTTTTGAA GAATTTGAGG CTTTAGCTCG CCGGGCATTG 780 

CATGAAGATC TACCTTCTGG GAGTTTTATT CAAGATGATA TGGTCCACTG CTCATATCTT 840 

TGTGATGAAG GCAAGGACTG CTGTGACCGA ATGGGAAGCT GCAAATGTGG GACACACACA 900 

GGCCATTTTG AGTGCATCTG TGAAAAGGGG TATTACGGGA AAGGTCTGCA GTATGAATGC 960 

ACAGCTTGCC CATCGGGGAC ATACAAACCT GAAGGCTCAC CAGGAGGAAT CAGCAGTTGC 1020 

55 ATTCCATGTC CTGATGAAAA TCACACCTCT CCACCTGGAA GCACATCCCC TGAAGACTGT 1080 

GTCTGCAGAG AGGGATACAG GGCATCTGGC CAGACCTGTG AACTTGTCCA CTGCCCTGCC 1140 

CTGAAGCCTC CCGAAAATGG TTACTTTATC CAAAACACTT GCAACAACCA CTTCAATGCA 1200 

GCCTGTGGGG TCOGATGTCA CCCTGGATTT GATCTTGTGG GAAGCAGCAT CATCTTATGT 1260 

CTACCCAATG GTTTGTGGTC CGGTTCAGAG AGCTACTGCA GAGTAAGAAC ATGTCCTCAT 1320 

60 CTCCGCCAGC CGAAACATGG CCACATCAGC TGTTCTACAA GGGAAATGTT ATATAAGACA 13 B0 

ACATGTTTGG TTGCCTGTGA TGAAGGGTAC AGACTAGAAG GCAGTGATAA GCTTACTTGT 1440 

CAAGGAAACA GCCAGTGGGA TGGGCCAGAA CCCCGGTGTG TGGAGCGCCA CTGTTCCACC 1500 

TTTCAGATGC CCAAAGATGT CATCATATCC CCCCACAACT GTGGCAAGCA GCCAGCCAAA 1560 

TTTGGGACGA TCTGCTATGT AAGTTGCCGC CAAGGGTTCA TTTTATCTGG AGTCAAAGAA 1620 

65 ATGCTGAGAT GTACCACTTC TGGAAAATGG AATGTCGGAG TTCAGGCAGC TGTGTGTAAA 1680 

GACGTGGAGG CTCCTCAAAT CAACTGTCCT AAGGACATAG AGGCTAAGAC TCTGGAACAG 1740 

CAAGATTCTG CCAATGTTAC CTGGCAGATT CCAACAGCTA AAGACAACTC TGGTGAAAAG 1800 

GTGTCAGTCC ACGTTCATCC AGCTTTCACC CCACCTTACC TTTTCCCAAT TGGAGATGTT 1860 

GCTATCGTAT ACACGGCAAC TGACCTATCC GGCAACCAGG CCAGCTGCAT TTTCCATATC 1920 

70 AAGGTTATTG ATGCAGAACC ACCTGTCATA GACTGGTGCA GATCTCCACC TCCCGTCCAG 1980 

GTCTCGGAGA AGGTACATGC CGCAAGCTGG GATGAGCCTC AGTTCTCAGA CAACTCAGGG 2040 

GCTGAATTGG TCATTACCAG AAGTCATACA CAAGGAGACC TTTTCCCTCA AGGGGAGACT 2100 

ATAGTACAGT ATACAGCCAC TGACCCCTCA GGCAATAACA GGACATGTGA TATCCATATT 2160 

GTCATAAAAG GTTCTCCCTG TGAAATTCCA TTCACACCTG TAAATGGGGA TTTTATATGC 2220 

75 ACTCCAGATA ATACTGGAGT CAACTGTACA TTAACTTGCT TGGAGGGCTA TGATTTCACA 2280 

GAAGGGTCTA CTGACAAGTA TTATTGTGCT TATGAAGATG GCGTCTGGAA ACCAACATAT 2340 

ACCACTGAAT GGCCAGACTG TGCCAAAAAA CGTTTTGCAA ACCACGGGTT CAAGTCCTTT 2400 

GAGATGTTCT ACAAAGCAGC TCGTTGTGAT GACACAGATC TGATGAAGAA GTTTTCTGAA 2460 

GCATTTGAGA CGACCCTGGG AAAAATGGTC CCATCATTTT GTAGTGATGC AGAGGACATT 2520 

80 GACTGCAGAC TGGAGGAGAA CCTGACCAAA AAATATTGCC TAGAATATAA TTATGACTAT 2580 

GAAAATGGCT TTGCAATTGG ACCAGGTGGC TGGGGTGCAG CTAATAGGCT GGATTACTCT 2640 

TACGATGACT TCCTGGACAC TGTGCAAGAA ACAGCCACAA GCATCGGCAA TGCCAAGTCC 2700 

TCACGGATTA AAAGAAGTGC CCCATTATCT GACTATAAAA TTAAGTTAAT TTTTAACATC 2760 

ACAGCTAGTG TGCCATTACC CGATGAAAGA AATGATACCC TTGAATGGGA AAATCAGCAA 2820 
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CGACTCCTTC AGACATTGGA AACTATCACA AATAAACTGA AAAGGACTCT CAACAAAGAC 28B0 
CCCATGTATT CCTTTCAGCT TGCATCAGAA ATACTTATAG CCGACAGCAA TTCATTAGAA 2940 
ACAAAAAAGG CTTCCCCCTT CTGCAGACCA GGCTCAGTGC TGAGAGGGCG TATGTGTGTC 3000 
AATTGCCCTT TGGGAACCTA TTATAATCTG GAACATTTCA CCTGTGAAAG CTGCCGGATC 3060 
GGATCCTATC AAGATGAAGA AGGGCAACTT GAGTGCAAGC TTTGCCCCTC TGGGATGTAC 3120 
ACGGAATATA TCCATTCAAG AAACATCTCT GATTGTAAAG CTCAGTGTAA ACAAGGCACC 3180 
TACTCATACA GTGGACTTGA GACTTGTGAA TCGTGTCCAC TGGGCACTTA TCAGCCAAAA 3240 
TTTGGTTCCC GGAGCTGCCT CTCGTGTCCA GAAAACACCT CAACTGTGAA AAGAGGAGCC 3300 
GTGAACATTT CTGCATGTGG AGTTCCTTGT CCAGAAGGAA AATTCTCGCG TTCTGGGTTA 3360 
ATGCCCTGTC ACCCATGTCC TCGTGACTAT TACCAACCTA ATGCAGGGAA GGCCTTCTGC 3420 
CTGGCCTGTC CCTTTTATGG AACTACCCCA TTCGCTGGTT CCAGATCCAT CACAGAATGT 3480 
TCAACTTCAG TTCTGAATAT TACTATTTTC GGTGGATTTG GGCATCTGGA GTTGTTAAAT 3S40 
TGTCCTTCTG AGGTTTTCCA TGAATGCTTC TTTAACCCTT GCCACAATAG TGGAACCTGC 3600 
CAGCAACTTG GGCGTGGTTA TGTTTGTCTC TGTCCACTTG GATATACAGG CTTAAAGTGT 3660 
GAAACAGACA TCGATGAGTG CAGCCCACTG CCTTGCCTCA ACAATGGAGT TTGTAAAGAC 3720 
CTAGTTGGGG AATTCATTTG TGAGTGCCCA TCAGGTTACA CAGGTCAGCG GTGTGAAGAA 3780 
AATATAAATG AGTGTAGCTC CAGTCCTTGT TTAAATAAAG GAATCTGTGT TGATGGTGTG 3840 
GCTGGCTATC GTTGCACATG TGTGAAAGGA TTTGTAGGCC TGCATTGTGA AACAGAAGTC 3900 
AATGAATGCC AGTCAAACCC ATGCTTAAAT AATGCAGTCT GTGAAGACCA GGTTGGGGGA 3960 
TTCTTGTGCA AATGCCCACC TGGATTTTTG GGTACCCGAT GTGGAAAGAA CGTCGATGAG 4020 
TGTCTCAGTC AGCCATGCAA AAATGGAGCT ACCTGTAAAG ACGGTGCCAA TAGCTTCAGA 4080 
TGCCTGTGTG CAGCTGGCTT CACAGGATCA CACTGTGAAT TGAACATCAA TGAATGTCAG 4140 
TCTAATCCAT GTAGAAATCA GGCCACCTGT GTGGATGAAT TAAATTCATA CAGTTGTAAA 4200 
TGTCAGCCAG GATTTTCAGG CAAAAGGTGT GAAACAGAAC AGTCTACAGG CTTTAACCTG 42 SO 
GATTTTGAAG TTTCTGGCAT CTATGGATAT GTCATGCTAG ATGGCATGCT CCCATCTCTC 4320 
CATGCTCTAA CCTGTACCTT CTGGATGAAA TCCTCTGACG ACATGAACTA TGGAACACCA 4380 
ATCTCCTATG CAGTTGATAA CGGCAGCGAC AATACCTTGC TCCTGACTGA TTATAACGGC 4440 
TGGGTTCTTT ATGTGAATGG CAGGGAAAAG ATAACAAACT GTCCCTCGGT GAATGATGGC 4500 
AGATGGCATC ATATTGCAAT CACTTGGACA AGTGCCAATG GCATCTGGAA AGTCTATATC 4S60 
GATGGGAAAT TATCTGACGG TGGTGCTGGC CTCTCTGTTG GTTTGCCCAT ACCTGGTGGT 4620 
GGTGCGTTAG TTCTGGGGCA AGAGCAAGAC AAAAAAGGAG AGGGATTCAG CCCAGCTGAG 4680 
TCTTTTGTGG GCTCCATAAG CCAGCTCAAC CTCTGGGACT ATGTCCTGTC TCCACAGCAG 4740 
GTGAAGTCAC TGGCTACCTC CTGCCCAGAG GAACTCAGTA AAGGAAACGT GTTAGCATGG 4800 
CCTGATTTCT TGTCAGGAAT TGTGGGGAAA GTGAAGATCG ATTCTAAGAG CATATTTTGT 4860 
TCTGATTGCC CACGCTTAGG AGGGTCAGTG CCTCATCTGA GAACTGCATC TGAAGATTTA 4920 
AAGCCAGGTT CCAAAGTCAA TCTGTTCTGT GATCCAGGCT TCCAGCTGGT CGGGAACCCT 4980 
GTGCAGTACT GTCTGAATCA AGGACAGTGG ACACAACCAC TTCCTCACTG TGAACGCATT 5040 
AGCTGTGGGG TGCCACCTCC TTTGGAGAAT GGCTTCCATT CAGCCGATGA CTTCTATGCT 5100 
GGCAGCACAG TAACCTACCA GTGCAACAAT GGCTACTATC TATTGGGTGA CTCAAGGATG 5160 
TTCTGTACAG ATAATGGGAG CTGGAACGGC GTTTCACCAT CCTGCCTTGA TGTCGATGAG S220 
TGTGCAGTTG GATCAGATTG TAGTGAGCAT GCTTCTTGCC TGAACGTAGA TGGATCCTAC 5280 
ATATGTTCAT GTGTCCCACC GTACACAGGA GATGGGAAAA ACTGTGCAGA ACCTATAAAA 5340 
TGTAAGGCTC CAGGAAATCC GGAAAATGGC CACTCCTCAG GTGAGATTTA TACAGTAGGT 5400 
GCCGGAGTCA CATTTTCGTG TCAGGAAGGA TACCAGTTGA TGGGAGTAAC CAAAATCACA 5460 
TGTTTGGAGT CTGGAGAATG GAATCATCTA ATACCATATT GTAAAGCTGT TTCATGTGGT 5520 
AAACCGGCTA TTCCAGAAAA TGGTTGCATT GAGGAGTTAG CATTTACTTT TGGCAGCAAA 5580 
GTGACATATA GGTGTAATAA AGGATATACT CTGGCCGGTG ATAAAGAATC ATCCTGTCTT 5640 
GCTAACAGTT CTTGGAGTCA TTCCCCTCCT GTGTGTGAAC CAGTGAAGTG TTCTAGTCCG 5700 
GAAAATATAA ATAATGGAAA ATATATTTTG AGTGGGCTTA CCTACCTTTC TACTGCATCA 5760 
TATTCATGCG ATACAGGATA CAGCTTACAG GGCCCTTCCA TTATTGAATG CACGGCTTCT 5820 
GGCATCTGGG ACAGAGCGCC ACCTGCCTGT CACCTCGTCT TCTGTGGAGA ACCACCTGCC 5880 
ATCAAAGATG CTGTCATTAC GGGGAATAAC TTCACTTTCA GGAACACCGT CACTTACACT 5940 
TGCAAAGAAG GCTATACTCT TGCTGGTCTT GACACCATTG AATGCCTGGC CGACGGCAAG 6000 
TGGAGTAGAA GTGACCAGCA GTGCCTGGCT GTCTCCTGTG ATGAGCCACC CATTGTGGAC 6060 
CACGCCTCTC CAGAGACTGC CCATCGGCTC TTTGGAGACA TTGCATTCTA CTACTGCTCT 6120 
GATGGTTACA GCCTAGCAGA CAATTCCCAG CTTCTCTGCA ATGCCCAGGG CAAGTGGGTA 6180 
CCCCCAGAAG GTCAAGACAT GCCCOGTTGT ATAGCTCATT TCTGTGAAAA ACCTCCATCG 6240 
GTTTCCTATA GCATCTTGGA ATCTGTGAGC AAAGCAAAAT TTGCAGCTGG CTCAGTTGTG 6300 
AGCTTTAAAT GCATGGAAGG CTTTGTACTG AACACCTCAG CAAAGATTGA ATGTATGAGA 6360 
GGTGGGCAGT GGAACCCTTC CCCCATGTCC ATCCAGTGCA TCCCTGTGCG GTGTGGAGAG 6420 
CCACCAAGCA TCATGAATGG CTATGCAAGT GGATCAAACT ACAGTTTTGG AGCCATGGTG 6460 
GCTTACAGCT GCAACAAGGG GTTCTACATC AAAGGGGAAA AGAAGAGCAC CTGCGAAGCC 6540 
ACAGGGCAGT GGAGTAGTCC TATACCGACG TGCCACCCGG TATCTTGTGG TGAACCACCT 6600 
AAGGTTGAGA ATGGCTTTCT GGAGCATACA ACTGGCAGGA TCTTTGAGAG TGAAGTGAGG 6660 
TATCAGTGTA ACCCGGGCTA TAAGTCAGTC GGAAGTCCTG TATTTGTCTG CCAAGCCAAT 6720 
CGCCACTGGC ACAGTGAATC CCCTCTGATG TGTGTTCCTC TCGACTGTGG AAAACCTCCC 6780 
CCGATCCAGA ATGGCTTCAT GAAAGGAGAA AACTTTGAAG TAGGGTCCAA GGTTCAGTTT 6840 
TTCTGTAATG AGGGTTATGA GCTTGTTGGT GACAGTTCTT GGACATGTCA GAAATCTGGC 6900 
AAATGGAATA AGAAGTCAAA TCCAAAGTGC ATGCCTGCCA AGTGCCCAGA GCCGCCCCTC 6960 
TTGGAAAACC AGCTAGTATT AAAGGAGTTG ACCACCGAGG TAGGAGTTGT GACATTTTCC 7020 
TGTAAAGAAG GGCATGTCCT GCAAGGCCCC TCTGTCCTGA AATGCTTGCC ATCCCAGCAA 7080 
TGGAATGACT CTTTCCCTGT TTGTAAGATT GTTCTTTGTA CCCCACCTCC CCTAATTTCC 7140 
TTTGGTGTCC CCATTCCTTC TTCTGCTCTT CATTTTGGAA GTACTGTCAA GTATTCTTGT 7200 
GTAGGTGGGT TTTTCCTAAG AGGAAATTCT ACCACCCTCT GCCAACCTGA TGGCACCTGG 7260 
AGCTCTCCAC TGCCAGAATG TGTTCCAGTA GAATGTCCCC AACCTGAGGA AATCCCCAAT 7320 
GGAATCATTG ATGTGCAAGG CCTTGCCTAT CTCAGCACAG CTCTCTATAC CTGCAAGCCA 7380 
GGCTTTGAAT TGGTGGGAAA TACTACCACC CTTTGTGGAG AAAATGGTCA CTGGCTTGGA 7440 
GGAAAACCAA CATGTAAAGC CATTGAGTGC CTGAAACCCA AGGAGATTTT GAATGGCAAA 7500 
TTCTCTTACA CGGACCTACA CTATGGACAG ACCGTTACCT ACTCTTGCAA CCGAGGCTTT 7560 
CGGCTCGAAG GTCCCAGTGC CTTGACCTGT TTAGAGACAG GTGATTGGGA TGTA GATGCC 7620 
CCATCTTGCA ATGCCATCCA CTGTGATTCC CCACAACCCA TTGAAAATGG TTTTGTAGAA 7680 
GGTGCAGATT ACAGCTATGG TGCCATAATC ATCTACAGTT GCTTCCCTGG GTTTCAGGTG 7740 
GCTGGTCATG CCATGCAGAC CTGTGAAGAG TCAGGATGGT CAAGTTCCAT CCCAACATGT 7800 
ATGCCAATAG ACTGTGGCCT CCCTCCTCAT ATAGATTTTG GAGACTGTAC TAAACTCAAA 7860 
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GATGACCAGG GATATTTTGA GCAAGAAGAC GACATGATGG AAGTTCCATA TGTGACTCCT 7920 
CACCCTCCTT ATCATTTGGG AGCAGTGGCT AAAACCTGGG AAAATACAAA GGAGTCTCCT 7980 
GCTACACATT CATCAAACTT TCTGTATGGT ACCATGGTTT CATACACCTG TAATCCAGGA 8040 
TATGAACTTC TGGGGAACCC TGTGCTGATC TGCCAGGAAG ATGGAACTTG GAATGGCAGT 8X00 
5 GCACCATCCT GCATTTCAAT TGAATGTGAC TTGCCTACTG CTCCTGAAAA TGGCTTTTTG 8160 
CGTTTTACAG AGACTAGCAT GGGAAGTGCT OTGCAGTATA GCTGTAAACC TGGACACATT 8220 
CTAGCAGGCT CTGACTTAAG GCTTTGTCTA GAGAATAGAA AGTGGAGTGG TGCCTCCCCA 8280 
CGCTGTGAAG CCATTTCATG CAAAAAGCCA AATCCAGTCA TGAATGGATC CATCAAAGGA 8340 
AGCAACTACA CATACCTGAG CACGTTGTAC TATGAGTGTG ACCCCGGATA TGTGCTGAAT 8400 
10 GGCACTGAGA GGAGAACATG CCAGGATGAC AAAAACTGGG ATGAGGATGA GCCCATTTGC 8460 
ATTCCTGTGG ACTGCAGTTC ACCCCCAGTC TCAGCCAATG GCCAGGTGAG AGGAGACGAG 8520 
TACACATTCC AAAAAGAGAT TGAATACACT TGCAATGAAG GGTTCTTGCT TGAGGGAGCC 8S80 
AGGAGTCGGG TTTGTCTTGC CAATGGAAGT TGGAGTGGAG CCACTCCCGA CTGTGTGCCT 8640 
GTCAGATGTG CCACCCCGCC ACAACTGGCC AATGGGGTGA CGGAAGGCCT GGACTATGGC 8700 
15 TTCATGAAGG AAGTAACATT CCACTGTCAC GAGGGCTACA TCTTGCACGG TGCTCCAAAA 8760 
CTCACCTGTC AGTCAGATGG CAACTGGGAT GCAGAGATTC CTCTCTGTAA ACCAGTCAAC 8820 
TGTGGACCTC CTGAAGATCT TGCCCATGGT TTCCCTAATG GTTTTTCCTT TATTCATGGG 8880 
GGCCATATAC AGTATCAGTG CTTTCCTGGT TATAAGCTCC ATGGAAATTC ATCAAGAAGG 8940 
TGCCTCTCCA ATGGCTCCTG GAGTGGCAGC TCACCTTCCT GCCTGCCTTG CAGATGTTCC 9000 
20 ACACCAGTAA TTGAATATGG AACTGTCAAT GGGACAGATT TTGACTGTGG AAAGGCAGCC 9060 
CGGATTCAGT GCTTCAAAGG CTTCAAGCTC CTAGGACTTT CTGAAATCAC CTGTGAAGCC 9120 
GATGGCCAGT GGAGCTCTGG GTTCCCCCAC TGTGAACACA CTTCTTGTGG TTCTCTTCCA 9180 
ATGATACCAA ATGCGTTCAT CAGTGAGACC AGCTCTTGGA AGGAAAATGT GATAACTTAC 9240 
AGCTGCAGGT CTGGATATGT CATACAAGGC AGTTCAGATC TGATTTGTAC AGAGAAAGGG 9300 
25 GTATGGAGCC AGCCTTATCC AGTCTGTGAG CCCTTGTCCT GTGGGTCCCC ACCGTCTGTC 9360 
GCCAATGCAG TGGCAACTGG AGAGG CACAC ACCTATGAAA GTGAAGTGAA ACTCAGATGT 9420 
CTGGAAGGTT ATACGATGGA TACAGATACA GATACATTCA CCTGTCAGAA AGATGGTCGC 9480 
TGGTTCCCTG AGAGAATCTC CTGCAGTCCT AAAAAATGTC CTCTCCCGGA AAACATAACA 9540 
CATATACTTG TACATGGGGA CGATTTCAGT GTGAATAGGC AAGTTTCTGT GTCATGTGCA 9600 
30 GAAGGGTATA CCTTTGAGGG AGTTAACATA TCAGTATGTC AGCTTGATGG AACCTGGGAG 9660 
CCACCATTCT CCGATGAATC TTGCAGTCCA GTTTCTTGTG GGAAACCTGA AAGTCCAGAA 9720 
CATGGATTTG TGGTTGGCAG TAAATACACC TTTGAAAGCA CAATTATTTA TCAGTGTGAG 9780 
CCTGGCTATG AACTAGAGGG GAACAGGGAA CGTGTCTGCC AGGAGAACAG ACAGTGGAGT 9840 
GGAGGGGTGG CAATATGCAA AGAGACCAGG TGTGAAACTC CACTTGAATT TCTCAATGGG 9900 
35 AAAGCTGACA TTGAAAACAG GACGACTGGA CCCAACGTGG TATATTCCTG CAACAGAGGC 9960 
TACAGTCTTG AAGGGCCATC TGAGG CACAC TGCACAGAAA ATGGAACCTG GAGCCACCCA 10020 
GTCCCTCTCT GCAAACCAAA TCCATGCCCT GTTCCTTTTG TGATTCCCGA GAATGCTCTG 10080 
CTGTCTGAAA AGGAGTTTTA TGTTGATCAG AATGTGTCCA TCAAATGTAG GGAAGGTTTT 10140 
CTGCTGCAGG GCCACGGCAT CATTACCTGC AACCCCGACG AGACGTGGAC ACAGACAAGC 10200 
40 GCCAAATGTG AAAAAATCTC ATGTGGTCCA CCAGCTCACG TAGAAAATGC AATTGCTCGA 10260 
GGCGTACATT ATCAATATGG AGACATGATC ACCTACTCAT GTTACAGTGG ATACATGTTG 10320 
GAGGGTTTCC TGAGGAGTGT TTGTTTAGAA AATGGAACAT GGACATCACC TCCTATTTGC 10380 
AGAGCTGTCT GTCGATTTCC ATGTCAGAAT GGGGGCATCT GCCAACGCCC AAATGCTTGT 10440 
TCCTGTCCAG AGGGCTGGAT GGGGCGCCTC TGTGAAGAAC CAATCTGCAT TCTTCCCTGT 10500 
45 CTGAACGGAG GTCGCTGTGT GGCCCCTTAC CAGTGTGACT GCCCGCCTGG CTGGACGGGG 10560 
TCTCGCTGTC ATACAGCTGT TTGCCAGTCT CCCTGCTTAA ATGGTGGAAA ATGTGTAAGA 10620 
CCAAACCGAT GTCACTGTCT TTCTTCTTGG ACGGGACATA ACTGTTCCAG GTAA 

Seq ID NO: 430 Protein sequence 
50 Protein Accession #: FGENESH predicted 

1 11 21 31 41 51 

I I I I I I 

MWPRLAFCCW GLALVSGWAT FQQMSPSRNF SFRLPPSTAP GAPGSIPAPP APGDEAAGSR 
55 VERLGQAFRR RVRLLRELSE RLELVFLVDD SSSVGEVNFR SELMFVRKLL SDFPWPTAT 
RVAIVTPSSK NYWPRVDYI STRRARQHKC ALLLQEIPAI SYRGGGTYTK GAFQQAAQIL 
LHARENSTKV VFLITDGYSN GGDPRPIAAS LRDSGVEIFT FGIWQGNIRE LNDMASTPKE 
EHCYLLHSFE EFEALARRAXi HEDLPSGSPI QDDMVHCSYL CDEGKDCCDR KGSCKCGTHT 
GHFBCICBKG YYGKGLQYEC TACPSGTYKP EGSPGGISSC IPCPDENHTS PPGSTSPEDC 
60 VCREGYRASG QTCELVHCPA LKPPENGYFI QNTCNNHFNA ACGVRCHPGF DLVGSSIILC 
LPNGLWSGSE SYCRVRTCPH LRQPKHGHIS CSTREMLYKT TCLVACDEGY RLEGSDKLTC 
QGNSQWDGPE PRCVERHCST FQMPKDVIIS PHNCGKQPAK FGTICYVSCR QGPILSGVKE 
MLRCTTSGKH NVGVQAAVCK DVEAPQINCP KDIEAKTLEQ GDSANVTWQI PTAKDNSGEK 
VSVHVHPAFT PPYLFPIGDV AIVYTATDLS GMQASCIFHI KVIDAEPPVI DWCRSPPPVQ 
65 VSEKVHAASW DEPQFSDNSG AELVITRSHT QGDLFPQGET IVQYTATDPS GNNRTCDIHI 
VIKGSPCEIP FTPVNGDFIC TPDNTGVNCT LTCLEGYDPT EGSTDKYYCA YEDGVWKPTY 
TTEWPDCAKK RFANHGFKSF EMFYKAARCD DTDLMKKFSE AFETTLGKMV PSFCSDAEDI 
DCRLEEOTjTK KYCLEYNYDY ENGFAIGPGG WGAANRLDYS YDDFLDTVQE TATSIGNAKS 
SRIKRSAPLS DYKIKLIFNI TASVPLPDER NDTLEWENQQ RLLQTLETIT NKLKRTLNKD 
70 PMYSFQLASE ILIADSNSLE TKKASPFCRP GSVLRGRMCV NCPLGTYYNIi BHFTCESCRI 
GSYQDEEGQL ECKXCPSGMY TEYIRSRNIS DCKAQCKQGT YSYSGLETCE SCPLGTYQPK 
FGSRSCLSCP ENTSTVKRGA VNISACGVPC PEGKFSRSGL MPCHPCPRDY YQPNAGKAFC 
LACPFYGTTP FAGSRSITEC STSVLNITIF GGFGHLELLN CPSEVFHECF FNPCHNSGTC 
QQLGRGYVCL CPLGYTGLKC ETDIDECSPL PCLNNGVCKD LVGEFICECP SGYTGQRCEE 
75 NINECSSSPC LNKGICVDGV AGYRCTCVKG FVGLHCETEV NECQSNPCLN NAVCEDQVGG 
FLCKCPPGFL GTRCGKNVDB CLSQPCKNGA TCKDGANSFR CLCAAGPTGS RCELNINECQ 
SNPCRNQATC VDELNSYSCK CQPGFSGKRC ETEQSTGFNL DFEVSGIYGY VMLDGMLPSL 
HALTCTFWMK SSDDMNYGTP ISYAVBNGSD NTLLLTDYNG WVLYVNGREK ITNCPSVNDG 
RWHHIAITWT SANGIWKVYI DGKLSDGGAG LSVGLPIPGG GALVLGQEQD KKGEGFSPAE 
80 SFVGSISQLN LWDYVLSPQQ VKSLATSCPE ELSKGNVLAW PDFLSGIVGK VKIDSKSIFC 
SDCPRLGGSV PHLRTASEDL KPGSKVNLFC DPGFQLVGNP VQYCLNQGQW TQPLPHCERI 
SCGVPPPLEN GFHSADDFYA GSTVTYQCNN GYYbLGDSRM FCTDNGSWNG VSPSCLDVDE 
CAVGSDCSEH ASCLNVDGSY ICSCVPPYTG DGKNCAEPIK CKAPGNPENG HSSGEIYTVG 
AGVTFSCQEG YQLMGVTKIT CLESGEWNHL IPYCKAVSCG KPAIPENGCI EELAFTFGSK 
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VTYRCNKGYT LAGDKESSCL ANSSWSHSPP VCEPVKCSSP ENINNGKYIL SGLTYLSTAS 1920 
YSCDTGYSLQ GPSIIECTAS GIWDRAPPAC HLVFCGEPPA IKDAVITGNN FTFRNTVTYT 1980 
CKEGYTLAGL DTIECLADGK WSRSDQQCLA VSCDEPPIVD HASPETAHRL FGDIAFYYCS 2040 
DGYSLADNSQ LLCNAQGKWV PPEGQDMPRC IAHFCBKPPS VSYSILESVS KAKFAAGSW 2100 
SFKCMEGFVL NTSAKIECMR GGQWNPSPMS IQCIPVRCGB PPSIMNGYAS GSNYSFGAMV 2160 
AYSCNKGFYI KGEKKSTCEA TGQWSSPIPT CHPVSOGEPP KVENGPLEHT TGRIFESEVR 2220 
YQCNPGYKSV GSPVFVCQAN RHWHSESPLM CVPLDCGKPP PIQNGFMKGE NFEVGSKVQF 2280 
FCNEGYELVG DSSWTCQKSG KWNKKSNPKC MPAKCPEPPL LENQLVLKEL TTEVGWTFS 2340 
CKEGHVLQQP SVLKCLPSQQ WNDSFPVCKI VLCTPPPLIS FGVPIPSSAL HFGSTVKYSC 2400 
VGGFFLRGNS TTLCQPDGTW SSPLPECVPV ECPQPEEIPN GIIDVQGIAY LSTALYTCKP 2460 
GFELVGNTTT LCGENGHWLG GKPTCKAIEC LKPKEILNGK FSYTDLHYGQ TVTYSCNRGF 2520 
RLEGPSALTC LETGDWDVDA PSCNAIHCDS PQPIENGFVE GADYSYGAI I IYSCFPGFQV 2580 
AGHAMQTCEE SGWSSSIPTC MPIDCGLPPH IDFGDCTKLK DDQGYFEQED DMMEVPYVTP 2640 
HPPYHLGAVA KTWENTKESP ATHSSNFLYG THVSYTCNPG YBLLGNPVtil CQEDGTWNGS 2700 
APSCISIECD LPTAPENGFL RFTETSMGSA VQYSCKPGHI LAGSDLRLCL ENRKWSGASP 2760 
RCEAISCKKP NPVMNGSIKG SNYTYL.STLY YECDPGYVliN GTERRTCQDD KNWDEDEPIC 2820 
IPVDCSSPPV SANGQVRGDE YTFQKEIEYT CNEGFLLEGA RSRVCLANGS WSGATPDCVP 2880 
VRCATPPQLA NGVTEGLDYG FMKEVTFHCH EGYILHGAPK LTCQSDGNWD AEIPLCKPVN 2940 
CGPPEDLAHG FPNGFSFIHG GHIQYQCFPG YKLHGNSSRR CLSNGSWSGS SPSCLPCRCS 3000 
TPVIEYGTVN GTDFDCGKAA RIQCFKGFKL LGLSEITCEA DGQWSSGFPH CEHTSCGSLP 3060 
MIPNAFISET SSWKENVITY SCRSGYVIQG SSDLICTEKG VWSQPYPVCE PLSCGSPPSV 3120 
ANAVATGEAH TYESEVKLRC LEGYTMDTDT DTPTCQKDGR WFPERISCSP KKCPLPENIT 3180 
HILVHGDDFS VNRQVSVSCA EGYTFEGVNI SVCQLDGTVIE PPFSDESCSP VSCGKPESPE 3240 
HGFWGSKYT FESTIIYQCE PGYELEGNRE RVCQENRQWS GGVAICKETR CETPLEFLNG 3300 
KADIENRTTG PNWYSCNRG YSLEGPSEAH CTEKGTWSHP VPLCKPNPCP VPFVIPENAL 3360 
LSEKEFYVDQ NVSIKCREGF LLQGHGIITC NPDETWTQTS AKCEKISCGP PAHVENAIAR 3420 
GVHYQYGDMI TYSCYSGYML EGFLRSVCIiE NGTWTSPPIC RAVCRPPCQN GGICQRPNAC 3480 
SCPEGWMGRL CEEPICILPC LNGGRCVAPY QCDCPPGWTG SRCHTAVCQS PCLNGGKCVR 3540 
PNRCHCLSSW TGHNCSR 



Seq ID NO: 431 DMA sequence 

Nucleic Acid Accession #s FGENESH predicted 

Coding sequence t 1..390 

1 11 21 31 41 51 

I I I I I 1 

ATGAGGTTCA GTGTCTCAGG CATGAGGACC GACTACCCCA GGAGTGTGCT GGCTCCTGCT 60 
TATGTGTCAG TCTGTCTCCT CCTCTTGTGT CCAAGGGAAG TCATCGCTCC CGCTGGCTCA 120 
GAACCATGGC TGTGCCAGCC GGCACCCAGG TGTGGAGACA AGATCTACAA CCCCTTGGAG 180 
CAGTGCTGTT ACAATGACGC CATCGTGTCC CTGAGCGAGA CCCGCCAATG TGGTCCCCCC 240 
TGCACCTTCT GGCCCTGCTT TGAGCTCTGC TGTCTTGATT CCTTTGGCCT CACAAACGAT 300 
TTTGTTGTGA AGCTGAAGGT TCAGGGTGTG AATTCCCAGT GCCACTCATC TCCCATCTCC 360 
AGTAAATGTG AAAGAGGCCG GATATGTTAG 



Seq ID NO: 432 Protein sequence 
Protein Accession #: FGENESH predicted 



1 11 21 31 41 51 

MRFSVSGMRT DYPRSVLAPA YVSVCLLLLC PREVIAPAGS EPWLCQPAPR CGDKIYNPLE 60 
QCCYNDAIVS LSETRQCGPP CTFWPCFELC CLDSFGLTND FWKLKVQGV NSQCHSSPIS 120 
SKCERGRIC 



Seq ID NO: 433 DNA sequence 

Nucleic Acid Accession #: NM_007231.l 

Coding sequence : 8 9 . . 2 0 17 

1 11 21 31 41 51 

TAGGAACAGG GGAGAGTGCA CCTGCTACCA GTCAAGCTCA GCCAGACTGC AAGAGGAGGC 60 

GAGGCGGAGC CAGCCGAGGG AGTGAACCAT GGACAAGTTG AAATGCCCGA GTTTCTTCAA 120 

GTGCAGGGAG AAGGAGAAAG TGTCGGCTTC ATCAGAGAAT TTCCATGTTG GTGAAAATGA 180 

TGAGAATCAG GACCGTGGTA ACTGGTCCAA AAAATCGGAT TATCTTCTAT CTATGATTGG 240 

ATACGCAGTG GGATTAGGAA ATGTGTGGAG ATTTCCATAT CTGACCTACA GCAATGGTQG 300 

AGGCGCCTTC TTGATACCTT ATGCAATTAT GTTAGCATTG GCTGGTTTAC CTTTGTTCTT 360 

TCTGGAGTGT TCACTGGGAC AATTTGCTAG CTTAGGTCCA GTTTCAGTTT GGAGGATTCT 420 

TCCATTGTTT CAAGGTGTGG GAATTACAAT GGTCCTGATC TCCATTTTTG TGACAATCTA 480 

TTACAATGTC ATAATTGCCT ATAGTCTTTA CTACATGTTT GCTTCTTTTC AAAGTGAACT 540 

ACCATGGAAA AATTGTTCTT CGTGGTCAGA TAAAAACTGT AGCAGATCAC CAATAGTAAC 600 

TCACTGTAAT GTGAGTACAG TGAATAAAGG AATACAAGAG ATCATCCAAA TGAATAAAAG 660 

CTGGGTAGAC ATCAACAATT TTACCTGCAT CAACGGCAGT GAAATTTATC AGCCAGGGCA 720 

GCTTCCCAGT GAACAATATT GGAATAAAGT GGCGCTCCAA CGGTCAAGTG GAATGAATGA 780 

GACTGGAGTA ATTGTTTGGT ATTTAGCACT TTGTCTTCTT CTGGCTTGGC TCATAGTTGG 840 

AGCAGCACTA TTTAAAGGAA TCAAATCGTC TGGCAAGGTG GTATATTTTA CAGCTCTTTT 900 

CCCCTATGTG GTCCTACTCA TCCTGTTAGT ACGAGGTGCA ACTCTGGAGG GTGCTTCAAA 960 

AGGCATTTCA TACTATATTG GAGCCCAGTC AAATTTTACA AAACTTAAGG AAGCTGAGGT 1020 

ATGGAAAGAT GCTGCCACTC AGATATTTTA CTCCCTTTCA GTGGCTTGGG GTGGCTTAGT 1080 

TGCTCTATCA TCTTACAATA AGTTCAAAAA CAACTGCTTC TCTGATGCCA TTGTGGTTTG 1140 

TTTGACAAAC TGTCTCACTA GCGTGTTTGC TGGATTTGCT ATTTTTTCTA TATTGGGACA 1200 

CATGGCCCAT ATATCTGGAA AGGAAGTTTC TCAAGTTGTA AAATCAGGTT TTGATTTGGC 1260 

ATTCATTGCC TATCCAGAGG CTCTAGCCCA ACTCCCAGGT GGTCCATTTT GGTCCATATT 1320 

ATTTTTTTTC ATGCTTTTAA CTTTGGGTCT CGATTCTCAG TTTGCTTCGA TTGAAACGAT 1380 

CACAACAACA ATTCAAGATT TATTTCCCAA AGTGATGAAG AAAATGAGGG TTCCCATAAC 1440 

TTTGGGCTGC TGCTTGGTTT TGTTTCTCCT TGGTCTCGTC TGTGTGACTC AGGCTGGAAT 1500 

1125 



WO 03/042661 



PCT/US02/36810 



TTACTGGGTT CATCTGATTG ACCACTTCTG TGCTGGATGG GGCATTTTAA TTGCAGCTAT 1560 
ACTGGAGCTA GTTGGAATCA TCTGGATTTA TGGAGGGAAC AQATTCATTG AGGATACAGA 1620 
AATGATGATT GGAGCAAAGA GGTGGATATT CTGGCTATGG TGGAGAGCTT GCTGGTTTGT 16 BO 
AATTACGCCT ATCCTTTTGA TTGCAATATT TATCTGGTCA TTGGTGCAAT TTCATAGACC 1740 
5 TAATTATGGC GCAATTCCAT ACCCTGACTG GGGAGTTGCT TTAGGCTGGT GTATGATTGT 1800 
TTTCTGCATT ATTTGGATAC CAATTATGGC TATCATAAAA ATAATTCAGG CTAAAGGAAA 1860 
CATCTTTCAA CGCCTTATAA GTTGCTGCAG ACCAGCTTCT AACTGGGGTC CATACCTGGA 1920 
ACAACATCGT GGGGAAAGAT ATAAAGACAT GGTAGATCCT AAAAAAGAGG CTGACCATGA 1980 
AATACCTACT GTTAGTGGCA GCAGAAAACC GGAATGAGAT CTCATTGAAA AAAATATATG 2040 
10 ATTGTATAAT GTGATTTTTT TTAGAATAGG GGGAACCTTA TTTATTTGTG TGTTAACTGA 2100 
ATAGGAAAAT GT A CAT ACT A TGTTCATGAT AGTGTGATTT TTTTCACATT TAAGCAGGAA 2160 
TGCAATATAA AAATGTGAAT CTCTTAATTC TCAGCCATGT GCTTATTATA TTTCTTTTTA 2220 
GATTGTCTAT CTGTATAACA CACACACACA CACCTAAGAG TCTCTATTTC ACAATTATAT 2280 
TTTTGTAAAT AGTATATGCA TTTTTAATAC ATTGGAGGCT TTATTTTGAA CTAATTTCTT 2340 
15 AGAGAATAGT TATATTTTCT ATTACACAAG TTTAAAAATA TTATTAACTT GTATTTTCTT 2400 
AATATACAAT CTATCTTTTC CACAAATATG AGTGGGAAAT AAATCAGCAC ATTTGAAAGA 2460 
AAGTGTTAAA ACTGAAGGCC TCACTTAATT AGAAACGTGA TAAATATATG GACAAATGGA 2S20 
CTATACATAC TATAAGAGGA CTGTAGTTTA ATACTTTTTA CCCAAATATG TTTAAAAACA 2580 
TCGTGCATTT GTTACAGCTC ATGTTTTCTA TATGAACTTA GTCATTAATG TTCTTTATAA 2640 
20 AAAGTGAAAT AAGATGGAAA AATTAGGATC CTACAGCCAG TACGTGATAA ATCTAGAAAA 2700 
TTGAGTTTTG AGTACCTCTT TTCCCATATA CAATCTTCCT TCCTTAGGTA ATTTGGAAGA 2760 
AAACTATGAC CCATTTAATT TCTATTGTGT TTCACCAAAT TCAGTGTTGT TCATTATACC 2820 
TCTCTGAAAT ATAGGTTTAA TTTCAAATAG AATATGGACT TAAATGTTAA TGAGAAACTG 2880 
GCTTTAATCA ATTCTAGCAT TTTATTACTG TAATACAGGG CTGATAGAGT GATTTTGTCT 2940 
25 TATATGAGTC AGTTACTACT TACAGGTGAT AACTTGCATA CTATTGGAAG ATAAAGTTGT 3000 
CAAACTTGTC AAGAATGAGA AAAGCCAAAT TAGAAAATCC TATGTCCTAG TTTCCTTACC 3060 
AAGGATAATT AAATATATCA CTAAGAGCTT TATATATTGA TTATATATTG TTGACAACTG 3120 
GTTTAAGCAT CATAGCCTAT GATGATAAAC ACTGCCTATA TATGTAAATA GCTTTTCATC 3180 
AATTCTTAAA TTTCTTAACC TAGGCTTCAG GGAGCATATG AAACCAAAAT TATATGGAAC 3240 
30 ATTTTCTGTG TGTACATGTA CATGCATTTT TCTAGGGAGA GAGTCCGTAG GTTTATCAGA 3300 
ATATCAAGGA AAACTGTGAC CCAAAGAAGT TTAAGAATCA CATACAGTGC TGCTGGCTTT 3360 
TTGTGCTTGG CAAATGAGTG ACAATAGAAG AAATAATTTT TCTTACACAT TTTA AAACGT 3420 
TTTCTCTTCC TTGTGATTGA AGATGAAAGG AGTAAGAAAT TAAGGCATTT GTTTAATTTA 3480 
TACTGGTAAC TTATTTAGGG GGGAGGGGAC ATGAAGGTAG GTAAATAGGT AGGCCTCTAA 3540 
35 TTGAACCACC TCTCTAAGTT ATGTACGTAT ATATAAGCTG AAATTGTGTT TGACATTCTG 3600 
AGGGTTTTCT TTTTCTTTTT CCTTTTTTTT TTTTTTTGGT GGGGGGCTGG GGGTCAGAGT 3660 
CTTGTTCTGT TGCCTGGGCT GGAGTGCAGT GGCATGATCT CAGCTCACTG CAACCTCTGC 3720 
CTTCTGGATT CAAGTGATTC TCCTGCCTCA GCCTCTTGAG TAGCTGGGAC TACAGGTGCC 3780 
CGCCACCACA CCAGCTAATT TTTGTATTTT TAGTAGAGGC GAAGTTTCCC CATGTTGGCC 3840 
40 AGGCTGGTCT TGAACTCCCG ACCTCAAGTG ATCTGTCTAC CTCGGCCTCC TAAAGTGCTG 3900 
AGATTACAGG TGTGAGCCAC CGTGCCCGGC CCATTCTAAG GGTTTTCTTT GA AGACA GGT 3960 
CAAATGCTGT TAGTAAGTTT CAGGAGATTG TTAATTCCTC AGTTATACCA GATTTTATAA 4020 
AATATTTGAG AATAGATGGC TAACAAGAGG TTAGAAATAC TTTTCCTTAA TTTT AATC CA 4080 
CAGTATGTTA CATGCATTCT ACCACTACAT TTTGGTGCTA TTTAAGGTGT GCAATTTTCT 4140 
45 ATAGGTGACT TTTGCAATTC AGGGAAGATT TGGGCATATT AAATGAAAGA ATATCTAATT 4200 
GGGGGAGGTG TGAAGGGAAA GAAATTCTTT TCAAAAGCTG ACCACAAAGA GTAGTTAAAA 4260 
GTTTTTGTCA CTATCTTCAC AAGTGTGTAA AGCACAGATT TCAACAGAGT GCTTGGCATA 4320 
TTGTAGGGTG CTCAATGGTG GTTTTTATTA TTATTACTCA QATTCCACAG TGGCAAGAAA 4380 
CATCATTCTA CATAATGGAA AACATTTACA TCAAATCCCA CTTACTTTAA TGCGAACTTG 4440 
50 GAGATAATTT ATGGTATTGT ATTGTAAACC ATTAATGAAA ACTTTTTCAC AGTTGAGTGA 4500 
AATTAAAATC ACTATATCTC 



55 



Seq ID NO: 434 Protein sequence 
Protein Accession #: np_009162.1 



L 11 21 31 41 51 

MDKLKCPSFF KCREKEKVSA SSENFHVGEN DENODRGNWS KKSDYLLSMI GYAVGLGNVW 60 

RPPYLTYSNG GGAFLIPYAI MLALAGLPLF FLECSLGQFA SLGPVSVWRI LPIiFQGVGIT 120 

60 MVLISIFVTI YYNVI IAYSL YYMFASFQSE LPWKNCSSWS DKNCSRSPIV THCNVSTVNK 180 

GIQEIIQMNK SWVDINNFTC INGSBIYQPG QLPSEQYWNK VAUQRSSGMN ETGVIVWYLA 240 

LCLLLAWLIV GAALFKGIKS SGKWYFTAL FPYWLLILL VRGATLEGAS KGISYYIGAQ 300 

SNFTKLKEAE VWKDAATQIF YSLSVAWGGL VALSSYNKFK NNCFSDAIW CLTNCLTSVF 360 

AGPAIFSILG HMAHISGKBV SQWKSGFDL AFIAYPEALA QLPGGPFWSI LFFFMLLTLG 420 

65 LDSQFASIET ITTTIQDLFP KVMKKMRVPI TLGCCLVLFL IX3LVCVTQAG IYWVHLIDHF 480 

CAGWGILIAA ILELVGIIWI YGGNRPIEDT EMMIGAKRWI FWLWWRACWF VITPILLIAI 540 

FIWSLVQFHR PNYGAIPYPD WGVALGWCMI VFC1IWIPIM AIIKIIQAKG NIFQRLISCC 600 
RPASNWGPYL EQHRGERYKD MVDPKKEADH EIPTVSGSRK PE 

70 Seq ID NO: 435 DNA sequence 

Nucleic Acid Accession #: M1B72B.1 
Coding sequence: 51.- 1085 

1 ii 21 31 41 51 

75 | i I I I I 

GGAGCTCAAG CTCCTCTACA AAGAGGTGGA CAGAGAAGAC AGCAGAGACC ATGGGACCCC 60 

CCTCAGCCCC TCCCTGCAGA TTGCATGTCC CCTGGAAGGA GGTCCTGCTC ACAGCCTCAC 120 

TTCTAACCTT CTGGAACCCA CCCACCACTG CCAAGCTCAC TATTGAATCC ACGCCATTCA 180 

ATGTCGCAGA GGGGAAGGAG GTTCTTCTAC TCGCCCACAA CCTGCCCCAG AATCGTATTG 240 

80 GTTACAGCTG GTACAAAGGC GAAAGAGTGG ATGGCAACAG TCTAATTGTA GGATATGTAA 300 

TAGGAACTCA ACAAGCTACC CCAGGGCCCG CATACAGTGG TCGAGAGACA ATATACCCCA 360 

ATGCATCCCT GCTGATCCAG AACGTCACCC AGAATGACAC AGGATTCTAT ACCCTACAAG 420 

TCATAAAGTC AGATCTTGTG AATGAAGAAG CAACCGGACA GTTCCATGTA TACCCGGAGC 480 

TGCCCAAGCC CTCCATCTCC AGCAACAACT CCAACCCCGT GGAGGACAAG GATGCTGTGG 540 
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CCTTCACCTG 
GCCTCCCGGT 
GCGTCAAAAG 
ACCGCAGTGA 
CCTCAAAGGC 
ACCCACCTGC 
TCTTTATCCC 
CAGCCACTGG 
TCCTCTCAGC 
TATAGCAGCC 
GAATTCTTCT 
CTCTGCTCCT 
ACCCTCAGGC 
GCAAACCATG 
AACAAGACTC 
TGCCTCTTTC 
GGGTAACTTA 
AGATCCTTTA 
AAATGTACAG 
TTTAATTCAA 
GGAGGAGTCT 
GCTGAGACTA 
CTGACTCATT 
CTCTTGGTAT 
CTCTAAAAGC 
GGCTGGAATT 
ATAAAAGCCC 
TCTCACCTAG 
GTTAAGGAAG 
ACACAGGAGA 
TTTACAAAAA 



TGAACCTGAG 
CAGTCCCAGG 
GAACGATGCA 
CCCAGTCACC 
CAATTACCGT 
ACAGTACTCT 
CAACATCACT 
CCTCAATAGG 
TGTGGCCACC 
CTGGTGTATT 
AGCTCCTCCA 
GAAGCCCTAT 
CTGAGGTGTG 
GTGAGAAATT 
CTCATCATGA 
GCTTGGCAGG 
ACAGAGTGTC 
GTGCACCCAG 
TGGTCCTTTT 
CCCAGCCATG 
GTGCAGTTTC 
AGTTGTAGAA 
CTTTATTCTA 
TACCCTCCTA 
TTTAAATGTC 
ACAAAACTCA 
CAAATGGTGG 
GTGAGCGCAT 
AAGATAGATC 
TTCCAGTCTA 
AGTAACCTGA 
TTCCAATTTG 
ACTTGTAGAG 
ACT 



GTTCAGAACA 
CTGCAGCTGT 
GGATCCTATG 
CTGAATGTCC 
CCAGGGGAAA 
TGGTTTATCA 
GTGAATAATA 
ACCACAGTCA 
GTCGGCATCA 
TTCGATATTT 
ATCCCATTTT 
ATGCTGGAGA 
TGCCACTCAG 
GACGACTTCA 
TAAGGCTCTT 
ATGATGCTGT 
AGATCTATCT 
TGACTGACAT 
CAGAGTTGGA 
CAATGCCAAA 
TGACACTTGT 
ATTAACAAAT 
TTTTAGTTGG 
ATAGTCATAC 
TGCATGCAGC 
GAGAAATGTG 
TAACTGATAA 
TGAGCCAGTG 
CAATTAAAAA 
CTTGAGTTAG 
ACTAATCTGA 
ACAAAACCCA 
TGGTGCTGCT 



CAACCTACCT 
CCAATGGCAA 
AATGTGAAAT 
TCTATGGCCC 
ATCTGAACCT 
ATGGGACGTT 
GCGGATCCTA 
CGATGATCAC 
CGATTGGAGT 
CAGGAAGACT 
ATCCCATGGA 
TGGACAACTC 
AGACTTCACC 
CACTATGGAC 
ACCCCCTTTT 
CATTAGTATT 
TGTCAATCCC 
TAGCAGCATC 
CTTCTAGACT 
TAATAGAATT 
TGTTGAACAT 
GTGCTGCTTG 
TTTGTATCTT 
TAGTAGTCAT 
CAGCCATCAA 
TCATCAGGAG 
TAGCACTAAT 
GTGCTAAATG 
AAATTAAAAC 
CATAATACAG 
TGTTAACCAA 
CTGTTCTTGT 
TTAATTCATA 



GTGGTGGGTA 
CATGACCCTC 
ACAGAACCCA 
AGATGTCCCC 
CTCCTGCCAC 
CCAGCAATCC 
TATGTGCCAA 
AGTCTCTGGA 
GCTGGCCAGG 
GGCAGATTGG 
ACCACTAAAA 
AATGAAAATT 
TAACTAGAGA 
AGCTTTTCCC 
AATTTGTCCT 
TCACAAGAAG 
AACGTTTTAC 
TTTAACACAG 
CACCTGTTCT 
GCTCCCTACC 
GGCTAAATAC 
GTTAAAATGG 
GCCTAAGGTG 
ACTCCCTGGT 
ATAGTGAATG 
AACATCATAA 
GCTTTAAGAT 
CTACATACTC 
CAATTTAAAA 
AAGTCCCCTC 
TGTATTTATT 
ATTGTATTGC 
AATCACAAAT 



AATGGTCAGA 
ACTCTACTCA 
GCGAGTGCCA 
ACCATTTCCC 
GCAGCCTCTA 
ACACAAGAGC 
GCCCATAACT 
AGTGCTCCTG 
GTGGCTCTGA 
ACCAGACCCT 
ACAAGGTCTG 
TAAAGGGAAA 
CAGTCAAACT 
AAGATGTCAA 
TGCTTATGCC 
TAGCTTCAGA 
ATAAAATAAG 
CCGTGTGTTC 
CACTCCCTGT 
AGCTGAACAG 
AATGGGTATC 
CTACACTCAT 
CGTAGTCCAA 
GTAGTGTATT 
GTCTCTCTTT 
CCCATGAAGG 
TTGGTCACAC 
CAACTGAAAT 
AAAAAAAAGA 
TACTTTAACT 
TCTGTGGTTC 
CCAGGGGGAG 
AAAAGCCAAT 



600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 



Seq ID NO: 436 Protein sequence 
Protein Accession #t AAA59907.1 

1 ll 21 31 41 51 

I I I I I I 

MGPPSAPPCR LHVPWKEVLL TASLLTFWNP PTTAKLTIES TPFNVAEGKE VLLLAHNLPQ 
NRIGYSWYKG ERVDGNSLIV GYVIGTQQAT PGPAYSGRET IYPNASLLIQ NVTQNDTGPY 
TUQVIKSDLV NEEATGQFHV YPELPKPSIS SNNSNPVEDK DAVAPTCEPB VQNTTYLWWV 
NGQSLPVSPR LQLSNGNMTL TTiLSVKKNDA GSYECEIQNP ASANRSDPVT LNVLYGPDVP 
TISPSKANYR PGENLNLSCH AASNPPAQYS WFINGTFQQS TQELPIPNIT VNNSGSYMCQ 
AHNSATGLNR TTVTMITVSG SAPVLSAVAT VGITIGVLAR VALI 

Seq ID NO: 437 DNA sequence 
Nucleic Acid Accession #: M18728.1 
Coding sequence: 1355.. 1657 



60 
120 
180 
240 
300 



GGAGCTCAAG 
OCTCAGCCCC 
TTCTAACCTT 
ATGTCGCAGA 
GTTACAGCTG 
TAGGAACTCA 
ATGCATCCCT 
TCATAAAGTC 
TGCCCAAGCC 
CCTTCACCTG 
GCCTCCCGGT 
GCGTCAAAAG 
ACCGCAGTGA 
CCTCAAAGGC 
ACCCACCTGC 
TCTTTATCCC 
CAGCCACTGG 
TCCTCTCAGC 
TATAGCAGCC 
GAATTCTTCT 
CTCTGCTCCT 
ACCCTCAGGC 
GCAAACCATG 
AACAAGACTC 
TGCCTCTTTC 
GGGTAACTTA 
AGATCCTTTA 
AAATGTACAG 
TTTAATTCAA 
GGAGGAGTCT 
GCTGAGACTA 



11 
I 

CTCCTCTACA 
TCCCTGCAGA 
CTGGAACCCA 
GGGGAAGGAG 
GTACAAAGGC 
ACAAGCTACC 
GCTGATCCAG 
AGATCTTGTG 
CTCCATCTCC 
TGAACCTGAG 
CAGTCCCAGG 
GAACGATGCA 
CCCAGTCACC 
CAATTACCGT 
ACAGTACTCT 
CAACATCACT 
CCTCAATAGG 
TGTGGCCACC 
CTGGTGTATT 
AGCTCCTCCA 
GAAGCCCTAT 
CTGAGGTGTG 
GTGAGAAATT 
CTCATCATGA 
GCTTGGCAGG 
ACAGAGTGTC 
GTGCACCCAG 
TGGTCCTTTT 
CCCAGCCATG 
GTGCAGTTTC 
AGTTGTAGAA 



21 
I 

AAGAGGTGGA 
TTGCATGTCC 
CCCACCACTG 
GTTCTTCTAC 
GAAAGAGTGG 
CCAGGGCCCG 
AACGTCACCC 
AATGAAGAAG 
AGCAACAACT 
GTTCAGAACA 
CTGCAGCTGT 
GGATCCTATG 
CTGAATGTCC 
CCAGGGGAAA 
TGGTTTATCA 
GTGAATAATA 
ACCACAGTCA 
GTCGGCATCA 
TTCGATATTT 
ATCCCATTTT 
ATGCTGGAGA 
TGCCACTCAG 
GACGACTTCA 
TAAGGCTCTT 
ATGATGCTGT 
AGATCTATCT 
TGACTGACAT 
CAGAGTTGGA 
CAATGCCAAA 
TGACACTTGT 
ATTAACAAAT 



31 
I 

CAGAGAAGAC 
CCTGGAAGGA 
CCAAGCTCAC 
TCGCCCACAA 
ATGGCAACAG 
CATACAGTGG 
AGAATGACAC 
CAACCGGACA 
CCAACCCCGT 
CAACCTACCT 
CCAATGGCAA 
AATGTGAAAT 
TCTATGGCCC 
ATCTGAACCT 
ATGGGACGTT 
GCGGATCCTA 
CGATGATCAC 
CGATTGGAGT 
CAGGAAGACT 
ATCCCATGGA 
TGGACAACTC 
AGACTTCACC 
CACTATGGAC 
ACCCCCTTTT 
CATTAGTATT 
TGTCAATCCC 
TAGCAGCATC 
CTTCTAGACT 
TAATAGAATT 
TGTTGAACAT 
GTGCTGCTTG 



41 
I 

AGCAGAGACC 
GGTCCTGCTC 
TATTGAATCC 
CCTGCCCCAG 
TCTAATTGTA 
TCGAGAGACA 
AGGATTCTAT 
GTTCCATGTA 
GGAGGACAAG 
GTGGTGGGTA 
CATGACCCTC 
ACAGAACCCA 
AGATGTCCCC 
CTCCTGCCAC 
CCAGCAATCC 
TATGTGCCAA 
AGTCTCTGGA 
GCTGGCCAGG 
GGCAGATTGG 
ACCACTAAAA 
AATGAAAATT 
TAACTAGAGA 
AGCTTTTCCC 
AATTTGTCCT 
TCACAAGAAG 
AACGTTTTAC 
TTTAACACAG 
CACCTGTTCT 
GCTCCCTACC 
GGCTAAATAC 
GTTAAAATGG 



51 
I 

ATGGGACCCC 
ACAGCCTCAC 
ACGCCATTCA 
AATCGTATTG 
GGATATGTAA 
ATATACCCCA 
ACCCTACAAG 
TACCCGGAGC 
GATGCTGTGG 
AATGGTCAGA 
ACTCTACTCA 
GCGAGTGCCA 
ACCATTTCCC 
GCAGCCTCTA 
ACACAAGAGC 
GCCCATAACT 
AGTGCTCCTG 
GTGGCTCTGA 
ACCAGACCCT 
ACAAGGTCTG 
TAAAGGGAAA 
CAGTCAAACT 
AAGATGTCAA 
TGCTTATGCC 
TAGCTTCAGA 
ATAAAATAAG 
CCGTGTGTTC 
CACTCCCTGT 
AGCTGAACAG 
AATGGGTATC 
CTACACTCAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
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CTGACTCATT CTTTATTCTA TTTTAGTTGG TTTGTATCTT GCCTAAGGTG CGTAGTCCAA 1920 

CTCTTGGTAT TACCCTCCTA ATAGTCATAC TAGTAGTCAT ACTCCCTGGT GTAGTGTATT 1980 

CTCTAAAAGC TTTAAATGTC TGCATGCAGC CAGCCATCAA ATAGTGAATG GTCTCTCTTT 2040 

GGCTGGAATT ACAAAACTCA GAGAAATGTG TCATCAGGAG AACATCATAA CCCATGAAGG 2100 

ATAAAAGCCC CAAATGGTGG TAACTGATAA TAGCACTAAT GCTTTAAGAT TTGGTCACAC 2160 

TCTCACCTAG GTGAGCGCAT TGAGCCAQTG GTGCTAAATG CTACATACTC CAACTGAAAT 2220 

GTTAAGGAAG AAGATAGATC CAATTAAAAA AAATTAAAAC CAATTTAAAA AAAAAAAAGA 22 BO 

ACACAGGAGA TTCCAGTCTA CTTGAGTTAG CATAATACAG AAGTCCCCTC TACTTTAACT 2340 

T TTACA AAAA AGTAACCTGA ACTAATCTGA TGTTAACCAA TGTATTTATT TCTGTGGTTC 2400 

TGTTTCCTTG TTCCAATTTG ACAAAACCCA CTGTTCTTGT ATTGTATTGC CCAGGGGGAG 2460 

CTATCACTGT ACTTGTAGAG TGGTGCTQCT TTAATTCATA AATCACAAAT AAAAGCCAAT 2520 
TAGCTCTATA ACT 

Seq ID NO: 438 Protein sequence 
Protein Accession #: AAA5 9908.1 

1 11 21 31 41 51 

I I I I I I 

MDSFSQDVKT RLLIMIRLLP PFNLSLLMPA SFAWQDDAVI SISQEVASEG NLTEOQIYLV 60 
NPNVLHKIRD PLVHPVTDIS SIFNTAVCSN VQWSFSELDP 

Seq ID NO: 439 DNA sequence 
Nucleic Acid Accession #: M18728.1 
Coding sequence: 2370.. 2501 

1 11 21 31 41 51 

I I I I I I 

GGAGCTCAAG CTCCTCTACA AAGAGGTGGA CAGAGAAGAC AGCAGAGACC ATGGGACCCC 60 

CCTCAGCCCC TCCCTGCAGA TTGCATGTCC CCTGGAAGGA GGTCCTGCTC ACAGCCTCAC 120 

TTCTAACCTT CTGGAACCCA CCCACCACTG CCAAGCTCAC TATTGAATCC ACGCCATTCA 180 

ATGTCGCAGA GGGGAAGGAG GTTCTTCTAC TCGCCCACAA CCTGCCCCAG AATCGTATTG 240 

GTTACAGCTG GTACAAAGGC GAAAGAGTGG ATGGCAACAG TCTAATTGTA GGATATGTAA 300 

TAGGAACTCA ACAAGCTACC CCAGGGCCCG CATACAGTGG TCGAGAGACA ATATACCCCA 3 SO 

ATGCATCCCT GCTGATCCAG AACGTCACCC AGAATGACAC AGGATTCTAT ACCCTACAAG 420 

TCATAAAGTC AGATCTTGTG AATGAAGAAG CAACCGGACA GTTCCATGTA TACCCGGAGC 480 

TGCCCAAGCC CTCCATCTCC AGCAACAACT CCAACCCCGT GGAGGACAAG GATGCTGTGG S40 

CCTTCACCTG TGAACCTGAG GTTCAGAACA CAACCTACCT GTGGTGGGTA AATGGTCAGA 600 

GCCTCCCGGT CAGTCCCAGG CTGCAGCTGT CCAATGGCAA CATGACCCTC ACTCTACTCA 660 

GCGTCAAAAG GAACGATGCA GGATCCTATG AATGTGAAAT ACAGAACCCA GCGAGTGCCA 720 

ACCGCAGTGA CCCAGTCACC CTGAATGTCC TCTATGGCCC AGATGTCCCC ACCATTTCCC 780 

CCTCAAAGGC CAATTACCGT CCAGGGGAAA ATCTGAACCT CTCCTGCCAC GCAGCCTCTA 840 

A CCCA CCTGC ACAGTACTCT TGGTTTATCA ATGGGACGTT CCAGCAATCC ACACAAGAGC 900 

TCTTTATCCC CAACATCACT GTGAATAATA GCGGATCCTA TATGTGCCAA GCCCATAACT 960 

CAGCCACTGG CCTCAATAGG ACCACAGTCA CGATGATCAC AGTCTCTGGA AGTGCTCCTG 1020 

TCCTCTCAGC TGTGGCCACC GTCGGCATCA CGATTGGAGT GCTGGCCAGG GTGGCTCTGA 1080 

TATAGCAGCC CTGGTGTATT TTCGATATTT CAGGAAGACT GGCAGATTGG ACCAGACCCT 1140 

GAATTCTTCT AGCTCCTCCA ATCCCATTTT ATCCCATGGA ACCACTAAAA ACAAGGTCTG 1200 

CTCTGCTCCT GAAGCCCTAT ATGCTGGAGA TGGACAACTC AATGAAAATT TAAAGGGAAA 1260 

ACCCTCAGGC CTGAGGTGTG TGCCACTCAG AGACTTCACC TAACTAGAGA CAGTCAAACT 1320 

GCAAACCATG GTGAGAAATT GACGACTTCA CACTATGGAC AGCTTTTCCC AAGATGTCAA 1380 

AACAR GACT C CTCATCATGA TAAGGCTCTT ACCCCCTTTT AATTTGTCCT TGCTTATGCC 1440 

TGCCTCTTTC GCTTGGCAGG ATGATGCTGT CATTAGTATT TCACAAGAAG TAGCTTCAGA 1500 

GGGTA ACTT A ACAGAGTGTC AGATCTATCX TGTCAATCCC AACGTTTTAC ATAAAATAAG 1560 

AGATCCTTTA GTGCACCCAG TGACTGACAT TAGCAGCATC TTTAACACAG CCGTGTGTTC 1620 

AAATGTACAG TGGTCCTTTT CAGAGTTGGA CTTCTAGACT CACCTGTTCT CACTCCCTGT 1680 

TTTAATTCAA CCCAGCCATG CAATGCCAAA TAATAGAATT GCTCCCTACC AGCTGAACAG 1740 

GGAGGAGTCT GTGCAGTTTC TGACACTTGT TGTTGAACAT GGCTAAATAC AATGGGTATC 1800 

GCTGAGACTA AGTT GTAGAA ATTAACAAAT GTGCTGCTTG GTTAAAATGG CTACACTCAT I860 

CTGACTCATT CTTTATTCTA TTTTAGTTGG TTTGTATCTT GCCTAAGGTG CGTAGTCCAA 1920 

CTCTTGGTAT TACCCTCCTA ATAGTCATAC TAGTAGTCAT ACTCCCTGGT GTAGTGTATT 1980 

CTCTAAAAGC TTTAAATGTC TGCATGCAGC CAGCCATCAA ATAGTGAATG GTCTCTCTTT 2040 

GGCTGGAATT ACAAAACTCA GAGAAATGTG TCATCAGGAG AACATCATAA CCCATGAAGG 2100 

ATAAAAGCCC CAAATGGTGG TAACTGATAA TAGCACTAAT GCTTTAAGAT TTGGTCACAC 2160 

TCTCACCTAG GTGAGCGCAT TGAGCCAGTG GTGCTAAATG CTACATACTC CAACTGAAAT 2220 

GTTAAGGAAG AAGATAGATC CAATTAAAAA AAATTAAAAC CAATTTAAAA AAAAAAAAGA 2260 

ACACAGGAGA TTCCAGTCTA CTTGAGTTAG CATAATACAG AAGTCCCCTC TACTTTAACT 2340 

TTTACAAAAA AGTAACCTGA ACTAATCTGA TGTTAACCAA TGTATTTATT TCTGTGGTTC 2400 

TGTTTCCTTG TTCCAATTTG ACAAAACCCA CTGTTCTTGT ATTGTATTGC CCAGGGGGAG 2460 

CTATCACTGT ACTTGTAGAG TGGTGCTGCT TTAATTCATA AATCACAAAT AAAAGCCAAT 2520 
TAGCTCTATA ACT 

Seq ID NO: 440 Protein sequence 
Protein Accession #: AAA59909.1 

1 11 21 31 41 51 

I I I I I I 

MLTNVPISW LFPCSNLTKP TVLVLYCPGG AITVLVEWCC PNS 



Seq ID NO: 441 DNA sequence 

Nucleic Acid Accession #: NH_002381.2 

Coding sequence: 64.. 1524 



11 21 31 41 51 

1128 



WO 03/042661 



I I I I I i 

AAATCCGAGC CTCGCGTGGG CTCCTGGCCC CCGACGGACA CCACCAGGCC CACGGAGCCC 60 

ACCATGCCGC GCCCGGCCCC CGCGCGCCGC CTCCCGGGAC TCCTCCTGCT GCTCTGGCCG 120 

CTGCTGCTGC TGCCCTCCGC CGCCCCCGAC CCCGTGGCCC GCCCGGGCTT CCGGAGGCTG 180 

GAGACCCGAG GTCCCGGGGG CAGCCCTGGA CGCCGCCCCT CTCCTGCGGC TCCCGACGGC 240 

GCGCCCGCTT CCGGGACCAG CGAGCCTGGC CGCGCCCGCG GTGCAGGTGT TTGCAAGAGC 300 

AGACCCTTGG ACCTGGTGTT TATCATTGAT AGTTCTCGTA GCGTACGGCC CCTGGAATTC 360 

ACCAAAGTGA AAACTTTTGT CTCCCGGATA ATCGACACTC TGGACATTGG GCCAGCCGAC 420 

ACGCGGGTGG CAGTGGTGAA CTATGCTAGC ACTGTGAAGA TCGAGTTCCA ACTCCAGGCC 480 

TACACAGATA AGCAGTCCCT GAAGCAGGCT GTGGGTCGAA TCACACCCTT GTCAACAGGC 540 

ACCATGTCAG GCCTAGCCAT CCAGACAGCA ATGGACGAAG CCTTCACAGT GGAGGCAGGG 600 

GCTCGAGAGC CCTCTTCTAA CATCCCTAAG GTGGCCATCA TTGTTACAGA TGGGAGGCCC 660 

CAGGACCAGG TGAATGAAGT GGCGGCTCGG GCCCAAGCAT CTGGTATTGA GCTCTATGCT 720 

GTGGGCGTGG ACCGGGCAGA CATGGCGTCC CTCAAGATGA TGGCCAGTGA GCCCCTAGAG 780 

GAGCATGTTT TCTACGTGGA GACCTATGGG GTCATTGAGA AACTTTCCTC TAGATTCCAG 840 

GAAACCTTCT GTGCGCTGGA CCCCTGTGTG CTTGGAACAC ACCAGTGCCA GCACGTCTGC 900 

ATCAGTGATG GGGAAGGCAA GCACCACTGT GAGTGTAGCC AAGGATACAC CTTGAATGCC 960 

GACAAGAAAA CGTGTTCAGC TCTTGATAGG TGTGCTCTTA ACACCCACGG ATGTGAGCAC 1020 

ATCTGTGTGA ATGACAGAAG TGGCTCTTAT CATTGTGAGT GCTATGAAGG TTATACCTTG 10 BO 

AATGAAGACA GGAAAACTTG TTCAGCTCAA GATAAATGTG CTTTGGGTAC CCATGGGTGT 1140 

CAGCACATTT GTGTGAATGA CAGAACAGGG TCCCATCATT GTGAATGCTA TGAGGGCTAC 1200 

ACTCTGAATG CAGATAAAAA AACATGTTCA GTCCGTGACA AGTGTGCCCT AGGCTCTCAT 1260 

GGTTGCCAGC ACATTTGTGT GAGTGATGGG GCCGCATCCT ACCACTGTGA TTGCTATCCT 1320 

GGCTACACCT TAAATGAGGA CAAGAAAACA TGTTCAGCCA CTGAGGAAGC ACGAAGACTT 1380 

GTTTCCACTG AAGATGCTTG TGGATGTGAA GCTACACTGG CATTCCAGGA CAAGGTCAGC 1440 

TCGTATCTTC AAAGACTGAA CACTAAACTT GATGACATTT TGGAGAAGTT GAAAATAAAT 1500 

GAATATGGAC AAATACATCG TTAAATTGCT CCAATTTCTC ACCTGAAAAT GTGGACAGCT 1560 

TGGTGTACTT AATACTCATG CATTCTTTTG CACACCTGTT ATTGCCAATG TTCCTGCTAA 1620 

TAATTTGCCA TTATCTGTAT TAATGCTTGA ATATTACTGG ATAAATTGTA TGAAGATCTT 1680 

CTGCAGAATC AGCATGATTT TTCCAAGGAA ATACATATGC AOATACTTAT TAAGAGCAAA 1740 

CTTTAGTGTC TCTAAGTTAT GACTGTGAAA TGATTGGTAG GAAATAGAAT GAAAAGTTTA 1800 

GTGTTTCTTT ATCTACTAAT TGAGCCATTT AATTTTTAAA TGTTTATATT AGATAACCAT 1860 

ATTCACAATG GAAACTTTAG GTCTAGTTTC TTTTGATAGT ATTTATAATA TAAATCAATC 1920 

TTATTACTGA GAGTGCAAAT TGTACAAGGT ATTTACACAT ACAACTTCAT ATAACTGAGA 1980 

TGAATGTAAT TTTGAACTGT TTAACACTTT TTGTTTTTTG CTTATTTTGT TGGAGTATTA 2040 

TTGAAGATGT GATCAATAGA TTGTAATACA CATATCTAAA AATAGTTAAC ACAGATCAAG 2100 

TGAACATTAC ATTGCCATTT TTAATTCATT CTGGTCTTTG AAAGAAATGT ACTACTAAAG 2160 

AGCACTAGTT GTGAATTTAG GGTGTTAAAC TTTTTACCAA GTACAAAAAT CCCAAATTCA 2220 

CTTTATTATT TTGCTTCAGG ATCCAAGTGA CAAAGTTATA TATTTATAAA ATTGCTATAA 2280 

ATCGACAAAA TCTAATGTTG TCTTTTTAAT GTTAGTGATC CACCTGCCTC AGCCTCCCAA 2340 

AGTGCTGGGA TTACAGGCTT GAAAGTCTAA CTTTTTTTTA CTTATATATT TGATACATAT 2400 

AATTCTTTTG GCTTTGAAAC TTGCAACTTT GAGAACAAAA CAGTCCTTTA AATTTTGCAC 2460 

TGCTCAATTC TGTTTTTCGT TTGCATTGTC TTTAATATAA TAAAAGTTAT TACCTTTACA 2520 

TATTATCATG TCTATTTTTG ATGACTCATC AATTTTGTCT ATTAAAGATA TTTCTTTAAA 2580 
TTAAAAAAAA AAAAAAAAA 

Seq ID KOj 442 Protein sequence 
Protein Accession #: NP_002372.1 

1 11 21 31 41 SI 

I I I I I I 

MPRPAPARRL PGLLLLLWPL LLLPSAAPDP VARPGFRRLE TRGPGGSPGR RPSPAAPDGA 60 
PASGTSBPGR ARGAGVCKSR PLDLVFIIDS SRSVRPLEPT KVKTFVSRII DTLDIGPADT 120 
RVAWNYAST VKIEPQLQAY TDKQSLKQAV GRITPLSTGT MSGLAIQTAM DEAFTVEAGA 180 
REPSSNIPKV AIIVTDGRPQ DQVNKVAARA QASGIELYAV GVDRADMASI* KMMASEPLBB 240 
HVFYVETYGV IEKLSSRFQE TFCALDPCVL GTHQCQHVCI SDGEGKHHCE CSQGYTLNAD 300 
KKTCSALDRC ALNTHGCEHI CVNDRSGSYH CECYEGYTLN EDRKTCSAQD KCALGTHGCQ 360 
HICVNDRTGS HHCECYEGYT LNADKKTCSV RDKCALGSHG CQHICVSDGA ASYHCDCYPG 420 
YTLNEDKKTC SATEEARRLV STEDACGCEA TLAPQDKVSS YLQRLNTKLD DILBKLKINB 480 
YGQIHR 

Seq ID NO: 443 DNA sequence 

Nucleic Acid Accession S: NM_016639.1 

Coding sequence: 40.. 429 

1 11 21 31 41 SI 

I I I I I I 

GCGGCGGGCG CAGACAGCGG CGGGCGCAGG AOGTGCACTA TGGCTOGGGG CTCGCTGCGC 60 

CGGTTGCTGC GGCTCCTCGT GCTGGGGCTC TGGCTGGCGT TGCTGCGCTC CGTGGCCGGG 120 

GAGCAAGCGC CAGGCACCGC CCCCTGCTCC OGCX5GCAGCT CCTGGAGCGC GGACCTGGAC 180 

. AAGTGGATGG ACTGGGCGTC TTGCAGGGCG CGACCGCACA GCGACTTCTG CCTGGGCTGC 240 

GCTGCAGCAC CTCCTGGCCC CTTCCGGCTG CTTTGGCCCA TCCTTGGGGG CGCTCTGAGC 300 

CTGACCTTCG TGCTGGGGCT GCTTTCTGGC TTTTTGGTCT GGAGACGATG CCGCAGGAGA 360 

GAGAAGTTCA CCACCCCCAT AGAGGAGACC GGCGGAGAGG GCTGCCCAGC TGTGGCGCTG 420 

ATCCAGTGAC AATGTGCCCC CTGCCAGCCG GGGCTCGCCC ACTCATCATT CATTCATCCA 480 

TTCTAGAGCC AGTCTCTGCC TCCCAGACGC GGCGGGAGCC AAGCTCCTCC AACCACAAGG S40 

GGGGTGGGGG GCGGTGAATC ACCTCTGAGG CCTGGGCCCA GGGTTCAGGG GAACCTTCCA 600 

AGGTGTCTGG TTGCCCTGCC TCTGGCTCCA GAACAGAAAG GGAGCCTCAC GCTGGCTCAC 660 

ACAAAACAGC TGACACTGAC TAAGGAACTG CAGCATTTGC ACAGGGGAGG GGGGTGCCCT 720 

CCTTCCTTAG GACCTGGGGG CCAGGCTGAC TTGGGGGGCA GACTTGACAC TAGGCCCCAC 780 

TCACTCAGAT GTCCTGAAAT TCCACCACGG GGGTCACCCT GGGGGGTTAG GGACCTATTT 840 

TTAACACTAG GGGCTCGCCC ACTAGGAGGG CTGGCCCTAA GATACAGACC CCCCCAACTC 900 

CCCAAAGCGG GGAGGAGATA TTTATTTTGG GGAGAGTTTG GAGGGGAGGG AGAATTTATT 960 
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AATAAAAGAA TCTTTAACTT TAAAAAAAAA AAAAAAAA 

Seq ID NO: 444 Protein sequence 
Protein Accession #» NP_057723.1 

1 11 21 31 41 51 

I I I I I t 

MARGSLRJtLL RLLVLGLWLA LLRSVAGEQA PGTAPCSRGS SWSADLDKCM DCASCRARPH 60 

SDFCLGCAAA PPAPFRLLWP ILGGALSLTF VLGLLSGFLV WRRCRRREKF TTPIEETGGB 120 
GCPAVAIiIQ 

Seq ID NO: 445 DNA sequence 

Nucleic Acid Accession #: AP322916.1 

Coding sequence: 50.. 4300 

1 11 21 31 41 51 

I I I I I I 

GCACTCCGCA GCCTTTAAGG TTCGCGCGGG GGCCAGGCAA GAGTTAGCCA TGAAGAGCCT 60 

CAAGTCCCGC CTGAGGAGGC AGGACGTGCC CGGCCCCGCG TCGTCTGGCG CCGCCGCCGC 120 

CAGCGCGCAT GCAGCAGATT GGAATAAATA TGATGACCGA TTGATGAAAG CAGCAGAAAG 180 

GGGGGATGTA GAAAAAGTGA CCTCAATCCT TGCTAAAAAG GGGGTCAATC CAGGCAAACT 240 

AGATGTGGAA GGCAGATCTG TCTTCCATGT TGTGACCTCA AAGGGGAATC TTGAGTGTTT 300 

GAATGCCATC CTTATACATG GAGTTGATAT TACAACCAGT GACACTGCAG GGAGAAATGC 360 

TCTTCACCTG GCTGCTAAGT ATGGACATGC ATTGTGCCTA CAAAAACTTC TACAGTACAA 420 

TTGTCCCACT GAGCATGCAG ACCTGCAGGG AAGAACTGCA CTTCAAAAAA AAGCAATGGC 480 

AGATTGTCCT TCTAGCATAC AGCTGCTTTG TGACCATGGG GCCTCTGTGA ATGCCAAAGA 540 

TGTAAAAGGG CGGACACCAC TTGTTCTGGC TACTCAGATG AGTAGGCCAA CAATATGTCA 600 

ACTGCTGATA GATAGAGGAG CGGATGTTAA TTCCAGAGAC AAACAAAACA GAACTGCCCT 660 

CATGCTAGGT TGCGAATATG GTTGCAGAGA TGCAGTAGAA GTCTTAATTA AAAATGGTGC 720 

TGATATAAGC TTGCTGGATG CGCTTGGCCA TGATAGTTCT TACTATGCAA GAATTGGTGA 780 

CAATCTGGAC ATTCTAACCT TGTTGAAGAC TGCATCGGAA AATACCAACA AAGGGAGAGA 840 

ACTTTGGAAG AAAGGACCAT CTTTGCAACA GCGAAATTTG ACACACATGC AAGATGAAGT 900 

AAATGTGAAG TCACATCAGA GGGAGCATCA AAATATTCAG GATTTGGAGA TTGAAAATGA 960 

AGATTTGAAA GAGAGGTTGA GAAAAATTCA GCAAGAACAA AGAATACTTT TGGATAAAGT 1020 

CAATGGTTTA CAGTTACAGC TGAATGAGGA AGTTATGGTT GCTGATGATC TGGAAAGCGA 1080 

GAGAGAAAAG CTGAAGTCCC TTTTGGCAGC TAAAGAAAAG CAACATGAAG AAAGCTTAAG 1140 

GACTATTGAG GCTCTGAAAA ATAGATTTAA ATATTTTGAG AGTGATCATT TAGGATCAGG 1200 

AAGTCATTTC AGTAACCGAA AAGAAGATAT GCTTCTTAAA CAAGGTCAGA TGTATATGGC 1260 

AGACTCACAG TGTACTTCCC CAGGTATACC AGCCCATATG CAAAGCAGAT CTATGTTAAG 1320 

ACCTCTGGAA CTATCTTTAC CCAGTCAAAC GTCATACTCT GAAAATGAAA TTTTAAAGAA 13 B0 

AGAGTTAGAA GCAATGCGAA CTTTCTGTGA GTCAGCAAAA CAAGACCGAC TGAAGCTCCA 1440 

AAATGAACTG GCACACAAAG TGGCAGAATG CAAAGCTTTA GCATTAGAAT GTGAAAGGGT 1500 

CAAGGAGGAT TCAGATGAAC AGATAAAGCA ATTAGAAGAT GCATTAAAAG ATGTGCAGAA 1560 

GAGGATGTAT GAGTCAGAAG GTAAAGTTAA ACAAATGCAG ACCCATTTTC TTGCCCTTAA 1620 

AGAACACTTA ACAAGTGAAG CAGCCTCAGG GAATCACAGA CTAACCGAGG AACTGAAGGA 1680 

TCAGTTGAAA GACTTGAAAG TAAAATATGA AGGTGCTTCA GCAGAAGTGG GGAAATTAAG 1740 

AAACCAAATC AAACAAAATG AGATGATAGT AGAAGAGTTT AAGAGGGATG AAGGCAAGCT 1600 

GATAGAAGAA AATAAGCGAT TACAGAAGGA ACTTAGTATG TGTGAAATGG AGCGAGAGAA 1860 

GAAAGGAAGA AAGGTCACAG AGATGGAAGG CCAGGCAAAA GAATTGTCAG CGAAGTTGGC 1920 

CCTTTCCATT CCAGCTGAAA AATTTCAAAA CATGAAGAGC TCATTATCAA ATGAAGTGAA 1980 

TGAGAAAGCA AAAAAATTAG TAGAAATGGA AAGAGAACAT GAAAAATCAC TTAGTGAAAT 2040 

TAGACAGTTA AAGAGAGAAC TTGAGAATGT TAAGGCCAAG CTTGCTCAGC ACGTCAAACC 2100 

AGAGGAACAT GAACAGGTTA AGAGCAGATT AGAACAGAAA TCAGGAGAAC TTGGGAAGAA 2160 

GATCACTGAG TTAACATTGA AAAATCAGAC ACTACAAAAG GAAATTGAAA AAGTTTATTT 2220 

GGATAATAAG CTCCTCAAGG AGCAAGCACA TAACTTAAGA ATTGAAATGA AAAATCATTA 2280 

TGTTCCTTTA AAAGTAAGTG AAGACATGAA AAAGTCACAT GATGCAATTA TTGATGATCT 2340 

TAATAGAAAG CTTTTAGATG TAACACAAAA ATATACAGAA AAGAAGTTGG AAATGGAGAA 2400 

ATTGCTACTG GAAAATGACA GCTTAAGTAA GGATGTAAGC CGCCTAGAAA CTGTGTTTGT 2460 

ACCTCCTGAG AAACATGAAA AAGAGATAAT AGCTCTGAAA TCCAATATTG TTGAACTTAA 2520 

GAAACAGCTG TCTGAACTTA AGAAAAAATG TGGTGAAGAC CAGGAGAAAA TACACGCTCT 2580 

CACATCTGAA AACACTAACT TGAAGAAGAT GATGAGTAAT CAGTATGTGC CAGTTAAAAC 2640 

CCATGAAGAG GTTAAAATGA CACTGAATGA CAOGTTAGCC AAAACTAACA GAGAATTATT 2700 

AGATGTGAAG AAAAAATTTG AAGATATAAA TCAGGAATTT GTAAAAATAA AAGATAAGAA 2760 

TGAAATATTA AAAAGAAACC TGGAAAACAC TCAGAACCAA ATAAAAGCTG AGTACATCAG 2820 

CCTGGCAGAG CACGAGGCAA AGATGAGCTC GCTAAGTCAG AGCATGAGAA AGGTGCAGGA 2880 

TAGTAATGCT GAAATCTTGG CCAACTACAG AAAAGGCCAA GAAGAGATTG TGACACTGCA 2940 

TGCCGAAATT AAAGCCCAGA AGAAGGAGCT CGACACAATA CAAGAATGCA TTAAGGTAAA 3000 

ATATGCCCCA ATTGTCAGCT TTGAGGAGTG CGAGAGAAAA TTTAAAGCAA CAGAGAAAGA 3060 

ACTAAAAGAC CAGTTATCAG AGCAGACACA AAAGTATAGT GTCAGTGAAG AAGAAGTCAA 3120 

GAAAAACAAG CAAGAGAATG ACAAGTTAAA GAAGGAGATT TTTACCCTTC AGAAAGATTT 3180 

GAGAGATAAG ACAGTTCTCA TTGAGAAGTC TCATGAAATG GAAAGAGCAT TAAGCAGAAA 3240 

AACAGACGAG CTAAACAAAC AGTTAAAAGA CTTGTCACAG AAATACACGG AAGTAAAGAA 3300 

TGTGAAAGAG AAGCTAGTAG AAGAAAATGC CAAACAGACT TCTGAGATAC TTGCAGTGCA 3360 

AAATCTTTTG CAAAAACAAC ATGTTCCATT GGAACAGGTT GAGGCTCTGA AAAAATCTCT 3420 

TAATGGCACA ATTGAAAATC TAAAGGAAGA ACTGAAGAGT ATGCAAAGGT GTTACGAGAA 3480 

AGAGCAGCAG ACAGTGACCA AACTGCATCA ATTGTTGGAG AATCAAAAGA ACTCTTCTGT 3540 

ACCCCTGGCA GAGCATTTGC AGATTAAAGA AGCATTTGAG AAAGAAGTTG GAATCATAAA 36 00 

AGCCAGCTTG AGAGAAAAGG AAGAAGAAAG CCAAAACAAA ATGGAAGAAG TCTCCAAACT 3660 

TCAGTCGGAG GTTCAGAATA CTAAACAAGC ATTAAAAAAA TTAGAGACTA GAGAGGTAGT 3720 

TGACTTGTCT AAATATAAAG CAACAAAAAG TGATTTGGAG ACACAGATTT CTAGCTTAAA 3780 

TGAAAAATTG GCCAATCTGA ATAGAAAGTA TGAGGAAGTA TGTGAGGAAG TTTTGCATGC 3840 
CAAAAAGAAG GAAATATCTG CAAAAGATGA GAAGGAATTA CTGCATTTCA GCATTGAGCA 3900 
AGAAATTAAG GATCAGAAGG AACGATGTGA TAAGTCCTTA ACAACAATCA CAGAGTTACA 3960 

AAGAAGAATA CAAGAATCTG CTAAACAAAT AGAAGCAAAA GATAATAAGA TAACTGAACT 4020 
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10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



GCTTAATGAT 
AAGTGGGAAC 
ATCTCTGGAG 
TCGGACACAC 
GCTCCAGATC 
TATCTGTTTT 
CTTACTGTGC 



GTGGAAAGAT 
CCCACCAAGA 
CAACAGCTGG 
CTTCTTAGTG 
ATACAAATGC 
ATCTTGCTGG 
TGGTATAATT 



TAAAACAGGC 
GGCAGAGCCA 
CCGATGCTGA 
CTGCACAGGG 
GGCAGGGGCT 
TGCTGAACAT 
AAAATAAAAT 



ACTCAATGGC 
GCTGATTGAC 
CAGACAGCAC 
TCACATGGAT 
TGTGTGCTAG 
TCTTTGTGCA 
ATATTTTGTT 



CTTTCCCAAC 
ACTCTGCAGC 
CAAGAAGTAA 
GAAGATGTTC 
CCGTTAGCAC 
ACTCCATGGT 
CTGGGTGT 



TCACCTACAC 40 BO 

ACCAAGTGAA 4140 

TTGCAATTTA 4200 

AGGAGGCTCT 4260 

TGACTGCCAG 4320 

CTTTCTGGGC 4380 



Seq ID NO: 446 Protein sequence 
Protein Accession #s AAG49577.1 



1 
I 

MKSLKSRLRR 
PGKLDVEGRS 
LQYNCPTEHA 
TICQLLIDRG 
RIGDNLDILT 
IENEDLKERIi 
ESLRTIEALK 
SMLRPLELSL 
CERVKEDSDE 
ELKDQLKDLK 



LSEIRQIiKRE 
KVYLDNKLLK 
EMEKLIiLEND 
IHALTSENTN 
KDKNEILKRN 
VTLHAEIKAQ 
EEVKKNKQEN 
EVKNVKEKLV 
CYEKEQQTVT 
VSKLQSEVQN 
VLHAKKKEIS 
ITELLNDVER 
IAIYRTHLLS 



11 
I 

QDVPGPASSG 
VFHWTSKGN 
DLQGRTALQK 
ADVNSRDKQN 
LLKTASENTN 
RKIQQEQRIL 
NRFKYFESDH 
PSQTSYSENE 
QIKQLEDALK 
VKYEGASAEV 
EMEGQAKELS 
LENVKAKLAQ 
EQAHNLTIEM 
SLSKDVSRLE 
LKKMMSNQYV 
LENTQNQIKA 
KKELDTIQEC 
DKLKKEIFTL 
EENAKQTSEI 
KLHQLLENQK 
TKQALKKLET 
AKDEKELLHF 
LKQALNGLSQ 
AAQGHMDEDV 



21 

I 

AAAASAHAAD 
LECLNAILIH 
KAMADCPSSI 
RTALMLGCEY 
KGRELWKKGP 
LDKVNGLQLQ 
LGSGSHFSNR 
ILKKELEAMR 
DVQKRMYESE 
GKLRNQIKQN 
AKLAL3IPAE 
HVKPEEHEQV 
KNHYVPLKVS 
TVFVPPEKHE 
PVKTHEBVKM 
EYISLAEHEA 
IKVKYAPIVS 
QKDLRDKTVL 
LAVQNLLQKQ 
NSSVPLAEHL 
REWDItSKYK 
SIEQEIKDQK 
LTYTSGNPTK 
QEALIiQIIQM 



31 
1 

WNKYDDRLMK 
GVDITTSDTA 
QXiLCDHGASV 
GCRDAVEVLI 
SLQQRNLTHM 
LNEEVMVADD 
KEDMLLKQGQ 
TFCESAKQDR 
GKVKQMQTHF 
EMIVEEFKRD 
KFENMKSSLS 



EDMKKSHDAI 
KEIIALKSNI 
TLNDTLAKTN 



IEKSHEHERA 
HVPliEQVEAL 
QIKEAFEKEV 
ATKSDLETQI 
ERCDKSLTTI 
RQSQLIDTLQ 
RQGLVC 



41 

1 

AAERGDVEKV 
GRNALHLAAK 
NAKDVKGRTP 
KNGADISLLD 
QDEVNVKSHQ 
LESEREKLKS 
MYMADSQCTS 
LKLQNELAHK 
LALKEHLTSE 
EGKLIEENKR 
NEVNEKAKKL 
LGKKITELTL 
IDDLNRKLLD 
VELKKQLSEL 
RELLDVKKKF 
KVQDSNAEIL 
TEKELKDQLS 
LSRKTDELNK 
KKSLNGTIEN 
GIIKASLREK 
SSLNEKLANL 
TELQRRIQES 
HQVKSLEQQL 



51 
I 

TSILAKKGVN 

YGHALCLQKL . 

LVLATQMSRP 

ALGHDSSYYA 

REHQNIQDLE 

LLAAKEKQHE 

PGIPAHMQSR 

VAECKALALE 

AASGNHRLTE 

LQKELSMCEM 

VEMEREHEKS 

KNQTLQKEIE 

VTQKYTEKKL 

KKKCGEDQEK 

EDINQEFVKI 

ANYRKGQEEI 

EQTQKYSVSE 

QLKDLSQKYT 

LKEELKSMQR 



NRKYEEVCEE 
AKQIEAKDNK 
ADADRQHQEV 



Seq ID NO i 447 DNA sequence 
Nucleic Acid Accession #: NM_003020.1 
Coding sequence : 29 . . 664 



CGCTCCTCGG 
TGGCCTACTG 
CCCTGACCGG 
GGGCATTGCC 
CCAGAGCATT 
CCCCAACATC 
GGGGTACCCA 
AAACACCCCT 
TCCGGAACAT 
GAAGGGAGGA 
GGATAATGTT 
GTAAAGAGAA 
TGCACGTGTA 
TACAAAGCAG 
AAATTAGAAT 
TAAAAATTAA 
TGCAGTTTAA 
TTTGATTTTG 
GCTGTACTCA 
GCAAGCATTG 



11 
I 

GCTGCCCCTC 
TTTTGGCTGG 
GTCTCAGAAG 
AGGCCCCGAG 
GAAGGTGGAG 
GTGGCAGAGT 
GACCCTCCAA 
GACACTGCAG 
GACTATCCAG 
GAGAGACGAA 
GTTGCAAAGA 
GATGCTAGAC 
AATGGAGTCC 
CTGTATGTAG 
AAGAGCTTTT 
AATGTGAATG 
AATGGTGTCT 
ATTATGTAGT 
AGAGGAGGAG 
GC 



21 
I 

GGTTGACAAT 
CATCTGGATG 
CAGATATCCA 
TGGAATATCC 
CTCATGAAGG 
TGACTGGAGA 
ATCCCTGTCC 
AGTTCAGTCG 
GCTTGGGCAA 
AGCGGAGGAG 
AGTCTGTCCC 
GAAAACCCAC 
CTGTGAATGA 
ATAGTGTATT 
TTGTTTCTTG 
TCAACAATAA 
GAGGTTGTAC 
TCATCCAGCC 
CTGACACATT 



31 
I 

GGTCTCCAGG 
GACTCCAGCA 
GAGGCTGCTT 
AGCTCACCAG 
ACTTCAGCAT 
CAACATTCCT 
TGTTGGAAAA 
AGAGTTCCAG 
GTGGAACAAG 
TGTCAATCCA 
CCATTTTTCA 
ATTACCTGTT 
CAGCATGTTT 
GTCTTCACAC 
GGTTTTTAAA 
AAAGCAAGAC 
TATTTTGGCC 
CTTGGGCATT 
TCACTTGGCT 



41 
I 

ATGGTCTCTA 
TTTGCTTACA 
CATGGTGTTA 
GCCATGAATC 
TTGGGTCCTT 
AAGGACTTTA 
ACAGATGATG 
TTGCACCAGC 
AAACTCCTTT 
TATCTACAAG 
GATGAGGATA 
AGGCCTCAGC 
CTTACATAGA 
CGATGATTCT 
ATGTGAATCT 
TATGAAAGGC 
AAGTCTGTAG 
GTTATACACC 
GCGTCTTAAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



51 
I 

CCATGCTATC 
GCCCCCGGAC 
TGGAGCAATT 
TTGTGGGCCC 
TTGGCAACAT 
GTGAGGATCA 
GATGTCTAGA 
ATCTCTTTGA 
ACQAGAAGAT 
GACAGAGACT 
AGGATCCAGA 
ATGGCTTATG 
TAATTATGGA 
GCTTTTTGCT 
GCAATGATCA 
TCAGATTTCT 
AAAGCTGTCA 
AGTAAAGAAG 
AAACATGAAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



Seq ID NO: 448 Protein sequence 
Protein Accession #: NP_003011.1 

1 11 21 31 41 51 

I I I I I I 

MVSRMVSTML SGLLFWLASG WTPAFAYSPR TPDRVSEADI QRLLHGVMEQ LGIARPRVEY 
PAHQAMNLVG PQSIEGGAHE GLQHLGPFGN IPNIVAELTG DNIPKDFSED QGYPDPPNPC 
PVGKTDDGCL ENTPDTAEFS REFQLHQHLF DPEHDYPGLG KWNKKLLYEK MKGGERRKRR 
SVNPYLQGQR LDNWAKKSV PHFSDEDKDP E 

Seq ID NO: 449 DNA sequence 

Nucleic Acid Accession #: NM_003816.l 

Coding sequence: 79..2S38 

1 11 21 31 41 51 

I I I I I I 

CGGCAGGGTT GGAAAATGAT GGAAGAGGCG GAGGTGGAGG CGACCGAGTG CTGAGAGGAA 
CCTGCGGAAT CGGCCGAGAT GGGGTCTGGC GCGCGCTTTC CCTCGGGGAC CCTTCGTGTC 
CGGTGGTTGC TGTTGCTTGG CCTGGTGGGC CCAGTCCTCG GTGCGGCGCG GCCAGGCTTT 

1131 



60 
120 
180 



60 
120 
180 



WO 03/042661 PCT/US02/36810 



CAACAGACCT CACATCTTTC TTCTTATGAA ATTATAACTC CTTGGAGATT AACTAGAGAA 240 

AGAAGAGAAG CCCCTAGGCC CTATTCAAAA CAAGTATCTT ATGTTATTCA GGCTGAAGGA 300 

AAAGAGCATA TTATTCACTT GGAAAGGAAC AAAGACCTTT TGCCTGAAGA TTTTGTGGTT 360 

TATACTTACA ACAAGGAAGG GACTTTAATC ACTGACCATC CCAATATACA GAATCATTGT 420 

5 CATTATCGGG GCTATGTGGA GGGAGTTCAT AATTCATCCA TTGCTCTTAG CGACTGTTTT 480 

GGACTCAGAG GATTGCTGCA TTTAGAGAAT GCGAGTTATG GGATTGAACC CCTGCAGAAC 540 

AGCTCTCATT TTGAGCACAT CATTTATCGA ATGGATGATG TCTACAAAGA GCCTCTGAAA 600 

TGTGGAGTTT CCAACAAGGA TATAGAGAAA GAAACTGCAA AGGATGAAGA GGAAGAGCCT 660 

CCCAGCATGA CTCAGCTACT TCGAAGAAGA AGAGCTGTCT TGCCACAGAC CCGGTATGTG 720 

10 GAGCTGTTCA TTGTCGTAGA CAAGGAAAGG TATGACATGA TGGGAAGAAA TCAGACTGCT 780 

GTGAGAGAAG AGATGATTCT CCTGGCAAAC TACTTGGATA GTATGTATAT TATGTTAAAT B40 

ATTCGAATTG TGCTAGTTGG ACTGGAGATT TGGACCAATG GAAACCTGAT CAACATAGTT 900 

GGGGGTGCTG GTGATGTGCT GGGGAACTTC GTGCAGTGGC GGGAAAAGTT TCTTATCACA 960 

CGTCGGAGAC ATGACAGTGC ACAGCTAGTT CTAAAGAAAG GTTTTGGTGG AACTGCAGGA 1020 

15 ATGGCATTTG TGGGAACAGT GTGTTCAAGG AGCCACGCAG GCGGGATTAA TGTGTTTGGA 1080 

CAAATCACTG TGGAGACATT TGCTTCCATT GTTGCTCATG AATTGGGTCA TAATCTTGGA 1140 

ATGAATCACG ATGATGGGAG AGATTGTTCC TGTGGAGCAA AGAGCTGCAT CATGAATTCA 12 00 

GGAGCATCGG GTTCCAGAAA CTTTAGCAGT TGCAGTGCAG AGGACTTTGA GAAGTTAACT 1260 

TTAAATAAAG GAGGAAACTG CCTTCTTAAT ATTCCAAAGC CTGATGAAGC CTATAGTGCT 1320 

20 CCCTCCTGTG GTAATAAGTT GGTGGACGCT GGGGAAGAGT GTGACTGTGG TACTCCAAAG 1380 

GAATGTGAAT TGGACCCTTG CTGCGAAGGA AGTACCTGTA AGCTTAAATC ATTTGCTGAG 1440 

TGTGCATATG GTGACTGTTG TAAAGACTGT CGGTTCCTTC CAGGAGGTAC TTTATGCCGA 1500 

GGAAAAACCA GTGAGTGTGA TGTTCCAGAG TACTGCAATG GTTCTTCTCA GTTCTGTCAG 1560 

CCAGATGTTT TTATTCAGAA TGGATATCCT TGCCAGAATA ACAAAGCCTA TTGCTACAAC 1620 

25 GGCATGTGCC AGTATTATGA TGCTCAATGT CAAGTCATCT TTGGCTCAAA AGCCAAGGCT 1680 

GCCCCCAAAG ATTGTTTCAT TGAAGTGAAT TCTAAAGGTG ACAGATTTGG CAATTGTGGT 1740 

TTCTCTGGCA ATGAATACAA GAAGTGTGCC ACTGGGAATG CTTTGTGTGG AAAGCTTCAG 1800 

TGTGAGAATG TACAAGAGAT ACCTGTATTT GGAATTGTGC CTGCTATTAT TCAAACGCCT 1860 

AGTCGAGGCA CCAAATGTTG GGGTGTGGAT TTCCAGCTAG GATCAGATGT TCCAGATCCT 1920 

30 GGGATGGTTA ACGAAGGCAC AAAATGTGGT GCTGGAAAGA TCTGTAGAAA CTTCCAGTGT 1980 

GTAGATGCTT CTGTTCTGAA TTATGACTGT GATGTTCAGA AAAAGTGTCA TGGACATGGG 2040 

GTATGTAATA GCAATAAGAA TTGTCACTGT GAAAATGGCT GGGCTCCCCC AAATTGTGAG 2100 

ACTAAAGGAT ACGGAGGAAG TGTGGACAGT GGACCTACAT ACAATGAAAT GAATACTGCA 2160 

TTGAGGGACG GACTTCTGGT CTTCTTCTTC CTAATTGTTC CCCTTATTGT CTGTGCTATT 2220 

35 TTTATCTTCA TCAAGAGGGA TCAACTGTGG AGAAGCTACT TCAGAAAGAA GAGATCACAA 2280 

ACATATGAGT CAGATGGCAA AAATCAAGCA AACCCTTCTA GACAGCCGGG GAGTGTTCCT 2340 

CGACATGTTT CTCCAGTGAC ACCTCCCAGA GAAGTTCCTA TATATGCAAA CAGATTTGCA 2400 

GTACCAACCT ATGCAGCCAA GCAACCTCAG CAGTTCCCAT CAAGGCCACC TCCACCACAA 2460 

COGAAAGTAT CATCTCAGGG AAACTTAATT CCTGCCCGTC CTGCTCCTGC ACCTCCTTTA 2520 

40 TATAGTTCCC TCACTTGATT TTTTTAACCT TCTTTTTGCA AATGTCTTCA GGGAACTGAG 2580 

CTAATACTTT TTTTTTTTCT TGATGTTTTC TTGAAAAGCC TTTCTGTTGC AACTATGAAT 2640 

GAAAACAAAA CACCACAAAA CAGACTTCAC TAACACAGAA AAACAGAAAC TGAGTGTGAG 2700 

AGTTGTGAAA TACAAGGAAA TGCAGTAAAG CCAGGGAATT TACAATAACA TTTCCGTTTC 2760 

CATCATTGAA TAAGTCTTAT TCAGTCATCG GTGAGGTTAA TGCACTAATC ATGGATTTTT 2820 

45 TGAACATGTT ATTGCAGTGA TTCTCAAATT AACTGTATTG GTGTAAGATT TTTGTCATTA 2880 

AGTGTTTAAG TGTTATTCTG AATTTTCTAC CTTAGTTATC ATTAATGTAG TTCCTCATTG 2940 

AACATGTGAT AATCTAATAC CTGTGAAAAC TGACTAATCA GCTGCCAATA ATATCTAATA 3000 

TTTTTCATCA TGCACGAATT AATAATCATC ATACTCTAGA ATCTTGTCTG TCACTCACTA 3060 

CATGAATAAG CAAATATTGT CTTCAAAAGA ATGCACAAGA ACCACAATTA AGATGTCATA 3120 

50 TTATTTTGAA AGTACAAAAT ATACTAAAAG AGTGTGTGTG TATTCACGCA GTTACTOGCT 3180 

TCCATTTTTA TGACCTTTCA ACTATAGGTA ATAACTCTTA GAGAAATTAA TTTAATATTA 3240 

GAATTTCTAT TATGAATCAT GTGAAAGCAT GACATTCGTT CACAATAGCA CTATTTTAAA 3300 

TAAATTATAA GCTTTAAGGT ACGAAGTATT TAATAGATCT AATCAAATAT GTTGATTCAT 3360 

GGCTATAATA AAGCAGGAGC AATTATAAAA TCTTCAATCA ATTGAACTTT TACAAAACCA 3420 

55 CTTGAGAATT TCATGAGCAC TTTAAAATCT GAACTTTCAA AGCTTGCTAT TAAATCATTT 3480 

AGAATGTTTA CATTTACTAA GGTGTGCTGG GTCATGTAAA ATATTAGACA CTAATATTTT 3540 

CATAGAAATT AGGCTGGAGA AAGAAGGAAG AAATGGTTTT CTTAAATACC TACAAAAAAG 3600 

TTACTGTGGT ATCTATGAGT TATCATCTTA GCTGTGTTAA AAATGAATTT TTACTATGGC 3660 

AGATATGGTA TGGATCGTAA AATTTTAAGC ACTAAAAATT TTTTCATAAC CTTTCATAAT 3720 

60 AAAGTTTAAT AATAGGTTTA TTAACTGAAT TTCATTAGTT TTTTAAAAGT GTTTTTGGTT 3780 

TGTGTATATA TACATATACA AATACAACAT TTACAATAAA TAAAATACTT GAAATTCTCA 3840 
AAAAAAAAAA AAAAAAAAAA AAAAA 

Seq ID NO: 450 Protein sequence 
65 Protein Accession #: NP_003807.l 

1 11 21 31 41 51 

I I I I I I 

MGSGARFPSG TLRVRWLLLL GLVGPVLGAA RPGFQOTSHL SSYEIITPWR LTRERREAPR 60 

70 PYSKQVSYVI QAEGKEHIIH LERNKDLLPE DFWYTYNKE GTLITDHPNI QNHCHYRGYV 120 

EGVHNSSIAL SDCFGLRGLL HLENASYGIE PLQNSSHFEH IIYRMDDVYK EPLKCGVSNK 180 

DIEKETAKDE EEEPPSMTQL LRRRRAVLPQ TRYVELFIW DKERYDMMGR NQTAVREEMI 240 

LLANYLDSMY IMLNIRIVLV GLEIWTNGNL INIVGGAGDV LGNFVQWREK FLITRRRHDS 300 

AQLVLKKGFG GTAGMAPVGT VCSREHAGGI NVFGQITVET PASIVAHELG HNLGMNHDDG 360 

75 RDCSCGAKSC IMNSGASGSR NFSSCSAEDF EKLTLNKGGN CLLNIPKPDE AYSAPSCGNK 420 

LVDAGEECDC GTPKECELDP CCEGSTCKLK SFAECAYGDC CKDCRFLPGG TLCRGKTSEC 480 

DVPEYCNGSS QFCQPDVPIQ NGYPCQNNKA YCYNGMCQYY DAQCQV1FGS KAKAAPKDCP 540 

IBVNSKGDRF GNCGFSGNEY KKCATGNALC GKLQCENVQE IPVFGIVPAI IQTPSRGTKC 600 

WGVDPQLGSD VPDPGMVNEG TKCGAGKICR NFQCVDASVL KYDCDVQKKC HGHGVCNSNK 660 

80 NCHCEKGWAP PNCETKGYGG SVDSGPTYNE MNTALRDGLL VFFFLIVPLI VCAIPIFIKR 720 

DQLWRSYFRK KRSQTYESDG KNQANPSRQP GSVPRHVSPV TPPREVPIYA NRFAVPTYAA 780 
KQPQQFPSRP PPPQPJCVSSQ GNLIPARPAP APPLYSSLT 



1132 
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Seq ID NO: 451 DNA sequence 

Nucleic Acid Accession g: NM_0166S0.1 

Coding sequence: 196.. 7B9 

1 11 21 31 41 51 

I I I I I I 

GGTTTCAATA TATGCAGATG TCTCGATATA GGAATQAAAT TACGTCTTTO GAACAACTTA 60 

AATAAGTCAA ATATACTTGG AGCTTTAAAA ATTAAAAGGA GAGAGATTCQ AGCACCTTTT 120 

CTGCTGCCAT GACAACCATG CAAGGAATGG AACAGGCCAT GCCAGGGTTG GCCTGGTGTG 180 

CCCCAGCTGG GAAACATGGC TGTCATACAT TCACATCTGT GGAAAGGATT GCAAGAGAAG 240 

TTCTTGAAGG GAGAACCCAA AGTCCTTGGG GTTGTGCAGA TTCTGACTGC CCTGATGAGC 300 

CTTAGCATGG GAATAACAAT GATGTGTATG GCATCTAATA CTTATGGAAG TAACCCTATT 360 

TCCGTGCATA TCGGGTACAC AATTTGGGGG TCAGTAATGT TTATTATTTC AGGATCCTTG 420 

TCAATTGCAG CAGGAATTAG AACTACAAAA GGCCTGGTCC GAGGTAGTCT AGGAATGAAT 480 

ATCACCAGCT CTGTACTGGC TGCATCAGGG ATCTTAATCA ACACATTTAG CTTGGCGTTT 540 

TATTCATTCC ATCACCCTTA CTGTAACTAC TATGGCAACT CAAATAATTG TCATGGGACT 600 

ATGTCCATCT TAATGGGTCT GGATGGCATG GTGCTCCTCT TAAGTGTGCT GGAATTCTGC 660 

ATTGCTGTGT CCCTCTCTGC CTTTGGATGT AAAGTGCTCT GTTGTACCCC TGGTGGGGTT 720 

GTGTTAATTC TGCCATCACA TTCTCACATG GCAGAAACAG CATCTCCCAC ACCACTTAAT 780 

GAGGTTTGAG GCCAACAAAA GATCAACAGA CAAATGCTCC AGAAATCTAT GCTGACTGTG 840 

ACACAAGAGC CTCACATGAG AAATTACCAG TATCCAACTT CGATACTGAT AGACGTGTTG 900 

ATATTATTAT TATATGTAAT CCAATTATGA ACTGTGTGTG TATAGAGAGA TAATAAATTC 960 

AAAATTATGT TCTCATTTTT TTCCCTGGAA CTCAATAACT CACTTCACTG GCTCTTTATC 1020 
GAGAGTACTA GGAGTTAAAT TAATAAATAA TGCATTTAAT GAGGCCACAG GAAAAA 

Seq ID NO: 452 Protein sequence 
Protein Accession #: NP 057734.1 



on 1 11 21 31 41 51 

30 | | | | | | 

MAVIHSHLWK GLQEKFLKGE PKVLGWQIL TALMSLSMGI TMMCMASNTY GSNPISVHIG 60 
YTIWGSVMFI ISGSLSIAAG IRTTKGLVRG SLGMNITSSV IAASGILINT FSLAFYSFHH 120 
PYCNYYGNSN NCHGTMSILM GLDGMVLLLS VLHPCIAVSL SAFGCKVLCC TPGGWLILP 180 
SHSHMAETAS PTPLNEV 



Seq ID NO: 453 DNA sequence 

Nucleic Acid Accession #: NM_002091.1 

Coding sequence : 56 . . 503 

1 11 21 31 41 51 

I I I I I I 

AGTCTCTGCT CTTCCCAGCC TCTCCGGCGC GCTCCAAGGG CTTCCCGTCG GGACCATGCG 60 

CGGCAGTGAG CTCCOGCTGG TCCTGCTGGC GCTGGTCCTC TGCCTAGCGC CCCGGGGGCG 120 

AGCGGTCCCG CTGCCTGCGG GCGGAGGGAC CGTGCTGACC AAGATGTACC CGCGCGGCAA 180 

CCACTGGGCG GTGGGGCACT TAATGGGGAA AAAGAGCACA GGGGAGTCTT CTTCTGTTTC 240 

TGAGAGAGGG AGCCTGAAGC AGCAGCTGAG AGAGTACATC AGGTGGGAAG AAGCTGCAAG 300 

GAATTTGCTG GGTCTCATAG AAGCAAAGGA GAACAGAAAC CACCAGCCAC CTCAACCCAA 360 

GGCCTTGGGC AATCAGCAGC CTTCGTGGGA TTCAGAGGAT AGCAGCAACT TCAAAGATGT 420 

AGGTTCAAAA GGCAAAGTTG GTAGACTCTC TGCTCCAGGT TCTCAACGTG AAGGAAGGAA 480 

CCCCCAGCTG AACCAGCAAT GATAATGATG GCCTCTCTCA AAAGAGAAAA ACAAAACCCC 540 

TAAGAGACTG AGTTCTGCAA GCATCAGTTC TACGGATCAT CAACAAGATT TCCTTGTGCA 600 

AAATATTTGA CTATTCTGTA TCTTTCATCC TTGACTAAAT TCGTGATTTT CAAGCAGCAT 660 

CTTCTGGTTT AAACTTGTTT GCTGTGAACA ATTGTCGAAA AGAGTCTTCC AATTAATGCT 720 

TTTTTATATC TAGGCTACCT GTTGGTTAGA TTCAAGGCCC CGAGCTGTTA CCATTCACAA 780 
TAAAAGCTTA AACACAT 

Seq ID NO: 454 Protein sequence 
Protein Accession 8> NP_0020B2.1 

1 11 21 31 41 51 

I I I 1 I I 

MRGSELPLVL LALVLCLAPR GRAVPLPAGG GTVLTKMYPR GNHWAVGHLM GKKSTGESSS 60 
VSERGSUCQQ LREYIRWREA ARNLLGLIEA KENRNHQPPQ PKALGNQQPS WDSEDSSNFK 120 
DVGSKGKVGR LSAPGSQREG RNPQLNQQ 

Seq ID NO: 455 DNA sequence 

Nucleic Acid Accession #: NMJJ16522.1 

Coding sequence: 265.. 1299 

1 11 21 31 41 51 

I I I I I I 

GCGGAAGCAG CGAGGAGGGA GCCCCCTTTG GCCGTCCTCC GTGGAACCGG TTTTCCGAGG 60 

CTGG CAAAAG CCGAGGCTGG ATTTGGGGGA GGAATATTAG ACTCGGAGGA GTCTGCGCGC 120 

TTTTCTCCTC CCCGCGCCTC CCGGTCGCCG CGGGTTCACC GCTCAGTCCC CGCGCTCGCT 180 

CCGCACCCCA CCCACTTCCT GTGCTCGCCC GGGGGGCGTG TGCCGTGCGG CTGCCGGAGT 240 

TOGGGGAAGT TGTGGCTGTC GAGAATGGGG GTCTGTGGGT ACCTGTTCCT GCCCTGGAAG 300 

TGCCTCGTGG TCGTGTCTCT CAGGCTGCTG TTCCTTGTAC CCACAGGAGT GCCCGTGCGC 360 

AGCGGAGATG CCACCTTCCC CAAAGCTATG GACAACGTGA CGGTCCGGCA GGGGGAGAGC 420 

GCCACCCTCA GGTGCACTAT TGACAACCGG GTCACCCGGG TGGCCTGGCT AAACCGCAGC 480 

ACCATCCTCT ATGCTGGGAA TGACAAGTGG TGCCTGGATC CTCGCGTGGT CCTTCTGAGC 540 

AACACCCAAA CGCAGTACAG CATCGAGATC CAGAACGTGG ATGTGTATGA CGAGGGCCCT 600 

TACACCTGCT CGGTGCAGAC AGACAACCAC CCAAAGACCT CTAGGGTCCA CCTCATTGTG 660 

CAAGTATCTC CCAAAATTGT AGAGATTTCT TCAGATATCT CCATTAATGA AGGGAACAAT 720 

ATTAGCCTCA CCTGCATAGC AACTGGTAGA CCAGAGCCTA CGGTTACTTG GAGACACATC 780 
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TCTCCCAAAG 
CGGGAACAGT 
CGGAGAGTAA 
GTCCCCGTGG 
TTCCAGTGGT 
AACAGACCTT 
TACACTTGCG 
CCAGGCGCCG 
CTGCCTCTTC 
CGGGAAAGGC 
CCAATCAGAT 
GGGAGGGGAA 
CCTTGCAGAT 
CCCGGCTTGG 
GGCTCAGCCT 
AAATTCAATC 
CCGGCCCAAG 
GAAACGTGAA 



CAGGGGACTA 
AGGTCACCGT 
GACAAAAGGG 
ACAAGGATGA 
TCCTCTCAAA 
TGGCCTCCAA 
TCAGCGAGGT 
TGGTCTTGCA 
TGCCGCCACC 
ATATACAAAT 
CAAAGAATAC 
ATTTAGGTAC 
ACCCACTGCA 
CTCTGCCCAC 
AGTCCATAGA 
CGTGGCGCTG 
ATAAAAAGAG 



TGTGAGTGAA 
CGAGTGCAGT 
GAACTATCCA 
GACACTGCAG 
CAAAAGACTG 
ACTCATCTTC 
CAAGCTGGGC 
GAGCAACGGC 
CCTGCTTCTC 
ACCACCACCA 
GAAATTAGAA 
TTTGGGGGGA 
AATGGAGTTT 
AGCTGCATCG 
AGACTGCCCC 
GACGAACAGA 
CGGGCACTTT 
CAAAAAAAAA 



GACGAATACT 
GCCTCCAATG 
CCATACATTT 
TGTGAAGCCT 
ATTGAAGGAA 
TTCAATGTCT 
CACACCAATG 
ACGTGGAGGA 
AAATTTTGAT 
ACACAACAGC 
GAAACACAGC 
AAAGAGTTTT 
TCTTTTCCCA 
TGCAACCTCT 
CACGTGGAAC 
ATGAGACCTT 
GGTAGACTGT 
AAAAAAAAA 



TGGAAATTCA 
ACGTGGCCGC 
CAGAAGCCAA 
CAGCAGTCCC 
AGAAAGGGGT 
CTGAACATGA 
CCAGCATCAT 
GGGCAGGCTG 
GTGAGTGCCA 
AATGGCAACA 
CTCATGGGAC 
AAAAAAGAAA 
AACGGGAAGA 
TTGGTGCCAG 
ATTCTGGAGC 
CCGGCCCAAG 
GCCACCACGG 



GGGCATCACC 
GCCCGTGGTA 
GGGTACAGGT 
CTCAGCAGAA 
GAAAGTGGAA 
CTATGGGAAC 
GCTATTTGGT 
CGTCTGGCTG 
CTTCCCCACC 
CCGACAGCAA 
AGAAATTTGA 
TTGAAAATTG 
ACACAGCACA 
TGTGGGCAAG 
TGGCCATCCC 
CGTGGCGCTT 
CGTGTGTTGT 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 



Seq ID NO: 456 Protein sequence 
Protein Accession #: NP_0S7606.l 



1 
I 

MGVCGYLFLP 
NRVTRVAWLN 
NHPKTSRVHL 
SEDEYLEIQG 
LQCEASAVPS 
LGHTNASIML 



11 
I 

WKCLWVSLR 
RSTILYAGND 
IVQVSPKIVE 
ITREQSGDYE 
AEFQWYKDDK 
FGPGAVSEVS 



21 
I 

LLFLVPTGVP 
KWCLDPRWL 
ISSDISINEG 
CSASNDVAAP 
RLIEGKKGVK 
NGTSRRAGCV 



31 
I 

VRSGDATFPK 
LSNTQTQYSI 
NNISLTCIAT 
WRRVKVTVN 
VENRPFLSKIi 
WLLPLLVLHL 



41 



51 



AMDNVTVRQG ESATLRCTID 
BIQNVDVYDE GPYTCSVQTD 
GRPEPTVTWR HISPKAVGFV 
YPPYISEAKG TGVPVGQKGT 
IFFNVSEHDY GNYTCVASNX 
LLKF 



Seq ID NO: 457 DNA sequence 

Nucleic Acid Accession ft: NM_012261.I 

Coding sequence? 203.. 1045 



GATTTGCTCT 
ACAGAATACG 
CACTCCAGCG 
CCTCATTCGG 
ACTTCGAGTT 
GGAAAATCTC 
TGGGACGACG 
GGCCAGCAAC 
TGAGGTGAAG 
CGCATATGCA 
GGCGACTTGG 
CAAAGAGGCA 
CACCCCCGCT 
TGATCCGCAG 
TATCTCAGAT 
GGAAGAAACC 
CGOGATTTAC 
ATCCCAGTAT 
CCAACTGGAT 
CATAGCTACA 
AACCCACGGA 
ATGCTGGGGA 
TGACTCTCCA 
TTGAAAACAT 
TGCTCCCTTG 
TCATGCTCCC 
GTTTAGTGAT 
AAAACGACTA 
GGGGGACCTG 
TTCTCTGGC 



11 
I 

GCCAGCAGCT 
CGCTCCCTCC 
GCGACTTTGA 
GGCACTGCGA 
CTCCTGATGT 
TCAGGCCTTT 
TGTCTCATGG 
TACGTAGATC 
GGCCGCTGTG 
CTCAAAATGC 
AGGCTGAGCA 
GTCAGTGCTG 
GGGAAGTCCT 
AAGACGGTCA 
TTTGTCTTCA 
TTGCCCCTGA 
CACGTCCACC 
AAGCACATGG 
CAGGTAGAAC 
ATCAAACAGG 
AGGGGGAGAC 
GGAGGGGAGG 
AAGAGCAATA 
GCTTCTTTGA 
GACACAGCTG 
TGCAGCAAGA 
TGTCTTGGGA 
ATGTAACTAT 
AAGAATCAAT 



21 
I 

GTCGGTGCCG 
CTCCCCCTTC 
GGGATTCCCT 
GTATGGATCT 
TGTTCCATAC 
CCACTAACCC 
CAGAGTTTGC 
TGATCACAGA 
GCCACAGCCA 
TCTTTGTAAA 
AAGTGCAGTT 
GGAAGCACAC 
ATGAGTGTCA 
CCATGATCCT 
GTGAAGAGCA 
TTTTGGGGCT 
ACAAAATGAC 
GCTAGAGGCC 
AACAAAAGCA 
CCTGGGTATC 
TCTTTCGGAT 
AGGGTCTCAG 
AATGCCACTT 
GGAGGAAACC 
GCTTATCCTA 
CCCCTGAAAG 
ATGTTTCACT 
GCAGAGTTGT 
CTGTGTGAGT 



31 
I 

CGCTCGACAC 
TCTGTCCCCC 
CTCTGGCGGC 
CCAAGGAAGA 
AATGGCTCAA 
TGAAAAAGAT 
AGCCAAATTT 
ACAGGCCGAT 
GTCGGAGCTG 
GGAAAGCCAC 
TGTCTACGAC 
AGCCAACTCG 
AGCTCAACAA 



TAAATGCCCA 
CATCTTGGGC 
TGCCAACCAG 
GTTAGGCAGG 
CTTTTCCATC 
TGAGGCTTGC 
TTGTAGGGTG 
ACAGCTTTCG 
GGAGCTGTAT 
CCTTTAGGTT 
TACAGTTGTC 
TGATTCATGC 
GCTACCCGCA 
TTGGACTTCT 
CTGTTTTTCA 




AAATGGCAAT 
TGCTGATGGT 
CTGGCCCCAA 
CAGAAGAATA 
AATGCACACA 
TTCTGGCTGG 
TCCAGCGACT 
TCCTGTGCCA 
AAATGAAATA 



51 
I 

CTAGGCGCTC 
CACCCOGGCC 
GCACAGCCGG 
GCATCGACAG 
AACAAGAAGT 
TGCGGGAAAA 
ATGATGTGTG 
CCCGGGGAGC 
GGGTGGATCG 
AGGGACCTGA 
AAACCCACTT 
CTGCCTTGGT 
TGGCCTCTAG 
CTTTTGACAT 
GGGAGCAACT 
TGGTAACACT 
CTCGGGACAG 
TCCTGCTCCC 
GATACACCAA 
TCCATGCTTA 
TATTCTCTCC 
GGCTTGGCTT 
AGTTTAGGGA 
TGGGGTGCTT 
GAATACAACC 
CATTCTGCAT 
GCAGCACCAG 
GGTCCAAGTC 
AAACACACTA 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



Seq ID NO: 458 Protein sequence 
Protein Accession #: NPJJ36393.1 

1 11 21 31 41 51 

1 I I I I I 

MDLQGRGVPS IDRLRVLLML FHTMAQIMAE QEVENLSGLS TNPEKDIPW RENGTTCLMA 
EPAAKPIVPY DVWASNYVDI* ITEQADLALT RGAEVKGROG HSQSELQVFW VDRAYALKMI* 
PVKESHNMSK GPEATWRLSK VQFVYDSSEK THFKDAVSAG KHTANSHHLS ALVTPAGKSY 
ECQAQQTISL ASSDPQKTVT MILSAVHIQP FDIISDFVFS EEHKCPVDER EQLEETLPLI 
LGLILGLVIM VTIAIYHVHH KMTANQVQIP RDRSQYKHMG 

Seq ID NO: 459 DNA sequence 

Nucleic Acid Accession ft: NM_001169.1 

Coding sequence: 85. .870 



60 
120 
180 
240 



11 



21 



31 



51 

1134 



WO 03/042661 



I I I I I I 

TAGGAGATAA GAGTATCTTG CACAGCAGGT GCAGGTTTCC CAGCAGCTCA GGCAAGAGTC 60 

CGATGTTTGT GCCATCTGAT CCTGATGTCT GGAGAGATAG CCATGTGTGA GCCTGAATTT 120 

GGCAATGACA AGGCCAGGGA GCCGAGCGTG GGTGGCAGGT GGCGAGTGTC CTGGTACGAA 180 

CGGTTTGTGC AGCCATGTCT GGTCGAACTG CTGGGCTCTG CTCTCTTCAT CTTCATCGGG 240 

TGCCTGTCGG TCATTGAGAA TGGGACGGAC ACTGGGCTGC TGCAGCCGGC CCTGGCCCAC 300 

GGGCTGGCTT TGGGGCTCGT GATTGCCACG CTGGGGAATA TCAGTGGTGG ACACTTCAAC 360 

CCTGCGGTGT CCCTGGCAGC CATGCTGATC GGAGGCCTCA ACCTGGTGAT GCTCCTCCCG 420 

TACTGGGTCT CACAGCTGCT CGGGGGGATG CTCGGGGCTG CCTTGGCCAA GGTGGTGAGT 480 

CCTGAGGAGA GGTTCTGGAA TGCATCTGGG GCGGCCTTTG TGACAGTCCA GGAGCAGGGG 540 

CAGGTGGCAG GGGCGTTGGT GGCAGAGATC ATCCTGACGA CGCTGCTGGC CCTGGCTGTA 600 

TGCATGGGTG CCATCAATGA GAAGACAAAG GGCCCTCTGG CCCCGTTCTC CATCGGCTTT 660 

GCCGTCACCG TGGATATCCT GGCTGGGGGC CCTGTGTCTG GAGGCTGCAT GAATCCCGCC 720 

CGTGCTTTTG GACCTGCGGT GGTGGCCAAC CACTGGAACT TCCACTGGAT CTACTGGCTG 780 

GGCCCACTCC TGGCTGGCCT GCTTGTTGGA CTGCTCATTA GGTGCTTCAT TGGAGATGGG 840 

AAGACCCGCC TCATCCTGAA GGCTCGGTGA GCAGAGCTCG TGGGATTCCT GCTGCTCCAG 900 

GTGTCCTCAG CTCACCTGTC CCAGACTGAG GACAGGGGAG TTCCTGCATT TCCTGCCAGG 960 

GCAGAGGCCC AGAGGAGCGA CCCCCTGCTT CCACTGCTTG GGCCTGCTTT CTCAGATAGA 1020 

CTGACTGCTG AGGAGGCTCT AGGTTCTTGG AATTCCTTTG TGCTCATCAG AGACCCCAGC 1080 

CTGGGGAACA CGCTGCCCGC ACTGCCCAGA GAGCAGTGCA AACACCACAA CACGAGCGTG 1140 

TTTCTTGAGA GGAATGTCCC CGAGTTGGAC AAGGAGGCTG TTTCTGCACA TCAGCTCATT 1200 

TCCCGCACCC CATTTCTTGC TTGATTGCTT TGTTGGGGGC CTGGCCACTT CCTTGCTTCT 1260 
CAAGCTGACA ATTCTCACTT TGCAATAAAT AGTCCAGTGT TTCCTTCAT 



Seq ID NO: 460 Protein sequence 
Protein Accession #: NP_001160.1 

1 11 21 31 41 SI 

I I I I I I 

MSGEIAMCEP EFGNDKAREP SVGGRWRVSW YERFVQPCLV ELLGSALFIF IGCLSVIENG 60 
TDTGLLQPAL AHGLAXGTjVI ATLGNISGGH FNPAVSLAAM LIGGLNLVML LPYWVSQLLG 120 
GMUGAALAKV VSPEERFWNA SGAAFVTVQE QGQVAGALVA EIILTTLLAL AVCMGAINEK 180 
TKGPLAPPSI GFAVTVDILA GGPVSGGCMN PARAFGPAW ANHWNFHWIY WLGPLLAGLL 240 
VGLLIRCFIG DGKTRLILKA R 



Seq ID NO» 461 DNA sequence 
Nucleic Acid Accession #• NM_003226.l 
Coding sequence: 2.. 226 

1 11 21 31 41 51 

I I I I 1 I 

GATGCTGGGG CTGGTCCTGG CCTTGCTGTC CTCCAGCTCT GCTGAGGAGT ACGTGGGCCT 60 
GTCTGCAAAC CAGTGTGCCG TGCCGGCCAA GGACAGGGTG GACTGCGGCT ACCCCCATGT 120 
CACCCCCAAG GAGTGCAACA ACCGGGGCTG CTGCTTTGAC TCCAGGATCC CTGGAGTGCC 180 
TTGGTGTTTC AAGCCCCTGA CTAGGAAGAC AGAATGCACC TTCTGAGGCA CCTCCAGCTG 240 
CCCCTGGGAT GCAGGCTGAG CACCCTTGCC CGGCTGTGAT TGCTGCCAGG CACTGTTCAT 300 
CTCAGTTTTT CTGTCCCTTT GCTCCCGGCA AGCTTTCTGC TGAAAGTTCA TATCTGGAGC 360 
CTGATGTCTT AACGAATAAA GGTCCCATGC TCCACCCG 

Seq ID NO: 462 Protein sequence 
Protein Accession #: NPJ>0321*7.1 

1 11 21 31 41 SI 

I I I I I I 

MLGLVLALLS SSSAEEYVGL SANQCAVPAK DRVDCGYPHV TPKECNNRGC CFDSRIPGVP 60 
WCFKPLTRKT ECTF 

Seq ID NO: 463 DNA sequence 

Nucleic Acid Accession #i NM_002993.1 

Coding sequence*. 64.. 408 

1 11 21 31 41 51 

I I I I I I 

GGCACGAGCC AGTCTCCGOG CCTCCACCCA GCTCAGGAAC CCGCGAACCC TCTCTTGACC 60 

ACTATGAGCC TCCCGTCCAG CCGCGCGGCC CGTGTCCCGG GTCCTTCGGG CTCCTTGTGC 120 

GCGCTGCTCG CGCTGCTGCT CCTGCTGACG CCGCCGGGGC CCCTCGCCAG CGCTGGTCCT 180 

GTCTCTGCTG TGCTGACAGA GCTGCGTTGC ACTTGTTTAC GCGTTACGCT GAGAGTAAA'C 240 

CCCAAAACGA TTGGTAAACT GCAGGTGTTC CCCGCAGGCC CGCAGTGCTC CAAGGTGGAA 300 

GTGGTAGCCT CCCTGAAGAA CGGGAAGCAA GTTTGTCTGG ACCCGGAAGC CCCTTTTCTA 360 

AAGAAAGTCA TCCAGAAAAT TTTGGACAGT GGAAACAAGA AAAACTGAGT AACAAAAAAG 420 

ACCATGCATC ATAAAATTGC CCAGTCTTCA GCGGAGCAGT TTTCTGGAGA TCCCTGGACC 480 

CAGTAAGAAT AAGAAGGAAG GGTTGGTTTT TTTCCATTTT CTACATGGAT TCCCTACTTT 540 

GAAGAGTGTG GGGGAAAGCC TACGCTTCTC CCTGAAGTTT ACAGCTCAGC TAATGAAGTA 600 

CTAATATAGT ATTTCCACTA TTTACTGTTA TTTTACCTGA TAAGTTATTG AACCCTTTGG 660 

CAATTGACCA TATTGTGAGC AAAGAATCAC TGGTTATTAG TCTTTCAATG AATATTGAAT 720 

TGAAGATAAC TATTGTATTT CTATCATACA TTCCTTAAAG TCTTACCGAA AAGGCTGTGG 780 

ATTTCGTATG GAAATAATGT TTTATTAGTG TGCTGTTGAG GGAGGTATCC TGTTGTTCTT 840 

ACTCACTCTT CTCATAAAAT AGGAAATATT TTAGTTCTGT TTTCTTGGGG AATATGTTAC 900 

TCTTTACCCT AGGATGCTAT TTAAGTTGTA CTGTATTAGA ACACTGGGTG TGTCATACCG 960 

TTATCTGTGC AGAATATATT TCCTTATTCA GAATTTCTAA AAATTTAAGT TCTGTAAGGG 1020 

CTAATATATT CTCTTCCTAT GGTTTTAGAT GTTTGATGTC TTCTTAGTAT GGCATAATGT 1080 

CATGATTTAC TCATTAAACT TTGATTTTGT ATGCTATTTT TTCACTATAG GATGACTATA 1140 



1135 



WO 03/042661 



PCT/US02/36810 
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ATTCTGGTCA CTAAATATAC ACTTTAGATA GATGAAGAAG CCCAAAAACA GATAAATTCC 
TGATTGCTAA TTTACATAGA AATGTATTCT CTTGGTTTTT TAAATAAAAG CAAAATTAAC 
AATGATCTGT GCTCTGCAAA GTTTTGAAAA TATATTTGAA CAATTTGAAT AT AAA TT CAT 
CATTTAGTCC TCAAAATATA TACAGCATTG CTAAGATTTT CAGATATCTA TTGTGGATCT 
TTTAAAGGTT TTGACCATTT TGTTATGAGG AATTATACAT GTATCACATT CACTATATTA 
AAATTGCACT TTTATTTTTT CCTGTGTGTC ATGTTGGTTT TTGGTACTTG TATTGTCATT 
TGGAGAAACA ATAAAAGATT TCTAAACCAA AAAAAAAAAA AAAAAAA 

Seq ID NO: 464 Protein sequence 
Protein Accession th NP_002984.1 

1 11 21 31 41 51 

I I I I I I 

MSLPSSRAAR VPGPSGSLCA LLALLLLLTP PGPLASAGPV SAVLTBLRCT CLRVTLRVNP 
KTIGKLQVPP AGPQCSKVEV VASLKNGKQV CLDPEAPFLK KVIQKILDSG NKKN 

Seq ID NO: 465 DNA sequence 
Nucleic Acid Accession flr NM_002038.2 
Coding sequence: 108.. SOO 



1200 
1260 
1320 
1380 
1440 
1500 



1 
I 

GAACCGTTTA 
CTTCTCTCCT 
CGGTATCGCT 
AGAAAAAGTG 
TGGCCGTCGG 
GCATCGCGGC 
GCGGCGTGCC 
GCGTCGTCAT 
GTGAGGAGGA 
TCTTCCAGTT 
GTTCTCACTA 
ACTGCAGCCT 
TACAAGCATG 
CCTAGATGTG 



11 
I 

CTCGCTGCTG 
CCAAGGTCTA 
TTTCTTGTGC 
CTCGGAGAGC 
AGGAGGACTC 
CAACTCGGTG 
CGCCGGGGGG 
AGGTAATATT 
TGAGGAGTAG 
AGGATCTAGA 
TATTGTCCAG 
CCAACTCCTA 
CGCCGACGAT 
AAAACAGAAT 



21 
I 

TGCCCATCTA 
GTGACGGAGC 
TACCTGCTGC 
TCGGACAGCG 
GCAGTCGCCG 
GCTGCCTCGC 
CTAGTGGCCA 
GGTGCCCTGA 
CCAGCAGCTC 
ACTTTGCCTT 
GCTAGAGTGC 
GCCTCAAGTG 
GCCCAGAATC 
AAACTTCACC 



31 
I 

TCAGCAGGCT 
CCGCGCGCGG 
TCTTCACTTG 
GCTCCGGGTT 
GGCTGCCCGC 
TGATGAGCTG 
CGCTGCAGAG 
TGGGCTACGC 
CCAGAACCTC 
TTTTTTTTTT 
AGTGGCTATT 
ATCCTCCTGT 
CAGAACTTTG 
CAGAAAA 



41 

I 

CCGGGCTGAA 
CGCCACCATG 
CAGTGGGGTG 
CTGGAAGGCC 
GCTGGGCTTC 
GTCTGCGATC 
CCTCGGGGCT 
CACCCACAAG 
TTCTTCCTTC 
TTTTTTTTTT 
CACAGATGCG 
CTCAACCTCC 
TCTATCACTC 



51 
I 

GATTGCTTCT 
CGGCAGAAGG 
GAGGCAGGTA 
CTGACCTTCA 
ACCGGOGCCG 
CTGAATGGGG 
GGTGGCAGCA 
TATCTCGATA 
TTGGCCTAAC 
TTTGAGATGG 
AACATAGTAC 
CAAGTAGGAT 
TCCCCAACAA 



Seq ID NOt 466 Protein sequence 
Protein Accession #: NP_002029.3 

1 11 21 31 41 51 

I I I I I I 

MRQKAVSLFL CYLLLPTCSG VEAGKKKCSE SSDSGSGFWK ALTFMAVGGG LAVAGLPALG 
PTGAGIAANS VAASLMSWSA ILNGGGVPAG GLVATLQSLG AGGSSWIGN IGALMGYATH 
KYLDSEEDRE 

Seq ID NO: 467 DNA sequence 

Nucleic Acid Accession #: NMJ>03469.2 

Coding sequence: 92.. 1945 

1 11 21 31 41 51 

I I I I I I 

GAAACGGCCC GAGAAGCTCG CCCGGAGAAC GGGGAGGAAT ATGCTGTGGA GCTCCTCTGC 
CATATAAACA AAAAGAGGAA ATCTTTCAAA CATGGCTGAA GCAAAGACCC ACTGGCTTGG 
AGCAGCCCTG TCTCTTATCC CTTTAATTTT CCTCATCTCT GGGGCTGAAG CAGCTTCATT 
TCAGAGAAAC CAGCTGCTTC AGAAAGAACC AGACCTCAGG TTGGAAAATG TCCAAAAGTT 
TCCCAGTCCT GAAATGATCA GGGCTTTGGA GTACATAGAA AACCTCCGAC AACAAGCTCA 
TAAGGAAGAA AGCAGCCCAG ATTATAATCC CTACCAAGGT GTCTCTGTCC CCCTTCAGCA 
AAAAGAAAAT GGCGATGAAA GCCACTTGCC CGAGAGGGAT TCACTGAGTG AAGAAGACTG 
GATGAGAATA ATACTCGAAG CTTTGAGACA GGCTGAAAAT GAGCCTCAGT CTGCACCAAA 
AGAAAATAAG CCCTATGCCT TGAATTCAGA AAAGAACTTT CCAATGGACA TGAGTGATGA 
TTATGAGACA CAGCAGTGGC CAGAAAGAAA GCTTAAGCAC ATGCAATTCC CTCCTATGTA 
TGAAGAGAAT TCCAGGGATA ACCCCTTTAA ACGCACAAAT GAAATAGTGG AGGAACAATA 
TACTCCTCAA AGCCTTGCTA CATTGGAATC TGTCTTCCAA GAGCTGGGGA AACTGACAGG 
ACCAAACAAC CAGAAACGTG AGAGGATGGA TGAGGAGCAA AAACTTTATA CGGATGATGA 
AGATGATATC TACAAGGCTA ATAACATTGC CTATGAAGAT GTGGTCGGGG GAGAAGACTG 
GAACCCAGTA GAGGAGAAAA TAGAGAGTCA AACCCAGGAA GAGGTGAGAG ACAGCAAAGA 
GAATATAGGA AAAAATGAAC AAATCAACGA TGAGATGAAA CGCTCAGGGC AGCTTGGCAT 
CCAGGAAGAA GATCTTCGGA AAGAGAGTAA AGACCAACTC TCAGATGATG TCTCCAAAGT 
AATTGCCTAT TTGAAAAGGT TAGTAAATGC TGCAGGAAGT GGGAGGTTAC AGAATGGGCA 
AAATGGGGAA AGGGCCACCA GGCTTTTTGA GAAACCTCTT GATTCTCAGT CTATTTATCA 
GCTGATTGAA ATCTCAAGGA ATTTACAGAT ACCCCCAGAA GACTTAATTG AGATGCTCAA 
AACTGGGGAG AAGCCGAATG GATCAGTGGA ACCGGAGCGG GAGCTTGACC TTCCTGTTGA 
CCTAGATGAC ATCTCAGAGG CTGACTTAGA CCATCCAGAC CTGTTCCAAA ATAGGATGCT 
CTCCAAGAGT GGCTACCCTA AAACACCTGG TCGTGCTGGG ACTGAGGCCC TACCAGACGG 
GCTCAGTGTT GAGGATATTT TAAATCTTTT AGGGATGGAG AGTGCAGCAA ATCAGAAAAC 
GTCGTATTTT CCCAATCCAT ATAACCAGGA GAAAGTTCTG CCAAGGCTCC CTTATGGTGC 
TGGAAGATCT AGATCGAACC AGCTTCCCAA AGCTGCCTGG ATTCCACATG TTGAAAACAG 
ACAGATGGCA TATGAAAACC TGAACGACAA GGATCAAGAA TTAGGTGAGT ACTTGGCCAG 
GATGCTAGTT AAATACCCTG AGATCATTAA TTCAAACCAA GTGAAGCGAG TTCCTGGTCA 
AGGCTCATCT GAAGATGACC TGCAGGAAGA GGAACAAATT GAGCAGGCCA TCAAAGAGCA 
TTTGAATCAA GGCAGCTCTC AGGAGACTGA CAAGCTGGCC CCGGTGAGCA AAAGGTTCCC 
TGTGGGGCCC CCGAAGAATG ATGATACCCC AAATAGGCAG TACTGGGATG AAGATCTGTT 
AATGAAAGTG CTGGAATACC TCAATCAAGA AAAGGCAGAA AAGGGAAGGG AGCATATTGC 



60 
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TAAGAGAGCA 
CACCCCAAGC 
TAATATCTTT 
TCCTCTGAGC 
ATTTATTATG 
GGCTGTGGCA 
TTTTTAATAT 
AAAAAAATAA 



ATGGAAAATA 
AAATCCCAAC 
AAATGATGTA 
TGTTATCTTG 
TCCATTATTC 
TTGTTGATGC 
TTATTGAATT 
AGCATTATAA 



TGTAAGCTGC 
ATTTCTCTTC 
CAGGCAGATG 
TGTATGGATA 
AAGAAAGATA 
TCACATATGA 
ATTTTGTTAC 
ATATA 



TTTCATTAAT 
AGTGTGTTGA 
AAACCAGGTC 
TGTGTAAATG 
TCTATGACTG 
TAAAAAAGTG 
TGTCTGTAGC 



TACCCTACTT 
CTTCTATCCT 
ACTGGGGAGT 
TTATGACTCC 
TGTTTAATAG 
TCCTATAATT 
GTTTTGTGGA 



TCATTCCTCC 
GTTAACACTG 
CTGCTTCATT 
TTGATAAAAA 
TATATCTAAT 
CTATTGAAAG 
GTACTGGACC 



1980 
2040 
2100 
2160 
2220 
2280 
2340 



Seq ID NO: 468 Protein sequence 
Protein Accession #: NP 003460.1 



MAEAKTHWLG 
YIENLRQQAH 
AENEPQSAPK 
RTNE1VEEQY 
YEDWGGEDW 
DQLSDDVSKV 
PPEDLIEMLK 
RAGTEALPDG 
AAWIPHVENR 
EQIEQAIKEH 
KAEKGREHIA 



11 

1 

AALSLIPLIF 
KEESSPDYNP 
ENKPYALNSE 
TPQSLATLES 
NPVEEKIESQ 
IAYLKRLVNA 
TGEKPNGSVE 
LSVEDILNLL 
QMAYENLNDK 
LNQGSSQETD 
KRAMENM 



21 
I 

LISGAEAASF 
YQGVSVPLQQ 
KNPPMDMSDD 
VFQELGKLTG 
TQEEVRDSKE 
AGSGRLQNGQ 
PERELDLPVD 
GMESAANQKT 
DQELGEYLAR 
KLAPVSKRPP 



31 
I 

QRNQLLQKEP 
KENGDESHLP 
YETQQWPERK 
PNNQKRERMD 
NIGKNEQIND 
NGERATRLPE 
LDDISEADLD 
SYFPNPYNQE 
MLVKYPEIXN 
VGPPKKDDTP 



41 
I 

DLRLENVQKF 
ERDSLSEEDW 
LKHMQFPPMY 
EEQKLYTDDE 
EMKRSGQLGI 
KPLDSQSIYQ 
HPDLFQNRML 
KVLPRLPYGA 
SNQVKRVPGQ 
NRQYWDEDLL 



51 
I 

PSPEMIRALE 
MRIILEALRQ 
EENSRDNPFK 
DDIYKANNIA 
QEEDLRKESK 
LIEISRNLQI 
SKSGYPKTPG 
GRSRSNQLPK 
GSSEDDLQEE 
MKVLEYLNQE 



Seq ID NO: 469 DNA sequence 
Nucleic Acid Accession «: NM_006398. 
Coding sequence: 19.. 516 



1 
I 

GGCCCCTTGT 
GAGGAATGGG 
GAACATGTCC 
AAGATCTTAA 
CACCTTACCC 
TCAGGTGATG 
GTGAAAGCAA 
AATGGAAAGA 
TTACTCTTCC 
AGGGGTCAAA 
TCCCAAAATT 
AGTATATTGC 
TTTACTAATG 



11 
I 

CTGCAGAGAT 
ATTTAATGAC 
GGTCTAAGAC 
AGCCACGGAG 
TGAAAGTGGT 
AGGCAAAGAG 
TGATCGAGAC 
GACTGGAAGA 
TGGCATCTTA 
AAGCTTATTT 
AATGAGAATG 
CCAACTCTAT 
TATTACTGAG 



21 
I 

GGCTCCCAAT 
CTTTGATGCC 
CAAGGTTCCT 
AAGCCTCTCA 
GAAGCCCAGT 
GCACCTCCTC 
TAAGACGGGT 
TGGGAAGATG 
TTGTATTGGA 
CTTTTAATCT 
AGATGAGTAG 
GTTTCTTTGA 
ACTAGTAAAT 



31 
I 

GCTTCCTGCC 
AACCCATATG 
GTGCAGGACC 
TCTTATGGCA 
GATGAGGAGC 
CAGGTGCGAA 
ATAATCCCTG 
ATGGCAGATT 
GGGTGACCAC 
CTTACTCAAC 
AGTAAGATTT 
TTCTAACACA 
AAATTTTTAA 



41 
I 

TCTGTGTGCA 
ACAGCGTGAA 
AGGTTCTTTT 
TTGACAAAGA 
TGCCCTTGTT 
GGTCCAGCTC 
AGACCCAGAT 
ACGGCATCAG 
CCTGGGGATG 
GAACACATCT 
GGGTGGGATG 
ATTAATTAAG 
GGCAAAATAG 



11 



ATGAAGGCCA 
TCCCTGCAGC 
AACTCGGCGC 
TTCCTCTGCC 
CTGGCCAACC 
TGCGCATCGC 
CGCGCGGCCG 
CGCACGCTGG 
CTCCTGGCTG 
CCGGCGTTGC 
GCCGCCCGCG 
GTGGCCCGAG 



CAGCCCGCGC 
GTCACACCTG 
GCGAGGAGGC 



ACTACAGCGC 



TGGTGCTGCT 
ACCTGGCCTT 
TGCGCGGACC 
TGGCTCTGGG 
CAGTGTGCCG 
CCAGCGCCTC 
AGCGGCOGCT 
TCAAGCTGGC 
TGGTCATCCT 
CTGTCTGTTG 
ACCTGACAGC 
AGCGCTACAA 
CTCTCAACCC 
TGCTGCGGAG 



21 
I 

AGAGGAGCGC 
OGCCCTTGTC 
GGCGGTGCGC 
GGTAGACGCG 
AGCGCTCTGG 
TTCGGCCGAA 
CCCGCTGOGC 
CTGGCTAAGC 
GTGCGCGCCC 
CTGCGGAGGC 
GCTGCTGCCG 
CATGCGGTTC 
CGTCTGCCTG 
CCAGGCACGG 
GCTCATCTAC 
TCTGGGGAGA 



31 
I 

TTTCTCCTGC 
CTCCTGTGCT 
GACCCGCGCC 
GGCTTCACTA 
CTGCCGCGCA 
TGCGTCCTCC 
TATGOGGGGC 
GGCCTCACCA 
CGCCTGCTGG 
GAOGGAGACA 
TTTGCCGTCA 
AGCGGAGGCC 
TTCTACGGCT 
GGCAAGTTCG 
ACCCTCAGGA 
GGCCAGGCTG 



41 

! 

TGGGTTTCTC 
ACCTOCTGAC 
TGCACACGCC 
CTAGCGTGGT 
GCCACTGCAC 
TGGCGGTGAT 
TCGTCTCCCC 
ACTCGGTTGC 
ACCACTTCAT 
CTACCGAGAA 
TCCTGGCCTC 
GGAGGAGGGC 
CGGCCATCTA 
TATCGCTCTT 
ATAAGAAAGT 
GGCAGTGA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



51 
I 

TGTCCGTTCC 
AAAAATCAAA 
GCTGGGCTCC 
GAAGACCATC 
TCTTGTGGAG 
AGTGGCACAA 
TGTGACTTGC 
AAAGGGCAAC 



TCTGATGATT 
GGTAGGATGA 
TGACATGATT 
AGCATTC 



Seq ID NO: 470 Protein sequence 
Protein Accession fl: NP_006389.1 

1 11 21 31 41 51 

I I I I I I 

MAPNASCLCV HVRSEEWDLM TFDANPYDSV KKIKEHVRSK TKVPVQDQVL LLGSKILKPR 
RSLSSYGIDK EKTIHLTLKV VKPSDEELPL PLVESGDEAK RHLLQVRRSS SVAQVKAMIE 
TKTGIIPETQ IVTCNGKRIjE DGKMMADYGI RKGNLLPLAS YCIGG 

Seq ID NO: 471 DNA sequence 

Nucleic Acid Accession #: XM_094741.1 

Coding sequence : 1 . . 948 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



51 
I 

CGACTGGCCT 
CTTGACGGGC 
CATGTACTAC 
GCCGCCGCTG 
GGCCCAGCTG 
GGCTCTGGAC 
GCGCCTATGT 
GCAAACCGCG 
CTGTGAGCTG 
CCAGATGTTC 
CTAOGGTGCC 
GGTGGGCACG 
CACCTACCTG 
CTACACCGTG 
GAAGGGGGCA 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



Seq ID NO: 472 Protein sequence 
Protein Accession #: XP_094741.1 

1 -11 21 31 41 51 

I t I I I I 

MKANYSAEER FLLLGPSDWP SLQPVLFALV LLCYLLTLTG NSALVLLAVR DPRLHTPMYY 



60 



1137 



WO 03/042661 



PCT/US02/36810 
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10 

15 

20 
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45 
50 
55 
60 
65 
70 
75 
80 



FLCHLALVDA GFTTSWPPL LANLRGPALW LPRSHCTAQL CASLALGSAE CVLLAVMALD 
RAAAVCRPLR YAGLVSPRLC RTLASASWLS GLTNSVAQTA LLAERPLCAP RLLDHFICEL 
PALLKLACGG DGDTTENQMF AARWILLLP PAVILASYGA VARAVCCMRF SGGRRRAVGT 
CGSHLTAVCL FYGSAIYTYL QPAQRYNQAR GKFVSLFYTV VTPALNPLIY TLRNKKVKGA 
ARRLLRSLGR GQAGQ 

Seq ID NO: 473 DNA sequence 

Nucleic Acid Accession ih NM_001062.1 

Coding sequence: 76.. 1380 



120 
180 
240 
300 



GCTCTCATTA 
TACACTGTTG 
TCTTTTATTC 
CTAAAACCTC 
AATGTTGTGT 
ATCCAACAAA 
GCCTTGATTA 
TACCACCTGA 
CACAATGGCA 
CTGTTCAATG 
AACTATTATT 
ACCTGTGTGA 
AACATCAGTA 
GGTCTCATTG 
GACTATTATA 
ATTTCTCAAG 
GGAAAQACCT 
AACATCTCCG 
GTCAATTACT 
TCTGTCTTCC 
ACAATGGAGG 
AATAATGACA 
GGTAGTTACG 
GCCCAAACTT 
TTATGCCTTC 
TCTCTACATG 



11 
I 

CCTTCTGCCC 
GAGAGATGAG 
CAAGCCAACT 
TGTTGAATAC 
TGTCCCTCAA 
TCAAATACAA 
TACTGGCTTT 
CTGACAAGCT 
CTCCCCTGAC 
GGAACTACTC 
TTGGTAGCCA 
AGAAGAGTCT 
TTTATACAAA 
GAAACACATT 
ATGAAAATGA 
GAGCATTCAG 
TCTTGGATAT 
CTGATGAGCC 
CTGTGAGAAT 
TCAGTGTGAT 
AGCGCTCATG 
GAACCTACTG 
TTGTCCGCAA 
TCCTCAGCTG 
TTCTTCATTT 
TTCAATAAAA 



21 
I 

ATCACTTAAT 
ACAGTCACAC 
ATGCGAGATT 
AATGATCCAG 
ACTTGTTGGA 
TGTGAAAAGC 
GGGAGTATGT 
AGAAAATAAA 
TAACTACTAC 
AACCGCCGAA 
GTTCTCAGTA 
AATAAATGGG 
GTCACTGGTA 
TA6CACAGGA 
CTGGAATTGC 
TAATCCAAAC 
TAACAAAGAC 
TATAACTGTG 
CAATGAAACA 
GGAGAAAGCC 
GGGGCCCTAT 
GGAACTTCTG 
TGGAGAAAAC 
CATAAAATCC 
ATCCCAGTAC 
GTTGTTGAAA 



31 
I 

AAATAGCCAG 
CAGCTGCCCC 
TGTGAGGTAA 
TCAAACTATA 
ATCCAGATCC 
AGATTGTCAG 
CGTAACGCTG 
TTCCAAGCAG 
CAGCTCAGCC 
GTTGTCAACC 
GATACTGGTG 
CAGATCAAAG 
GAAAAGATTC 
GAAGCCATGC 
CAACAAACTC 
GCTGCAGCCC 
TCTTCTTGCG 
ACACCTCCTG 
TATTTCACCA 
CAGAAAATGA 
ATCACCTGTA 
AGTGGAGGCG 
TTGGAGGTTC 
ATTTGCAGTG 
GAGCAGGAGA 
GATTAAC 



41 
I 

CCAATTCATC 
TAGTGGGGCT 
GTGAAGAAAA 
ACAGGGGAAC 
AAACCCTGAT 
ATGTAAGCTC 
AGGAAAACTT 
AAATTGAAAA 
TGGACGTTTT 
ACTTCACTCC 
CAATGGCTGT 
CAGATGAAGG 
TGTCTGAGAA 
AGGCCCTCTT 
TGAATACAGT 
AGGTCTTACC 
TCTCTGCTTC 
ACTCACAATC 
ATGTCACTGT 
ATGATACTAT 
TTCAGGGCCT 
AACCACTGAG 
GCTGGAGCAA 
GAGTTCCATG 
GTTAATAACC 



51 
I 

AACATTCTGG 
CTTACTGTTT 
CTACATCCGC 
CAGCGCTGTC 
GCAAAAGATG 
GGGAGAGCTT 
AATATATGAT 
TATGGAAGCA 
GGCCTTGTGT 
TGAAAATAAA 
CCTGGCTCTG 
CAGTTTAAAG 
AAAAGAAAAT 
TGTATCATCA 
GCTCACGGAA 
TGCCCTGATG 
AGGTAACTTC 
ATATATCTCC 
GCTAAATGGT 

ATGTGCCAAC 
CCAAGGAGCT 
ATACTAATAA 
TTTATTGTCC 
TCCCCTTCTC 



Seq ID NO: 474 Protein sequence 
Protein Accession #: NP 001053.1 



KRQSHQLPLV 
LKLVGIQIQT 
KLENKFQAEI 
SQFSVDTGAM 
TFSTGEAMQA 
DINKDSSCVS 
VMEKAQKMND 
RNGENLEVRW 



11 
I 

GL.LLFSFIPS 
LMQKMIQQIK 
ENMEAHNGTP 
AVLALTCVKK 
LFVSSDYYNE 
ASGNFNISAD 



21 
I 

QLCEICEVSE 
YNVKSRLSDV 
LTNYYQLSLD 
SLINGQIKAD 
NDWNCQQTLN 
EPITVTPPDS 
SWGPYITCIQ 



31 
I 

ENYIRLKPLL 
SSGELALIIL 
VIALCLFNGN 
EGSLKNISIY 
TVLTEISQGA 
QSYISVNYSV 
GLCANNNDRT 



41 
I 

NTMIQSNYNR 
ALGVCRNAEE 
YSTAEWNHF 
TKSLVEK.ILS 
FSNPNAAAQV 
RINSTYFTNV 
YWELLSGGEP 



SKY 



Seq ID NO: 475 DNA sequence 

Nucleic Acid Accession #: NM_004852.1 

Coding sequence: 89.. 1546 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



GCCCCCGCCC 
GCCTCACCAA 
CACTTTGCAC 
CGGCGGCGGG 

GCTGGGCACG 
GGCCTCGATC 
CATGAGCATG 
GCTGACACCG 
TCACCCGCAC 
CGTCAGCGGC 
CTACAGTCCC 
GCCGCTGGGC 
TCCGCCGGGC 
GACCCGCGGT 
GCACCTGAAC 
CAGTCGCGAG 
GGAAGAAATC 
TATCCCCCAG 
CCTGCTCCGG 
GATGTGGAAG 
GTGCAAACGC 
CCTGGTGTTC 
CCCGTCAAAG 
CAGCAACTTC 



11 
I 

GCCCCGGGCC 
AGACCTAGAA 
GGCGCGCGCG 
GGCCCGGGCC 
CCGCGTGGCT 
GCGGCAGCGG 
CTGGAGGGCG 
TCCTGCGACT 
CTCCAGCCGC 
CACCATCCGC 
AGCTTCACCC 
TACAAGGAGA 
AACGGGCTAG 
CACGACAAAA 
GAGCAACACC 
GGCCTGCACC 
CGGCCACCCT 
AACACCAAAG 
GCGATCTTTG 
AATCCAAAAC 
TGGCTTCAGG 
AAAGAGCAAG 
ACTGACCTCC 
GAGATGCAGA 
TTCATGAACG 



21 
I 

CTGATGGACT 
CGTGCGCCAT 
GCGGCGGCAG 
ATGAGCAGGA 
CGCTGCGGGG 
CGGCAGCGGC 
GCGACTACCG 
CGTCTCCGCC 
TGCCACCCAT 
ACCACCACCA 
TCATGCGCGA 
TGCCCGGCAT 
GCGGCCTCCA 
TGCTCAGCCC 
TGTCCCGCGG 
ACCCGGGCCA 
CGTCCTCATC 
AGGTGGCCCA 
CGCAGAGGGT 
CGTGGAGTAA 
AGCCCGAGTT 
AACCAAACAA 
AACGCCGAAC 
TCACCATTTC 
CCCGGCGCCG 



31 
I 

GAATGAAGGC 
GAACCCGGAG 
TGGCGGGGGC 
GCTGCTGGCC 
CCCTCCGCCG 
GGCGTCGCGC 
GCCCGAGCTC 
TGGCATGGGC 
CTCCACCGTG 
CCACCACCAC 
CGAGCGCGGG 
GAGCCAGAGC 
CAACGCGCAG 
CAACTTCGAC 
CCTGGGCACC 
CACTCAGTCT 
GGGCTCGCAG 
GCGCATCACA 
GCTGTGCCGG 
ACTCAAATCT 
CCAGCGCATG 
AGACAGGAAC 
ACTCTTCGCC 
CCAGCAGCTG 
CAGCCTGGAG 




CAGAGTCTGC 
GCGCACCACA 
CCACCTGCGG 
CACGGGCCGG 
GTGGCCACGT 
GCGGAGCTGA 
TCTCAGGGGA 
GGCAGGGAGA 
TCCGCCTTAC 
AATTCCCAGA 
ATCTTCAAGG 
GGCCTGGAGC 
AAGTGGCAAG 



51 
I 

GTSAVNWLS 
NLIYDYHLTD 
TPENKNYYFG 
EKKENGLIGN 
LPALMGKTFli 
TVLNGSVPLS 
LSOGAGSYW. 



51 
I 

GCCTATCGAT 
AAAGTCTGGG 
GCGGCGGGGG 
CCCACCACGC 
CGCACCAGGA 
TCACCAGCAT 
TGCACCACGC 
CCTACACCAC 
TCCACCACCC 
TGTCCGGCAA 
TGAACAACCT 
TGGCCGCCAC 
CCAACTACGG 
CTGCCATGCT 
CCATGATGTC 
TGCTGGCACC 
CGGGCCAGCT 
AGOGCTACAG 
CTCTCTCCGA 
CCTTCCGCAG 
GCCTGGCAGC 
AGAAGTCCCG 
AGAACAAACG 
TCACAACCGT 
ACGATCTGAG 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



1138 



WO 03/042661 PCT/US02/36810 



CACAGGGGGC TCCTCGTCCA CCTCCAGCAC GTGTACCAAA GCATGATGGA AGGACTCTCA 1560 
CTTGGGCACA AGTCACCTCC AAATGAGGAC AACAGATACC AAAAGAAAAC AAAGGAAAAA 1620 
GACACCGGAT TCCTAGCTGG GGCCCTTCAC TGGTG 



5 Seq ID NO: 476 Protein sequence 
Protein Accession ft: NP 004843.1 



10 



1 11 21 31 41 SI 

I I I I I I 

MNPELTMESL GTLHGARGGG SGGGGGGGGG GGGGGPGHEQ ELLASPSPHH ARRGPRGSLR 60 

GPPPPPTAHQ ELGTAAAAAA AASRSAMVTS MASILDGGDY RPELSIPLHH AMSMSCDSSP 120 

PGMGMSNTYT TLTPLQPLPP ISTVSDKFHH PHPHHHPHHH HHHHHQRLSG NVSGSFTLMR 180 

DERGLPAMNN LYSPYKEMPG MSQSLSPLAA TPLGNGLGGL HNAQQSLPNY GPPGHDKMLS 240 

PNFDAHHTAM LTRGEQHLSR GLGTPPAAMM SHLNGLHHPG HTQSHGPVLA PSRERPPSSS 300 

15 SGSQVATSGQ LEEINTKEVA QRITAELKRY SIPQAIFAQR VLCRSQGTLS DLLRNPKPWS 360 

KLKSGRBTFR RMWKWLQEPE FQRMSALRLA ACKRKEQEPN KDRNNSQKKS RLVPTDLQRR 420 

TLFAIFKENK RPSKEMQITI SQQLGLELTT VSNFFMNARR RSLEKWQDDL STGGSSSTSS 480 
TCTKA 

20 Seq ID NO: 477 DNA sequence 

Nucleic Acid Accession «: NM_013271.1 
Coding sequence: 27. .809 

„ 1 11 21 31 41 51 

25 | | | | | | 

TCCGGAGCCA GGCTCGCTGG GGCAGCATGG CGGGGTCGCC GCTGCTCTGG GGGCCGCGGG 60 

CCGGGGGCGT CGGCCTTTTG GTGCTGCTGC TGCTCGGCCT GTTTCGGCCG CCCCCCGCGC 120 

TCTGCGCGCG GCCGGTAAAG GAACCCCGCG GCCTAAGCGC AGCGTCTCCG CCCTTGGCTG 180 

AGACTGGCGC TCCTCGCCGC TTCCGGCGGT CAGTGCCCCG AGGTGAGGCG GCGGGGGCGG 240 

30 TGCAGGAGCT GGCGCGGGCG CTGGCGCATC TGCTGGAGGC CGAACGTCAG GAGCGGGCGC 300 

GGGCCGAGGC GCAGGAGGCT GAGGATCAGC AGGCGCGCGT CCTGGCGCAG CTGCTGCGCG 360 

TCTGGGGCGC CCCCCGCAAC TCTGATCCGG CTCTGGGCCT GGACGACGAC CCCGACGCGC 420 

CTGCAGCGCA GCTCGCTCGC GCTCTGCTCC GCGCCCGCCT TGACCCTGCC GCCCTAGCAG 480 

CCCAGCTTGT CCCCGCGCCC GTCCCCGCCG CGGCGCTCCG ACCCCGGCCC CCGGTCTAOG S40 

ACGACGGCCC CGCGGGCCCG GATGCTGAGG AGGCAGGCGA CGAGACACCC GACGTGGACC 600 

CCGAGCTGTT GAGGTACTTG CTGGGACGGA TTCTTGCGGG AAGCGCGGAC TCCGAGGGGG 660 

TGGCAGCCCC GCGCCGCCTC CGCCGTGCCG CCGACCACGA TGTGGGCTCT GAGCTGCCCC 720 

CTGAGGGCGT GCTGGGGGCG CTGCTGCGTG TGAAACGCCT AGAGACCCCG GCGCCCCAGG 780 

TGCCTGCACG CCGCCTCTTG CCACCCTGAG CACTGCCCGG ATCCGGTGCA CCCTGGGACC 840 

CAGAAGTGCC CCCGCCATCC CGCCACCAGG ACTTCTCCCC GCCAGCACGT CCAGAGCAAC 900 

TTACCCCGGC CAGCCAGCCC TCTCACCCGA GGATCCCTAC CCCCTGGCCC ACAATAACAT 960 
GATCTGAGC 



35 



40 



Seq ID NO: 478 Protein sequence 
45 Protein Accession #: NPJ>37403.1 

1 11 21 31 41 51 

I I I I I I 

MAGSPLLWGP RAGGVGLLVL LLLGLFRPPP ALCARPVKEP RGLSAASPPL AETGAPRRFR 60 

50 RSVPRGEAAG AVQELARALA HLLEAERQER ARAEAQEAED QQARVLAQLL RVWGAPRNSD 120 

PALGLDDDPD APAAQLARAL LRARLDPAAL AAQLVPAPVP AAALRPRPPV YDDGPAGPDA 180 

EEAGDETPDV DPELLRYLLG RILAGSADSE GVAAPRRLRR AADHDVGSBL PPEGVLGALL 240 
RVKRLETPAP QVPARRLLPP 

55 Seq ID NO: 479 DNA sequence 

Nucleic Acid Accession #: NMJ>02214 
Coding sequence: 681.. 2990 

1 11 21 31 41 51 

60 | | | | ! | 

CCCAGAGCCG CCTCCCCCTG TTGCTGGCAT CCCGAGCTTC CTCCCTTGCC AGCCAGGACG 60 

CTGCCGACTT GTCTTTGCCC GCTGCTCCGC AGACGGGGCT GCAAAGCTGC AACTAATGGT 120 

GTTGGCCTCC CTGCCCACCT GTGGAAGCAA CTGCGCTGAT TGATGCGCCA CAGACTTTTT 180 

TCCCCTCGAC CTCGCCGGCO TACCCTCCCA CAGATCCAGC ATCACCCAGT GAATGTACAT 240 

TAGGGTGGTT TCCCCCCCAG CTTCGGGCTT TGTTTGGGTT TGATTGTGTT TGGCTCTTCG 300 

CTAAGCTGAT TTATGCAGCA GAAGCCCCAC CGGCTGGAGA GAAACAAAAG CTCTTTTCTT 360 

TGTCCCGGAG CAGGCTGCGG AGCCCTTGCA GAGCCCTCTC TCCAGTCGCC GCCGGGCCCT 420 

TGGCCGTCGA AGGAGGTGCT TCTCGCGGAG ACCGCGGGAC CCGCCGTGCC GAGCCGGGAG 480 

GGCCGTAGGG GCCCTGAGAT GCCGAGCGGT GCCCGGGCCC GCTTACCTGC ACCGCTTGCT 540 

CCGAGCCGCG GGGTCCGCCT GCTAGGCCTG CGGAAAACGT CCTAGCGACA CTCGCCCGCG 600 

GGCCCCGAGG TCGCCCGGGA GGCCGAGCCC GCGTCCGGAA GGCAGCCAGG CGGCGGGCGC 660 

GGGGCGGGCT GTTTTGCATT ATGTGCGGCT CGGCCCTGGC TTTTTTTACC GCTGCATTTG 720 

TCTGCCTGCA AAACGACCGG CGAGGTCCCG CCTCGTTCCT CTGGGCAGCC TGGGTGTTTT 780 

CACTTGTTCT TGGACTGGGC CAAGGTGAAG ACAATAGATG TGCATCTTCA AATGCAGCAT 840 

CCTGTGCCAG GTGCCTTGCG CTGGGTCCAG AATGTGGATG GTGTGTTCAA GAGGATTTCA 900 

TTTCAGGTGG ATCAAGAAGT GAACGTTGTG ATATTGTTTC CAATTTAATA AGCAAAGGCT 960 

GCTCAGTTGA TTCAATAGAA TACCCATCTG TGCATGTTAT AATACCCACT GAAAATGAAA 1020 

TTAATACCCA GGTGACACCA GGAGAAGTGT CTATCCAGCT GCGTCCAGGA GCCGAAGCTA 1080 

ATTTTATGCT GAAAGTTCAT CCTCTGAAGA AATATCCTGT GGATCTTTAT TATCTTGTTG 1140 

ATGTCTCAGC ATCAATGCAC AATAATATAG AAAAATTAAA TTCCGTTGGA AACGATTTAT 1200 

CTAGAAAAAT GGCATTTTTC TCCCGTGACT TTCGTCTTGG ATTTGGCTCA TACGTTGATA 1260 

AAACAGTTTC ACCATACATT AGCATCCACC CCGAAAGGAT TCATAATCAA TGCAGTGACT 1320 

ACAATTTAGA CTGCATGCCT CCCCATGGAT ACATCCATGT GCTGTCTTTG ACAGAGAACA 1380 

TCACTGAGTT TGAGAAAGCA GTTCATAGAC AGAAGATCTC TGGAAACATA GATACACCAG 1440 



65 
70 
75 
80 



1139 



WO 03/042661 



AAGGAGGTTT TGACGCCATG CTTCAGGCAG CTGTCTGTGA AAGTCATATC GGATGGCGAA 1500 
AAGAGGCTAA AAGATTGCTG CTGGTGATGA CAGATCAGAC GTCTCATCTC GCTCTTGATA 1560 
GCAAATTGGC AGGCATAGTG GTGCCCAATG ACGGAAACTG TCATCTGAAA AACAACGTCT 1620 
ACGTCAAATC GACAACCATG GAACACCCCT CACTAGGCCA ACTTTCAGAG AAATTAATAG 1680 
ACAACAACAT TAATGTCATC TTTGCAGTTC AAGGAAAACA ATTTCATTGG TATAAGGATC 1740 
TTCTACCCCT CTTGCCAGGC ACCATTGCTG GTGAAATAGA ATCAAAGGCT GCAAACCTCA 1800 
ATAATTTGGT AGTGGAAGCC TATCAGAAGC TCATTTCAGA AGTGAAAGTT CAGGTGGAAA 1860 
ACCAGGTACA AGGCATCTAT TTTAACATTA CCGCCATCTG TCCAGATGGG TCCAGAAAGC 1920 
CAGGCATGGA AGGATGCAGA AACGTGACGA GCAATGATGA AGTTCTTTTC AATGTAACAG 1980 
TTACAATGAA AAAATGTGAT GTCACAGGAG GAAAAAACTA TGCAATAATC AAACCTATTG 2040 
GTTTTAATGA AACCGCTAAA ATTCATATAC ACAGAAACTG CAGCTGTCAG TGTGAGGACA 2100 
ACAGAGGAGC TAAAGGAAAG TGTGTAGATG AAACTTTTCT AGATTCCAAG TGTTTCCAGT 2160 
GTGATGAGAA TAAATGTCAT TTTGATGAAG ATCAGTTTTC TTCTGAGAGT TGCAAGTCAC 2220 
ACAAGGATCA GCCTGTTTGC AGTGGTCGAG GAGTTTGTGT TTGTGGGAAA TGTTCATGTC 2280 
ACAAAATTAA GCTTGGAAAA GTGTATGGAA AATACTGTGA AAAGGATGAC TTTTCTTGTC 2340 
CATATCACCA TGGAAATCTG TGTGCTGGGC ATGGAGAGTG TGAAGCAGGC AGATGCCAAT 2400 
GCTTCAGTGG CTGGGAAGGT GATCGATGCC AGTGCCCTTC AGCAGCAGCC CAGCACTGTG 2460 
TCAATTCAAA GGGCCAAGTG TGCAGTGGAA GAGGCACGTG TGTGTGTGGA AGGTGTGAGT 2520 
GCACCGATCC CAGGAGCATC GGCCGCTTCT GTGAACACTG CCCCACCTGT TATACAGCCT 2580 
GCAAGGAAAA CTGGAATTGT ATGCAATGCC TTCACCCTCA CAATTTGTCT CAGGCTATAC 2640 
TTGATCAGTG CAAAACCTCA TGTGCTCTCA TGGAACAACA GCATTATGTC GACCAAACTT 2700 
CAGAATGTTT CTCCAGCCCA AGCTACTTGA GAATATTTTT CATCATTTTC ATAGTTACAT 2760 
TCTTGATTGG GTTGCTTAAA GTCCTGATCA TTAGACAGGT GATACTACAA TGGAATAGTA 2820 
ATAAAATTAA GTCCTCATCA GATTACAGAG TGTCAGCCTC AAAAAAGGAT AAGTTGATTC 2880 
TGCAAAGTGT TTGCACAAGA GCAGTCACCT ACCGACGTGA GAAGCCTGAA GAAATAAAAA 2940 
TGGATATCAG CAAATTAAAT GCTCATGAAA CTTTCAGGTG CAACTTCTAA AAAAAGATTT 3000 
TTAAACACTT AATGGGAAAC TGGAATTGTT AATAATTGCT CCTAAAGATT ATAATTTTAA 3060 
AAGTCACAGG AGGAGACAAA TTGCTCACGG TCATGCCAGT TGCTGGTTGT ACACTCGAAC 3120 
GAAGACTGAC AAGTATCCTC ATCATGATGT GACTCACATA GCTGCTGACT TTTTCAGAGA 3180 
AAAATGTGTC TTACTACTGT TTGAGACTAG TGTCGTTGTA GCACTTTACT GTAATATATA 3240 
ACTTATTTAG ATCAGCATAG AATGTAGATC CTCTGAAGAG CACTGATTAC ACTTTACAGG 3300 
TACCTGTTAT CCCTAOGCTT CCCAGAGAGA ACAATGCTGT GAGAGAGTTT AGCATTGTGT 3360 
CACTACAAGG GTACAGTAAT CCCTGCACTG GACATGTGAG GAAAAAAATA ATCTGGCAAG 3420 
TATATTCTAA GGTTGCCAAA CACTTCAACA GTTGGTGGTT GAATAGACAA GAACAGCTAG 3480 
ATGAATAAAT GATTCGTGTT TCACTCTTTC AAGAGGTGAA CAGATACAAC CTTAATCTTA 3540 
AAAGATTATT GCTTTTTAAA GTGTGTAGTT TTATGCATGT GTGTTTATGG TTTGCTTATT 3600 
TTTGCAAGAT GGATACTAAT TCCAGCATTC TCTCCTCTTT GCCTTTATGT TTTGTTTTCT 3660 
TTTTTACAGG ATAAGTTTAT GTATGTCACA GATGACTGGA TTAATTAAGT GCTAAGTTAC 3720 
TACTGCCATA AAAAACTAAT AATACAATGT CACTTTATCA GAATACTAGT TTTAAAAGCT 3780 
GAATGTTAA 

Seq ID NO: 480 Protein sequence 
Protein Accession #: NPJJ02205 

1 11 21 31 41 51 

I I I I I I 

MCGSALAFFT AAFVCLQNDR RGPASFLWAA WVFSLVLGLG QGEDNRCASS NAASCARCLA 60 
LGPECGWCVQ EDP1SGGSRS ERCDIVSNLI SKGCSVDSIE YPSVHVIIPT ENBINTQVTP 120 
GEVSIQLRPG AEANFMLKVH PLKKYPVDLY YLVDVSASMH NNIEKLNSVG NDLSRKMAFF 180 
SRDPRLGFGS YVDKTVSPYI SIHPERIHNQ CSDYNLDCMP PHGYIHVLSL. TENITEFEKA 240 
VHRQKISGNI DTPEGGFDAM LQAAVCESHI GWRKEAKRLL LVMTDQTSHL ALDSKLAGIV 300 
VPNDGNCHLK NNVYVKSTTM EHPSLGQLSE KLIDNNINVI FAVQGKQFHW YKDLLPLLPG 360 
TIAGBIESKA ANLNNLWEA YQKLISEVKV QVENQVQGIY FNITAICPDG SRKPGMEGCR 420 
NVTSNDEVLP NVTVTMKKCD VTGGKNYAII KPIGFNETAK IHIHRNCSCQ GEDNRGPKGK 480 
CVDBTPLDSK CFQCDENKCH PDBDQPSSES CKSHKDQPVC SGRGVCVCGK CSCHKIKLGK 540 
VYGKYCEKDD PSCPYHHGNL CAGHGECEAG RCQCFSGWEG DRCQCPSAAA QHCVNSKGQV 600 
CSGRGTCVCG RCECTDPRSI GRFCEHCPTC YTACKENWNC MQCLHPHNLS QAILDQCKTS 660 
CALMEQQKYV DQTSECFSSP SYLRIPPIIP IVTPLIGLLK VLIIRQVILQ WNSNKIKSSS 720 
DYRVSASKKD KLILQSVCTR AVTYRREKPE EIKMDISKLN AHETFRCNF 

Seq ID NOt 481 DNA sequence 

Nucleic Acid Accession #: NM_003318.1 

Coding sequence: 1..2574 

1 11 21 31 41 51 

1 I I I I I 

ATGGAATCCG AGGATTTAAG TGGCAGAGAA TTGACAATTG ATTCCATAAT GAACAAAGTG 60 

AGAGACATTA AAAATAAGTT TAAAAATGAA GACCTTACTG ATGAACTAAG CTTGAATAAA 120 

ATTTCTGCTG ATACTACAGA TAACTCGGGA ACTGTTAACC AAATTATGAT GATGGCAAAC 180 

AACCCAGAGG ACTGGTTGAG TTTGTTGCTC AAACTAGAGA AAAACAGTGT TCCGCTAAGT 240 

GATGCTCTTT TAAATAAATT GATTGGTCGT TACAGTCAAG CAATTGAAGC GCTTCCCCCA 300 

GATAAATATG GCCAAAATGA GAGTTTTGCT AGAATTCAAG TGAGATTTGC TGAATTAAAA 360 

GCTATTCAAG AGCCAGATGA TGCACGTGAC TACTTTCAAA TGGCCAGAGC AAACTGCAAG 420 

AAATTTGCTT TTGTTCATAT ATCTTTTGCA CAATTTGAAC TGTCACAAGG TAATGTCAAA 480 

AAAAGTAAAC AACTTCTTCA AAAAGCTGTA GAACGTGGAG CAGTACCACT AGAAATGCTG 540 

GAAATTGCCC TGCGGAATTT AAACCTCCAA AAAAAGCAGC TGCTTTCAGA GGAGGAAAAG 600 

AAGAATTTAT CAGCATCTAC GGTATTAACT GCCCAAGAAT CATTTTCCGG TTCACTTGGG 660 

CATTTACAGA ATAGGAACAA CAGTTGTGAT TCCAGAGGAC AGACTACTAA AGCCAGGTTT 720 

TTATATGGAG AGAACATGCC ACCACAAGAT GCAGAAATAG GTTACCGGAA TTCATTGAGA 780 

CAAACTAACA AAACTAAACA GTCATGCCCA TTTGGAAGAG TCCCAGTTAA CCTTCTAAAT 840 

AGCCCAGATT GTGATGTGAA GACAGATGAT TCAGTTGTAC CTTGTTTTAT GAAAAGACAA 900 

ACCTCTAGAT CAGAATGCCG AGATTTGGTT GTGCCTGGAT CTAAACCAAG TGGAAATGAT 960 

TCCTGTGAAT TAAGAAATTT AAAGTCTGTT CAAAATAGTC ATTTCAAGGA ACCTCTGGTG 1020 

TCAGATGAAA AGAGTTCTGA ACTTATTATT ACTGATTCAA TAACCCTGAA GAATAAAACG 1080 



1140 



WO 03/042661 



PCT/US02/36810 



GAATCAAGTC TTCTAGCTAA ATTAGAAGAA ACTAAAGAGT ATCAAGAACC AGAGGTTCCA 1140 

GAGAGTAACC AGAAACAGTG GCAATCTAAG AGAAAGTCAG AGTGTATTAA CCAGAATCCT 1200 

GCTGCATCTT CAAATCACTG GCAGATTCCG GAGTTAGCCC GAAAAGTTAA TACAGAGCAG 1260 

AAACATACCA CTTTTGAGCA ACCTGTCTTT TCAGTTTCAA AACAGTCACC ACCAATATCA 1320 

5 ACATCTAAAT GGTTTGACCC AAAATCTATT TGTAAGACAC CAAGCAGCAA TACCTTGGAT 1380 

GATTACATGA GCTGTTTTAG AACTCCAGTT GTAAAGAATG ACTTTCCACC TGCTTGTCAG 1440 

TTGTCAACAC CTTATGGCCA ACCTGCCTGT TTCCAGCAGC AACAGCATCA AATACTTGCC 1500 

ACTCCACTTC AAAATTTACA GGTTTTAGCA TCTTCTTCAG CAAATGAATG CATTTCGGTT 1560 

AAAGGAAGAA TTTATTCCAT TTTAAAGCAG ATAGGAAGTG GAGGTTCAAG CAAGGTATTT 1620 

10 CAGGTGTTAA ATGAAAAGAA ACAGATATAT GCTATAAAAT ATGTGAACTT AGAAGAAGCA 1680 

GATAACCAAA CTCTTGATAG TTACCGGAAC GAAATAGCTT ATTTGAATAA ACTACAACAA 1740 

CACAGTGATA AGATCATCOG ACTTTATGAT TATGAAATCA CGGACCAGTA CATCTACATG 1800 

GTAATGGAGT GTGGAAATAT TGATCTTAAT AGTTGGCTTA AAAAGAAAAA ATCCATTGAT 1860 

CCATGGGAAC GCAAGAGTTA CTGGAAAAAT ATGTTAGAGG CAGTTCACAC AATCCATCAA 1920 

15 CATGGCATTG TTCACAGTGA TCTTAAACCA GCTAACTTTC TGATAGTTGA TGGAATGCTA 1980 

AAGCTAATTG ATTTTGGGAT TGCAAACCAA ATGCAACCAG ATACAACAAG TGTTGTTAAA 2040 

GATTCTCAGG TTGGCACAGT TAATTATATG CCACCAGAAG CAATCAAAGA TATGTCTTCC 2100 

TCCAGAGAGA ATGGGAAATC TAAGTCAAAG ATAAGCCCCA AAAGTGATGT TTGGTCCTTA 2160 

GGATGTATTT TGTACTATAT GACTTACGGG AAAACACCAT TTCAGCAGAT AATTAATCAG 2220 

20 ATTTCTAAAT TACATGCCAT AATTGATCCT AATCATGAAA TTGAATTTCC CGATATTCCA 2280 

GAGAAAGATC TTCAAGATGT GTTAAAGTGT TGTTTAAAAA GGGACCCAAA ACAGAGGATA 2340 

TCCATTCCTG AGCTCCTGGC TCATCCCTAT GTTCAAATTC AAACTCATCC AGTTAACCAA 2400 

ATGGCCAAGG GAACCACTGA AGAAATGAAA TATGTTCTGG GCCAACTTGT TGGTCTGAAT 2460 

TCTCCTAACT CCATTTTGAA AGCTGCTAAA ACTTTATATG AACACTATAG TGGTGGTGAA 2520 

25 AGTCATAATT CTTCATCCTC CAAGACTTTT GAAAAAAAAA GGGGAAAAAA ATGA 

Seq ID NO: 482 Protein sequence 
Protein Accession #: NPJI03309.1 

30 1 11 21 31 41 51 

I I I I I I 

MESEDLSGRE LTIDSIMNKV RDIKNKFKNE DLTDELSLNK ISADTTDNSG TVNQIMMMAN 60 

NPEDWLSLLL KLEKNSVPLS DALLNKLIGR YSQAIEALPP DKYGQNESFA RIQVRFAELK 120 

AIQEPDDARD YFQMARANCK KFAFVHISFA QFELSQGNVK KSKQLLQKAV ERGAVPLEML 180 

35 EIALRNLNLQ KKQLLSEEEK KNLSASTVLT AQESPSGSLG HLQNRNNSCD SRGQTTKARF 240 

LYGENMPPQD AEIGYRNSLR QTNKTKQSCP FGRVPVNLLN SPDCDVKTDD SWPCFMKRQ 300 

TSRSECRDLV VPGSKPSGND SCELRNLKSV QNSHFKEPLV SDEKSSELII TDSITLKNKT 360 

ESSLLAKLEE TKEYQEPEVP ESNQKQWQSK RKSECINQNP AASSNHWQIP ELARKVNTEQ 420 

KHTTFEQPVF SVSKQSPPIS TSKWFDPKSI CKTPSSNTLD DYMSCFRTPV VKNDFPPACQ 480 

40 LSTPYGQPAC FQQQQHQIIA TPLQNLQV1A SSSANECISV KGRIYSILKQ IGSGGSSKVF 540 

QVLNEKKQIY AIKYVNLEEA DNQTLDSYRN EIAYLNKLQQ HSDKIIRLYD YEITDQYIYM 600 

VMECGNIDLN SWLKKKKSID PWERKSYWKN MLEAVHTIHQ HGIVHSDLKP ANFLIVDGML 660 

KLIDFGIANQ MQPDTTSWK DSQVGTVNYM PPEAIKDMSS SREKGKSKSK ISPKSDVWSL 720 

GCILYYMTYG KTPFQQIINQ ISKLHAIIDP NHEIBFPDIP EKDLQDVLKC CLKRDPKQRI 780 

45 SIPELLAHPY VQIQTHPVNQ MAKGTTEEMK YVLGQLVGLN SPNSILKAAK TLYEHYSGGE 840 
SHNSSSSKTF EKXRGKK 

Seq ID NO: 483 DNA sequence 
Nucleic Acid Accession #: NM_003667.1 
50 Coding sequence: 1. .2651 

1 11 21 31 41 51 

1 I I I I I 

ATGGACACCT CCCGGCTCGG TGTGCTCCTG TCCTTGCCTG TGCTGCTGCA GCTGGCGACC 60 

55 GGGGGCAGCT CTCCCAGGTC TGGTGTGTTG CTGAGGGGCT GCCCCACACA CTGTCATTGC 120 

GAGCCCGACG GCAGGATGTT GCTCAGGGTG GACTGCTCCG ACCTGGGGCT CTCGGAGCTG 180 

CCTTCCAACC TCAGCGTCTT CACCTCCTAC CTAGACCTCA GTATGAACAA CATCAGTCAG 240 

CTGCTCCCGA ATCCCCTGCC CAGTCTCCGC TTCCTGGAGG AGTTACGTCT TGCGGGAAAC 300 

GCTCTGACAT ACATTCCCAA GGGAGCATTC ACTGGCCTTT ACAGTCTTAA AGTTCTTATG 360 

60 CTGCAGAATA ATCAGCTAAG ACACGTACCC ACAGAAGCTC TGCAGAATTT GCGAAGCCTT 420 

CAATCCCTGC GTCTGGATGC TAACCACATC AGCTATGTGC CCCCAAGCTG TTTCAGTGGC 480 

CTGCATTCCC TGAGGCACCT GTGGCTGGAT GACAATGCGT TAACAGAAAT CCCCGTCCAG 540 

GCTTTTAGAA GTTTATCGGC ATTGCAAGCC ATGACCTTGG CCCTGAACAA AATACACCAC 600 

ATACCAGACT ATGCCTTTGG AAACCTCTCC AGCTTGGTAG TTCTACATCT CCATAACAAT 660 

65 AGAATCCACT CCCTGGGAAA GAAATGCTTT GATGGGCTCC ACAGCCTAGA GACTTTAGAT 720 

TTAAATTACA ATAACCTTGA TGAATTCCCC ACTGCAATTA GGACACTCTC CAACCTTAAA 780 

GAACTACATT TCTATGACAA TCCCATCCAA TTTGTTGGGA GATCTGCTTT TCAACATTTA 840 

CCTGAACTAA GAACACTGAC TCTGAATGGT GCCTCACAAA TAACTGAATT TCCTGATTTA 900 

ACTGGAACTG CAAACCTGGA GAGTCTGACT TTAACTGGAG CACAGATCTC ATCTCTTCCT 960 

70 CAAACCGTCT GCAATCAGTT ACCTAATCTC CAAGTGCTAG ATCTGTCTTA CAACCTATTA 1020 

GAAGATTTAC CCAGTTTTTC AGTCTGCCAA AAGCTTCAGA AAATTGACCT AAGACATAAT 1080 

GAAATCTACG AAATTAAAGT TGACACTTTC CAGCAGTTGC TTAGCCTCCG ATCGCTGAAT 1140 

TTGGCTTGGA ACAAAATTGC TATTATTCAC CCCAATGCAT TTTCCACTTT GCCATCCCTA 1200 

ATAAAGCTGG ACCTATCGTC CAACCTCCTG TCGTCTTTTC CTATAACTGG GTTACATGGT 1260 

75 TTAACTCACT TAAAATTAAC AGGAAATCAT GCCTTACAGA GCTTGATATC ATCTGAAAAC 1320 

TTTCCAGAAC TCAAGGTTAT AGAAATGCCT TATGCTTACC AGTGCTGTGC ATTTGGAGTG 1380 

TGTGAGAATG CCTATAAGAT TTCTAATCAA TGGAATAAAG GTGACAACAG CAGTATGGAC 1440 

GACCTTCATA AGAAAGATGC TGGAATGTTT CAGGCTCAAG ATGAACGTGA CCTTGAAGAT 1500 

TTCCTGCTTG ACTTTGAGGA AGACCTGAAA GCCCTTCATT CAGTGCAGTG TTCACCTTCC 1560 

80 CCAGGCCCCT TCAAACCCTG TGAACACCTG CTTGATGGCT GGCTGATCAG AATTGGAGTG 1620 

TGGACCATAG CAGTTCTGGC ACTTACTTGT AATGCTTTGG TGACTTCAAC AGTTTTCAGA 1680 

TCCCCTCTGT ACATTTCCCC CATTAAACTG TTAATTGGGG TCATCGCAGC AGTGAACATG 1740 

CTCACGGGAG TCTCCAGTGC CGTGCTGGCT GGTGTGGATG CGTTCACTTT TGGCAGCTTT 1800 

GCACGACATG GTGCCTGGTG GGAGAATGGG GTTGGTTGCC ATGTCATTGG TTTTTTGTCC 1860 
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ATTTTTGCTT CAGAATCATC TGTTTTCCTG CTTACTCTGG CAGCCCTGGA GCGTGGGTTC 1920 

TCTGTGAAAT ATTCTGCAAA ATTTGAAACG AAAGCTCCAT TTTCTAGCCT GAAAGTAATC 1980 

ATTTTGCTCT GTGCCCTGCT GGCCTTGACC ATGGCCGCAG TTCCCCTGCT GGGTGGCAGC 2040 

AAGTATGGCG CCTCCCCTCT CTGCCTGCCT TTGCCTTTTG GGGAGCCCAG CACCATGGGC 2100 

TACATGGTCG CTCTCATCTT GCTCAATTCC CTTTGCTTCC TCATGATGAC CATTGCCTAC 2160 

ACCAAGCTCT ACTGCAATTT GGACAAGGGA GACCTGGAGA ATATTTGGGA CTQCTCTATG 2220 

GTAAAACACA TTGCCCTGTT GCTCTTCACC AACTGCATCC TAAACTGCCC TGTGGCTTTC 2280 

TTGTCCTTCT CCTCTTTAAT AAACCTTACA TTTATCAGTC CTGAAGTAAT TAAGTTTATC 2340 

CTTCTGGTGG TAGTCCCACT TCCTGCATGT CTCAATCCCC TTCTCTACAT CTTGTTCAAT 2400 

CCTCACTTTA AGGAGGATCT GGTGAGCCTG AGAAAGCAAA CCTACGTCTG GACAAGATCA 2460 

AAACACCCAA GCTTGATGTC AATTAACTCT GATGATGTCG AAAAACAGTC CTGTGACTCA 2520 

ACTCAAGCCT TGGTAACCTT TACCAGCTCC AGCATCACTT ATGACCTGCC TCCCAGTTCC 2580 

GTGCCATCAC CAGCTTATCC AGTGACTGAG AGCTGCCATC TTTCCTCTGT GGCATTTGTC 2640 
CCATGTCTTA A 

Seq ID NO: 484 Protein sequence 
Protein Accession #: NP_00 3658.1 

1 11 21 31 41 51 

I I I I I I 

MDTSRLGVLL SLPVLLQIiAT GGSSPRSGVL LRGCPTHCHC EPDGRMLLRV DCSDLGLSEL 60 

PSNLSVFTSY LDLSMNNISQ LLPNPLPSLR FLEELRLAGN ALTYIPKGAF TGLYSLKVLM 120 

LQNNQLRHVP TEALQNLRSL QSLRLDANHI SYVPPSCPSG LHSLRHLWLD DNALTEIPVQ 180 

AFRSLSALQA MTLALNKIHH IPDYAFGNLS SLWLHLHNN RIHSLGKKCF DGLHSLETLD 240 

LNYNNLDEFP TAIRTLSNLK ELHFYDNPIQ FVGRSAFQHL PELRTLTLNG ASQITEFPDI* 300 

TGTANLESLT LTGAQISSLP QTVCNQLPNL QVLDLSYNLL EDLPSFSVCQ KLQKIDLRHN 360 

EIYEIKVDTF QQLLSLRSLN IAWNKIAIIH PHAFSTLPSL IKLDLSSNLL SSFPITGLHG 420 

LTHLKLTGNH ALQSLISSEN FPELKVIEMP YAYQCCAFGV CENAYKISNQ WNKGDNSSMD 480 

DLHKKDAGMF QAQDBRDLED FLLDFEEDLK ALHSVQCSPS PGPFKPCEHL LDGWLIRIGV 540 

WTIAVLALTC NALVTSTVFR SPLYISPIKL LIGVIAAVNM LTGVSSAVLA GVDAFTFGSF 600 

ARHGAWWENG VGCHVIGFLS IFASESSVFL LTLAALERGF SVKYSAKFET KAPFSSLKVI 660 

ILLCALLALT MAAVPLLGGS KYGASPLCLP LPFGEPSTMG YMVALILLNS LCFLMMTIAY 720 

TKLYCNLDKG DLENIWDCSM VKHIALLLFT NCILNCPVAF LSFSSLINLT FISPEVIKFI 780 

LLVWPLPAC LNPLLYILFN PHFKEDLVSL RKQTYVWTRS KHPSLMSINS DDVEKQSCDS 840 
TQALVTFTSS SITYDLPPSS VPSPAYPVTE SCHLSSVAFV PCL 

Seq ID NO: 485 DNA sequence 

Nucleic Acid Accession #: NM_005756.1 

Coding sequence: 73.. 3117 

1 11 21 31 41 51 

I I i I I I 

AGCCAGCCCG AGGACGCGAG CGGCAGGTGT GCACAGAGGT TCTCCACTTT GTTTTCTGAA 60 

CTCGCGGTCA GGATGGTTTT CTCTGTCAGG CAGTGTGGCC ATGTTGGCAG AACTGAAGAA 120 

GTTTTACTGA CGTTCAAGAT ATTCCTTGTC ATCATTTGTC TTCATGTCGT TCTGGTAACA 180 

TCCCTGGAAG AAGATACTGA TAATTCCAGT TTGTCACCAC CACCTGCTAA ATTATCTGTT 240 

GTCAGTTTTG CCCCCTCCTC CAATGAGGTT GAAACAACAA GCCTCAATGA TGTTACTTTA 300 

AGCTTACTCC CTTCAAACGA AACAGAAAAA ACTAAAATCA CTATAGTAAA AACCTTCAAT 360 

GCTTCAGGCG TCAAACCCCA GAGAAATATC TGCAATTTGT CATCTATTTG CAATGACTCA 420 

GCATTTTTTA GAGGTGAGAT CATGTTTCAA TATGATAAAG AAAGCACTGT TCCCCAGAAT 480 

CAACATATAA CGAATGGCAC CTTAACTGGA GTCCTGTCTC TAAGTGAATT AAAACGCTCA 540 

GAGCTCAACA AAACCCTGCA AACCCTAAGT GAGACTTACT TTATAATGTG TGCTACAGCA 600 

GAGGCCCAAA GCACATTAAA TTGTACATTC ACAATAAAAC TGAATAATAC AATGAATGCA 660 

TGTGCTGCAA TAGCCGCTTT GGAAAGAGTA AAGATTCGAC CAATGGAACA CTGCTGCTGT 720 

TCTGTCAGGA TACCCTGCCC TTCCTCCCCA GAAGAGTTGG GAAAGCTTCA GTGTGACCTG 780 

CAGGATCCCA TTGTCTGTCT TGCTGAGCAT CCACGTGGCC CACCATTTTC TTCCAGCCAA 840 

TCCATCCCAG TGGTGCCTCG GGCCACTGTG CTTTCCCAGG TCCCCAAAGC TACCTCTTTT 900 

GCTGAGCCTC CAGATTATTC ACCTGTGACC CACAATGTTC CCTCTCCAAT AGGGGAGATT 960 

CAACCCCTTT CACCCCAGCC TTCAGCTCCC ATAGCTTCCA GCCCTGCCAT TGACATGCCC 1020 

CCACAGTCTG AAACGATCTC TTCCCCTATG CCCCAAACCC ATGTCTGCGG CACCCCACCT 1080 

CCTGTGAAAG CCTCATTTTC CTCTCCCACC GTGTCTGCCC CTGCGAATGT CAACACTACC 1140 

AGCGCACCTC CTGTCCAGAC AGACATCGTC AACACCAGCA GTATTTCTGA TCTTGAGAAC 1200 

CAAGTGTTGC AGATGGAGAA GGCTCTGTCC TTGGGCAGCC TGGAGCCTAA CCTCGCAGGA 1260 

GAAATGATCA ACCAAGTCAG CAGACTCCTT CATTCCCCGC CTGACATGCT GGCCCCTCTG 1320 

GCTCAAAGAT TGCTGAAAGT AGTGGATGAC ATTGGCCTAC AGCTGAACTT TTCAAACACG 1380 

ACTATAAGTC TAACCTCCCC TTCTTTGGCT CTGGCTGTGA TCAGAGTGAA TGCCAGTAGT 1440 

TTCAACACAA CTACCTTTGT GGCCCAAGAC CCTGCAAATC TTCAGGTTTC TCTGGAAACC 1500 

CAAGCTCCTG AGAACAGTAT TGGCACAATT ACTCTTCCTT CATCGCTGAT GAATAATTTA 1560 

CCAGCTCATG ACATGGAGCT AGCTTCCAGG GTTCAGTTGA ATTTTTTTGA AACACCTGCT 1620 

TTGTTTCAGG ATCCTTCCCT GGAGAACCTC TCTCTGATCA GCTACGTCAT ATCATCGAGT 1680 

GTTGCAAACC TGACCGTCAG GAACTTGACA AGAAACGTGA CAGTCACATT AAAGCACATC 1740 

AACCCGAGCC AGGATGAGTT AACAGTGAGA TGTGTATTTT GGGACTTGGG CAGAAATGGT 1800 

GGCAGAGGAG GCTGGTCAGA CAATGGCTGC TCTGTCAAAG ACAGGAGATT GAATGAAACC I860 

ATCTGTACCT GTAGCCATCT AACAAGCTTC GGCGTTCTGC TGGACCTATC TAGGACATCT 1920 

GTGCTGCCTG CTCAAATGAT GGCTCTGACG TTCATTACAT ATATTGGTTG TGGGCTTTCA 1980 

TCAATTTTTC TGTCAGTGAC TCTTGTAACC TACATAGCTT TTGAAAAGAT CCGGAGGGAT 2040 

TACCCTTCCA AAATCCTCAT CCAGCTGTGT GCTGCTCTGC TTCTGCTGAA CCTGGTCTTC 2100 

CTCCTGGACT CGTGGATTGC TCTGTATAAG ATGCAAGGCC TCTGCATCTC AGTGGCTGTA 2160 

TTTCTTCATT ATTTTCTCTT GGTCTCATTC ACATGGATGG GCCTAGAAGC ATTCCATATG 2220 

TACCTGGCCC TTGTCAAAGT ATTTAATACT TACATCCGAA AATACATCCT TAAATTCTGC 2280 

ATTGTCGGTT GGGGGGTACC AGCTGTGGTT GTGACCATCA TCCTGACTAT ATCCCCAGAT 2340 

AACTATGGGC TTGGATCCTA TGGGAAATTC CCCAATGGTT CACCGGATGA CTTCTGCTGG 2400 

ATCAACAACA ATGCAGTATT CTACATTACG GTGGTGGGAT ATTTCTGTGT GATATTTTTG 2460 

CTGAACGTCA GCATGTTCAT TGTGGTCCTG GTTCAGCTCT GTCGAATTAA AAAGAAGAAG 2520 
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CAACTGGGAG CCCAGCGAAA AACCAGTATT CAAGACCTCA GGAGTATCGC TGGCCTTACA 2580 

TTTTTACTGG GAATAACTTG GGGCTTTGCC TTCTTTGCCT GGGGAOCAGT TAACGTGACC 2640 

TTCATGTATC TGTTTGCCAT CTTTAATACC TTACAAGGAT TTTTCATATT CATCTTTTAC 2700 

TGTGTGGCCA AAGAAAATGT CAGGAAGCAA TGGAGGCGGT ATCTTTGTTG TGGAAAGTTA 2760 

CGGCTGGCTG AAAATTCTGA CTGGAGTAAA ACTGCTACTA ATGGTTTAAA GAAGCAGACT 2820 

GTAAACCAAG GAGTGTCCAG CTCTTCAAAT TCCTTACAGT CAAGCAGTAA CTCCACTAAC 2880 

TCCACCACAC TGCTAGTGAA TAATGATTGC TCAGTACACG CAAGCGGGAA TGGAAATGCT 2940 

TCTACAGAGA GGAATGGGGT CTCTTTTAGT GTTCAGAATG GAGATGTGTG CCTTCACGAT 3000 

TTCACTGGAA AACAGCACAT GTTTAACGAG AAGGAAGATT CCTGCAATGG GAAAGGCCGT 3060 

ATGGCTCTCA GAAGGACTTC AAAGCGGGGA AGCTTACACT TTATTGAGCA AATGTGATTC 3120 

CTTTCTTCTA AAATCAAAGC ATGATGCTTG ACAGTGTGAA ATGTCCAATT TTACCTTTTA 3180 

CACAATGTGA GATGTATGAA AATCAACTCA TTTTATTCTC GGCAACATCT GGAGAAGCAT 3240 

AAGCTAATTA AGGGCGATGA TTATTATTAC AAGAAGAAAC CAAGACATTA CACCATGGTT 3300 

TTTAGACATT TCTGATTTGG TTTCTTATCT TTCATTTTAT AAGAAGGTTG GTTTTAAACA 3360 

ATACACTAAG AATGACTCCT ATAAAGAAAA CAAAAAAAGG TAGTGAACTT TCAGCTACCT 3420 

TTTAAAGAGG CTAAGTTATC TTTGATAACA TCATATAAAG CAACTGTTGA CTTCAGCCTG 3480 

TTGGTGAGTT TAGTTGTGCA TGCCTTTGTT GTATATAAGC TAAATTCTAG TGACCCATGT 3540 

GTCAAAAATC TTACTTCTAC ATTTTTTTGT ATTTATTTTC TACTGTGTAA ATGTATTCCT 3600 

TTGTAGAATC ATGGTTGTTT TGTCTCACGT GATAATTCAG AAAATCCTTG CTCGTTCCGC 3660 

AAATCCTAAA GCTCCTTTTG GAGATGATAT AGGATGTGAA ATACAGAAAC CTCAGTGAAA 3720 

TCAAGAAATA ATGATCCCAG CCAGACTGAG AAAATGTAAG CAGACAGTGC CACAGTTAGC 3780 

TCATACAGTG CCTTTG AG CA AGTTAGGAAA AGATGCCCCC ACTGGGCAGA CACAGCCCTA 3840 

TGGGTCATGG TTTGACAAAC AGAGTGAGAG ACCATATTTT AGCCCCACTC ACCCTCTTGG 3900 

GTGCACGACC TGTACAGCCA AACACAGCAT CCAATATGAA TACCCATCCC CTGACCGCAT 3960 

CCCCAGTAGT CAGATTATAG AATCTGCACC AAGATGTTTA GCTTTATACC TTGGCCACAG 4020 

AGAGGGATGA ACTGTCATCC AGACCATGTG TCAGGAAAAT TGTGAACGTA GATGAGGTAC 4080 

ATACACTGCC GCTTCTCAAA TCCCCAGAGC CTTTAGGAAC AGGAGAGTAG ACTAGGATTC 4140 

CTTCTCTTAA AAAGGTACAT ATATATGGAA AAAAATCATA TTGCCGTTCT TTAAAAGGCA 4200 

ACTGCATGGT ACATTGTTGA TTGTTATGAC TGGTACACTC TGGCCCAGCC AGAGCTATAA 4260 

TTGTTTTTTA AATGTGTCTT GAAGAATGCA CAGTGACAAG GGGAGTAGCT ATTGGGAACA 4320 

GGGAACTGTC CTACACTGCT ATTGTTGCTA CATGTATCGA GCCTTGATTG CTCCTAGTTA 4380 

TATACAGGGT CTATCTTGCT TCCTACCTAC ATCTGCTTGA GCAGTGCCTC AAGTACATCC 4440 

TTATTAGGAA CATTTCAAAC CCCTTTTAGT TAAGTCTTTC ACTAAGGTTC TCTTGCATAT 4500 

ATTTCAAGTG AATGTTGGAT CTCAGACTAA CCATAGTAAT AATACACATT TCTGTGAGTG 4560 

CTGACTTGTC TTTGCAATAT TTCTTTTCTG ATTTATTTAA TTTTCTTGTA TTTATATGTT 4620 
AAAATCAAAA ATGTTAAAAT CAATGAAATA AATTTGCAGT TAAGA 

Seq ID NO: 486 Protein sequence 
Protein Accession #: NP_005747.1 



1 
I 

MVFSVRQCGH 
PSSNEVETTS 
QNQHITNGTL 
NACAVIAALE 
SQSIPWPRA 
MPPQSETISS 
ENQVLQMEKA 
NTTISLTSPS 
NLPAHDMELA 
HINPSODELT 
TSVLPAQMMA 
VFLLDSWIAL 
FCIVGWGVPA 
FLLNVSMFIV 
VTFMYLFAIF 
QTVNQGVSSS 
HDFTGKQHMF 



11 
I 

VGRTEEVLLT 
LNDVTLSLLP 
TGVLSLSELK 
RVKIRPMEHC 
TVLSQVPKAT 
PMPQTHVSGT 
LSLGSLEPNL 
LALAVIRVNA 
SRVQFNFFET 
VRCVFWDLGR 
LTFITYIGCG 
YKMQGLCISV 
WVTIILTIS 
VLVQLCRIKK 
NTLQGFFIFI 



21 
I 

FKIFLVIICL 
SNETGVKPQR 
RSELNKTLQT 
CCSVRIPCPS 
SFABPPDYSP 
PPPVKASFSS 
AGEMINQVSR 
SSFNTTTFVA 
PALFQDPSLB 
NGGRGGWSDN 
LSSIFLSVTL 
AVFLHYFLLV 
PDNYGLGSYG 
KKQLGAQRKT 
FYCVAKENVR 
TNSTTLLVNN 
GRMALRRTSK 



31 
I 

HWLVTSLEE 
NICNbSSICN 
LSBTYFZMCA 
SPEELEKLQC 
VTHNVPSPIG 
PTVSAPANVN 
LLHSPPDMliA 
QDPANLQVSL 
NLSLISYVIS 
GCSVKDRRLN 
VTYIAFEKIR 
SFTWMGLEAF 
KFPNGSPDDF 
SIQDLRSIAG 
RQWRRYLCCG 
DCSVHASGNG 
RGSLHFIEQM 



41 
I 

DTDNSSLSPP 
DSAFFRGEIM 
TAEAQSTLNC 
DLQDPIVCLA 
EIQPLSPQPS 
TTSAPPVQTD 
PLAQRLLKW 
ETQAPENSIG 
SSVANLTVRN 
ETICTCSHLT 
RDYPSKILIQ 
HMYLALVKVF 
CWINNNAVFY 
LTFLLGITWG 
KLRLAENSDH 
NASTERNGVS 



51 
I 

PAKLSWSFA 
FQYDKESTVP 
TFTIKLNNTO 
DHPRGPPFSS 
APIASSPAID 
IVNTSSISDL 
DDIGLQLNFS 
TITLPSSLMN 
LTRNVTVTLK 
SFGVLLDLSR 
LCAALLLLNL 
NTYIRKYIZjK 
ITWGYFCVI 
FAFFAWGPVN 
SKTATNGLKK 
FSVQNGDVCL 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



Seq ID NO i 487 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..2904 



ATGGTTTTCT 
TTCAAGATAT 
GATACTGATA 
AACGAAACAG 
CCCCAGAGAA 
GAGATCATGT 
GGCACCTTAA 
ATAAAACTGA 
ATTCGACCAA 
GAGTTGGAAA 
CGTGGCCCAC 
TCCCAGGTCC 
AATGTTCCCT 
GCTTCCAGCC 
CAAACCCATG 
TCTGCCCCTG 
ACCAGCAGTA 
GGCAGCCTGG 



11 
I 

CTGTCAGGCA 
TCCTTGTCAT 
ATTCCAGTTT 
AAAAAACTAA 
ATATCTGCAA 
TTCAATATGA 
CTGGAGTCCT 
ATAATACAAT 
TGGAACACTG 
AGCTTCAGTG 
CATTTTCTTC 
CCAAAGCTAC 
CTCCAATAGG 
CTGCCATTGA 
TCTCCGGCAC 
CGAATGTCAA 
TTTCTGATCT 
AGCCTAACCT 



21 
1 

GTGTGGCCAT 
CATTTGTCTT 
GTCACCACCA 
AATCACTATA 
TTTGTCATCT 
TAAAGAAAGC 
GTCTCTAAGT 
GAATGCATGT 
CTGCTGTTCT 
TGACCTGCAG 
CAGCCAATCC 
CTCTTTTGCT 
GGAGATTCAA 
CATGCCCCCA 
CCCACCTCCT 
CACTACCAGC 
TGAGAACCAA 
CGCAGGAGAA 



31 

! 

GTTGGCAGAA 
CATGTCGTTC 
CCTGATGTTA 
GTAAAAACCT 
ATTTGCAATG 
ACTGTTCCCC 
GAATTAAACA 
GCTGTAATAG 
GTCAGGATAC 
GATCCCATTG 
ATCCCAGTGG 
GAGCCTCCAG 
CCCCTTTCAC 
CAGTCTGAAA 
GTGAAAGCCT 
GCACCTCCTG 
GTGTTGCAGA 
ATGATCAACC 



41 
I 

CTGAAGAAGT 
TGGTAACATC 
CTTTAAGCTT 
TCAATGCTTC 
ACTCAGCATT 
AGAATCAACA 
CATTAAATTG 
CTGCTTTGGA 
CCTGCCCTTC 
TCTGTCTTGC 
TGCCTCGGGC 
ATTATTCACC 
CCCAGCCTTC 
CGATCTCTTC 
CATTTTCCTC 
TCCAGACAGA 
TGGAGAAGGC 
AAGTCAGCAG 



51 
I 

TTTACTGACG 
CCTGGAAGAA 
ACTCCCTTCA 
AGGCGTCAAA 
TTTTAGAGGT 
TATAACGAAT 
TACATTCACA 
AAGAGTAAAG 
CTCCCCAGAA 
TGACCATCCA 
CACTGTGCTT 
TGTGACCCAC 
AGCTCCCATA 
CCCTATGCCC 
TCCCACCGTG 
CATCGTCAAC 
TCTGTCCTTG 
ACTCCTTCAT 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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TCCCCGCCTG ACATGCTGGC CCCTCTGGCT CAAAGATTGC TGAAAGTAGT GGATGACATT 1140 

GGCCTACAGC TGAACTTTTC AAACACGACT ATAAGTCTAA CCTCCCCTTC TTTGGCTCTG 1200 

GCTGTGATCA GAGTGAATGC CAGTAGTTTC AACACAACTA CCTTTGTGGC CCAAGACCCT 1260 

GCAAATCTTC AGGTTTCTCT GGAAACCCAA GCTCCTGAGA ACAGTATTGG CACAATTACT 1320 

5 CTTCCTTCAT CGCTGATGAA TAATTTACCA GCTCATGACA TGGAGCTAGC TTCCAGGGTT 1380 

CAGTTCAATT TTTTTGAAAC ACCTGCTTTG TTTCAGGATC CTTCCCTGGA GAACCTCTCT 1440 

CTGATCAGCT ACGTCATATC ATCGAGTGTT GCAAACCTGA CCGTCAGGAA CTTGACAAGA 1500 

AACGTGACAG TCACATTAAA GCACATCAAC CCGAGCCAGG ATGAGTTAAC AGTGAGATGT 1S60 

GTATTTTGGG ACTTGGGCAG AAATGGTGGC AGAGGAGGCT GGTCAGACAA TGGCTGCTCT 1620 

10 GTCAAAGACA GGAGATTGAA TGAAACCATC TGTACCTGTA GCCATCTAAC AAGCTTCGGC 1680 

GTTCTGCTGG ACCTATCTAG GACATCTGTG CTGCCTGCTC AAATGATGGC TCTGACGTTC 1740 

ATTACATATA TTGGTTGTGG GCTTTCATCA ATTTTTCTGT CAGTGACTCT TGTAACCTAC 1800 

ATAGCTTTTG AAAAGATCCG GAGGGATTAC CCTTCCAAAA TCCTCATCCA GCTGTGTGCT 1860 

GCTCTGCTTC TGCTGAACCT GGTCTTCCTC CTGGACTCGT GGATTGCTCT GTATAAGATG 1920 

15 CAAGGCCTCT GCATCTCAGT GGCTGTATTT CTTCATTATT TTCTCTTGGT CTCATTCACA 1980 

TGGATGGGCC TAGAAGCATT CCATATGTAC CTGGCCCTTG TCAAAGTATT TAATACTTAC 2040 

ATCCGAAAAT ACATCCTTAA ATTCTGCATT GTCGGTTGGG GGGTACCAGC TGTGGTTGTG 2100 

ACCATCATCC TGACTATATC CCCAGATAAC TATGGGCTTG GATCCTATGG GAAATTCCCC 2160 

AATGGTTCAC CGGATGACTT CTGCTGGATC AACAACAATG CAGTATTCTA CATTACGGTG 2220 

20 GTGGGATATT TCTGTGTGAT ATTTTTGCTG AACGTCAGCA TGTTCATTGT GGTCCTGGTT 2280 

CAGCTCTGTC GAATTAAAAA GAAGAAGCAA CTGGGAGCCC AGCGAAAAAC CAGTATTCAA 2340 

GACCTCAGGA GTATCGCTGG CCTTACATTT TTACTGGGAA TAACTTGGGG CTTTGCCTTC 2400 

TTTGCCTGGG GACCAGTTAA CGTGACCTTC ATGTATCTGT TTGCCATCTT TAATACCTTA 2460 

CAAGGATTTT TCATATTCAT CTTTTACTGT GTGGCCAAAG AAAATGTCAG GAAGCAATGG 2520 

25 AGGCGGTATC TTTGTTGTGG AAAGTTACGG CTGGCTGAAA ATTCTGACTG GAGTAAAACT 2580 

GCTACTAATG GTTTAAAGAA GCAGACTGTA AACCAAGGAG TGTCCAGCTC TTCAAATTCC 2640 

TTACAGTCAA GCAGTAACTC CACTAACTCC ACCACACTGC TAGTGAATAA TGATTGCTCA 2700 

GTACACGCAA GCGGGAATGG AAATGCTTCT ACAGAGAGGA ATGGGGTCTC TTTTAGTGTT 2760 

CAGAATGGAG ATGTGTGCCT TCACGATTTC ACTGGAAAAC AGCACATGTT TAACGAGAAG 2820 

30 GAAGATTCCT GCAATGGGAA AGGCCGTATG GCTCTCAGAA GGACTTCAAA GCGGGGAAGC 2880 
TTACACTTTA TTGAGCAAAT GTGA 

Seq ID NO: 488 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 SI 

I I I 1 I I 

MVPSVRQCGH VGRTEEVLLT FKIFLVIICL HWLVTSLEE DTDNSSLSPP PDVTLSLLPS 60 

NETEKTKITI VKTFNASGVK PQRNICNLSS ICNDSAFFRG EIMFQYDKES TVPQNQHITH 120 

40 GTLTGVLSLS ELNTLNCTPT IKLNNTMNAC AVIAALERVK IRPMEHCCCS VR1PCPSSPE 180 

ELEKI/QCDLQ DPIVCLADHP RGPPFSSSQS IPWPRATVL SQVPKATSFA EPPDYSPVTH 240 

NVPSPIGEIQ PLSPQPSAPI ASSPAIDMPP QSETISSPMP QTHVSGTPPP VKASFSSPTV 300 

SAPANVNTTS APFVQTDIVN TSSISDLENQ VLQMBKALSL GSLEPNLAGE MINQVSRLLH 360 

SPPDMLAPIiA QRLLKWDDI GLQLNFSNTT ISLTSPSIiAL AVIRVNASSF NTTTFVAQDP 420 

45 ANLQVSLETQ APENSIGTIT LPSSLMNNLP AHDMELASRV QFNFFETPAL FQDPSLENLS 480 

LISYVISSSV ANLTVRNLTR NVTVTLKHIN PSQDELTVRC VFWDLGRNGG RGGWSDNGCS 540 

VKDRRLNETI CTCSHLTSFG VLLDLSRTSV LPAQMMALTF ITYIGCGLSS IFLSVTLVTY 600 

IAFEKIRRDY PSKILIQLCA ALLLUTLVFL LDSVJIALYKM QGLCISVAVF LHYFUjVSFT 660 

WMGLEAFHMY LALVKVFNTY IRKYILKFCI VGWGVPAVW TIILTISPDN YGLGSYGKFP 720 

50 NGSPDDFCWI MNNAVFYITV VGYFCVIFLL NVSMPIWLV QLCRIKKKKQ LGAQRKTSIQ 780 

DLRSIAGLTF LLGITWGPAP PAWGPVNVTF MYLFAIFNTL QGFFIFIFYC VAKENVRKQW 840 

RRYLCCGKLR LAENSDWSKT ATNGLKKQTV NQGVSSSSNS LQSSSNSTNS TTLIiVNNDCS 900 

VHASGNGNAS TERNGVSFSV QNGDVCLHDF TGKQHMFNEK EDSCNGKGRM ALRRTSKRGS 960 

55 tHPIBQM 

Seq ID NO: 489 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1. .2611 

60 1 11 21 31 41 51 

I I I I I I 

ATGGTTTTCT CTGTCAGGCA GTGTGGCCAT GTTGGCAGAA CTGAAGAAGT TTTACTGACG 60 

TTCAAGATAT TCCTTGTCAT CATTTGTCTT CATGTCGTTC TGGTAACATC CCTGGAAGAA 120 

GATACTGATA ATTCCAGTTT GTCACCACCA CCTGAGGTTG AAACAACAAG CCTCAATGAT 180 

65 GTTACTTTAA GCTTACTCCC TTCAAACGAA ACAGGCGTCA AACCCCAGAG AAATATCTGC 240 

AATTTGTCAT CTATTTGCAA TGACTCAGCA TTTTTTAGAG GTGAGATCAT GTTTCAATAT 300 

GATAAAGAAA GCACTGTTCC CCAGAATCAA CATATAACGA ATGGCACCTT AACTGGAGTC 360 

CTGTCTCTAA GTGAATTAAA ACGCTCAGAG CTCAACAAAA CCCTGCAAAC CCTAAGTGAG 420 

ACTTACTTTA TAATGTGTGC TACAGCAGAG GCCCAAAGCA CATTAAATTG TACATTCACA 460 

70 ATAAAACTGA ATAATACAAT GAATGCATGT GCTGTAATAG CTGCTTTGGA AAGAGTAAAG 540 

ATTCGACCAA TGGAACACTG CTGCTGTTCT GTCAGGATAC CCTGCCCTTC CTCCCCAGAA 600 

GAGTTGGAAA AGCTTCAGTG TGACCTGCAG GATCCCATTG TCTGTCTTGC TGACCATCCA 660 

CGTGGCCCAC CATTTTCTTC CAGCCAATCC ATCCCAGTGG TGCCTCGGGC CACTGTGCTT 720 

TCCCAGGTCC CCAAAGCTAC CTCTTTTGCT GAGCCTCCAG ATTATTCACC TGTGACCCAC 780 

75 AATGTTCCCT CTCCAATAGG GGAGATTCAA CCCCTTTCAC CCCAGCCTTC AGCTCCCATA 840 

GCTTCCAGCC CTGCCATTGA CATGCCCCCA CAGTCTGAAA CGATCTCTTC CCCTATGCCC 900 

CAAACCCATG TCTCCGGCAC CCCACCTCCT GTGAAAGCCT CATTTTCCTC TCCCACCGTG 960 

TCTGCCCCTG CGAATGTCAA CACTACCAGC GCACCTCCTG TCCAGACAGA CATCGTCAAC 1020 

OA ACCAGCAGTA TTTCTGATCT TGAGAACCAA GTGTTGCAGA TGGAGAAGGC TCTGTCCTTG 1080 

80 GGCAGCCTGG AGCCTAACCT CGCAGGAGAA ATGATCAACC AAGTCAGCAG ACTCCTTCAT 1140 

TCCCCGCCTG ACATGCTGGC CCCTCTGGCT CAAAGATTGC TGAAAGTAGT GGATGACATT 1200 

GGCCTACAGC TGAACTTTTC AAACACGACT ATAAGTCTAA CCTCCCCTTC TTTGGCTCTG 1260 

GCTGTGATCA GAGTGAATGC CAGTAGTTTC AACACAACTA CCTTTGTGGC CCAAGACCCT 1320 

GCAAATCTTC AGGTTTCTCT GGAAACCCAA GCTCCTGAGA ACAGTATTGG CACAATTACT 1380 
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CTTCCTTCAT 
CAGTTCAATT 
CTGATCAGCT 
AACGTGACAG 
GTATTTTGGG 
GTCAAAGACA 
GTTCTGCTGG 
ATTACATATA 
ATAGCTTTTG 
GCTCTGCTTC 
CAAGGCCTCT 
TGGATGGGCC 
ATCCGAAAAT 
ACCATCATCC 
AATGGTTCAC 
GTGGGATATT 
CAGCTCTGTC 
GACCTCAGGA 
TTTGCCTGGG 
CAAGGATTTT 
AGGCGGTATC 
GAGAGGAATG 
GGAAAACAGC 
CTCAGAAGGA 



CGCTGATGAA 
TTTTTGAAAC 
ACGTCATATC 
TCACATTAAA 
ACTTGGGCAG 
GGAGATTGAA 
ACCTATCTAG 
TTGGTTGTGG 
AAAAGATCCG 
TGCTGAACCT 
GCATCTCAGT 
TAGAAGCATT 
ACATCCTTAA 
TGACTATATC 
CGGATGACTT 
TCTGTGTGAT 
GAATTAAAAA 
GTATCGCTGG 
GACCAGTTAA 
TCATATTCAT 
TTTGTTGTGG 
GGGTCTCTTT 
ACATGTTTAA 
CTTCAAAGCG 



TAATTTACCA 
ACCTGCTTTG 
ATOGAGTGTT 
GCACATCAAC 
AAATGGTGGC 
TGAAACCATC 
GACATCTGTG 
GCTTTCATCA 
GAGGGATTAC 
GGTCTTCCTC 
GGCTGTATTT 
CCATATGTAC 
ATTCTGCATT 
CCCAGATAAC 
CTGCTGGATC 
ATTTTTGCTG 
GAAGAAGCAA 
CCTTACATTT 
CGTGACCTTC 
CTTTTACTGT 
AAAGTTACGG 
TAGTGTTCAG 
CGAGAAGGAA 
GGGAAGCTTA 



GCTCATGACA 
TTTCAGGATC 
GCAAACCTGA 
CCGAGCCAGG 
AGAGGAGGCT 
TGTACCTGTA 
CTGCCTGCTC 
ATTTTTCTGT 
CCTTCCAAAA 
CTGGACTCGT 
CTTCATTATT 
CTGGCCCTTG 
GTCGGTTGGG 
TATGGGCTTG 
AACAACAATG 
AACGTCAGCA 
CTGGGAGCCC 
TTACTGGGAA 
ATGTATCTGT 
GTGGCCAAAG 
CTGGCTGAAA 
AATGGAGATG 
GATTCCTGCA 
CACTTTATTG 



TGGAGCTAGC 
CTTCCCTGGA 
CCGTCAGGAA 
ATGAGTTAAC 
GGTCAGACAA 
GCCATCTAAC 
AAATGATGGC 
CAGTGACTCT 
TCCTCATCCA 
GGATTGCTCT 
TTCTCTTGGT 
TCAAAGTATT 
GGGTACCAGC 
GATCCTATGG 
CAGTATTCTA 
TGTTCATTGT 
AGCGAAAAAC 
TAACTTGGGG 
TTGCCATCTT 
AAAATGTCAG 
ATTCTGGAAA 
TGTGCCTTCA 
ATGGGAAAGG 
AGCAAATOTG 



TTCCAGGGTT 
GAACCTCTCT 
CTTGACAAGA 
AGTGAGATGT 
TGGCTGCTCT 
AAGCTTCGGC 
TCTGACGTTC 
TGTAACCTAC 



GTATAAGATG 
CTCATTCACA 
TAATACTTAC 



GAAATTCCCC 
CATTACGGTG 

CAGTATTCAA 
CTTTGCCTTC 
TAATACCTTA 
GAAGCAATGG 
TGCTTCTACA 
CGATTTCACT 
CCGTATGGCT 
A 



1440 
1500 
1560 
1G20 
1680 
1740 
1600 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2260 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
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MVPSVRQCGH 
VTLSLLPSNE 
LSLSELKRSE 
IRPMEHCCCS 
SQVPKATSFA 
QTHVSGTPPP 
GSLEPNLAGE 
AVIRVNASSP 
QFNFFETPAL 
VFWDLGRNGG 
ITYIGCGLSS 
QGLCISVAVF 
TIILTISPDN 
QLCRIKKKKQ 
QGPFIFIFYC 
GKQHMFNEKE 



11 
I 

VGRTEEVLLT 
TGVKPQRNIC 
LNKTLQTLSE 
VRIPCPSSPE 
EPPDYSPVTH 
VKASFSSPTV 
MINQVSRLLH 
NTTTFVAQDP 
FQDPSLEHLS 
RGGWSDNGCS 
IFLSVTLVTY 
LHYFLLVSFT 
YGLGSYGKFP 
LGAQRKTSIQ 
VAKENVRKQW 
DSCNGKGRMA 



21 
I 

FKIFIiVIICL 
NLSSIQJDSA 
TYFIMCATAE 
ELEKLQCDLQ 
NVPSPIGEIQ 
SAPANVNTTS 
SPPDMLAPLA 
ANLQVSLETQ 
LISYVISSSV 
VKDRRLNETI 
IAFEKIRRDY 
WMGLEAFHMY 
KGSPDDFCWZ 
DLRSIAGLTF 
RRYLCCGKLR 
LRRTSKRGSL 



31 
I 

HWLVTSLEE 
FPRGEIMFQY 
AQSTLNCTFT 
DPIVCLADHP 
PLSPQPSAPI 
APPVQTDIVN 
QRLLKWDDI 
APENSIGTIT 
ANLTVRNLTR 
CTCSHLTSFG 
PSKILIQLCA 
LALVKVFNTY 
NNNAVFYITV 
LLGITWGFAF 



41 
I 

DTDNSSLSPP 
DKESTVPQNQ 
IKLNNTMNAC 



ASSPAIDMPP 
TSSISDIiENQ 
GLQLNFSNTT 
LPSSLMNNLP 
NVTVTLKHIN 
VLLDLSRTSV 
ALLLLNLVFL 
IRKYILKFCI 
VGYFCVIPLIi 
FAWGPVNVTP 
ERNGVSFSVQ 
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51 

I 

PEVETTSLND 
HITNGTLTGV 
AVIAALERVK 
IPWPRATVL 
QSETISSPMP 
VLQMEKALSL 
ISLTSPSLAL 
AHDMEIiASRV 
PSQDEIiTVRC 
LPAQMMALTF 
LDSWIALYKM 
VGWGVPAVW 
NVSMPIWLV 
MYLFAIFNTL 
NGDVCLHDFT 



51 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



1 11 21 31 41 

I I I I I I 

ATGGTTTTCT CTGTCAGGCA GTGTGGCCAT GTTGGCAGAA CTGAAGAAGT TTTACTGACG 
TTCAAGATAT TOCTTQTCAT CATTTGTCTT CATGTCGTTC TGGTAACATC C CTGGAAG AA 
GATACTGATA ATTCCAGTTT GTCACCACCA CCTGCTAAAT TATCTGTTGT CAGTTTTGCC 
CCCTCCTCCA ATGAGGTTGA AACAACAAGC CTCAATGATG TTACTTTAAG CTTACTCCCT 
TCAAACGAAA CAGAAAAAAC TAAAATCACT ATAGTAAAAA CCTTCAATGC TTCAGGCGTC 
AAACCCCAGA GAAATATCTG CAATTTGTCA TCTATTTGCA ATGACTCAGC ATTTTTTAGA 
GGTGAGATCA TGTTTCAATA TGATAAAGAA AGCACTGTTC CCCAGAATCA ACATATAACG 
AATGGCACCT TAACTGGAGT CCTGTCTCTA AGTGAATTAA AACGCTCAGA GCTCAACAAA 
ACCCTGCAAA OCCTAAGTGA GACTTACTTT ATAATGTGTG CTACAGCAGA GGCCCAAAGC 
ACATTAAATT GTACATTCAC AATAAAACTG AATAATACAA TGAATGCATG TGCTGCAATA 
GCCGCTTTGG AAAGAGTAAA GATTCGACCA ATGGAACACT GCTGCTGTTC TGTCAGGATA 
CCCTGCCCTT CCTCCCCAGA AGAGTTGGGA AAGCTTCAGT GTGACCTGCA GGATCCCATT 
GTCTGTCTTG CTGACCATCC ACGTGGCCCA CCATTTTCTT CCAGCCAATC CATCCCAGTG 
GTGCCTOGGG CCACTGTGCT TTCCCAGGTC CCCAAAGCTA CCTCTTTTGC TGAG CCTC CA 
GATTATTCAC CTGTGACCCA CAATGTTCCC TCTCCAATAG GGGAGATTCA ACCCCTTTCA 
CfcCCAGCCTT CAGCTCCCAT AGCTTCCAGC CCTGCCATTG ACATGCCCCC ACAGTCTGAA 
ACGATCTCTT CCCCTATGCC CCAAACCCAT GTCTCCGGCA CCCCACCTCC TGTGAAAGCC 
TCATTTTCCT CTCCCACCGT GTCTGCCCCT GOGAATGTCA ACACTACCAG CGCACCTCCT 
GTCCAGACAG ACATCGTCAA CACCAGCAGT ATTTCTGATC TTGAGAACCA AGTGTTGCAG 
ATGGAGAAGG CTCTGTCCTT GGGCAGCCTG GAGCCTAACC TCGCAGGAGA AATGATCAAC 
CAAGTCAGCA GACTCCTTCA TTCCCOGCCT GACATGCTGG CCCCTCTGGC TCAAAGATTG 
CTGAAAGTAG TGGATGACAT TGGCCTACAG CTGAACTTTT CAAACACGAC TATAAGTCTA 
ACCTCCCCTT CTTTGGCTCT GGCTGTGATC AGAGTGAATG CCAGTAGTTT CAACACAACT 
ACCTTTGTGG CCCAAGACCC TGCAAATCTT CAGGTTTCTC TGGAAACCCA AGCTCCTGAG 
AACAGTATTG G CACAATT AC TCTTCCTTCA TCGCTGATGA ATAATTTACC AGCTCATGAC 
ATGGAGCTAG CTTCCAGGGT TCAGTTCAAT TTTTTTGAAA CACCTGCTTT GTTTCAGGAT 
CCTTCCCTGG AGAACCTCTC TCTGATCAGC TACGTCATAT CATCGAGTGT TGCAAACCTG 
ACCGTCAGGA ACTTGACAAG AAACGTGACA GTCACATTAA AGCACATCAA CCCGAGCCAG 
GATGAGTTAA CAGTGAGATG TGTATTTTGG GACTTGGGCA GAAATGGTGG CAGAGGAGGC 
TGGTCAGACA ATGGCTGCTC TGTCAAAGAC AGGAGATTGA ATGAAACCAT CTGTACCTGT 
AGCCATCTAA CAAGCTTCGG CGTTCTGCTG GACCTATCTA GGACATCTGT GCTGCCTGCT 

1145 
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420 
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CAAATGATGG CTCTGACGTT CATTACATAT ATTGGTTGTG GGCTTTCATC AATTTTTCTG 
TCAGTGACTC TTGTAACCTA CATAGCTTTT GAAAAGATCC GGAGGGATTA CCCTTCCAAA 
ATCCTCATCC AGCTGTGTGC TGCTCTGCTT CTGCTGAACC TGGTCTTCCT CCTGGACTCG 
TGGATTGCTC TGTATAAGAT GCAAGGCCTC TGCATCTCAG TGGCTGTATT TCTTCATTAT 
TTTCTCTTGG TCTCATTCAC ATGGATGGGC CTAGAAGCAT TCCATATGTA CCTGGCCCTT 
GTCAAAGTAT TTAATACTTA CATCCGAAAA TACATCCTTA AATTCTGCAT TGTCGGTTGG 
GGGGTACCAG CTGTGGTTGT GACCATCATC CTGACTATAT CCCCAGATAA CTATGGGCTT 
GGATCCTATG GGAAATTCCC CAATGGTTCA CCGGATGACT TCTGCTGGAT CAACAACAAT 
GCAGTATTCT ACATTACGGT GGTGGGATAT TTCTGTGTGA TATTTTTGCT GAACGTCAGC 
ATGTTCATTG TGGTCCTGGT TCAGCTCTGT CGAATTAAAA AGAAGAAGCA ACTGGGAGCC 
CAGCGAAAAA CCAGTATTCA AGACCTCAGG AGTATCGCTG GCCTTACATT TTTACTGGGA 
ATAACTTGGG GCTTTGCCTT CTTTGCCTGG GGACCAGTTA ACGTGACCTT CATGTATCTG 
TTTGCCATCT TTAATACCTT ACAAGGATTT TTCATATTCA TCTTTTACTG TGTGGCCAAA 
GAAAATGTCA GGAAGCAATG GAGGCGGTAT CTTTGTTGTG GAAAGTTACG GCTGGCTGAA 
AATTCTGACT GGAGTAAAAC TGCTACTAAT GGTTTAAAGA AGCAGACTGT AAACCAAGGA 
GTGTCCAGCT CTTCAAATTC CTTACAGTCA AGCAGTAACT CCACTAACTC CACCACACTG 
CTAGTGAATA ATGATTGCTC AGTACACGCA AGCGGGAATG GAAATGCTTC TACAGAGAGG 
AATGGGGTCT CTTTTAGTGT TCAGAATGGA GATGTGTGCC TTCACGATTT CACTGGAAAA 
CAGCACATGT TTAACGAGAA GGAAGATTCC TGCAATGGGA AAGGCCGTAT GGCTCTCAGA 
AGGACTTCAA AGCGGGGAAG CTTACACTTT ATTGAGCAAA TGTGA 

Seq ID NO: 492 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MVFSVRQCGH VGRTEEVLLT FKIFLVIICL HWLVTSLEE DTDNSSLSPP PAKLSWSFA 60 

PSSNEVETTS LNDVTLSLLP SNETEKTKIT IVKTFNASGV KPQRNICNLS SICNDSAFFR 120 

GEIMFQYDKE STVPQNQHIT NGTIiTGVIiSL SELKRSELNK TLQTLSETYF IMCATAEAQS 180 

TLNCTFTIKJj NNTMNACAAI AALERVKIRP MEHCCCSVRI PCPSSPEELG KLQCDLQDPI 240 

VCLADHPRGP PFSSSQSIPV VPRATVLSQV PKATSFAEPP DYSPVTHNVP SPIGEIQPfcS 300 

PQPSAPIASS PAIDMPPQSE TISSPMPQTH VSGTPPPVKA SFSSPTVSAP ANVNTTSAPP 360 

VQTDIVNTSS ISDLENQVLQ MEKALSIiGSL EPNLAGEMIN QVSRLLHSPP DMLAPLAQRL 420 

LKWDDIGLQ LNFSNTTISL TSPSLAIAVI RVNASSFNTT TFVAQDPANL QVSLETQAPE 480 

NSIGTITLPS SLMNNLPAHD MELASRVQFN FFETPALFQD PSLENLSLIS YVISSSVANL 540 

TVRNLTRNVT VTLKHINPSQ DELTVRCVFW DLGRNGGRGG WSDNGCSVKD RRLNETICTC 600 

SHLTSFGVLL DLSRTSVLPA QMMALTFITY IGCGLSSIFL SVTLVTYIAF EKIRRDYPSK 660 

ILIQLCAALL LLNLVFLLDS WIALYKMQGL CISVAVFLHY FLLVSFTWMG LEAFHMYLAL 720 

VKVFNTYIRK YILKFCIVGW GVPAVWTII LTISPDNYGL GSYGKFPNGS PDDFCWINNN 780 

AVFYITWGY FCVIFL.LNVS MFIWLVQLC RIKKKKQLGA QRKTSIQDLR SIAGLTFLLG 840 

ITWGFAFFAW GPVNVTFMYL FAIFNTLQGF FIFIFYCVAK ENVRKQWRRY LCCGKLRLAE 900 

NSDWSKTATN GLKKQTVNQG VSSSSNSLQS SSNSTNSTTL LVNNDCSVHA SGNGNASTER 960 
KGVSFSVQNG DVCLHDFTGK QHMFNEKEDS CNGKGRMALR RTSKRGSLHF IEQM 

Seq ID NO: 493 DMA sequence 
Nucleic Acid Accession ft: NM_015507 
Coding sequence: 241.. 1902 

1 11 21 31 41 51 

I I I I I I 

CCGCAGAGGA GCCTCGGCCA GGCTAGCCAG GGCGCCCCCA GCCCCTCCCC AGGCCGCGAG 60 

CGCCCCTGCC GCGGTGCCTG GCCTCCCCTC CCAGACTGCA GGGACAGCAC CCGGTAACTG 120 

CGAGTGGAGC GGAGGACCCG AGCGGCTGAG GAGAGAGGAG GOGGCGGCTT AGCTGCTACG 180 

GGGTCCGGCC GGCGCCCTCC CGAGGGGGGC TCAGGAGGAG GAAGGAGGAC CCGTGCGAGA 240 

ATGCCTCTGC CCTGGAGCCT TGCGCTCCCG CTGCTGCTCT CCTGGGTGGC AGGTGGTTTC 300 

GGGAACGCGG CCAGTGCAAG GCATCACGGG TTGTTAGCAT CGGCACGTCA GCCTGGGGTC 360 

TGTCACTATG GAACTAAACT GGCCTGCTGC TACGGCTGGA GAAGAAACAG CAAGGGAGTC 420 

TGTGAAGCTA CATGCGAACC TGGATGTAAG TTTGGTGAGT GCGTGGGACC AAACAAATGC 480 

AGATGCTTTC CAGGATACAC CGGGAAAACC TGCAGTCAAG ATGTGAATGA GTGTGGAATG 540 

AAACCCCGGC CATGCCAACA CAGATGTGTG AATACACACG GAAGCTACAA GTGCTTTTGC 600 

CTCAGTGGCC. ACATGCTCAT GCCAGATGCT ACGTGTGTGA ACTCTAGGAC ATGTGCCATG 660 

ATAAACTGTC AGTACAGCTG TGAAGACACA GAAGAAGGGC CACAGTGCCT GTGTCCATCC 720 

TCAGGACTCC GCCTGGCCCC AAATGGAAGA GACTGTCTAG ATATTGATGA ATGTGCCTCT 7 BO 

GGTAAAGTCA TCTGTCCCTA CAATCGAAGA TGTGTGAACA CATTTGGAAG CTACTACTGC 840 

AAATGTCACA TTGGTTTCGA ACTGCAATAT ATCAGTGGAC GATATGACTG TATAGATATA 900 

AATGAATGTA CTATGGATAG CCATACGTGC AGCCACCATG CCAATTGCTT CAATACCCAA 960 

GGGTCCTTCA AGTGTAAATG CAAGCAGGGA TATAAAGGCA ATGGACTTCG GTGTTCTGCT 1020 

ATCCCTGAAA ATTCTGTGAA GGAAGTCCTC AGAGCACCTG GTACCATCAA AGACAGAATC 1080 

AAGAAGTTGC TTGCTCACAA AAACAGCATG AAAAAGAAGG CAAAAATTAA AAATGTTACC 1140 

CCAGAACCCA CCAGGACTCC TACCCCTAAG GTGAACTTGC AGCCCTTCAA CTATGAAGAG 1200 

ATAGTTTCCA GAGGCGGGAA CTCTCATGGA GGTAAAAAAG GGAATGAAGA GAAAATGAAA 1260 

GAGGGGCTTG AGGATGAGAA AAGAGAAGAG AAAGCCCTGA AGAATGACAT AGAGGAGCGA 1320 

AGCCTGCGAG GAGATGTGTT TTTCCCTAAG GTGAATGAAG CAGGTGAATT CGGCCTGATT 1380 

CTGGTCCAAA GGAAAGCGCT AACTTCCAAA CTGGAACATA AAGATTTAAA TATCTCGGTT 1440 

GACTGCAGCT TCAATCATGG GATCTGTGAC TGGAAACAGG ATAGAGAAGA TGATTTTGAC 1500 

TGGAATCCTG CTGATCGAGA TAATGCTATT GGCTTCTATA TGGCAGTTCC GGCCTTGGCA 1560 

GGTCACAAGA AAGACATTGG CCGATTGAAA CTTCTCCTAC CTGACCTGCA ACCCCAAAGC 1620 

AACTTCTGTT TGCTCTTTGA TTACCGGCTG GCCGGAGACA AAGTCGGGAA ACTTCX3AGTG 1680 

TTTGTGAAAA ACAGTAACAA TGCCCTGGCA TGGGAGAAGA CCACGAGTGA GGATGAAAAG 1740 

TGGAAGACAG GGAAAATTCA GTTGTATCAA GGAACTGATG CTACCAAAAG CATCATTTTT 1800 

GAAGCAGAAC GTGGCAAGGG CAAAACCGGC GAAATCGCAG TGGATGGCGT CTTGCTTGTT 1860 

TCAGGCTTAT GTCCAGATAG CCTTTTATCT GTGGATGACT GAATGTTACT ATCTTTATAT 1920 

TTGACTTTGT ATGTCAGTTC CCTGGTTTTT TTGATATTGC ATCATAGGAC CTCTGGCATT 1980 

TTAGAATTAC TAGCTGAAAA ATTGTAATGT ACCAACAGAA ATATTATTGT AAGATGCCTT 2040 
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TCTTGTATAA GATATGCCAA TATTTGCTTT AAATATCATA TCACTGTATC TTCTCAGTCA 2100 

TTTCTQAATC TTTCCACATT ATATTATAAA ATATGGAAAT GTCAGTTTAT CTCCCCTCCT 2160 

CAGTATATCT GATTTGTATA AGTAAGTTGA TGAGCTTCTC TCTACAACAT T TCTA GAAAA 2220 

TAGAAAAAAA AGCACAGAGA AATGTTTAAC TGTTTGACTC TTATGATACT TCTTGGAAAC 2280 

5 TATGACATCA AAGATAGACT TTTGCCTAAG TGGCTTAGCT GGGTCTTTCA TAGCCAAACT 2340 
TGTATATTTA AATTCTTTGT AATAATAATA TCCAAATCAT CAAAAAAAAA AAAAAAAA 
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Seq ID NO: 494 Protein sequence 
Protein Accession ft: NP 056322 



MPLPWSLALP 
CEATCEPGCK 
LSGHMLMPDA 
GKVICPYNRR 
GSFKCKCKQG 
PEPTRTPTPK 
SLRGDVFFPK 
WNPADRDNAI 
FVKNSNNALA 
SGLCPDSLLS 



11 
I 

LLLSWVAGGF 
FGECVGPNKC 
TCVNSRTCAM 
CVNTFGSYYC 
YKGNGLRCSA 
VNLQPFNYEE 
VNEAGBFGLI 
GFYMAVPALA 
WEKTTSEDEK 
VDD 



21 
I 

GNAASARHHG 
RCFPGYTGKT 
INCQYSCEDT 
KCHIGFEtiQY 
IPENSVKEVL 
IVSRGGNSHG 
LVQRKALTSK 
GHKKDIGRLK 
WKTGKIQbYQ 



31 
I 

LIiASARQPGV 
CSQDVNECGM 
EEGPQCLCPS 
ISGRYDCIDI 
RAPGTIKDRI 
GKKGNEEKMK 
LEHKDLNISV 
LLLPDLQPQS 
GTDATKSIIF 



41 
I 

CHYGTKLACC 
KPRPCQHRCV 
SGLRLAPNGR 
NECTMDSHTC 
KKLLAHKNSM 



DCSFNHGICD 
NFCLLFDYRL 
EAERGKGKTG 



Seq ID NO: 49S DNA sequence 

Nucleic Acid Accession ft: NM_003506.1 

Coding sequence: 259.. 2379 



11 



51 

I 

YGWRRNSKGV 
NTHGSYKCFC 
DCLDIDECAS 
SHHANCFNTQ 
KKKAKIKNVT 
KALKNDIEER 
HKQDREDDFD 
AGDKVGKLRV 
EIAVDGVLLV 



51 



60 
120 
180 
240 
300 
360 
420 
480 
540 



21 31 41 

I I I I I I 

GCAGCTCCAG TCCCGGACGC AACCCCGGAG CCGTCTCAGG TCCCTGGGGG GAACGGTGGG 60 

TTAGACGGGG ACGGGAAGGG ACAGCGGCCT TCGACCGCCC CCCGAGTAAT TGACCCAGGA 120 

CTCATTTTCA GGAAAGCCTG AAAATGAGTA AAATAGTGAA ATGAGGAATT TGAACATTTT 180 

ATCTTTGGAT GGGGATCTTC TGAGGATGCA AAGAGTGATT CATCCAAGCC ATGTGGTAAA 240 

ATCAGGAATT TGAAGAAAAT GGAGATGTTT ACATTTTTGT TGACGTGTAT TTTTCTACCC 300 

CTCCTAAGAG GGCACAGTCT CTTCACCTGT GAACCAATTA CTGTTCCCAG ATGTATGAAA 360 

ATGGCCTACA ACATGACGTT TTTCCCTAAT CTGATGGGTC ATTATGACCA GAGTATTGCC 420 

GCGGTGGAAA TGGAGCATTT TCTTCCTCTC GCAAATCTGG AATGTTCACC AAACATTGAA 480 

ACTTTCCTCT GCAAAGCATT TGTACCAACC TGCATAGAAC AAATTCATGT GGTTCCACCT 540 

TGTCGTAAAC TTTGTGAGAA AGTATATTCT GATTGCAAAA AATTAATTGA CACTTTTGGG 600 

ATCCGATGGC CTGAGGAGCT TGAATGTGAC AGATTACAAT ACTGTGATGA GACTGTTCCT 660 

GTAACTTTTG ATCCACACAC AGAATTTCTT GGTCCTCAGA AGAAAACAGA ACAAGTCCAA 720 

AGAGACATTG GATTTTGGTG TCCAAGGCAT CTTAAGACTT CTGGGGGACA AGGATATAAG 780 

TTTCTGGGAA TTGACCAGTG TGCGCCTCCA TGCCCCAACA TGTATTTTAA AAGTGATGAG 840 

CTAGAGTTTG CAAAAAGTTT TATTGGAACA GTTTCAATAT TTTGTCTTTG TGCAACTCTG 900 

TTCACATTCC TTACTTTTTT AATTGATGTT AGAAGATTCA GATACCCAGA GAGACCAATT 960 

ATATATTACT CTGTCTGTTA CAGCATTGTA TCTCTTATGT ACTTCATTGG ATTTTTGCTG 1020 

GGCGATAGCA CAGCCTGCAA TAAGGCAGAT GAGAAGCTAG AACTTGGTGA CACTGTTGTC 1080 

CTAGGCTCTC AAAATAAGGC TTGCACCGTT TTGTTCATGC TTTTGTATTT TTTCACAATG 1140 

GCTGGCACTG TGTGGTGGGT GATTCTTACC ATTACTTGGT TCTTAGCTGC AGGAAGAAAA 1200 

TGGAGTTGTG AAGCCATCGA GCAAAAAGCA GTGTGGTTTC ATGCTGTTGC ATGGGGAACA 1260 

CCAGGTTTCC TGACTGTTAT GCTTCTTGCT CTGAACAAAG TTGAAGGAGA CAACATTAGT 1320 

GGAGTTTGCT TTGTTGGCCT TTATGACCTG GATGCTTCTC GCTACTTTGT ACTCTTGCCA 1380 

CTGTGCCTTT GTGTGTTTGT TGGGCTCTCT CTTCTTTTAG CTGGCATTAT TTCCTTAAAT 1440 

CATGTTCGAC AAGTCATACA ACATGATGGC CGGAACCAAG AAAAACTAAA GAAATTTATG 1500 

ATTCGAATTG GAGTCTTCAG CGGCTTGTAT CTTGTGCCAT TAGTGACACT TCTCGGATGT 1560 

TACGTCTATG AGCAAGTGAA CAGGATTACC TGGGAGATAA CTTGGGTCTC TGATCATTGT 1620 

CGTCAGTACC ATATCCCATG TCCTTATCAG GCAAAAGCAA AAGCTCGACC AGAATTGGCT 1680 

TTATTTATGA TAAAATACCT GATGACATTA ATTGTTGGCA TCTCTGCTGT CTTCTGGGTT 1740 

GGAAGCAAAA AGACATGCAC AGAATGGGCT GGGTTTTTTA A AOGAAAT CG CAAGAGAGAT 1800 

CCAATCAGTG AAAGTCGAAG AGTACTACAG GAATCATGTG AGTTTTTCTT AAAGCACAAT 1860 

TCTAAAGTTA AACACAAAAA GAAGCACTAT AAACCAAGTT CACACAAGCT GAAGGTCATT 1920 

TCCAAATCCA TGGGAACCAG CACAGGAGCT ACAGCAAATC ATGGCACTTC TGCAGTAGCA 1980 

ATTACTAGCC ATGATTACCT AGGACAAGAA ACTTTQACAG AAATCCAAAC CTCACCAGAA 2040 

ACATCAATGA GAGAGGTGAA AGOGGACGGA GCTAGCACCC CCAGGTTAAG AGAACAGGAC 210 0 

TGTGGTGAAC CTGCCTCGCC AGCAGCATCC ATCTCCAGAC TCTCTGGGGA AGAGGTCGAC 2160 

GGGAAGGGCC AGGCAGGCAG TGTATCTGAA AGTGCGCGGA GTGAAGGAAG GATTAGTCCA 2220 

AAGAGTGATA TTACTGACAC TGGCCTGGCA CAGAGCAACA ATTTGCAGGT CCCCAGTTCT 2280 

TCAGAACCAA GCAGCCTCAA AGGTTCCACA TCTCTGCTTG TTCACCCAGT TTCAGGAGTG 2340 

AGAAAAGAGC AGGGAGGTGG TTGTCATTCA GATACTTGAA GAACATTTTC TCTCGTTACT 2400 

CAGAAGCAAA TTTGTGTTAC ACTGGAAGTG ACCTATGCAC TGTTTTGTAA GAATCACTGT 2460 

TACGTTCTTC TTTTGCACTT AAAGTTGCAT TGCCTACTGT TATACTGGAA AAAATAGAGT 2520 

TCAAGAATAA TATGACTCAT TTCACACAAA GGTTAATGAC AACAATATAC CTGAAAACAG 2580 

AAATGTGCAG GTTAATAATA TTTTTTTAAT AGTGTGGGAG GACAGAGTTA GAGGAATCTT 2640 

CCTTTTCTAT TTATGAAGAT TCTACTCTTG GTAAGAGTAT TTTAAGATGT ACTATGCTAT 2700 

TTTACCTTTT TGATATAAAA TCAAGATATT TCTTTGCTGA AGTAT TTAAA TCTTATCCTT 2760 

GTATCTTTTT ATACATATTT GAAAATAAGC TTATATGTAT TTGAACTTTT TTGAAATCCT 2820 

ATTCAAGTAT TTTTATCATG CTATTGTGAT ATTTTAGCAC TTTGGTAGCT TTTACACTGA 2880 

ATTTCTAAGA AAATTGTAAA ATAGTCTTCT TTTATACTGT AAAAAAAGAT ATACCAAAAA 2940 

GTCTTATAAT AGGAATTTAA CTTTAAAAAC CCACTTATTG ATACCTTACC ATCTAAAATG 3000 

TGTGATTTTT ATAGTCTCGT TTTAGGAATT TCACAGATCT AAATTATGTA ACTGAAATAA 3060 

GGTGCTTACT CAAAGAGTGT CCACTATTOA TTGTATTATG CTGCTCACTG ATCCTTCTGC 3120 
ATATTTAAAA TAAAATGTCC TAAAGGGTTA GTAGACAAAA TGTTAGTCTT TTGTATATTA 3180 

GGCCAAGTGC AATTGACTTC CCTTTTTTAA TGTTTCATGA CCACCCATTG ATTGTATTAT 3240 
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AACCACTTAC AGTTGCTTAT ATTTTTTGTT TTAACTTTTG TTTCTTAACA TTTAGAATAT 3300 
TACATTTTGT ATTATACAGT ACCTTTCTCA GACATTTTGT AG 

Seq ID NO: 496 Protein sequence 
Protein Accession #: NP_0O3497.1 

1 11 21 31 41 51 

I I I I I I 

MEMPTFLLTC IFLPLLRGHS LFTCEPITVP RCMKMAYNMT FFPNLMGHYD QSIAAVEMEH 60 

FLPLANLECS PNIETFLCKA FVPTCIEQIH WPPCRKLCE KVYSDCKKLI DTFGIRWPEE 120 

LECDRLQYCD ETVPVTFDPH TEFLGPQKKT EQVQRDIGFW CPRHLKTSGG QGYKFLGIDQ 180 

CAPPCPNMYF KSDELEFAKS FIGTVSIFCL CATLFTFLTF LIDVRRFRYP ERPIIYYSVC 240 

YSIVSLMYFI GFLLGDSTAC NKADEKLELG DTWLGSQNK ACTVLFMLLY FFTMAGTVWW 300 

VILTITWFLA AGRKWSCEAI EQKAVWFHAV AWGTPGFLTV MLLALNKVEG DNISGVCFVG 360 

LYDLDASRYF VLLPLCLCVF VGLSLLLAGI ISLNHVRQVI QHDGRNQEKL KKFMIRIGVF 420 

SGLYLVPLVT LLGCYVYEQV NRITWEITWV SDHCRQYHIP CPYQAKAKAR PELALFMIKY 480 

LMTLIVGISA VFWVGSKKTC TEWAGFFKRN RKRDPISESR RVLQESCEFF LKHNSKVKHK 540 

KKHYKPSSHK LKVISKSMGT STGATANHGT SAVAITSHDY LGQETLTEIQ TSPETSMREV 600 

KADGASTPRL REQDCGEPAS PAASISRLSG EQVDGKGQAG SVSESARSEG RISPKSDITD 660 
TGLAQSNNLQ VPSSSEPSSL KGSTSLLVHP VSGVRKEQGG GCHSDT 

Seq ID NO: 497 DNA sequence 
Nucleic Acid Accession #: NM_005046 
Coding sequence: 16.. 777 

1 11 21 31 41 51 

I I I i I I 

GGATTTCCGG GCTCCATGGC AAGATCCCTT CTCCTGCCCC TGCAGATCCT ACTGCTATCC 60 

TTAGCCTTGG AAACTGCAGG AGAAGAAGCC CAGGGTGACA AGATTATTGA TGGCGCCCCA 120 

TGTGCAAGAG GCTCCCACCC ATGGCAGGTG GCCCTGCTCA GTGGCAATCA GCTCCACTGC 180 

GGAGGCGTCC TGGTCAATGA GCGCTGGGTG CTCACTGCCG CCCACTGCAA GATGAATGAG 240 

TACACCGTGC ACCTGGGCAG TGATACGCTG GGCGACAGGA GAGCTCAGAG GATCAAGGCC 300 

TCGAAGTCAT TCCGCCACCC CGGCTACTCC ACACAGACCC ATGTTAATGA CCTCATGCTC 360 

GTGAAGCTCA ATAGCCAGGC CAGGCTGTCA TCCATGGTGA AGAAAGTCAG GCTGCCCTCC 420 

CGCTGCGAAC CCCCTGGAAC CACCTGTACT GTCTCCGGCT GGGGCACTAC CACGAGCCCA 480 

GATGTGACCT TTCCCTCTGA CCTCATGTGC GTGGATGTCA AGCTCATCTC CCCCCAGGAC 540 

TGCACGAAGG TTTACAAGGA CTTACTGGAA AATTCCATGC TGTGCGCTGG CATCCCCGAC 600 

TCCAAGAAAA ACGCCTGCAA TGGTGACTCA GGGGGACCGT TGGTGTGCAG AGGTACCCTG 660 

CAAGGTCTGG TGTCCTGGGG AACTTTCCCT TGCGGCCAAC CCAATGACCC AGGAGTCTAC 720 

ACTCAAGTGT GCAAGTTCAC CAAGTGGATA AATGACACCA TGAAAAAGCA TCGCTAACGC 780 

CACACTGAGT TAATTAACTG TGTGCTTCCA ACAGAAAATG CACAGGAGTG AGGACGCCGA 840 

TGACCTATGA AGTCAAATTT GACTTTACCT TTCCTCAAAG ATATATTTAA ACCTCATGCC 900 

CTGTTGATAA ACCAATCAAA TTGGTAAAGA CCTAAAACCA AAACAAATAA AGAAACACAA 960 
AACCCTCAA 

Seq ID NO: 498 Protein sequence 
Protein Accession #: NP_005037 

1 11 21 31 41 51 

I I I I I I 

MARSLLLPLQ ILLLSLALET AGEEAQGDKI IDGAPCARGS HPWQVALLSG NQLHCGGVLV 60 

NERWVLTAAH CKMNEYTVHL GSDTLGDRRA QRIKASKSFR HPGYSTQTHV NDLMLVKLNS 120 

QARLSSMVXK VRLPSRCEPP GTTCTVSGWG TTTSPDVTFP SDLMCVDVKL ISPQDCTKVY 180 

KDIiLENSMLC AGIPDSKKNA CNGDSGGPLV CRGTLQGLVS WGTFPCGQPN DPGVYTQVCK 240 
FTKWINDTMK KHR 

Seq ID NO: 499 DNA sequence 
Nucleic Acid Accession U : NM_007196 
Coding sequence: 182.. 962 

1 11 21 31 41 51 

I I I I I 1 

GTTCCCAGAA GCTCCCCAGG CTCTAGTGCA GGAGGAGAAG GAGGAGGAGC AGGAGGTGGA 60 

GATTCCCAGT TAAAAGGCTC CAGAATCGTG TACCAGGCAG AGAACTGAAG TACTGGGGCC 120 

TCCTCCACTG GGTCCGAATC AGTAGGTGAC CCCGCCCCTG GATTCTGGAA GACCTCACCA 180 

TGGGACGCCC COGACCTCGT GCGGCCAAGA CGTGGATGTT CCTGCTCTTG CTGGGGGGAG 240 

CCTGGGCAGG ACACTCCAGG GCACAGGAGG ACAAGGTGCT GGGGGGTCAT GAGTGCCAAC 300 

CCCATTCGCA GCCTTGGCAG GCGGCCTTGT TCCAGGGCCA GCAACTACTC TGTGGCGGTG 360 

TCCTTGTAGG TGGCAACTGG GTCCTTACAG CTGCCCACTG TAAAAAACCG AAATACACAG 420 

TACGCCTGGG AGACCACAGC CTACAGAATA AAGATGGCCC AGAGCAAGAA ATACCTGTGG 480 

TTCAGTCCAT CCCACACCCC TGCTACAACA GCAGCGATGT GGAGGACCAC AACCATGATC 540 

TGATGCTTCT TCAACTGCGT GACCAGGCAT CCCTGGGGTC CAAAGTGAAG CCCATCAGCC 600 

TGGCAGATCA TTGCACCCAG CCTGGCCAGA AGTGCACCGT CTCAGGCTGG GGCACTGTCA 660 

CCAGTCCCCG AGAGAATTTT CCTGACACTC TCAACTGTGC AGAAGTAAAA ATCTTTCCCC 720 

AGAAGAAGTG TGAGGATGCT TACCCGGGGC AGATCACAGA TGGCATGGTC TGTGCAGGCA 780 

GCAGCAAAGG GGCTGACACG TGCCAGGGCG ATTCTGGAGG CCCCCTGGTG TGTGATGGTX3 840 

CACTCCAGGG CATCACATCC TGGGGCTCAG ACCCCTGTGG GAGGTCCGAC AAACCTGGCG 900 

TCTATACCAA CATCTGCCGC TACCTGGACT GGATCAAGAA GATCATAGGC AGCAAGGGCT 960 
GATTCTAGGA TAAGCACTAG ATCTCCCTTA ATAAACTCAC AACTCTC 

Seq ID NO: 500 Protein sequence 
Protein Accession #: NP_0 09127 

1 11 21 31 41 51 
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MGRPRPRAAK TWMFLLLLGG AWAGHSRAQE DKVLGGHECQ PHSQPWQAAL FQGQQLLCGG 60 

VLVGGNWVLT AAHCKKPKYT VRLGDHSLQN KDGPEQEIPV VQSIPHPCYN SSDVEDHNHD 120 

LMLLQLRDQA SLGSKVKPIS LADHCTQPGQ KCTVSGWGTV TSPRENFPDT LNCAEVKIFP 180 

QKKCEDAYPG QITDGMVCAG SSKGADTCQG DSGGPLVCDG ALQGITSKGS DPCGRSDKPG 240 
VYTNICRYLD WIKKIIGSKG 



Seq ID NO: 501 DNA sequence 
Nucleic Acid Accession ft: NM_006103 
Coding sequence: 29.. 406 

1 11 21 31 41 51 

I 1 I I I i 

CACCTGCACC CCGCCCGGGC ATAGCACCAT GCCTGCTTGT CGCCTAGGCC CGCTAGCCGC 60 
CGCCCTCCTC CTCAGCCTGC TGCTGTTCGG CTTCACCCTA GTCTCAGGCA CAGGAGCAGA 120 
GAAGACTGGC GTGTGCCCCG AGCTCCAGGC TGACCAGAAC TGCACGCAAG AGTGCGTCTC 180 
GGACAGCGAA TGCGCCGACA ACCTCAAGTG CTGCAGCGCG GGCTGTGCCA CCTTCTGCCT 240 
TCTCTGCCCA AATGATAAGG AGGGTTCCTG CCCCCAGGTG AACATTAACT TTCCCCAGCT 300 
CGGCCTCTGT CGGGACCAGT GCCAGGTGGA CAGCCAGTGT CCTGGCCAGA TGAAATGCTG 360 
CCGCAATGGC TGTGGGAAGG TGTCCTGTGT CACTCCCAAT TTCTGAGGTC CAGCCACCAC 420 
CAGGCTGAGC AGTGAGGAGA GAAAGTTTCT GCCTGGCCCT GCATCTGGTT CCAGCCCACC 480 
TGCCCTCCCC TTTTTCGGGA CTCTGTATTC CCTCTTGGGC TGACCACAGC TTCTCCCTTT 540 
CCCAACCAAT AAAGTAACCA CTTTCAGCAA AAAAAAAAAA AAAA 



Seq ID NO: 502 Protein sequence 
Protein Accession ft: NP_OOG094 

1 11 21 31 41 51 

I 1 I I I I 

MPACRLGPLA AALLLSLLLF GFTLVSGTGA EKTGVCPELQ ADQNCTQECV SDSECADNLK 60 
CCSAGCATFC LLCPNDKEGS CPQVNINFPQ LGLCRDQCQV DSQCPGQMKC CRNGCGKVSC 120 
VTPNF 



Seq ID NO: 503 DNA sequence 
Nucleic Acid Accession ft: NM_002407 
Coding sequence: 65.. 352 

1 11 21 31 41 SI 

I I I I I I 

CCTCCACAGC AACTTCCTTG ATCCCTGCCA CGCACGACTG AACACAGACA GCAGCCGCCT 60 
CGCCATGAAG CTGCTGATGG TCCTCATGCT GGCGGCCCTC CTCCTGCACT GCTATGCAGA 120 
TTCTGGCTGC AAACTCCTGG AGGACATGGT TGAAAAGACC ATCAATTCCG ACATATCTAT 180 
ACCTGAATAC AAAGAGCTTC TTCAAGAGTT CATAGACAGT GATGCCGCTG CAGAGGCTAT 240 
GGGGAAATTC AAGCAGTGTT TCCTCAACCA GTCACATAGA ACTCTGAAAA ACTTTGGACT 300 
GATGATGCAT ACAGTGTACG ACAGCATTTG GTGTAATATG AAGAGTAATT AACTTTACCC 360 
AAGGCGTTTG GCTCAGAGGG CTACAGACTA TGGCCAGAAC TCATCTGTTG ATTG CTAGAA 420 
ACCACTTTTC TTTCTTGTGT TGTCTTTTTA TGTGGAAACT GCTAGACAAC TGTTGAAACC 480 
TCAAATTCAT TTCCATTTCA ATAACTAACT GCAAATC 



Seq ID NO: 504 Protein sequence 
Protein Accession ft: NP_002398 

1 11 21 31 41 51 

I I I I I I 

MKLLMVLMLA ALLLHCYADS GCKLLEDMVE KTINSDISIP EYKELLQBFI DSDAAAEAMG 60 
KPKQCPLNQS HRTLKNFGLM MHTVYDSIWC NMKSN 



Seq ID NO: 505 DNA sequence 

Nucleic Acid Accession ft: NM_014791.1 

Coding sequence: 171.. 2 12 6 

1 11 21 31 41 51 

I I I I I I 

TTGGCGGGCG GAAGCGGCCA CAACCCGGCG ATCGAAAAGA TTCTTAGGAA OGCCGTACCA 60 

GCCGCGTCTC TCAGGACAGC AGGCCCCTGT CCTTCTGTCG GGCGCCGCTC AGCCGTGCCC 120 

TCOGCCCCTC AGGTTCTTTT TCTAATTCCA AATAAACTTG CAAGAGGACT ATGAAAGATT 180 

ATGATGAACT TCTCAAATAT TATGAATTAC ATGAAACTAT TGGGACAGGT GGCTTTGCAA 240 

AGGTCAAACT TGCCTGCCAT ATCCTTACTG GAGAGATGGT AGCTATAAAA ATCATGGATA 300 

AAAACACACT AGGGAGTGAT TTGCCCCGGA TCAAAACGGA GATTGAGGCC TTGAAGAACC 360 

TGAGACATCA GCATATATGT CAACTCTACC ATGTGCTAGA GACAGCCAAC AAAATATTCA 420 

TGGTTCTTGA GTACTGCCCT GGAGGAGAGC TGTTTGACTA TATAATTTCC CAGGATCGCC 480 

TGTCAGAAGA GGAGACCCGG GTTGTCTTCC GTCAGATAGT ATCTGCTGTT GCTTATGTGC 540 

ACAGCCAGGG CTATGCTCAC AGGGACCTCA AGCCAGAAAA TTTGCTGTTT GATGAATATC 600 

ATAAATTAAA GCTGATTGAC TTTGGTCTCT GTGCAAAACC CAAGGGTAAC AAGGATTACC 660 

ATCTACAGAC ATGCTGTGGG AGTCTGGCTT ATGCAGCACC TGAGTTAATA CAAGGCAAAT 720 

CATATCTTGG ATCAGAGGCA GATGTTTGGA GCATGGGCAT ACTGTTATAT GTTCTTATGT 780 

GTGGATTTCT ACCATTTGAT GATGATAATG TAATGGCTTT ATACAAGAAG ATTATGAGAG 840 

GAAAATATGA TGTTCCCAAG TGGCTCTCTC CCAGTAGCAT TCTGCTTCTT CAACAAATGC 900 

TGCAGGTGGA CCCAAAGAAA CGGATTTCTA TGAAAAATCT ATTGAACCAT CCCTGGATCA 960 

TGCAAGATTA CAACTATCCT GTTGAGTGGC AAAGCAAGAA TCCTTTTATT CACCTCGATG 1020 

ATGATTGCGT AACAGAACTT TCTGTACATC ACAGAAACAA CAGGCAAACA ATGGAGGATT 1080 

TAATTTCACT GTGGCAGTAT GATCACCTCA CGGCTACCTA TCTTCTGCTT CTAGCCAAGA 1140 

AGGCTCGGGG AAAACCAGTT CGTTTAAGGC TTTCTTCTTT CTCCTGTGGA CAAGCCAGTG 1200 

CTACCCCATT CACAGACATC AAGTCAAATA ATTGGAGTCT GGAAGATGTG ACCGCAAGTG 1260 
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ATAAAAATTA TGTGGCGGGA TTAATAGACT ATGATTGGTG TGAAGATGAT TTATCAACAG 1320 

GTGCTGCTAC TCCCCGAACA TCACAGTTTA CCAAGTACTQ GACAQAATCA AATGGGGTGG 1380 

AATCTAAATC ATTAACTCCA GCCTTATGCA GAACACCTGC AAATAAATTA AAGAACAAAG 1440 

AAAATGTATA TACTCCTAAG TCTGCTGTAA AGAATGAAGA GTACTTTATG TTTCCTGAGC 1500 

CAAAGACTCC AGTTAATAAG AACCAGCATA AGAGAGAAAT ACTCACTACG CCAAATOGTT 1560 

ACACTACACC CTCAAAAGCT AGAAACCAGT GCCTGAAAGA AACTCCAATT AAAATACCAG 1620 

TAAATTCAAC AGGAACAGAC AAGTTAATGA CAGGTGTCAT TAGCCCTGAG AGGCGGTGCC 1680 

GCTCAGTGGA ATTGGATCTC AACCAAGCAC ATATGGAGGA GACTCCAAAA AGAAAGGGAG 1740 

CCAAAGTGTT TGGGAGCCTT GAAAGGGGGT TGGATAAGGT TATCACTGTG CTCACCAGGA 1800 

GCAAAAGGAA GGGTTCTGCC AGAGACGGGC CCAGAAGACT AAAGCTTCAC TATAATGTGA 1860 

CTACAACTAG ATTAGTGAAT CCAGATCAAC TGTTGAATGA AATAATGTCT ATTCTTCCAA 1920 

AGAAGCATGT TGACTTTGTA CAAAAGGGTT ATACACTGAA GTGTCAAACA CAGTCAGATT 1980 

TTGGGAAAGT GACAATGCAA TTTGAATTAG AAGTGTGCCA GCTTCAAAAA CCCGATGTGG 2040 

TGGGTATCAG GAGGCAGCGG CTTAAGGGCG ATGCCTGGGT TTACAAAAGA TTAGTGGAAG 2100 

ACATCCTATC TAGCTGCAAG GTATAATTGA TGGATTCTTC CATCCTGCCG GATGAGTGTG 2160 

GGTGTGATAC AGCCTACATA AAGACTGTTA TGATCGCTTT GATTTTAAAG TTCATTGGAA 2220 

CTACCAACTT GTTTCTAAAG AGCTATCTTA AGACCAATAT CTCTTTGTTT TTAAACAAAA 2280 

GATATTATTT TGTGTATGAA TCTAAATCAA GCCCATCTGT CATTATGTTA CTGTCTTTTT 2340 

TAATCATGTG GTTTTGTATA TTAATAATTG TTGACTTTCT TAGATTCACT TCCATATGTG 2400 

AATGTAAGCT CTTAACTATG TCTCTTTGTA ATGTGTAATT TCTTTCTGAA ATAAAACCAT 2460 
TTGTGAATAT 

Seq ID NO: 506 Protein sequence 
Protein Accession #: NP_055606.l 

1 11 21 31 41 51 

I I I I I I ' 

MKDYDELLKY YELHETIGTG GFAKVKLACH ILTGEMVAIK IMDKNTLGSD LPRIKTEIEA 60 

LKNLRHQHIC QLYHVLETAN KIFMVLEYCP GGELFDYIIS QDRLSEEETR WFRQIVSAV 120 

AYVHSQGYAH RDLKPENLLF DEYHKLKLID FGLCAKPKGN KDYHLQTCCG SIAYAAPELI 180 

QGKSYLGSEA DVWSMGILLY VLMCGFLPFD DDNVMALYKK IMRGKYDVPK WLSPSSILLL 240 

QQMLQVDPKK RISMKNLLNH PWIMQDYNYP VEWQSKNPFI HLDDDCVTEL SVHHRNNRQT 300 

MEDLISLWQY DHLTATYLLL LAKKARGKPV RLRLSSFSCG QASATPFTDI KSNNWSLEDV 360 

TASDKNYVAG LIDYDWCEDD LSTGAATPRT SQFTKYWTES NGVESKSLTP ALCRTPANKL 420 

KNKENVYTPK SAVKNEEYFM FPEPKTPVNK NQHKREILTT PNRYTTPSKA RNQCLKETPI 480 

KIPVNSTGTD KLMTGVISPE RRCRSVELDL NQAHMEETPK RKGAKVFGSL ERGLDKVITV 540 

LTRSKRKGSA RDGPRRLKLH YKVTTTRLVN PDQLLNEIMS ILPKKHVDFV QKGYTLKGQT 600 
QSDFGKVTMQ FELEVCQLQK PDWGIRRQR LKGDAWVYKR LVEDILSSCK V 

Seq ID NO: . 507 DNA sequence 
Kucleic Acid Accession ft: NM_0005B2 
Coding sequence: 88.. 990 

1 11 21 31 41 51 

1 I I I I I 

GCAGAGCACA GCATCGTCGG GACCAGACTC GTCTCAGGCC AGTTGCAGCC TTCTCAGCCA 60 

AACGCCGACC AAGGAAAACT CACTACCATG AGAATTGCAQ TGATTTGCTT TTGCCTCCTA 120 

GGCATCACCT GTGCCATACC AGTTAAACAG GCTGATTCTG GAAGTTCTGA GGAAAAGCAG 180 

CTTTACAACA AATACCCAGA TGCTGTGGCC ACATGGCTAA ACCCTGACCC ATCTCAGAAG 240 

CAGAATCTCC TAGCCCCACA GACCCTTCCA AGTAAGTCCA AGGAAAGCCA TGACCACATG 300 

GATGATATGG ATGATGAAGA TGATGATGAC CATGTGGACA GCCAGGACTC CATTGACTOG 360 

AACGACTCTG ATGATGTAGA TGACACTGAT GATTCTCACC AGTCTGATGA GTCTCACCAT 420 

TCTGATGAAT CTGATGAACT GGTCACTGAT TTTCCCACGG ACCTGCCAGC AACCGAAGTT 480 

TTCACTCCAG TTGTCCCCAC AGTAGACACA TATGATGGCC GAGGTGATAG TGTGGTTTAT 540 

GGACTGAGGT CAAAATCTAA GAAGTTTCGC AGACCTGACA TCCAGTACCC TGATGCTACA 600 

GACGAGGACA TCACCTCACA GATGGAAAGC GAGGAGTTGA ATGGTGCATA CAAGGCCATC 660 

CCCGTTGCCC AGGACCTGAA CGCGCCTTCT GATTGGGACA GCCGTGGGAA GGACAGTTAT 720 

GAAACGAGTC AGCTGGATGA CCAGAGTGCT GAAACCCACA GCCACAAGCA GTCCAGATTA 780 

TATAAGCGGA AAGCCAATGA TGAGAGCAAT GAGCATTCCG ATGTGATTGA TAGTCAGGAA 840 

CTTTCCAAAG TCAGCCGTGA ATTCCACAGC CATGAATTTC ACAGCCATGA AGATATGCTG 900 

GTTGTAGACC CCAAAAGTAA GGAAGAAGAT AAACACCTGA AATTTCGTAT TTCTCATGAA 960 

TTAGATAGTG CATCTTCTGA GGTCAATTAA AAGGAGAAAA AATACAATTT CTCACTTTGC 1020 

ATTTAGTCAA AAGAAAAAAT GCTTTATAGC AAAATGAAAG AGAACATGAA ATGCTTCTTT 1080 

CTCAGTTTAT TGGTTGAATG TGTATCTATT TGAGTCTGGA AATAACTAAT GTGTTTGATA 1140 

ATTAGTTTAG TTTGTGGCTT CATGGAAACT CCCTGTAAAC TAAAAGCTTC AGGGTTATGT 1200 

CTATGTTCAT TCTATAGAAG AAATGCAAAC TATCACTGTA TTTTAATATT TGTTATTCTC 1260 

TCATGAATAG AAATTTATGT AGAAGCAAAC AAAATACTTT TACCCACTTA AAAAGAGAAT 1320 

ATAACATTTT ATGTCACTAT AATCTTTTGT TTTTTAAGTT AGTGTATATT TTGTTGTGAT 1380 

TATCTTTTTG TGGTGTGAAT AAATCTTTTA TCTTGAATGT AATAAGAATT TGGTGGTGTC 1440 

AATTGCTTAT TTGTTTTCCC ACGGTTGTCC AGCAATTAAT AAAACATAAC CTTTTTTACT 1500 
GCCTAAAAAA AAAAAAAAAA AAAA 

Seq ID NO: 508 Protein sequence 
Protein Accession §: NP_000573 

1 11 21 31 41 51 

I I I I I I 

MRIAVICFCL LGITCAIPVK QADSGSSEEK QLYNKYPDAV ATWLNPDPSQ KQNLLAPQTL 60 
PSKSNESHDH MDDMDDEDDD DHVDSQDSID SNDSDDVDDT DDSHQSDESH HSDESDELVT 120 
DFPTDLPATE VFTPWPTVD TYDGRGDSW YGLRSKSKKF RRPDIQYPDA TDEDITSHME 180 
SEELNGAYKA IPVAQDLNAP SDWDSRGKDS YETSQLDDQS AETHSHKQSR LYKRKANDES 240 
NEHSDVIDSQ ELSKVSREFH SHEFHSHEDM LWDPKSKEE DKHLKFRISH ELDSASSEVN 
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Seq ID NO: 509 DNA sequence 

Nucleic Acid Accession 8: AB051390.1 

Coding sequence: 34.. 2457 

1 11 21 31 41 51 

I I I I I I 

AGCGGCCGCG GCACAAAGTT GGGGGCCGCG AAGATGAGGC TGTCCCCGGC GCCCCTGAAG 60 

CTGAGCCGGA CTCCGGCACT GCTGGCCCTG GCGCTGCCCC TGGCCGCGGC GCTGGCCTTC 120 

TCCGACGAGA CCCTGGACAA AGTGCCCAAG TCAGAGGGCT ACTGCAGCCG TATCCTGCGC 180 

GCCCAGGGCA CGCGGCGCGA GGGCTACACC GAGTTCAGCC TCCGCGTGGA GGGCGACCCC 240 

GACTTCTACA AGCCGGGAAC CAGCTACCGC GTAACACTTT CAGCTGCTCC TCCCTCCTAC 300 

TTCAGAGGAT TCACATTAAT TGCCCTCAGA GAGAACAGAG AGGGTGATAA GGAAGAAGAC 360 

CATGCTGGGA CCTTCCAGAT CATAGACGAA GAAGAAACTC AGTTTATGAG CAATTGCCCT 420 

GTTGCAGTCA CTGAAAGCAC TCCACGGAGG AGGACCCGGA TCCAGGTGTT TTGGATAGCA 480 

CCACCAGCGG GAACAGGCTG CGTGATTCTG AAGGCCAGCA TCGTACAAAA ACGCATTATT 540 

TATTTTCAAG ATGAGGGCTC TCTGACCAAG AAACTTTGTG AACAAGATTC CACATTTGAT 600 

GGGGTGACTG ACAAACCCAT CTTAGACTGC TGTGCCTGCG GAACTGCCAA GTACAGACTC 660 

ACATTTTATG GGAATTGGTC CGAGAAGACA CACCCAAAGG ATTACCCTCG TCGGGCCAAC 720 

CACTGGTCTG CGATCATCGG AGGATCCCAC TCCAAGAATT ATGTACTGTG GGAATATGGA 780 

GGATATGCCA GCGAAGGCGT CAAACAAGTT GCAGAATTGG GCTCACCCGT GAAAATGGAG 840 

GAAGAAATTC GACAACAGAG TGATGAGGTC CTCACCGTCA TCAAAGCCAA AGCCCAATGG 900 

CCAGCCTGGC AGCCTCTCAA CGTGAGAGCA GCACCTTCAG CTGAATTTTC CGTGGACAGA 960 

ACGCGCCATT TAATGTCCTT CCTGACCATG ATGGGCCCTA GTCCCGACTG GAACGTAGGC 1020 

TTATCTGCAG AAGATCTGTG CACCAAGGAA TGTGGCTGGG TCCAGAAGGT GGTGCAAGAC 1080 

CTGATTCCCT GGGACGCTGG CACCGACAGC GGGGTGACCT ATGAGTCACC CAACAAACCC 1140 

ACCATTCCCC AGGAGAAAAT CCGGCCCCTG ACCAGCCTGG ACCATCCTCA GAGTCCTTTC 1200 

TATGACCCAG AGGGTGGGTC CATCACTCAA GTAGCCAGAG TTGTCATCGA GAGAATCGCA 1260 

GGGAAGGGTG AACAATGCAA TATTGTACCT GACAATGTCG ATGATATTGT AGCTGACCTG 1320 

GCTCCAGAAG AGAAAGATGA AGATGACACC CCTGAAACCT GCATCTACTC CAACTGGTCC 1380 

CCATGGTCCG CCTGCAGCTC CTCCACCTGT GACAAAGGCA AGAGGAT6CG ACAGCGCATG 1440 

CTGAAAGCAC AGCTGGACCT CAGCGTCCCC TGCCCTGACA CCCAGGACTT CCAGCCCTGC 1500 

ATGGGCCCTG GCTGCAGTGA CGAAGACGGC TCCACCTGCA CCATGTCCGA GTGGATCACC 1560 

TGGTCGCCCT GCAGCATCTC CTGCGGCATG GGCATGAGGT CCCGGGAGAG GTATGTGAAG 1620 

CAGTTCCCGG AGGACGGCTC CGTGTGCACG CTGCCCACTG AGGAAACGGA GAAGTGCACG 1680 

GTCAACGAGG AGTGCTCTCC CAGCAGCTGC CTGATGACCG AGTGGGGCGA GTGGGACGAG 1740 

TGCAGCGCCA CCTGCGGCAT GGGCATGAAG AAGCGGCACC GCATGATCAA GATGAACCCC 1800 

GCAGATGGCT CCATGTGCAA AGCCGAGACA TCACAGGCAG AGAAGTGCAT GATGCCAGAG 1860 

TGCCACACCA TCCCATGCTT GCTGTCCCCA TGGTCCGAGT GGAGTGACTG CAGCGTGACC 1920 

TGCGGGAAGG GCATGCGAAC COGACAGCGG ATGCTCAAGT CTCTGGCAGA ACTTGGAGAC 1980 

TGCAATGAGG ATCTGGAGCA GGTGGAGAAG TGCATGCTCC CTGAATGCCC CATTGACTGT 2040 

GAGCTCACCG AGTGGTCCCA GTGGTCGGAA TGTAACAAGT CATGTGGGAA AGGCCACGTG 2100 

ATTCGAACCC GGATGATCCA AATGGAGCCT CAGTTTGGAG GTGCACCCTG CCCAGAGACT 2160 

GTGCAGCGAA AAAAGTGCCG CATCCGAAAA TGCCTTCGAA ATCCATCCAT CCAAAAGCTA 2220 

CGCTGGAGGG AGGCCCGAGA GAGCCGGCGG AGTGAGCAGC TGAAGGAAGA GTCTGAAGGG 22 B0 

GAGCAGTTCC CAGGTTGTAG GATGCGCCCA TGGACGGCCT GGTCAGAATG CACCAAACTG 2340 

TGCGGAGGTG GAATTCAGGA ACGTTACATG ACTGTAAAGA AGAGATTCAA AAGCTCCCAG 2400 

TTTACCAGCT GCAAAGACAA GAAGGAGATC AGAGCATGCA ATGTTCATOC TTGTTAGCAA 2460 

GGGTACGAGT TCCCCAGGGC TGCACTCTAG ATTCCAGAGT CACCAATGGC TGGATTATTT 2520 

GCTTGTTTAA GACAATTTAA ATTGTGTACG CTAGTTTTCA TTTTTGCAGT GTGGTTCGCC 2580 

CAGTAGTCTT GTGGATGCCA GAGACATCCT TTCTGAATAC TTCTTGATGG GTACAGGCTG 2640 

AGTGGGGCGC CCTCACCTCC AGCCAGCCTC TTCCTGCAGA GGAGTAGTGT CAGCCACCTT 2700 

GTACTAAGCT GAAACATGTC CCTCTGGAGC TTCCACCTGG CCAGGGAGGA CGGAGACTTT 2760 

GACCTACTCC ACATGGAGAG GCAACCATGT CTGGAAGTGA CTATGCCTGA GTCCCAGGGT 2820 

GCGGCAGGTA GGAAACATTC ACAGATGAAG ACAGCAGATT CCCCACATTC TCATCTTTGG 2880 

CCTGTTCAAT GAAACCATTG TTTGCCCATC TCTTCTTAGT GGAACTTTAG GTCTCTTTTC 2940 

AAGTCTCCTC AGTCATCAAT AGTTCCTGGG GAAAAACAGA GCTGGTAGAC TTGAAGAGGA 3000 

GCATTGATGT TGGGTGGCTT TTGTTCTTTC ACTGAGAAAT TCGGAATACA TTTGTCTCAC 3060 

CCCTGATATT GGTTCCTGAT GCCCCCCCAA CAAAAATAAA TAAATAAATT ATGGCTGCTT 3120 

TATTTAARTA TAAGGTAGCT AGTTTTTACA CCTGAGATAA ATAATAAGCT TAGAGTGTAT 3180 

TTTTCCC^TG CTTTTGGGGG TTCAGAGGAG TATGTACAAT TCTTCTGGGA AGCCAGCCTT 3240 

CTGAACTTTT TGGTACTAAA TCCTTATTGG AACCAAGACA AAGGAAGCAA AATTGGTCTC 3300 

TTTAGAGACC AATTTGCCTA AATTTTAAAA TCTTCCTACA CACATCTAGA CGTTCAAGTT 3360 

TGCAAATCAG TTTTTAGCAA GAAAACATTT TTGCTATACA AACATTTTGC TAAGTCTGCC 3420 

CAAAGCCOCC CCAATGCATT CCTTCAACAA AATACAATCT CTGTACTTTA AAGTTATTTT 3480 

AGTCATGAAA TTTTATATGC AGAGAGAAAA AGTTACCGAG ACAGAAAACA AATCTAAGGG 3540 

" AAAGGAATAT TATGGGATTA AGCTGAGCAA GCAATTCTGG TGGAAAGTCA AACCTGTCAG 3600 

TGCTCCACAC CAGGGCTGTG GTCCTCCCAG ACATGCATAG GAATGGCCAC AGGTTTACAC 3660 

TGCCTTCCCA GCAATTATAA GCACACCAGA TTCAGGGAGA CTGACCACCA AGGGATAGTG 3720 

TAAAAGGACA TTTTCTCAGT TGGGTCCATC AGCAGTTTTT CTTCCTGCAT TTATTGTTGA 3780 

AAACTATTGT TTCATTTCTT CTTTTATAGG CCTTATTACT GCTTAATCCA AATGTGTACC 3840 

ATTGGTGAGA CACATACAAT GCTCTGAATA CACTACGAAT TTGTATTAAA CACATCAGAA 3900 

TATTTCCAAA TACAACATAG TATAGTCCTG AATATGTACT TTTAACACAA GAGAGACTAT 3960 

TCAATAAAAA CTCACTGGGT CTTTCATGTC TTTAAGCTAA GTAAGTGTTC AGAAGGTTCT 4020 

TTTTTATATT GTCCTCCACC TCCATCATTT TCAATAAAAG ATAGGGCTTT TGCTCCCTTG 4080 

TTCTTGGAGG GACCATTATT ACATCTCTGA ACTACCTTTG TATCCAACAT GTTTTAAATC 4140 

CTTAAATGAA TTGCTTTCTC CCAAAAAAAG CACAATATAA AGAAACACAA GATTTAATTA 4200 

TTTTTCTACT TGGGGGGAAA AAAGTCCTCA TGTAGAAGCA CCCACTTTTG CAATGTTGTT 4260 

CTAAGCTATC TATCTAACTC TCAGCCCATG ATAAAGTTCC TTAAGCTGGT GATTCCTAAT 4320 
CAAGGACAAG CCACCCTAGT GTCTCATGTT TGTATTTGGT CCCAGTTGGG TACATTTTAA 4380 
AATCCTGATT TTGGAGACTT AAAACCAGGT TAATGGCTAA GAATGGGTAA CATGACTCTT 4440 
GTTGGATTGT TATTTTTTGT TTGCAATGGG GAATTTATAA GAAGCATCAA GTCTCTTTCT 4500 
TACCAAAGTC TTGTTAGGTG GTTTATAGTT CTTTTGGCTA ACAAATCATT TTGGAAATAA 4560 
AGATTTTTTA CTACAAAAAT G 
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Seq ID NO: 510 Protein sequence 
Protein Accession ft: BAB18461.1 
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MRLSPAPLKL 
FSLRVEGDPD 
ETQFMSNCPV 
LCEQDSTFDG 
KNYVLWEYGG 
PSAEFSVDRT 
VTYBSPNKPT 
NVDDIVADLA 
PDTQDFQPCM 
PTEETBKCTV 
QAEKCMMPEC 
MLPECPIDCE 
LRNPSIQKLR 
VKKRFKSSQP 



11 
I 

SRTPALLALA 
FYKPGTSYRV 
AVTESTPRRR 
VTDKPILDCC 
YASEGVKQVA 
RHLMSFLTMM 
IPQEKIRPLT 
PEEK!) ED DTP 
GPGCSDEDGS 
NEECSPSSCL 
HTIPCLLSPW 
LTEWSQWSEC 



21 
1 

LPLAAALAFS 
TLSAAPPSYF 
TRIQVFWIAP 
ACGTAKYRLT 
ELGSPVKMEE 
GPSPDWNVGL 
SLDHPQSPFY 
ETCIYSNWSP 
TCTMSEWITW 
MTENGEWDEC 
SEWSDCSVTC 
NKSCGKGHVI 



31 
I 

DETLDKVPKS 
RGFTLIALRE 
PAGTGCVILK 
PYGNWSEKTH 
EIRQQSDEVL 
SAEDLCTKEC 
DPEGGSITQV 
WSACSSSTCD 
SPCSISCGMG 
SATCGMGMKK 
GKGMRTRQRM 
RTRMIQMEPQ 
QFPGCRMRPW 



41 
I 

EGYCSRILRA 
NREGDKEEDH 
ASIVQKRIIY 
PKDYPRRANH 
TVIKAKAQWP 
GWVQKWQDL 
ARWIERIAR 
KGKRMRQRML 
MRSRERYVKQ 
RHRMIKMNPA 
LKSLAELGDC 
FGGAPCPETV 
TAWSECTKLC 



51 

1 

QGTRREGYTE 
AGTFQIIDEE 
FQDEGSLTKK 
WSAIIGGSHS 
AWQPLNVRAA 
IPWDAGTDSG 
KGEQCNIVPD 
KAQLDLSVPC 
FPEDGSVCTL 
DGSMCKAETS 
NEDLEQVEKC 
QRKKCRIRKC 
GGGIQERYMT 



TSCKDKKEIR ACNVHPC 



Seq ID NO: 511 DNA sequence 

Nucleic Acid Accession ft: NM_003108.1 

Coding sequence: 76.. 1401 
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1 11 21 31 41 51 

I i I I I I 

GGGGTGGGAG GGGGAGGGGG ACCTCCGCAC GAGACCCAGC GGCCCGGGTT GGAGCGTCCA 
GCCCTGCAAC GGATCATGGT GCAGCAGGCG GAGAGCTTGG AAGCGGAGAG CAACCTGCCC 
CGGGAGGCGC TGGACACGGA GGAGGGCGAA TTCATGGCTT GCAGCCCGGT GGCCCTGGAC 
GAGAGCGACC CAGACTGGTG CAAGAGGGGG TCGGGCCACA TCAAGCGGCC GATGAACGCG 
TTCATGGTAT GGTCCAAGAT CGAAGGCAGG AAGATCATGG AGCAGTCTCC GGACATGCAC 
AACGCCGAGA TCTCCAAGAG GCTGGGCAAG CGCTGGAAAA TGCTGAAGGA CAGCGAGAAG 
ATCCCGTTCA TCCGGGAGGC GGAGCGGCTG CGGCTCAAGC ACATGGCCGA CTACCCCGAC 
TACAAGTACC GGCCCCGGAA AAAGCCCAAA ATGGACCCCT CGGCCAAGCC CAGCGCCAGC 
CAGAGCCCAG AGAAGAGCGC GGCCGGCGGC GGCGGCGGGA GCGCGGGCGG AGGCGCGGGC 
GGTGCCAAGA CCTCCAAGGG CTCCAGCAAG AAATGCGGCA AGCTCAAGGC CCCCGCGGCC 
GCGGGCGCCA AGGCGGGCGC GGGCAAGGCG GCCCAGTCCG GGGACTACGG GGGCGCGGGC 
GACGACTACG TGCTGGGCAG CCTGCGCGTG AGCGGCTCGG GCGGCGGCGG CGCGGGCAAG 
ACGGTCAAGT GCGTGTTTCT GGATGAGGAC GACGACGACG ACGACGACGA CGACGAGCTG 
CAGCTGCAGA TCAAACAGGA GCCGGACGAG GAGGACGAGG AACCACCGCA CCAGCAGCTC 
CTGCAGCCGC CGGGGCAGCA GCCGTCGCAG CTGCTGAGAC GCTACAACGT CGCCAAAGTG 
CCCGCCAGCC CTACGCTGAG CAGCTOGGCG GAGTCCCCCG AGGGAGCGAG CCTCTACGAC 
GAGGTGCGGG CCGGCGCGAC CTCGGGCGCC GGGGGCGGCA GCCGCCTCTA CTACAGCTTC 
AAGAACATCA CCAAGCAGCA CCCGCCGCCG CTCGCGCAGC CCGCGCTGTC GCCCGCGTOC 
TCGCGCTCGG TGTCCACCTC CTCGTCCAGC AGCAGCGGCA GCAGCAGCGG CAGCAGOGGC 
GAGGACGCCG ACGACCTGAT GTTCGACCTG AGCTTGAATT TCTCTCAAAG CGOGCACAGC 
GCCAGCGAGC AGCAGCTGGG GGGCGGCGCG GCGGCCGGGA ACCTGTCCCT GTCGCTGGTG 
GATAAGGATT TGGATTCGTT CAGCGAGGGC AGCCTGGGCT CCCACTTCGA GTTCCCCGAC 
TACTGCACGC CGGAGCTGAG CGAGATGATC GCGGGGGACT GGCTGGAGGC GAACTTCTCC 
GACCTGGTGT TCACATATTG AAAGGCGCCC GCTGCTCGCT CTTTCTCTCG GAGGGTGCAG 
AGCTGGGTTC CTTGGGAGGA AGTTGTAGTG GTGATGATGA TGATGATGAT AATGATGATG 
ATGATGGTGG TGTTGATGGT GGCGGTGGTA GGGTGGAGGG GAGAGAAGAA GATGCTGATG 
ATATTGATAA GATGTCGTGA CGCAAAGAAA TTGGAAAACA TGATGAAAAT TTTGGTGGAG 
TTAAAGTGAA ATGAGTAGTT TTTAAACATT TTTCCTGTCC TTTTTTTGTC C CCCC TCCCT 
TCCTTTATCG TGTCTCAAGG TAGTTGCATA CCTAGTCTGG AGTTGTGATT ATTTTCCCAA 
AAAATGTGTT TTTGTAATTA CTATTTCTTT TTCCTGAAAT TCGTGATTGC AACAAAGGCA 
GAGGGGGCGG CGCGGCGGAG GGGAGGTAGG ACCCGCTCCG GAAGGCGCTG TTTGAAGCTT 
GTCGGTCTTT GAAGTCTGGA AGAOGTCTGC AGAGGACCCT TTTGGCAGCA CAACTGTTAC 
TCTAGGGAGT TGGTGGAGAT ATTTTTTTTT CTTAAGAGAA CTTAAAGAAC TGGTGATTTT 
TTTTTAACAA AAAAAGGG 

Seq ID NO: 512 Protein sequence 
Protein Accession ft: NP_003099.1 

1 11 21 31 41 51 

I I I I I I 

MVQQAESLEA ESNLPREALD TEEGEFMACS PVALDESDPD WCKTASGHIK RPHNAFMVWS 
KIERRKIMEQ SPDMHNAEIS KRLGKRWKML KDSEKIPFIR EAERLRLKHM ADYPDYKYRP 
RKKPKMDPSA KPSASQSPEK SAAGGGGGSA GGGAGGAKTS KGSSKKCGKL KAPAAAGAKA 
GAGKAAQSGD YGGAGDDYVL GSLRVSGSGG GGAGKTVKCV FLDEDDDDDD DDDELQLQIK 
QEPDEEDEEP PHQQLLQPPG QQPSQbLRRY NVAKVPASPT LSSSAESPEG ASLYDEVRAG 
ATSGAGGGSR LYYSFKNITK QHPPPLAQPA LSPASSRSVS TSSSSSSGSS SGSSGEDADD 
LMFDLSLNFS QSAHSASEQQ LGGGAAAGNL SLSLVDKDLD SFSEGSLGSH FEPPDYCTPE 
LSEMIAGDWL EANFSDLVFT Y 



Seq ID NO: 513 DNA sequence 

Nucleic Acid Accession ft: CAT Cluster 
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60 
120 
180 
240 
300 
360 
420 



1 11 21 31 

I I I I I I 

GGTCGACCTA AATCTGATAA CTGGCTTATT ATGTAATTTA TTGGTGTTAT TATAGTAGAG 
ATTGGTAATC TACAGTAAGA TTTTCAGTTA GGATTTGAGA TTATGATAAT AACTAATAGA 
ATATTTCTAA ATTGGAATTA GAAGATTGTT GTATGACAGA GAGTCAGGAC TTGCCATTTG 



60 
120 
180 
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GCAAACATCA AAGTCATTGT TTGGTGTGTA ATAGTACAAA ATCATCTTGC TTAACAGAGA 240 

AAGGATATCT GTTGCTCCCG AATGAAACAA TTTTTCTGAA ATAGAGGGCC CAGAATTGGT 300 

CTCTGACAAT TAATAAAGAC ATCAAAGATA GCAAAATGAT TTTTATATCT TAGGGCCAAT 360 

ACTACCAATT TAATAATTAA AACAAGTTCT GGTGAGCTCT GAACTTGGCA GAATTGGTGG 420 

CAACATAGAC TTTGGATTTT CCAAATTCCC CACATAAAAC AAAGGGGATC AACTAGATAG 480 
AAAAACCAGA AACCTTTGGA AATATCTGTT TAAAAAAAAA AAAAAGTCGA CGCGGGCC 



Seq ID NO: 514 DNA sequence 

Nucleic Acid Accession #: CAT cluster 



X 11 21 31 41 51 

I I I I I I 

GGAGCCACAG TGAAAGTCAA GAATGTCAGT GATTCCACAT TTAATATCTA CATTTTTGCA 60 

GGGCAGTTAC TCTTTTGTAG TATAACATTG AGCTGATAGC ACATAGTGTA GACAAGTGAA 120 

TACAGGATTC TCTGGGTTGT ATTCCCAGAA GTCTGGAGGT CATTTGGATA TTTGTGGGCC 180 

CTTGGCTTCA CTCTGACTTG TGTGACACAT AAAAATTGTG ATGAAATGTC CTATAGATGT 240 

CCTGCAGGTC TTAAAAGAAC CTTTCCAAAC TATGAAACAG CCCAGCAGCA CTGAGTTAGA 300 

GGTAAATTCT GAACCCTTGA ACACTAAAAC TATTCTAACT GCACATAGAA TTGGCAAGTA 360 

GCATTCTATG TCTATGAACA GTATGTCTTT TCTATATAAC AGAGAAAATC TTTTTAAGCA 420 

AACTACTCAG TTTAAAACCT AATTCTTCTC ATAATCTCAG TACTTTTGAA TGAAGACATA 480 

TCAATGCAAC AGTACACTCT TATTCAGGCA TTTGAAAGAA AGAATTCGAG ATCTAGTTTG 540 

TATCAGATAT TATAAATTAG TATGGTTTAG TCTTTGTCAT GAAATTCTAC TTAATTTTTG 600 

GACTATAGGT TTAAGAATGT AAGCAGAAGT TCTGCACCAA TCAGAATAAG CTACATTATG 660 

CTTGAGTGAC AACTACTGTA ATGACAAAAT ATCAGTGGCT TAATACAATG GTTTTTCTCT 720 

CATACTTGTT CATAAAGAGT CAGCAAGGAC CCTGCTCATT ATGGTCCCTC AGGGACCCAG 780 

GGTTGTTGGA AGCTCCACCA TTTTAGATAG CTCCCTTCAA AGTCAGCCAT CTTTGGCAGT 840 

CCATGTCCCC CAACAGGCTG GCAAAATTTG GCTCTGGATG GCTTCAAGGA TTGAGCATCG 900 

GGCAGTTTAA ATGCTTTCAA CATGGAAAGT GGACACCGGC CACTCCCACT CACATCCCTT 960 

GGGCCAGAAC TAGGTCACTG GGCCCGGACC TAACTTCGGA GGGTTGGGGA ATTGTAATTC 1020 

CTCCATGTAC CCAAAGTGGA GAGAAGCCAG ATACTGAGAA ACATCAATAA TGGCTAACAG 1080 

AAATCCATTC TACCATTCCC TTTGCCTAAA GTGAAAAGAT GAGTACTTTC ATCAATTTGT 1140 

AAACTGTACT TTTGAAGTAA ATCCTGGTAG CTTGCATGGG GGCTGGATTT CCAGAAAGCC 1200 

ATATGTAATT TGGGAATGAC ATTCACTTAA GCTCATAGAA TATCATTATT TGATGTAAAA 1260 

TGCCCTCATT TGCAATACAG GACCAAAATG CACTAACCAC AAAACCCCCC TCCCCACGGG 1320 

GCCCGGGGTC CCTATTCCCC TCCATCCCTT TAAATGAGGC ATTCTATGAT TTGGAATGGA 1380 

AGCCCAGTTG TAGTCGTAAG AATTTTACTT AATTCAAGAA TTATTCTCAC TGAATATGTG 1440 

CCAGTTCTGA AAGGAATGCA AAGTCAAATT TTGCATCTTC TTTGCTCAAG GGCCTTTAGA 1500 

TGTAACAACA CAGACATGAT ACAAGGCTGA CAATGACATT ATGATTTAAA TATGTTAAAC 1560 

AACTTATTAA ATTGTGAATC AACAAAAAAT TATGTTCTTT ATTTTATGGT TTTGCATAGT 1620 

CCTGACTCAC TGCCTACATA CCCCTCTTGT TCCTCAGTTC TTATCCCTGA TTTCTTACAG 1680 

GATGGCCTAA GACAGCTGTA GATGTTTTTA TTTAGCAAAA AAAAAAAAAA AAAAGTCGAC 1740 
GCGGCCGCGA ATTTAGTAG 



Seq ID NO: 515 DNA sequence 
Nucleic Acid Accession #: NM_012427 
Coding sequence : 43 . . 924 

1 11 21 31 41 SI 

I I I I I I 

CTTGTGGTTC CTCTCTACTT GGGGAAATCA GGTGCAGCGG CCATGGCTAC AGCAAGACCC 60 

CCCTGGATGT GGGTGCTCTG TGCTCTGATC ACAGCCTTGC TTCTGGGGGT CACAGAGCAT 120 

GTTCTCGCCA ACAATGATGT TTCCTGTGAC CACCCCTCTA ACACCGTGCC CTCTGGGAGC 180 

AACCAGGACC TGGGAGCTGG GGCCGGGGAA GACGCCCGGT CGGATGACAG CAGCAGCCGC 240 

ATCATCAATG GATCCGACTG CGATATGCAC ACCCAGCCGT GGCAGGCCGC GCTGTTGCTA 300 

AGGCCCAACC AGCTCTACTG CGGGGCGGTQ TTGGTGCATC CACAGTGGCT GCTCACGGCC 360 

GCCCACTGCA GGAAGAAAGT TTTCAGAGTC CGTCTOGGCC ACTACTCCCT GTCACCAGTT 420 

TATGAATCTG GGCAGCAGAT GTTCCAGGGG GTCAAATCCA TCCCCCACCC TGGCTACTCC 480 

CACCCTGGCC ACTCTAACGA CCTCATGCTC ATCAAACTGA ACAGAAGAAT TCGTCCCACT 540 

AAAGATGTCA GACCCATCAA CGTCTCCTCT CATTGTCCCT CTGCTGGGAC AAAGTGCTTG 600 

GTGTCTGGCT GGGGGACAAC CAAGAGCCCC CAAGTGCACT TCCCTAAGGT CCTCCAGTGC 660 

TTGAATATCA GCGTGCTAAG TCAGAAAAGG TGCGAGGATG CTTACCCGAG ACAGATAGAT 720 

GACACCATGT TCTGCGCCGG TGACAAAGCA GGTAGAGACT CCTGCCAGGG TGATTCTGGG 780 

GGGCCTGTGG TCTGCAATGG CTCCCTGCAG GGACTCGTGT CCTGGGGAGA TTACCCTTGT 840 

GCCCGGCCCA ACAGACCGGG TGTCTACACG AACCTCTGCA AGTTCACCAA GTGGATCCAG 900 

GAAACCATCC AGGCCAACTC CTGAGTCATC CCAGGACTCA GCACACCGGC ATCCCCACCT 960 

GCTGCAGGGA CAGCCCTGAC ACTCCTTTCA GACCCTCATT CCTTCCCAGA GATGTTGAGA 1020 

ATGTTCATCT CTCCAGCCCC TGACCCCATG TCTCCTGGAC TCAGGGTCTG CTTCCCCCAC 1080 

ATTGGGCTGA CCGTGTCTCT CTAGTTGAAC CCTGGGAACA ATTTCCAAAA CTGTCCAGGG 1140 

CGGGGGTTGC GTCTCAATCT CCCTGGGGCA CTTTCATCCT CAAGCTCAGG GCCCATCCCT 1200 
TCTCTGCAGC TCTGACCCAA ATTTAGTCCC AGAAATAAAC TGAGAAGTGG AAAAAAAAAA 

Seq ID NO: 516 Protein sequence 
Protein Accession #: NPJ)36559 

1 11 21 31 41 51 

I I I I i I 

MATARPPWMW VLCALITALL LGVTEHVLAN NDVSCDHPSN TVPSGSNQDL GAGAGEDARS 60 

DDSSSRIING SDCDMHTQPW QAALLLRPNQ LYCGAVLVHP QWLLTAAHCR KKVFRVRLGH 120 

YSLSPVYESG QOMFQGVKSI PHPGYSHPGH SNDLMLIKLN RRIRPTKDVR PINVSSHCPS 180 

AGTKCLVSGW GTTKSPQVHF PKVLQCLNIS VLSQKRCEDA YPRQIDDTMP CAGDKAGRDS 240 
CQGDSGGPW CNGSLQGLVS WGDYPCARPN RPGVYTNLCK FTKWIQETIQ ANS 
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WO 03/042661 



PCT/US02/36810 



Seq ID NO: 517 DNA sequence 
Nucleic Acid Accession #: NM_001719 
Coding sequence: 123.. 1418 



5 

10 

15 

20 

25 

30 

35 
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45 

50 

55 

60 

65 

70 

75 

80 



GGGCGCAGCQ 
CTGCCACCTG 
CGATGCACGT 
CCCTGTTCCT 
GCTTCATCCA 
CCATTTTGGG 
CCATGTTCAT 
GCCAGGGCTT 
GCCTGCAAGA 
TGGAACATGA 
TTTCCAAGAT 
ACATCCGGGA 
AGCACTTGGG 
AGGAGGGCTG 
GGCACAACCT 
AGTTGGCGGG 
TCTTCAAGGC 
GCCAGAACCG 
AGAACAQCAG 
GAGACCTGGG 
AGGGGGAGTG 
AGACGCTGGT 
AGCTCAATGC 
ACAGAAACAT 
TTGGGGCCAA 
CTGCCTTTTG 
AAACATGAGC 
TCCTACAAGC 
GCCGGGCCAG 
TTATGAGCGC 
GGGCACATTG 
CAATAAAACG 



11 
I 

GGGCCCGTCT 
GGGCGGTGCG 
GCGCTCACTG 
GCTGCGCTCC 
CCGGCGCCTC 
CTTGCCCCAC 
GCTGGACCTG 
CTCCTACCCC 
TAGCCATTTC 
CAAGGAATTC 
CCCAGAAGGG 
ACGCTTCGAC 
CAGGGAATCG 
GCTGGTGTTT 
GGGCCTGCAG 
CCTGATTGGG 
CACGGAGGTC 
CTCCAAGACG 
CAGCGACCAG 
CTGGCAGGAC 
TGCCTTCCCT 
CCACTTCATC 
CATCTCCGTC 
GGTGGTCCGG 
GTTTTTCTGG 
TGAGACCTTC 
AGCATATGGC 
TGTGCAGGCA 
GTCATTGGCT 
CTACCAGCCA 
GTGTCTGTGC 
AATGAATG 



21 31 

I I 
GCAGCAAGTG ACCGACGGCC 
GGCCCGGAGC CCGGAGCCCG 
CGAGCTGCGG CGCCGCACAG 
GCCCTGGCCG ACTTCAGCCT 
CGCAGCCAGG AGCGGCGGGA 
CGCCCGOGCC CGCACCTCCA 
TACAACGCCA TGGCGGTGGA 
TACAAGGCCG TCTTCAGTAC 
CTCACCGACG CCGACATGGT 
TTCCACCCAC GCTACCACCA 
GAAGCTGTCA CGGCAGCCGA 
AATGAGACGT TCCGGATCAG 
GATCTCTTCC TGCTCGACAG 
GACATCACAG CCACCAGCAA 
CTCTCGGTGG AGACGCTGGA 
CGGCACGGGC CCCAGAACAA 
CACTTCCGCA GCATCCGGTC 
CCCAAGAACC AGGAAGCCCT 
AGGCAGGCCT GTAAGAAGCA 
TGGATCATCG CGCCTGAAGG 
CTGAACTCCT ACATGAACGC 
AACCCGGAAA CGGTGCCCAA 
CTCTACTTCG ATGACAGCTC 
GCCTGTGGCT GCCACTAGCT 
ATCCTCCATT GCTCGCCTTG 
CCCTCCCTAT CCCCAACTTT 
TTTTGATCAG TTTTTCAGTG 
AAACCTAGCA GGAAAAAAAA 
GGGAAGTCTC AGCCATGCAC 
GGCCACCCAG CCGTGGGAGG 
GAAAGGAAAA TTGACCCGGA 



41 
I 

GGGACGGCCG 
GGTAGCGCGT 
CTTCGTGGCG 
GGACAACGAG 
GATGCAGCGC 
GGGCAAGCAC 
GGAGGGCGGC 
CCAGGGCCCC 
CATGAGCTTC 
TCGAGAGTTC 
ATTCCGGATC 
CGTTTATCAG 
CCGTACCCTC 
CCACTGGGTG 
TGGGCAGAGC 
GCAGCCCTTC 
CACGGGGAGC 
GCGGATGGCC 
CGAGCTGTAT 
CTACGCCGCC 
CACCAACCAC 
GCCCTGCTGT 
CAACGTCATC 
CCTCCGAGAA 
GCCAGGAACC 
AAAGGTGTGA 
GCAGCATCCA 
ACAACGCATA 
GGACTCGTTT 
AAGGGGGCGT 
AGTTCCTGTA 



51 
I 

CCTGCCCCCT 
AGAGCCGGCG 
CTCTGGGCAC 
GTGCACTCGA 
GAGATCCTCT 
AACTCGGCAC 
GGGCCCGGCG 
CCTCTGGCCA 
GTCAACCTCG 
CGGTTTGATC 
TACAAGGACT 
GTGCTCCAGG 
TGGGCCTCGG 
GTCAATCCGC 
ATCAACCCCA 
ATGGTGGCTT 
AAACAGCGCA 
AACGTGGCAG 
GTCAGCTTCC 
TACTACTGTG 
GCCATCGTGC 
GCGCCCACGC 
CTGAAGAAAT 
TTCAGACCCT 
AGCAGACCAA 
GAGTATTAGG 
ATGAACAAGA 
AAGAAAAATG 
CCAGAGGTAA 
GGCAAGGGGT 
ATAAATGTCA 



Seq ID NO: 51B Protein sequence 
Protein Accession #: NP 001710 



1 
I 

MHVRSLRAAA 
ILGLPHRPRP 
LQDSHFLTDA 
IRERFDNETP 



GECAFPLNSY 
RNMWRACGC 



11 
I 

PHSFVALWAP 
HLQGKHNSAP 
DMVMSFVNLV 
RISVYQVLQE 
TLDGQSINPK 
EALRMANVAE 
MNATNHAIVQ 
H 



21 
I 

LFLLRSALAD 
MFMLDLYNAM 
EHDKEFFHPR 
HLGRESDtiFL 
LAGLIGRHGP 
NSSSDQRQAC 
TLVHFINPET 



AVEEGGGPGG 
YHHREFRFDL 
LDSRTLWASE 
QNKQPPMVAF 
KKHELYVSFR 
VPKPCCAPTQ 



41 
I 

FIKRRLRSQE 
QGFSYPYKAV 
SKIPEGEAVT 
EGWLVFDITA 
FKATEVHFRS 
DLGWQDWIIA 
LNAISVLYFD 



51 
I 



Seq ID NO: 519 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 264.. 782 



1 
I 

CCCTGCTCCA 
TCATCGCGGG 
TTCCTCAGAC 
TGCOGGTGGC 
AACTTATCAG 
TGATCGCGGT 
TCAGCAGGCA 
CCAGGCCCAG 
CAGAGAGAGA 
ATAGCTTGGA 
ACACACAAGT 
AG AA CAT AAA 
TCTGGTATTT 
GAATTCCAAA 
GGAAATGCCA 
CAAAAAGAAT 
CCTGGTCTGT 
AAAACAGGCC 
TAAAGTATGA 
AAATTTTAAA 
TGACTTCCCT 
AAAATAGCAA 



11 
I 

GTCACACCCG 
ACTAATTTTC 
TGAGAACTGT 
ACTACTGAGA 
CAAGGAGCTC 
GGTGTCCCTC 
CATGGCCCAC 
CCCTGGCCAC 
CATCCCAATG 
TAGCTCCTGC 
CGTCTTTTCT 
GGAAATCACA 
TGTCAACCCT 
TATTTTTAAT 
TTTTTCCCCC 
GTGGAGAAGA 
ACCCAAAAAA 
AATGCCCCGG 
GAACTATGGA 
AAGGTTGAAT 
TTGAGCTCTT 
CCACCACCA 



21 
I 

GAAGCTGACT 
CTTAAAATTT 
TTCCAGTATA 
GACGAGGTGC 
ATCATGCTGA 
TTCCTGCAGG 
AAGAGTGAAC 
CATCATCCAC 
TCTGATTCCC 
AGTTCGCCTC 
GACCCTGGAG 
GATTATGTCA 
GCTCTGTCTG 
GGGGTCCAGT 
TTAAACAAGG 
AAACTGATAA 
GCTGTTCGTT 
CAAGAAAGGT 
GTCCATCAGC 
CAGCTGTTGT 
TAATTATTGG 



31 
I 

GGTCCACGCA 
AGACTTGCAC 
TACATCAAGT 
CAGGGTGGTT 
CAGAAGTCAT 
CCTGCTTCCT 
AGATACTGAA 
CTGCTGTCAA 
TTTACAGGCA 
CTGCCTGCCA 
AACTAAAAAA 
ATGTCAATCC 
AGCCAGCGGA 
TCTCTATGGA 
CATGGGGCTC 
ATACACAGAG 
CCTCAAAAAC 
TGAGATCAGA 
AGAGATAGTA 
AGAGTTCTAT 
CAATAAACAA 



41 

I 

CAGCTGAAGC 
AGTAAGGACT 
CACTGAGATC 
CCTGAAAGTG 
GGAGGTCTGG 
CACCGCCATC 
AGCGGCCAGT 
AGAGATGAAG 
TGACAGCGAC 
GGCCACAGAG 
TGACTCCCCG 
AGAAAGACAC 
ATATGATCAA 
TTCTTACATT 
ACAAGTCTAT 
GTCCTCAAGA 
AAAAACAAGG 
TGTTAGGAAG 
GTGAAGTCTC 
TTGGCAATCT 
CTTCTTTAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1360 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



FSTQGPPLAS 
AAEFRIYKDY 
TSNHWWNPR 
IRSTGSKQRS 
PEGYAAYYCE 
DSSNVILKKY 



60 
120 
180 
240 
300 
360 
420 



51 
1 

ATGAGGAAAC 
TCAACTGACC 
TCCAGCACCC 
CCTGAGCCCC 
CATGGCTTAG 
AACTACCTGC 
CTCCAGGTTC 
GAGACTCAGA 
ACACCCTCAG 
GATGTGGATT 
CTGGACTATG 
AAGCCCAGTT 
GTGGCCATGT 
TAATTTGTAG 
GGAGACAGGC 
CCCATGGACT 
CTTGGCTGGG 
AACTTTCAGG 
TCCCCAGGGA 
CATGGTTAAA 
AGTTTTAAAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



Seq ID NO: 520 Protein sequence 
Protein Accession #: Eos sequence 



1154 



WO 03/042661 



1 11 21 31 41 51 

I I I 1 I I 

MLTEVMBVWH GLVIAWSLP LQACFLTAIN YLLSRHMAHK SEQILKAASL QVPRPSPGHH 60 

HPPAVKEMKE TQTERDIPMS DSLYRHDSDT PSDSLDSSCS SPPACQATED VDYTQWFSD 120 
PGELKNDSPL DYENIKEITD YVNVNPERHK PSFWYFVNPA LSEPAEYDQV AM 

Seq ID NO: 521 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 107.. 328 

1 11 21 31 41 51 

I ! I I I I 

CTGCTCTGTC TGAGCCAGCG GAATATGATC AAGTGGCCAT GTGAATTCCA AATATTTTTA 60 

ATGGGGTCCA GTTCTCTATG GATTCTTACA TTTAATTTGT AGGGAAATGC CATTTTTCCC 120 

CCTTAAACAA GGCATGGGGC TCACAAGTCT ATGGAGACAG GCCAAAAAGA ATGTGGAGAA 180 

GAAAACTGAT AAATACACAG AGGTCCTCAA GACCCATGGA CTCCTGGTCT GTACCCAAAA 240 

AAGCTGTTCX3 TTCCTCAAAA ACAAAAACAA GGCTTGGCTG GGAAAACAGG CCAATGCCCC 300 

GGCAAGAAAG GTTGAGATCA GATGTTAGGA AGAACTTTCA GGTAAAGTAT GAGAACTATG 360 

GAGTCCATCA GCAGAGATAG TAGTGAAGTC TCTCCCCAGG GAAAATTTTA AAAAGGTTGA 420 

ATCAGCTGTT GTAGAGTTCT ATTTGGCAAT CTCATGGTTA AATGACTTCC CTTTGAGCTC 480 

TTTAATTATT GGCAATAAAC AACTTCTTTA AAAGTTTTAA ATAAAATAGC AACCACCACC 540 
A 

Seq ID NO: 522 Protein sequence 
Protein Accession # : Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MPFFPLKQGM GLTSLWRQAK KNVEKKTDKY TEVLKTHGLIi VCTQKSCSPL KNKNKAWLGK 60 
QANAPARKVE IRC 

Seq ID NO: 523 DNA sequence 

Nucleic Acid Accession #i Eos sequence 

Coding sequence: 211.. 1895 

1 11 21 31 41 51 

I I I I I I 

GGATCTGAGG GGCGCCCAGT CACTTCCTCC ACGTTCTCGT GCTGGGCGGG AGGAGCGGAT 60 

GGGGCTTGGG AGGCAGCCTG CTCTCCAGTC CCTATCCACC CACAGGTTTT TTGGGTCGGA 120 

GAGGAATTAT CTGATAAAAT TCCTGGGTTA ATATTTTTAA AAACGGAGAG TTTTTAAAAA 180 

TGATTTTTTT CCCTCGAAAA TGACCTTTTT ATGCTTCGAA GCAGTTTGTC AACCAGCATA 240 

GTGCTTTTTC TTTTCTCTTC TTTTTCTACG ATAAATGAAA GCATTTCTTC AAGAAAAAGG 300 

CACAGGTTCC TTGAACAGCT GGATTCTGAT GGCACCATTA CTATAGAGGA GCAGATTGTC 360 

CTTGTGCTGA AAGCGAAAGT ACAATGTGAA CTCAACATCA CAGCTCAACT CCAGGAGGGA 420 

GAAGGTAATT GTTTCCCTGA ATGGGATGGA CTCATTTGTT GGCCCAGAGG AACAGTGGGG 480 

AAAATATCGG CTGTTCCATG CCCTCCTTAT ATTTATGACT TCAACCATAA AGGAGTTGCT 540 

TTCCGACACT GTAACCCCAA TGGAACATGG GATTTTATGC ACAGCTTAAA TAAAACATGG 600 

GCCAATTATT CAGACTGCCT TCGCTTTCTG CAGCCAGATA TCAGCATAGG AAAGCAAGAA 660 

TTCTTTGAAC GCCTCTATGT AATGTATACC GTTGGCTACT CCATCTCTTT TGGTTCCTTG 720 

GCTGTGGCTA TTCTCATCAT TGGTTACTTC AGAGGATTGC ATTGCACTAG GAACTATATC 780 

CACATGCACT TATTTGTGTC TTTCATGCTG AGAGCTACAA GCATCTTTGT CAAAGACAGA 840 

GTAGTCCATG CTCACATAGG AGTAAAGGAG CTGGAGTCCC TAATAATGCA GGATGACCCA 900 

CAAAATTCCA TTGAGGCAAC TTCTGTGGAC AAATCACAAT ATATCGGGTG CAAGATTGCT 960 

GTTGTGATGT TTATTTACTT CCTGGCTACA AATTATTATT GGATCCTGGT GGAAGGTCTC 1020 

TACCTGCATA ATCTCATCTT TGTGGCTTTC TTTTCGGACA CCAAATACCT GTGGGGCTTC 1080 

ATCTTGATAG GCTGGGGGTT TCCAGCAGCA TTTGTTGCAG CATGGGCTGT GGCACGAGCA 1140 

ACTCTGGCTG ATGCGAGGTG CTGGGAACTT AGTGCTGGAG ACATCAAGTG GATTTATCAA 1200 

GCACCGATCT TAGCAGCTAT TGGGCTGAAT TTTATTCTGT TTCTGAATAC GGTTAGAGTT 1260 

CTAGCTACCA AAATCTGGGA GACCAATGCA GTTGGGCATG ACACAAGGAA GCAATACAGG 1320 

AAACTGGCCA AATCGACACT GGTCCTGGTC CTAGTCTTTG GAGTGCATTA CATCGTGTTC 1380 

GTATGCCTGC CTCACTCCTT CACTGGGCTC GGGTGGGAGA TCCGCATGCA CTGTGAGCTC 1440 

TTCTTCAACT CCTTTCAGGG TTTCTTTGTG TCTATCATCT ACTGCTACTG CAATGGAGAG 1500 

GTTCAGGCAG AGGTGAAGAA GATGTGGAGT CGGTGGAATC TCTCCGTGGA CTGGAAAAGG 1560 

ACACCGCCAT GTGGCAGCCG CAGATGCGGC TCAGTGCTCA CCACCGTGAC GCACAGCACC 1620 

AGCAGCCAGT CACAGGTGGC GGCCAGCACA CGCATGGTGC TTATCTCTGG CAAAGCTGCC 1680 

AAGATCGCCA GCAGACAGCC TGACAGCCAC ATCACTTTAC CTGGCTATGT CTGGAGTAAC 1740 

TCAGAGCAGG ACTGCCTGCC ACACTCTTTC CACGAGGAGA CCAAGGAAGA TAGTGGGAGG 1800 

CAGGGAGATG ATATTCTAAT GGAGAAGCCT TCCAGGCCTA TGGAATCTAA CCCAGACACT I860 
GAAGGATGCC AAGGAGAAAC TGAGGATGTT CTCTGA 

Seq ID NO: 524 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I I I ! 

MLRSSLSTSI VLFLFSSFST INESISSRKR HRFLEQLDSD GTITIEEQIV LVLKAKVQCE 60 

LNITAQLQEG EGNCFPEWDG LICWPRGTVG KISAVPCPPY IYDFNHKGVA PRHCNPNGTW 120 

DFMHSLNKTW ANYSDCLRFL QPDISIGKQE FPERLYVMYT VGYSISFGSL AVAILIIGYF 180 

RRLHCTRNYI HMHLFVSFML RATSIFVKDR WHAHIGVKE LESLIMQDDP QNSIEATSVD 240 

KSQYIGCKIA WMFIYFLAT NYYWILVEGL YLHNLIFVAF FSDTKYLWGF ILIGWGFPAA 300 

FVAAWAVARA TLADARCWEL SAGDIKWIYQ APILAAIGLN FILFLNTVRV LATKIWETNA 360 

VGHDTRKQYR KLAKSTLVLV LVFGVHYIVF VCLPHSFTGL GWEIRMHCEL FFNSFQGFFV 420 

SIIYCYCNGE VQAEVKKMWS RWNLSVDWKR TPPCGSRRCG SVLTTVTHST SSQSQVAAST 480 



1155 



WO 03/042661 



PCTAJS02/36810 
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RMVLISGKAA KIASRQPDSH ITLPGYVWSN SEQDCLPHSF HEETKBDSGR QGDDILMEKP 
SRPMESNPDT EGCQGETEDV L 

Seq ID NO: S2S DNA sequence 
Nucleic Acid Accession #: NM_00SO48 
Coding sequence: 143.. 1795 



540 



11 



51 



21 31 41 

I I I I I I 

GGCCGGTGGC CCGGGCCCGA CCACCCCAGC TGCGCGTCGT TACTGGCCAC AAGTTTGCTC 
TGGGCCAGCC AAGTTGGCAA CTTGGAAGCT TCTCCCGGGC TCTGGAGGAG GGTCCCTGCT 
TCTTCCTACA GCCGTTCCGG GCATGGCCGG GCTGGGGGCG TCGCTCCACG TCTGGGGTTG 
GCTAATGCTC GGCAGCTGCC TCCTGGCCAG AGCCCAGCTG GATTCTGATG GCACCATTAC 
TATAGAGGAG CAGATTGTCC TTGTGCTGAA AGCGAAAGTA CAATGTGAAC TCAACATCAC 
AGCTCAACTC CAGGAGGGAG AAGGTAATTG TTTCCCTGAA TGGGATGGAC TCATTTGTTG 
GCCCAGAGGA ACAGTGGGGA AAATATCGGC TGTTCCATGC CCTCCTTATA TTTATGACTT 
CAACCATAAA GGAGTTGCTT TCCGACACTG TAACCCCAAT GGAACATGGG ATTTTATGCA 
CAGCTTAAAT AAAACATGGG CCAATTATTC AGACTGCCTT CGCTTTCTGC AGCCAGATAT 
CAGCATAGGA AAGCAAGAAT TCTTTGAACG CCTCTATGTA ATGTATACCG TTGGCTACTC 
CATCTCTTTT GGTTCCTTGG CTGTGGCTAT TCTCATCATT GGTTACTTCA GACGATTGCA 
TTGCACTAGG AACTATATCC ACATGCACTT ATTTGTGTCT TTCATGCTGA GAGCTACAAG 
CATCTTTGTC AAAGACAGAG TAGTCCATGC TCACATAGGA GTAAAGGAGC TGGAGTCCCT 
AATAATGCAG GATGACCCAC AAAATTCCAT TGAGGCAACT TCTGTGGACA AATCACAATA 
TATCGGGTGC AAGATTGCTG TTGTGATGTT TATTTACTTC CTGGCTACAA ATTATTATTG 
GATCCTGGTG GAAGGTCTCT ACCTGCATAA TCTCATCTTT GTGGCTTTCT TTTCGGACAC 
CAAATACCTG TGGGGCTTCA TCTTGATAGG CTGGGGGTTT CCAGCAGCAT TTGTTGCAGC 
ATGGGCTGTG GCACGAGCAA CTCTGGCTGA TGCGAGGTGC TGGGAACTTA GTGCTGGAGA 
CATCAAGTGG ATTTATCAAG CACCGATCTT AGCAGCTATT GGGCTGAATT TTATTCTGTT 
TCTGAATACG GTTAGAGTTC TAGCTACCAA AATCTGGGAG ACCAATGCAG TTGGGCATGA 
CACAAGGAAG CAATACAGGA AACTGGCCAA ATCGACACTG GTCCTGGTCC TAGTCTTTGG 
AGTGCATTAC ATCGTGTTCG TATGCCTGCC TCACTCCTTC ACTGGGCTCG GGTGGGAGAT 
CCGCATGCAC TGTGAGCTCT TCTTCAACTC CTTTCAGGGT TTCTTTGTGT CTATCATCTA 
CTGCTACTGC AATGGAGAGG TTCAGGCAGA GGTGAAGAAG ATGTGGAGTC GGTGGAATCT 
CTCCGTGGAC TGGAAAAGGA CACCGCCATG TGGCAGCCGC AGATGCGGCT CAGTGCTCAC 
CACCGTGACG CACAGCACCA GCAGCCAGTC ACAGGTGGCG GCCAGCACAC GCATGGTGCT 
TATCTCTGGC AAAGCTGCCA AGATCGCCAG CAGACAGCCT GACAGCCACA TCACTTTACC 
TGGCTATGTC TGGAGTAACT CAGAGCAGGA CTGCCTGCCA CACTCTTTCC ACGAGGAGAC 
CAAGGAAGAT AGTGGGAGGC AGGGAGATGA TATTCTAATG GAGAAGCCTT CCAGGCCTAT 
GGAATCTAAC CCAGACACTG AAGGATGCCA AGGAGAAACT GAGGATGTTC TCTGAATGGA 
CATTTGTGGC TGACTTTCAT GGGCTGGTCC AATGGCTGGT TGTGTGAGAG GGCTTGGCTG 
ATACTCCTAT GCTTGAGTTC AAAGGCTGAA AATTCAGTTA AGGTGTTACT TAATAATAGT 
TTTTAGGCTC CATGAATTGG CTCCTGTAAA TACTAACGAC ATGAAAATGC AAGTGTCAAT 
GGAGTAGTTT ATTACCTTCT ATTGGCATCA AGTTTTCCTC TAAATTAATG TATGGTATTT 
GCTCTGTGAT TGTTCATTTT TTTCTGCTAC TTTTGGGTAG AAAAAAGATT CAATTGCTTG 
GCTGTAGCTT TCTCTCATAT ATATCACCCT AAATATAATG AAGATCTTTT AGTGTGTATC 
ATTTTCCTTT TAGAAACTAG TATTCTCTTA TTTCTTACTT TAATGTACTT CTATCACTGC 
ATTTATTTTG CCTGTGCATA GGAGCAATTA GGATCTAAAA AAATATATGG GAAGATAAAA 
GATCTAAGAA CAAGTACTTG CTGGAAAATT AGTTGGCTGG ACATTGATAA AATAATGCAT 
TTATAACAAT TACATGTGTT TTTGGGAACA AGGAAAATTT CTCAAAAAAG AATATTTCAC 
ACATCCCTTC TTTTGAATGG CCTCTTTGTG ACCAGCCAGA CCTCAGGTCT TCACTCTTTC 
TTCTTTGTAA ACCATGTCAT GTGGAAAGAT TTCCTCAGTT AGTGAGCTTG TGTCTGCAAA 
TTGATTTTGT TTGTAATGTA TTTTGATAGC AAATCATGCT GCATCTATAT CTTTTTCTTG 
TTTGAGCTGT TACTACATTG TACATGGCAT GTGGGATCAA TTAAAAATTT GTTTTAAAAA 
T 

Seq ID NO: 526 Protein sequence 
Protein Accession #: NP_005039 



1 
1 

MAGLGASLHV 
GNCFPEWDGL 
NYSDCLRFLQ 
MHLPVSFMLR 
VMPIYFLATN 
LADARCWELS 
LAKSTLVLVL 
QAEVKKMWSR 
IASRQPDSHI 
GCQGETEDVL 



11 
I 

WGWLMLGSCL 
ICWPRGTVGK 
PDISIGKQEP 
ATSIPVKDRV 
YYWIIiVEGLY 
AGDIKWIYQA 
VFGVHYIVFV 
WNLSVDWKRT 
TLPGYVWSNS 



21 
I 

LARAQItDSDG 
ISAVPCPPYI 
FERLYVMYTV 
VHAHIGVKEL 
LHNLIFVAFF 
PILAAIGLNP 
CLPHSFTGLG 
PPCGSRRCGS 
EQDCLPHSFH 



31 

! 

TITIEEQIVL 
YDFNHKGVAP 
GYSISFGSLA 
BSLIMQDDPQ 
SDTKYLNGFI 
ILFLNTVRVL 
WEIRMHCELF 
VLTTVTHSTS 
EETKEDSGRQ 



41 
I 

VLKAKVQCEL 
RHCNPNGTWD 
VAILIIGYFR 
NSIEATSVDK 
LIGWGFPAAP 
ATKIWETNAV 
FNSFQGFFV9 
SQSQVAASTR 
GDDILMEKP3 



Seq ID NO: 527 DNA sequence 
Nucleic Acid Accession &: XM_036683 
Coding sequence: 38.. 3655 



60 
120 
1B0 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1660 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 



51 
I 

NITAQLQEGE 
FMHSLNKTWA 
RLHCTRNYIH 
SQYIGCKIAV 
VAAWAVARAT 
GHDTRKQYRK 
IIYCYCNGEV 
MVLISGKAAK 
RPMESNPDTE 



60 
120 
180 
240 
300 
360 
420 
480 
540 



1 
I 

GCTTTGCCCA 
GATAGCAGCC 
AATGGTGCAA 
CAGCACAAAT 
GTCAGCGAGG 
AAAAGATTTT 
GGAGTGGCAT 



11 
I 

GTAGTTGGAA 
GCTCTGGTAG 
ATAGATTTAC 
CTAGAAGGAC 
GACGTGTCTT 
CATCTGCGAC 
GAGACATCTC 



21 
I 

AGTGAACTCG 
AGGTTAGGAC 
CAATAAAGAG 
GCTATCTCTC 
CCAACCCTGA 
TAAAGCCCAA 
TGGTGCCTGG 



31 
I 

ACTCGTGATG 
TTCAGCTGAT 
ATATAGAGAG 
CCATACTCTT 
GCAGTTGTTC 
CACTCAACTA 
GAATATAACC 



41 
I 

GTTCTCCTGT 
GGACAAGCTG 
TATGAGCTGG 
TCTGCGAGTC 
TTTAACATCA 
GTAGCTCCTG 
GATCCCATTA 



51 
I 

CACTTTGGTT 60 

GTAATGAAGA 120 

TGACTCCAGT 180 

ACAAAAAGAG 240 

CGGCATTTGG 300 

GGGCTGTTGT 360 

ACAACCATCA 420 



1156 



WO 03/042661 



PCTAJS02/36810 



ACCAGGAAGT GCTACGTATA GAATCCGGAA AACAGAGCCT TTGCAGACTA ACTGTGCTTA 480 
TGTTGGTGAC ATCX5TGGACA TTCCAGGAAC CTCTGTTGCC ATCAGCAACT GTGATGGTCT 540 
GGCTGGAATG ATAAAAAGTG ATAATGAAGA GTATTTCATT GAACCCTTGG AAAGAGGTAA 600 
ACAGATGGAG GAAGAAAAAG GAAGGATTCA TGTTGTCTAC AAGAGATCAG CTGTAGAACA 660 
5 GGCTCCCATA GACATGTCCA AAGACTTCCA CTACAGAGAG TCGGACCTGG AAGGCCTTGA 720 
TGATCTAGGT ACTGTTTATG GCAACATCCA CCAGCAGCTG AATGAAACAA TGAGACGCCG 780 
CAGACACGCG GGAGAAAACG ATTACAATAT CGAGGTACTG CTGGGAGTGG ATGACTCTGT 840 
GGTCCGTTTC CATGGCAAAG AGCACGTCCA AAACTACCTC CTGACCCTAA TGAACATTGT 900 
GAATGAAATT TACCATGATG AGTCCCTCGG AGTGCATATA AATGTGGTCC TGGTGCGCAT 960 
10 GATAATGCTG GGATATGCAA AGTCCATCAG CCTCATAGAA AGGGGAAACC CATCCAGAAG 1020 
CTTGGAGAAT GTGTGTCGCT GGGCGTCCCA ACAGCAAAGA TCTGATCTCA ACCACTCTGA 1080 
ACACCATGAC CATGCAATTT TTTTAACCAG GCAAGACTTT GGACCTGCTG GAATGCAAGG 1140 
ATATGCTCCA GTCACCGGCA TGTGTCATCC AGTGAGAAGT TGTACCCTGA ATCATGAGGA 1200 
TGGTTTTTCA TCTGCTTTTG TAGTAGCCCA TGAAACGGGC CATGTGTTGG GAATGGAGCA 1260 
15 TGATGGACAA GGCAACAGGT GTGGTGATGA GACTGCTATG GGAAGTGTCA TGGCTCCCTT 1320 
GGTACAAGCA GCATTCCATC GTTACCACTG GTCCCGATGC AGTGGTCAAG AACTGAAAAG 1380 
ATATATCCAT TCCTATGACT GTCTCCTTGA TGACCCTTTT GATCATGATT GGCCTAAACT 1440 
CCCAGAACTT CCTGGAATCA ATT ATT CT AT GGATGAGCAA TGTCGTTTTG ATTTTGGTGT 1500 
TGGCTATAAA ATGTGCACCG CQTTCCGAAC CTTTGACCCA TGTAAACAGC TGTGGTGTAG 1560 
20 CCATCCTGAT AATCCCTACT TTTGTAAGAC TAAAAAGGGA CCTCCACTTG ATGGGACTGA 1620 
ATGTGCTGCT GGAAAATGGT GCTATAAGGG TCATTGCATG TGGAAGAATG CTAATCAGCA 1680 
AAAACAAGAT GGCAATTGGG GGTCATGGAC TAAATTTGGC TCCTGTTCTC GGACATGTGG 1740 
AACTGGTGTT CGTTTCAGAA CACGCCAGTG CAATAATCCC ATGCCCATCA ATGGTGGTCA 1800 
GGATTGTCCT GGTGTTAATT TTGAGTACCA GCTTTGTAAC ACAGAAGAAT GCCAAAAACA 1860 
25 CTTTGAGGAC TTCAGAGCAC AGCAGTGTCA GCAGCGAAAC TCCCACTTTG AATACCAGAA 1920 
TACCAAACAC CACTGGTTGC CATATGAACA TCCTGACCCC AAGAAAAGAT GCCACCTTTA 1980 
CTGTCAGTCC AAGGAGACTG GAGATGTTGC TTACATGAAA CAACTGGTGC ATGATGGAAC 2040 
GCACTGTTCT TACAAAGATC CATATAGCAT ATGTGTGCGA GGAGAGTGTG TGAAAGTGGG 2100 
CTGTGATAAA GAAATTGGTT CTAATAAGGT TGAGGATAAG TGTGGTGTCT GTGGAGGAGA 2160 
30 TAATTCCCAC TGCCGAACCG TGAAGGGGAC ATTTACCAGA ACTCCCAGGA AGCTTGGGTA 2220 
CCTTAAGATG TTTGATATAC CCCCTGGGGC TAGACATGTG TTAATCCAAG AAGACGAGGC 2200 
TTCTCCTCAT ATTCTTGCTA TTAAGAACCA GGCTACAGGC CATTATATTT TAAATGGCAA 2340 
AGGGGAGGAA GCCAAGTCGC GGACCTTCAT AGATCTTGGT GTGGAGTGGG ATTATAACAT 2400 
TGAAGATGAC ATTGAAAGTC TTCACACCGA TGGACCTTTA CATGATCCTG TTATTGTTTT 2460 
35 GATTATACCT CAAGAAAATG ATACCCGCTC TAGCCTGACA TATAAGTACA TCATCCATGA 2520 
AGACTCTGTA CCTACAATCA ACAGCAACAA TGTCATCCAG GAAGAATTAG ATACTTTTGA 2580 
GTGGGCTTTG AAGAGCTGGT CTCAGTGTTC CAAACCCTGT GGTGGAGGTT TCCAGTACAC 2640 
TAAATATGGA TGCCGTAGGA AAAGTGATAA TAAAATGGTC CATCGCAGCT TCTGTGAGGC 2700 
CAACAAAAAG CCGAAACCTA TTAGACGAAT GTGCAATATT CAAGAGTGTA CACATCCACT 2760 
40 CTGGGTAGCA GAAGAATGGG AACACTGCAC CAAAACCTGT GGAAGTTCTG GCTATCAGCT 2820 
TCGCACTGTA CGCTGCCTTC AGCCACTCCT TGATGGCACC AACCGCTCTG TGCACAGCAA 2880 
ATACTGCATG GGTGACCGTC CCGAGAGCCG CCGGCCCTGT AACAGAGTGC CCTGCCCTGC 2940 
ACAGTGGAAA ACAGGACCCT GGAGTGAGTG TTCAGTGACC TGCGGTGAAG GAACGGAGGT 3000 
GAGGCAGGTC CTCTGCAGGG CTGGGGACCA CTGTGATGGT GAAAAGCCTG AGTCGGTCAG 3060 
45 AGCCTGTCAA CTGCCTCCTT GTAATGATGA ACCATGTTTG GGAGACAAGT CCATATTCTG 3120 
TCAAATGGAA GTGTTGGCAC GATACTGCTC CATACCAGGT TATAACAAGT TATGTTGTGA 3180 
GTCCTGCAGC AAGCGCAGTA GCACCCTGCC ACCACCATAC CTTCTAGAAG CTGCTGAAAC 3240 
TCATGATGAT GTCATCTCTA ACCCTAGTGA CCTCCCTAGA TCTCTAGTGA TGCCTACATC 3300 
TTTGGTTCCT TATCATTCAG AGACCCCTGC AAAGAAGATG TCTTTGAGTA GCATCTCTTC 3360 
50 AGTGGGAGGT CCAAATGCAT ATGCTGCTTT CAGGCCAAAC AGTAAACCTG ATGGTGCTAA 3420 
TTTACGCCAG AGGAGTGCTC AGCAAGCAGG AAGTAAGACT GTGAGACTGG TCACCGTACC 3480 
ATCCTCOCCA CCCACCAAGA GGGTCCACCT CAGTTCAGCT TCACAAATGG CTGCTGCTTC 3540 
CTTCTTTGCA GCCAGTGATT CAATAGGTGC TTCTTCTCAG GCAAGAACCT CAAAGAAAGA 3600 
TGGAAAGATC ATTGACAACA GACGTCCGAC AAGATCATCC ACCTTAGAAA GATGAGAAAG 3660 
55 TGAACCAAAA AGGCTAGAAA CCAGAGGAAA ACCTGGACAA CCTCTCTCTT CCCATGGTGC 3720 
ATATGCTTGT TTAAAGTGGA AATCTCTATA GATCGTCAGC TCATTTTATC TGTAATTGGA 3780 
AGAACAGAAA GTGCTGGCTC ACTTTCTAGT TGCTTTCATC CTCCTTTTGT TCTGCATTGA 3840 
CTCATTTACC AGAATTCATT GGAAGAAATC ACCAAAGATT ATTACAAAAG AAAAATATGT 3900 
TGCTAAGATT GTGTTGGTCX3 CTCTCTGAAG CAGAAAAGGG ACTGGAACCA ATTGTGCATA 3960 
60 TCAGCTGACT TTTTGTTTGT TTTAGAAAAG TTACAGTAAA AATTAAAAAG AGATACCAAT 4020 
GGTTTACACT TTAACAAGAA ATTTTGGATA TGGAACAAAG AATTCTTAGA CTTGTATTCC 4080 
TATTTATCTA TATTAGAAAT ATTGTATGAG CAAATTTGCA GCTGTTGTGT AAATACTGTA 4140 
TATTGCAAAA ATCAGTATTA TTTTAAGAGA TGTGTTCTCA AATGATTGTT TACTATATTA 4200 
CATTTCTGGA TGTTCTAGGT GCCTGTCGTT GAGTATTGCC TTGTTTGACA TTCTATAGGT 4260 
65 TAATTTTCAA AGCAGAGTAT TACAAAAGAG AAGTTAGAAT TACAGCTACT GACAATATAA 4320 
AGGGTTTTGT TGAATCAACA ATGTGATACG TAAATTATAG AAAAAGAAAA GAAACACAAA 4380 
AGCTATAGAT ATACAGATAT CAGCTTACCT ATTGCCTTCT ATACTTATAA TTTAAAGGAT 4440 
TGGTGTCTTA GTACACTTGT GGTCACAGGG ATCAACGAAT AGTAAATAAT GAACTCGTGC 4500 
AAGACAAAAC TGAAACCCTC TTTCCAGGAC CTCAGTAGGC ACCGTTGAGG TGTCCTTTGT 4560 
70 TTTTGTGTGT GTGTGTTCTT TTTTAATTTT CGCATTGTTG ACAGATACAA ACAGTTATAC 4620 
TCAATGTACT GTAATAATCG CAAAGGAAAA AGTTTTGGGA TAACTTATTT GTATGTTGGT 4680 
AGCTGAGAAA AATATCATCA GTCTAGAATT GATATTTGAG TATAGTAGAG CTTTGGGGCT 4740 
TTGAAGGCAG GTTCAAGAAA GCATATGTCG ATGGTTGAGA TATTTATTTT CCATATGGTT 4800 
CATGTTCAAA TGTTCACAAC CACAATGCAT CTGACTGCAA TAATGTGCTA ATAATTTATG 4860 
75 TCAGTAGTCA . CCTTGCTCAC AGCAAAGCCA GAAATGCTCT CTCCAGGGAG TAGATGTAAA 4920 
GTACTTGTAC ATAGAATTCA GAACTGAAGA TATTTATTAA AAGTTGATTT TTTTTTCTTG 4980 
ATAGTATTTT TATGTACTAA ATATTTACAC TAATATCAAT TACATATTTT GGTAAACTAG 5040 
AGAGACATAA TTAGAGATGC ATGCTTTGTT CTGTGCATAG AGACCTTTAA GCAAACTACT 5100 
D _ ACAGCCAACT CAAAAGCTAA AACTGAACAA ATTTGATGTT ATGCAAACAT CTTGCATTTT 5160 
80 TAGTAGTTGA TATTAAGTTG ATGACTTGTT TCCCTTCAAG GAAACATTAA ATTGTATGGA 5220 
CTCAGCTAGC TGTTCAATGA AATTGTGAAT TAGAAACATT TTTAAAAGTT TTTGAAAGAG 5280 
ATAAGTGCAT CATGAATTAC ATGTACATGA GAGGAGATAG TGATATCAGC ATAATGATTT 5340 
TGAGGTCAGT ACCTGAGCTG TCTAAAAATA TATTATACAA ACTAAAATGT AGATGAATTA 5400 
ACCTCTCAAA GCACAGAATG TGCAAGAACT TTTGCATTTT AATCGTTGTA AACTAACAGC 5460 
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TTAAACTATT 
GTCAAATTAG 
CCTAAAAATT 
CCACAGGGGA 
ATACTTCTGA 
TACACTACTG 
AT 



GACTCTATAC CTCTAAAGAA TTGCTGCTAC TTTGTGCAAG AACTTTGAAG 5520 

GCAAATTCCA GATAGTAAAA CAATCCCTAA GCCTTAAGTC TTTTTTTTTT 5580 

CCCATAGAAT AAAATTCTCT CTAGTTTACT TGTGTGTGCA TACATCTCAT 5640 

AGATAAAGAT GGTCACACAA ACAGTTTCCA TAAAGATGTA CATATTCATT 5700 

CCTTTGGGCT TTCTTTTCTA CTAAGCTAAA AATTCCTTTT TATCAAAGTG 5760 

ATGCTGTTTG TTGTACTGAG AGCACGTACC AATAAAAATG TTAACAAAAT 5820 



Seq ID NO: 528 Protein sequence 
Protein Accession fl: XPJJ36683 



1 
1 

MVLLSLWLIA 
LSASHKKRSA 
TDPINNHQPG 
IEPLERGKQM 
LNETMRRRRH 
INWLVRMIM 
FGPAGMQGYA 
MGSVMAPLVQ 
QCRFDFGVGY 
MWKNANQQKQ 
NTEECQKHFE 
KQLVHDGTHC 
RTPRKLGYLK 
GVEWDYNIED 
QEELDTFEWA 
IQECTHPLWV 
OJRVPCPAQW 
LGDKSIFCQM 
RSLVMPTSLV 
TVRLVTVPSS 
STLER 



11 
I 

AALVEVRTSA 
RDVSSNPEQL 
SATYRIRKTE 
EEEKGRIHW 
AGENDYNIEV 
LGYAKSISLI 
PVTGMCHPVR 
AAFHRYHWSR 
KMCTAFRTFD 
DGNWGSWTKF 
DFRAQQOQQR 
SYKDPYSICV 
MFDIPPGARH 
DIESLHTDGP 
LKSWSQCSKP 
AEEWEHCTRT 
KTGPWSECSV 
EVIARYCSIP 
PYHSETPAKK 
PPTKRVHLSS 



21 
I 

OGQAGNEEMV 
FFNITAFGKD 
PLQTNCAYVG 
YKRSAVEQAP 
LLGVDDSWR 
ERGNPSRSLE 
SCTLNHEDGF 
CSGQELKRYI 
PCKQLWCSHP 
GSCSRTCGTG 
NSHFEYQNTK 
RGECVKVGCD 
VLIQEDEASP 
LHDPVIVLI1 
CGGGFQYTKY 
CGSSGYQLRT 
TCGEGTEVRQ 
GYNKLCCESC 
MSLSSISSVG 
ASQMAAASFF 



31 
I 

QIDLPIKRYR 
FHLRLKPNTQ 
DIVDIPGTSV 
IDMSKDFHYR 
FHGKEHVQNY 
NVCRWASQQQ 
SSAFWAHET 
HSYDCLLDDP 
DNPYFCKTKK 
VRFRTRQCNN 
HHWLPYEHPD 
KEIGSNKVED 
HILAIKNQAT 
PQENDTRSSL 
GCRRKSDNKM 
VRCLQPLLDG 
VLCRAGDHCD 
SKRSSTLPPP 
GPNAYAAFRP 
AASDSIGASS 



41 
I 

EYELVTPVST 
LVAPGAWEW 
AISNCDGLAG 
ESDIiEGLDDL 
LLTLMNIVNE 
RSDLNHSEHH 
GHVLGMEHDG 
FDHDWPKLPE 
GPPLDGTECA 
PMPINGGQDC 
PKKRCKLYCQ 
KCGVCGGDNS 
GHYILNGKGE 
TYKYIIHEDS 
VHRSFCEANK 
TNRSVHSKYC 
GEKPESVRAC 
YLLEAAETHD 
NSKPDGANLR 
QARTSKKDGK 



51 
I 

KLEGRYLSHT 
HETSLVPGNI 
MIKSDNEEYF 
GTVYGNIHQQ 
IYHDESLGVH 
DHAIFLTRQD 
QGNRCGDETA 
LPGINYSMDE 
AGKWCYKGHC 
PGVNFEYQLC 
SKETGDVAYM 
HCRTVKGTFT 
EAKSRTFIDL 
VPTINSNNVI 
KPKPIRRMCN 
MGDRPESRRP 
QLPPCNDEPC 
DVISNPSDLP 
QRSAQQAGSK 
I IDNRRPTRS 



Seq ID NO: 529 DNA sequence 
Nucleic Acid Accession #: NM_002774 
Coding sequence: 246.. 980 



1 
I 

AGGCGGACAA 
AGCCCGGGGC 
GCTGGCTCGC 
GGCAGCTGTT 
CGGCCATGAA 
AGAATAAGTT 
TCTACACCTC 
CAGCTGOCCA 
AAAGGGAGAG 
ATGCCGCCAG 
CTGAACTCAT 
ACATCCTGGG 
ACATCCACCT 
ACATGTTGTG 
GTCCGCTGGT 
GATCAAAGGA 
AAACCATTCA 
TGGCTGGTTC 
TGACCCTGAT 
CAGCTCCATC 
TACCCCCACC 
TTCCACACGT 
CTGCTTACTG 
AAGATGAAGA 
TTTCACACCT 
TATTTT 



11 

I 

AGCCCGATTG 
AGGGGCGGGG 
TCTCTCCTGG 
CCTTCCCCCG 
GAAGCTGATG 
GGTGCATGGC 
GGGCCACTTG 
CTGCAAAAAA 
TTCCCAGGAG 
CCATGACCAG 
CCAGCCCCTT 
CTGGGGCAAG 
GGTGTCCCGT 
TGCTGGGGAT 
ATGTGGAGAC 
GAAGCCAGGA 
GGCCAAGTGA 
CAGAACGTCT 
GCTTAATAAA 
CTTGCATCAC 
ACTAAGAGAA 
TTGATTTCTT 
TCGGTGTCCC 
TAAGGATGAT 
ATGACATACA 



21 
I 

TTCCTGGGCC 
GCCAGTGTGG 
GGACACAGAG 
ACTCAAGAAT 
GTGGTGCTGA 
GGACCCTGCG 
CTCTGTGGTG 
CCGAATCTTC 
CAGAGTTCTG 
GACATCATGC 
CCCCTGGAGA 
ACAGCAGATG 
GAGGAGTGTG 
GAGAAGTACG 
CACCTCCGAG 
GTCTACACCA 
CCCTGACATG 
CTCACCTAGA 
CGCAGCGACG 
TGGGGAGGAC 
TACAGGAAAA 
CCTGCAGAGG 
CTTGGGATGT 
ACAGTCTCCA 
TGGGATAGCA 



31 

! 

CTTTCCCCAT 
TGACACACGC 
GTCGGCAGGC 
CCCCGGAGGC 
GTCTGATTGC 
ACAAGACATC 
GGGTCCTTAT 
AGGTCTTCCT 
TTGTCCGGGC 
TGTTGCGCCT 
GGGACTGCTC 
GTGATTTCCC 
AGCATGCCTA 
GGAAGGATTC 
GCCTTGTGTC 
ACGTCTGCAG 
TGACATCTAC 
CCTTGCCTCC 
TGAGGGTCCT 
GTGATGAGTG 
TCCCTTCTAG 
CCCAGCCACG 
ACCTTTCTTC 
TCAGGCAGTG 
CCTGGGCCGC 



41 
I 

CGCGCCTGGG 
TGTAGCTGTC 
AGCACAGAGA 
CCGGAGGCCT 
TGCAGCCTGG 
TCACCCCTAC 
CCATCCACTG 
GGGGAAGCAT 
TGTGATCCAC 
GGCACGCCCA 
AGCCAACACC 
TGACACCATC 
CCCTGGCCAG 
CTGCCAGGGT 
ATGGGGTAAC 
ATACACGAAC 
CTCCOGACCT 
CCTCCTCTCC 
GATTCTCCCT 
AGGACTTGGG 
GCATCTCCTC 
TGTCTGGAAT 
ACTGCAGATT 
GCTGTTGGAA 
CATGCACTCA 




GGGACCTACG 
GCAGCAGGAG 
GCAGAGGAGC 
CAAGCTGCCC 
TGGGTCCTCA 
AACCTTCGGC 
CCTGACTATG 
GCCAAACTCT 
ACCAGCTGCC 
CAGTGTGCAT 
ATCACCCAGA 
GATTCTGGGG 

TGGATCCAAA 
ACCACCCCAC 
TGCCCAGCTC 
GGTTTTACCC 
TCCTCGGTCT 
TCCCCAACCC 
CCCAGCTCCG 
TCTCACCTGT 
AGATTTAAGA 
ATAAAGAATG 



Seq ID NO: 530 Protein sequence 
Protein Accession #: NP 002765 
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60 
120 
180 
240 
300 
360 
420 
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600 
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1020 
1080 
1140 
1200 
1260 
1320 
1380 
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1500 



1 11 21 31 41 51 

I I I I I I 

MKKLMWLSL IAAAWAEEQN KLVHGGPCDK TSHPYQAALY TSGHLLCGGV LIHPLWVLTA 
AHCKKPNIjQV FLGKHNLRQR ESSQEQSSW RAVIHPDYDA ASKDQDIMLL RLARPAKLSE 
LIQPLPLERD CSANTTSCHI LGWGKTADGD FPDTIOCAYI HLVSREECEH AYPGQITQNM 
LCAGDEKYGK DSCQGDSGGP LVCGDHLRGL VSWGNIPCGS KEKPGVYTNV CRYTNWIQKT 
IQAK 

Seq ID NO: 531 DNA sequence 
Nucleic Acid Accession ft: NM_012152 
Coding sequence: 43.. 1104 
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CTTCTTTAAA 
GACAAGCACA 
GGAACAAAGC 
TCTAATTCTC 
TACCTGTTGG 
ATGTTTAACA 
GGGCTTCTGG 
AGGCACATGT 
CTGCTCATTT 
TGGAATTGCC 
TACCTTGTTT 
CTGCGGATCT 
TCCATCAGCC 
GCGTTTOTGQ 
AGGCAGTGTG 
GTCGTGAACC 
ATGATCTGCT 
GTCCTCAGCA 
GTCTGCAATA 
GTCTTAGG 



TTTCTTTCTA 
TGGACTTTTT 
TTGTGATTGT 
TGGTCATCGC 
CTAATTTAGC 
CAGGCCCAGT 
ACAGTAGCTT 
CAATCATGAG 

TCTGCAACAT 
TCTGGACAGT 
ACGTGTACGT 
GCCGGAGGAC 
TATGCTGGAC 
GCGTGCAGCA 
CCATCATCTA 
GCTTCTCTCA 
GGAGTGACAC 
AAAGCACTTC 



GGATGTTCAC 
TTATAATAGG 
TTTGTGTGTT 
GGCAGTGATC 
TGCTGCCGAT 
TTCAAAAACT 
GACTGCTTCC 
GATGCGGGTC 
GGCCATCGCC 
CTCTGCCTGC 
GTCCAACCTC 
CAAGAGGAAA 
ACCCATGAAG 
CCCGGGCCTG 
TGTGAAAAGG 
CTCCTACAAG 
GGAGAACCCA 
AGGCAGCCAG 
CTAAACTCTG 



TTCTTCTCCA 
AGCAACACTG 
GGGACGTTTT 
AAAAACAGAA 
TTCTTCGCTG 
TTGACTGTCA 
CTCACCAACT 
CATAGCAACC 
ATTTTTATGG 
TCTTCCCTGG 
ATGGCCTTCC 
ACCAACGTCT 
CTAATGAAGA 
GTGGTTCTGC 
TGGTTCCTGC 
GACGAGGACA 
GAGAGGCGTC 
TACATAGAGG 
GATGCCTCTC 



CAATGAATGA 
ATACTGTCGA 
TCTGCCTGTT 
AATTTCATTT 
GAATTGCCTA 
ACCGCTGGTT 
TGCTGGTTAT 
TGACCAAAAA 
GGGCGGTCCC 
CCCCCATTTA 
TCATCATGGT 
TGTCTCCGCA 
CGGTGATGAC 
TCCTCGACGG 
TGCTGGCGCT 
TGTATGGCAC 
CCTCTCGCAT 
ATAGTATTAG 
GGCCCACCCA 



I 

GTGTCACTAT 
TGACTGGACA 
TATTTTTTTT 
CCCCTTCTAC 
TGTATTCCTG 
TCTCCGTCAG 
CGCCGTGGAG 
GAGGGTGACA 
CACACTGGGC 
CAGCAGGAGT 
TGTGGTGTAC 
TACAAGTGGG 
TGTCTTAGGG 
CCTGAACTGC 
GCTCAACTCC 
CATGAAGAAG 
CCCCTCCACA 
CCAAGGTGCA 
GGTGATGACT 



60 
120 
180 
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Seq ID NO: 532 Protein sequence 
Protein Accession #: NP 036284 



1 
I 

KNECHYDKHM 
FHFPFYYLLA 
LVIAVERHMS 
PIYSRSYLVF 
VMTVLGAFW 
YGTMKKMICC 



11 
I 

DFFYNRSNTD 
NLAAADFFAG 
IMRMRVHSNL 
WTVSNLMAFL 
CWTPGLWLL 
FSQENPERRP 



21 
1 

TVDDWTGTKL 
IAYVFLMFNT 
TKKRVTLLIL 
IMVWYLRIY 
LDGLNCRQCG 
SRIPSTVLSR 



31 

1 

VrVLCVGTFF 
GPVSKTLTVN 
LVWAIAIFMG 
VYVKRKTNVL 
VQHVKRWFLL 
SDTGSQYIED 



41 



51 



CLFIFFSNSL VIAAVIKNRK 
RWFLRQGLLD SSLTASLTNL 
AVPTLGWNCL CNISACSSLA 
SPHTSGSISR RRTPMKLMKT 
LALLNSWNP IIYSYKDEDM 
SISQGAVCNK STS 



Seq ID NO: S33 DNA sequence 
Nucleic Acid Accession #: NM_002821 
Coding sequence: 150.. 3362 

1 11 21 31 41 51 

I I 1 I I I 

AACTCCCGCC TCGGGACGCC TOGGGGTCGG GCTCCGGCTG CGGCTGCTGC TGCGGCGCCC 
GCGCTCCGGT GCGTCCGCCT CCTGTGCCCG CCGCGGAGCA GTCTGCGGCC CGCCGTGCGC 
CCTCAGCTCC TTTTCCTGAG CCCGCCGCGA TGGGAGCTGC GCGGGGATCC CCGGCCAGAC 
CCCGCCGGTT GCCTCTGCTC AGCGTCCTGC TGCTGCCGCT GCTGGGCGGT ACCCAGACAG 
CCATTGTCTT CATCAAGCAG CCGTCCTCCC AGGATGCACT GCAGGGGCGC CGGGCGCTGC 
TTCGCTGTGA GGTTGAGGCT CCGGGCCCGG TACATGTGTA CTGGCTGCTC GATGGGGCCC 
CTGTCCAGGA CACGGAGCGG CGTTTCGCCC AGGGCAGCAG CCTGAGCTTT GCAGCTGTGG 
ACCGGCTGCA GGACTCTGGC ACCTTCCAGT GTGTGGCTCG GGATGATGTC ACTGGAGAAG 
AAGCCCGCAG TGCCAACGCC TCCTTCAACA TCAAATGGAT TGAGGCAGGT CCTGTGGTCC 
TGAAGCATCC AGCCTCGGAA GCTGAGATCC AGCCACAGAC CCAGGTCACA CTTCGTTGCC 
ACATTGATGG GCACCCTCGG CCCACCTACC AATGGTTCCG AGATGGGACC CCCCTTTCTG 
ATGGTCAGAG CAACCACACA GTCAGCAGCA AGGAGCGGAA CCTGACGCTC CGGCCAGCTG 
GTCCTGAGCA TAGTGGGCTG TATTCCTGCT GCGCCCACAG TGCTTTTGGC CAGGCTTGCA 
GCAGCCAGAA CTTCACCTTG AGCATTGCTG ATGAAAGCTT TGCCAGGGTG GTGCTGGCAC 
CCCAGGACGT GGTAGTAGCG AGGTATGAGG AGGCCATGTT CCATTGCCAG TTCTCAGCCC 
AGCCACCCCC GAGCCTGCAG TGGCTCTTTG AGGATGAGAC TCCCATCACT AACCGCAGTC 
GCCCCCCACA CCTCCGCAGA GCCACAGTGT TTGCCAACGG GTCTCTGCTG CTGACCCAGG 
TCCGGCCACG CAATGCAGGG ATCTACCGCT GCATTGGCCA GGGGCAGAGG GGCCCACCCA 
TCATCCTGGA AGCCACACTT CACCTAGCAG AGATTGAAGA CATGCCGCTA TTTGAGCCAC 
GGGTGTTTAC AGCTGGCAGC GAGGAGCGTG TGACCTGCCT TCCCCCCAAG GGTCTGCCAG 
AGCCCAGCGT GTGGTGGGAG CACGCGGGAG TCCGGCTGCC CACCCATGGC AGGGTCTACC 
AGAAGGGCCA GGAGCTGGTG TTGGCCAATA TTGCTGAAAG TGATGCTGGT GTCTACACCT 
GCCACGCGGC CAACCTGGCT GGTCAGCGGA GACAGGATGT CAACATCACT GTGGCCACTG 
TGCCCTCCTG GCTGAAGAAG CCCCAAGACA GCCAGCTGGA GGAGGGCAAA CCCGGCTACT 
TGGATTGCCT GACCCAGGCC ACACCAAAAC CTACAGTTGT CTGGTACAGA AACCAGATGC 
TCATCTCAGA GGACTCACGG TTCGAGGTCT TCAAGAATGG GACCTTGCGC ATCAACAGCG 
TGGAGGTGTA TGATGGGACA TGGTACCGTT GTATGAGCAG CACCCCAGCC GGCAGCATCG 
AGGCGCAAGC CCGTGTCCAA GTGCTGGAAA AGCTCAAGTT CACACCACCA CCCCAGCCAC 
AGCAGTGCAT GGAGTTTGAC AAGGAGGCCA CGGTGCCCTG TTCAGCCACA GGCCGAGAGA 
AGCCCACTAT TAAGTGGGAA CGGGCAGATG GGAGCAGCCT CCCAGAGTGG GTGACAGACA 
ACGCTGGGAC CCTGCATTTT GCCCGGGTGA CTCGAGATGA CGCTGGCAAC TACACTTGCA 
TTGCCTCCAA CGGGCCGCAG GGCCAGATTC GTGCCCATGT CCAGCTCACT GTGGCAGTTT 
TTATCACCTT CAAAGTGGAA CCAGAGCGTA CGACTGTGTA CCAGGGCCAC ACAGCCCTAC 
TGCAGTGCGA GGCCCAGGGG GACCCCAAGC CGCTGATTCA GTGGAAAGGC AAGGACCGCA 
TCCTGGACCC CACCAAGCTG GGACCCAGGA TGCACATCTT CCAGAATGGC TCCCTGGTGA 
TCCATGACGT GGCCCCTGAG GACTCAGGCC GCTACACCTG CATTGCAGGC AACAGCTGCA 
ACATCAAGCA CACGGAGGCC CCCCTCTATG TCGTGGACAA GCCTGTGCCG GAGGAGTCGG 
AGGGCCCTGG CAGCCCTCCC CCCTACAAGA TGATCCAGAC CATTGGGTTG TCGGTGGGTG 
CCGCTGTGGC CTACATCATT GCCGTGCTGG GCCTCATGTT CTACTGCAAG AAGCGCTGCA 
AAGCCAAGCG GCTGCAGAAG CAGCCCGAGG GCGAGGAGCC AGAGATGGAA TGCCTCAACG 
GAGGGCCTTT GCAGAACGGG CAGCCCTCAG CAGAGATCCA AGAAGAAGTG GCCTTGACCA 
GCTTGGGCTC CGGCCCCGCG GCCACCAACA AACGCCACAG CACAAGTGAT AAGATGCACT 
TCCCACGGTC TAGCCTGCAG CCCATCACCA CGCTGGGGAA GAGTGAGTTT GGGGAGGTGT 
TCCTGGCAAA GGCTCAGGGC TTGGAGGAGG GAGTGGCAGA GACCCTGGTA CTTGTGAAGA 

1159 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



GCCTGCAGAC 
GGAAGCTGAA 
ACTACATGGT 
AGAGCAAGGA 
GCACCCAGGT 
TGGCTGCGCG 
TCAGCAAGGA 
GCTGGATGTC 
CCTTCGGTGT 
CAGATGATGA 
GCTGCCCTTC 
GGCCCTCCTT 
GAGGAGGGAG 
CAGCATGATG 
TTGCTGAGGT 
GGCTGACTTG 
CTCTTCCTCT 
TTCTCCCCTT 
AGGCTTGGGA 
AGGGTTAATG 
ACACAGCAAG 
CCCCACCCTT 
CTTTTGACAC 
TGCAGCGTGG 
GCCATCCTTA 
TGTTTTTGTT 



GAAGGATGAG 
CCACGCCAAC 
GCTGGAATAT 
TGAAAAATTG 
AGCCCTGGGC 
TAACTGCCTG 
TGTGTACAAC 
CCCCGAGGCC 
GCTGATGTGG 
AGTACTGGCA 
CAAACTCTAT 
CAGTGAGATT 
CCCGCTCAGG 
GGCAAGATCC 
CTGAGCAGGG 
GACCCAAACT 
ATCAGGGACA 
GACCGGGTCC 
TGAGCTGGGT 
AGTCTCTTGC 
TGAGTCCTCC 
CTCTCCTTTC 
TATATAAACC 
GGTGGGTGGG 
CCCCACACTT 
TTTACACTCG 



CAGCAGCAGC 
GTGGTGOGGC 
GTGGATCTGG 
AAGTCACAGC 
ATGGAGCACC 
GTCAGTGCCC 
AGTGAGTACT 
ATCCTGGAGG 
GAAGTGTTTA 
GATTTGCAGG 
CGGCTGATGC 
GCCAGCGCCC 
ATGGCCTGGG 

CCTGGCCTTT 
GGGCGACTAG 
GTGTGGGTGC 
AACTCTGCCA 
TTGTGGGGAQ 
CCACTGGTCC 
CCACTCTGGG 
CTCATCCTAA 
GCCCTTTTTG 
CATGGGAGGT 
TTATTGTTGT 
CTGCTCTCAA 



TGGACTTCCG 
TCCTGGGGCT 
GAGACCTCAA 
CCCTCAGCAC 
TGTCCAACAA 
AGAGACAAGT 
ACCACTTCCG 
GTGACTTCTC 
CACATGGAGA 
CTGGGAAGGC 
AGCGCTGCTG 
TGGGAGACAG 
CAGGGGAGGA 
GGGCCCTGAG 
CCTCCTCTTC 
GGCTTTGAGC 
CACAGGTAAC 
CTCATCTGCC 
TTOCTTAATA 
ACTTGGGGGT 
CTTGTGCACA 
GTGCCTGGCA 
TATGCACCAC 
AGGGGTGGGC 
CGTTTTTTGT 
TAAATAAGCC 



GAGGGAGTTG 
GTGCOGGGAG 
GCAGTTCCTG 
CAAGCAGAAG 



GAAGGTGTCT 
CCAGGCCTGG 
TACCAAGTCT 
GATGCCCCAT 
TAGACTTCCT 
GGCCCTCAGC 
CACCGTGGAC 
CATCTCTAGA 
GTGCCCTAGT 
CTCACCCTCA 
TGGGCAGTTT 
CCCAATTTCT 
AACTTTGCCT 
TTCTCAAGTT 
CTAGACCAGG 
CTGACCCAGA 
GATGAAGGAG 
GGGCGGCTTT 
CCTGGAGATG 
TTGTTTTGTT 
TTTTTTA 



GAGATGTTTG 
GCTGAGCCCC 
AGGATTTCCA 
GTGGCCCTAT 
CATAAGGACT 
GCCCTGGGCC 
GTGCCGCTGC 
GATGTCTGGG 
GGTGGGCAGG 
CAGCCCGAGG 
CCCAAGGACC 
AGCAAGCCGT 
GGGAAGCTCA 
GCAACAGGCA 
TCCTTTGGGA 
CCCCTGCCAC 
GGCCTTCAAC 
GGGGAGGGCT 
CTGGGCACAC 
ATTATAGAGG 
CCCACGTCTT 
TTTTCAGGAG 
TATATGTAAT 
AGGAGGGTGG 
TTTTTGTTTT 



Seq ID NO: 534 Protein sequence 
Protein Accession #: NP 002812 



MGAARGSPAR 
VHVYWLLDGA 
IKWIEAGPW 
KERNLTLRPA 
BAMFHCOFSA 
CIGQGQRGPP 
VRIiPTHGRVY 
SQLEEGKPGY 
CMSSTPAGSI 
GSSLPEWVTD 
TTVYQGHTAL 
RYTCIAGNSC 
GLMFYCKKRC 
KRHSTSDKMH 
LDFRRELEMF 
PLSTKQKVAli 
YHFRQAWVPL 
AGKARLPQPE 



n 
I 

PRRLPLLSVL 
PVQDTERRPA 
LKHPASEAEI 
GPEHSGLYSC 
QPPPSLQWLF 
IILEATLHLA 
QKGHELVLAN 
LDCLTQATPK 
EAQARVQVLE 
NAGTLHFARV 
LQCEAQGDPK 
NIKHTEAPLY 
KAKRLQKQPE 
FPRSSLQPIT 
GKLNHANWR 
CTQVALGMEH 
RWMSPEAILE 
GCPSKLYRLM 



21 
I 

LLPLLGGTQT 
QGSSLSFAAV 
QPQTQVTLRC 
CAHSAFGQAC 
EDETPITNRS 
EIEDMPLPEP 
IAESDAGVYT 
PTWWYRNQM 
KLKFTPPPQP 
TRDDAGNYTC 
PLIQWKGXDR 
WDKPVPEES 



TLGKSEFGEV 
LLGLCREAEP 
LSNNRFVHKD 
GDFSTKSDVW 
QRCWALSPKD 



31 

1 

AIVFIKQPSS 
DRLQDSGTFQ 
HIDGHPRPTY 
SSQNFTLSIA 
RPPHLRRATV 
RVFTAGSEER 
CHAANLAGQR 
LISEDSRFEV 
QQCMEFDKEA 
IASNGPQGQI 
ILDPTKLGPR 
EGPGSPPPYK 
GGPLQNGQPS 
FLAKAQGLEE 
HYMVLEYVDL 
LAARNCLVSA 
AFGVLMWEVF 
RPSFSEIASA 



41 
I 

QDALQGRRAL 
CVARDDVTGE 
QWFRDGTPLS 
DESFARWLA 
FANGSLLLTQ 
VTCLPPKGLP 
RQDVNITVAT 
FKNGTLRINS 
TVPCSATGRE 
RAHVQLTVAV 
MHIFQKGSLV 
MIQTIGLSVG 
AEIQEEVALT 
GVAETLVLVK 
GDLKQFLRIS 
QRQVKVSALG 



Seq ID NO: S35 DNA sequence 
Nucleic Acid Accession #: NM_013952 
Coding sequence: 161.. 1357 



1 
I 

TTCAGAAGGA 
TGAGGGCCTG 
AGCCCCGAGC 
ATCTGGCCAT 
GGAAGTGGTC 
CTCTCGCCAG 
GACTGGCAGC 
GGTGGTGGAG 
CCGAGACCGG 
CATTAATAGA 
CGTGGCCACC 
CCCGGAGTCA 
CATCGCTCAG 
ACTAAGCATT 
CTTCAGCCAG 
GGCCTATGCC 
CAACAGCACC 
CAACCTCTCG 
CAAGTCGGCT 
CAGTTCACGG 
TACCCACCCC 
ATGGTGGCAG 
AGCGAGGCCT 
ACATCAAGGC 
TGGGGACAGT 




2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
34B0 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 



51 
I 

LRCEVEAPGP 
EARS ANAS FN 
DGQSNHTVSS 
PQDVWARYE 
VRPRNAGIYR 
EPSVWWEHAG 
VPSWLKKPQD 
VEVYDGTWYR 
KPTIKWERAD 
FITFKVEPER 
IHDVAPEDSG 
AAVAYIIAVL 
SLGSGPAATN 
SLQTKDEOOO 
KSKDEKLKSQ 
LSKDVYNSEY 
ADDEVLADLQ 



CGAGTGCACC 
G 



CCTACCCCGT 
GCCCTTCAAT 
CCTCTCAGGG 
CAGCGGACAG 
CTCTGGCAAT 
CAACTCCAGC 
GCCCACCACT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



ACCCAAGCAG 
TACGGGAGGA 
ACTCCATCAG 
GACCTCTGCC 
CCTGCGACAT 
GGTACTACGA 
CCACCCCCAA 
CCTGGGAGAT 
GTGTCAGCTC 
TGGACAGCTG 
CTGTAACTCC 
GGCTCCTGGG 
ATAGCTGCCG 
GCACGGATGC 
ACTACCCAGA 
TGCCCTTGCT 
CACTGGGGCG 
GGATCTGCAG 
CGTGTACGGG 
GCTGCCCGGA 
CATCGCAGGC 
CTCCTCCTAC 
TTACAGTTCC 
GTAGTTGCCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
13B0 
1440 
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Seq ID NO: 536 Protein sequence 
Protein Accession ft: NP_039246 

_ 1 11 21 31 41 51 

5 I | | | I I 

MPHNSIRSGH GGLNQLGGAF VNGRPLPEW RQRIVDLAHQ GVRPCDISRQ LRVSHGCVSK 60 

ILGRYYETGS IRPGVIGGSK PKVATPKWE KIGDYKRQNP TMFAWEIRDR LLAEGVCDND 120 

TVPSVSSINR IIRTKVQQPF NLPMDSCVAT KSLSPGHTLI PSSAVTPPES PQSDSLGSTY 180 

SINGLLGIAQ PGSDKRKMDD SDQDSCRLSI DSQSSSSGPR KHLRTDAFSQ HHLEPLECPF 240 

10 ERQHYPEAYA SPSHTKGEQG LYPLPLLNST LDDGKATLTP SNTPLGRNLS THQTYPWAA 300 

PPFWICSKSA PGSRPSMPFP MLPPCTGSSR ARPSSQGERW WGPRCPDTHP TSPPADRAAM 360 
PPLPSQAWWQ EVNTLAMPMA TPPTPPTARP GASPTPAC 

Seq ID NO: 537 DNA sequence 
15 Nucleic Acid Accession #: NM_003466.1 
Coding sequence: 11..1363 

1 11 21 31 41 51 

ZU GAATTCGGCG ATGCCTCACA ACTCCATCAG ATCTGGCCAT GGAGGGCTGA ACCAGCTGGG 60 

AGGGGCCTTT GTGAATGGCA GACCTCTGCC GGAAGTGGTC CGCCAGCGCA TCGTAGACCT 120 

GGCCCACCAG GGTGTAAGGC CCTGCGACAT CTCTCGCCAG CTCCGCGTCA GCCATGGTTG 180 

CGTCAGCAAG ATCCTTGGCA GGTACTACGA GACTGGCAGC ATCCGGCCTG GAGTGATAGG 240 

GGGCTCCAAG CCCAAGGTGG CCACCCCCAA GGTGGTGGAG AAGATTGGGG ACTACAAACG 300 

25 CCAGAACCCT ACCATGTTTG CCTGGGAGAT CCGAGACCGG CTCCTGGCTG AGGGCGTCTG 360 

TGACAATGAC ACTGTGCCCA GTGTCAGCTC CATTAATAGA ATCATCCGGA CCAAAGTGCA 420 

GCAACCATTC AACCTCCCTA TGGACAGCTG CGTGGCCACC AAGTCCCTGA GTCGCGGACA 480 

CACGCTGATC CCCAGCTCAG CTGTAACTCC CCCGGAGTCA CCCCAGTCGG ATTCCCTGGG 540 

CTCCACCTAC TCCATCAATG GGCTCCTGGG CATCGCTCAG CCTGGCAGCG ACAAGAGGAA 600 

30 AATGGATGAC AGTGATCAGG ATAGCTGCCG ACTAAGCATT GACTCACAGA GCAGCAGCAG 660 

CGGACCCCGA AAGCACCTTC GCACGGATGC CTTCAGCCAG CACCACCTCG AGCCGCTCGA 720 

GTGCCCATTT GAGCGGCAGC ACTACCCAGA GGCCTATGCC TCCCCCAGCC ACACCAAAGG 780 

CGAGCAGGGC CTCTACCCGC TGCCCTTGCT CAACAGCACC CTGGACGACG GGAAGGCCAC 840 

CCTGACCCCT TCCAACACGC CACTGGGGCG CAACCTCTCG ACTCACCAGA CCTACCCCGT 900 

35 GGTGGCAGAT CCTCACTCAC CCTTCGCCAT AAAGCAGGAA ACCCCCGAGG TGTCCAGTTC 960 

TAGCTCCACC CCTTCCTCTT TATCTAGCTC CGCCTTTTTG GATCTGCAGC AAGTCGGCTC 1020 

CGGGGTCCCO CCCTTCAATG CCTTTCCCCA TGCTGCCTCC GTGTACGGGC AGTTCACGGG 1080 

CCAGGCCCTC CTCTCAGGGC GAGAGATGGT GGGGCCCACG CTGCCCGGAT ACCCACCCCA 1140 

CATCCCCACC AGCGGACAGG GCAGCTATGC CTCCTCTGCC ATCGCAGGCA TGGTGGCAGG 1200 

40 AAGTGAATAC TCTGGCAATG CCTATGGCCA CACCCCCTAC TCCTCCTACA GOGAGGCCTG 1260 

GCGCTTCCCC AACTCCAGCT TGCTGAGTTC CCCATATTAT TACAGTTCCA CATCAAGGCC 1320 
GAGTGCACCG CCCACCACTG CCACGGCCTT TGACCATCTG TAGTTGAAGC TT 

Seq ID NO: 538 Protein sequence 
45 Protein Accession # : NP_0034S7 

1 11 21 31 41 51 

I I I I I I 

MPHNSIRSGH GGLNQLGGAF VNGRPLPEW RQRIVDLAHQ GVRPCDISRQ LRVSHGCVSK 60 

50 ILGRYYETGS IRPGVIGGSK PKVATPKWE KIGDYKRQNP TMFAWEIRDR LLAEGVCDND 120 

TVPSVSSINR IIRTKVQQPF NLPMDSCVAT KSLSPGHTLI PSSAVTPPES PQSDSLGSTY 180 

SINGLLGIAQ PGSDKRKMDD SDQDSCRLSI DSQSSSSGPR KHLRTDAFSQ HHLEPLECPF 240 

ERQHYPEAYA SPSHTKGEQG LYPLPLLNST LDDGKATLTP SNTPLGRNLS THQTYPWAD 300 

PHSPFAIKQE TPEVSSSSST PSSLSSSAFL DLQQVGSGVP PFNAFPHAAS VYGQFTGQAL 360 

55 LSGREMVGPT LPGYPPHIPT SGQGSYASSA IAGMVAGSEY SGNAYGHTPY SSYSEAWRFP 420 
NSSLLSSPYY YSSTSRPSAP PTTATAFDHL 

Seq ID NO: 539 DNA sequence 
Nucleic Acid Accession #: NMJ)06799 
60 Coding sequence: 19.. 963 

1 11 21 31 41 51 

I I I I I I 

GCCGCGGGAG AGGAGGCCAT GGGCGCGCGC GGGGCGCTGC TGCTGGCGCT GCTGCTGGCT 60 

65 CGGGCTGGAC TCAGGAAGCC GGAGTCGCAG GAGGCGGCGC CGTTATCAGG ACCATGCGGC 120 

CGACGGGTCA TCACGTCGCG CATCGTGGGT GGAGAGGACG CCGAACTCGG GCGTTGGCCG 180 

TGGCAGGGGA GCCTGCGCCT GTGGGATTCC CACGTATGCG GAGTGAGCCT GCTCAGCCAC 240 

CGCTGGGCAC TCACGGCGGC GCACTGCTTT GAAACCTATA GTGACCTTAG TGATCCCTCC 300 

GGGTGGATGG TCCAGTTTGG CCAGCTGACT TCCATGCCAT CCTTCTGGAG CCTGCAGGCC 360 

70 TACTACACCC GTTACTTCGT ATCQAATATC TATCTGAGCC CTCGCTACCT GGGGAATTCA 420 

CCCTATGACA TTGCCTTGGT GAAGCTGTCT GCACCTGTCA CCTACACTAA ACACATCCAG 480 

CCCATCTGTC TCCAGGCCTC CACATTTGAG TTTGAGAACC GGACAGACTG CTGGGTGACT 540 

GGCTGGGGGT ACATCAAAGA GGATGAGGCA CTGCCATCTC CCCACACCCT CCAGGAAGTT 600 

CAGGTCGCCA TCATAAACAA CTCTATGTGC AACCACCTCT TCCTCAAGTA CAGTTTCCGC 660 

75 AAGGACATCT TTGGAGACAT GGTTTGTGCT GGCAATGCCC AAGGCGGGAA GGATGCCTGC 720 

TTCGGTGACT CAGGTGGACC CTTGGCCTGT AACAAGAATG GACTGTGGTA TCAGATTGGA 780 

GTCGTGAGCT GGGGAGTGGG CTGTGGTCGG CCCAATCGGC CCGGTGTCTA CACCAATATC 840 

AGCCACCACT TTGAGTGGAT CCAGAAGCTG ATGGCCCAGA GTGGCATGTC CCAGCCAGAC 900 

on CCCTCCTGGC CGCTACTCTT TTTCCCTCTT CTCTGGGCTC TCCCACTCCT GGGGCCGGTC 960 

O0 TGAGCCTACC TGAGCCCATG CAGCCTGGGG CCACTGCCAA GTCAGGCCCT GGTTCTCTTC 1020 
TGTCTTGTTT GGTAATAAAC ACATTCCAGT TGATGCCTTG CAGGGCATTC TTCAAAA 

Seq ID NO: 540 Protein sequence 
Protein Accession #: NP 006790 
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1 11 21 31 41 51 

I I I I I I 

KGARGALLLA LLLARAGLRK PESQEAAPLS GPCGRRVITS RIVGGEDAKL GRWPWQGSLR 60 
LWDSHVCGVS LLSHRWALTA AHCFETYSDL SDPSGWMVQF GQLTSMPSFW SLQAYYTRYF 120 
VSNIYLSPRY LGNSPYDIAL VKLSAPVTYT KHIQPICLQA STFEPENRTO CWVTGWGYIK 180 
EDEALPSPHT LQEVQVAI IN NSMCNHLFLK YSFRKDIFGD MVCAGNAQGG KDACFGDSGG 240 
PLACNKNGLW YQIGWSWGV GCGRPNRPGV YTNISHHFEW IQKLMAQSGM SQPDPSWPLL 300 
FFPLLWALPL LGPV 

Seq ID NO: 541 DNA sequence 
Nucleic Acid Accession U: NM_014344 
Coding sequence: 131.. 1444 



1 11 21 31 41 51 

I I I I I I 

GGGGCCGCGA TGGGGCCGAA GCGCCCGAAG CCCCGGAGCC CACAAACTGC CGGGCCCGCC 60 

TCGCCGCCGG GACCCGGGTG CCTGGGCTCG GCTTGAAGCG GCGGCGGCGC ACOGGCACAG 120 

CCGCGGGAGC ATGGGCAGGA GGATGCGGGG CGCCGCCGCC ACCGCGGGGC TCTGGCTGCT 180 

20 GGCGCTGGGC TCGCTGCTGG CGCTGTGGGG AGGGCTCCTG CCGCCGCGGA CCGAGCTGCC 240 

CGCCTCCCGG CCGCCCGAAG ACCGACTCCC ACGGCGCCCG GCCCGGAGCG GCGGCCCCGC 300 

GCCCGCGCCT CGCTTCCCTC TGCCCCCGCC CCTGGCGTGG GACGCCCGCG GCGGCTCCCT 360 

GAAAACTTTC CGGGOGCTGC TCACCCTGGC GGCCGGCGCG GACGGCCCGC CCCGGCAGTC 420 

CCGGAGCGAG CCCAGGTGGC ACGTGTCAGC CAGGCAGCCC CGGCCGGAGG AGAGCGCCGC 480 

25 GGTGCAOGGG GGCGTCTTCT GGAGCCGCGG CCTGGAGGAG CAGGTGCCCC CGGGCTTTTC 540 

GGAGGCCCAG GCGGCGGCGT GGCTGGAGGC GGCTCGCGGC GCCCGGATGG TGGCCCTGGA 600 

GCGCGGGGGT TGCGGGCGCA GCTCCAACCG ACTGGCCCGT TTTGCCGACG GCACCCGCGC 660 

CTGCGTGCGC TACGGCATCA ACCCGGAGCA GATTCAGGGC GAGGCCCTGT CTTACTATCT 720 

GGCGCGCCTG CTGGGCCTCC AGCGCCACGT GCCGCCGCTG GCACTGGCTC GGGTGGAGGC 780 

30 TCGGGGOGCG CAGTGGGCGC AGGTGCAGGA GGAGCTGCGC GCTGCGCACT GGACCGAGGG 840 

CAGCGTGGTG AGCCTGACAC GCTGGCTGCC CAACCTCACG GACGTGGTGG TGCCCGCGCC 900 

CTGGCGCTCG GAGGACGGCC GTCTGCGCCC CCTCCGGGAT GCCGGGGGTG AGCTGGCCAA 960 

CCTCAGCCAG GCGGAGCTGG TGGACCTAGT ACAATGGACC GACTTAATCC TTTTCGACTA 1020 

CCTGACGGCC AACTTCGACC GGCTCGTAAG CAACCTCTTC AGCCTGCAGT GGGACCCGCG 1080 

35 CGTCATGCAG CGTGCCACCA GCAACCTGCA CCGCGGTCCG GGCGGGGCGC TGGTCTTTCT 1140 

GGACAATGAG GGGGGCTTGG TGCACGGCTA CCGGGTAGCA GGCATGTGGG ACAAGTATAA 1200 

CGAGCOGCTG TTGCAGTCAG TGTGCGTGTT CCGCGAGCGG ACCGCGCGGC GCGTCCTGGA 1260 

GCTGCACCGC GGACAGGACG CCGCGGCCCG GCTGCTGCGC CTCTACCGGC GCCACGAGCC 1320 

TCGCTTCCCC GAGCTGGCCG CCCTTGCAGA CCCCCACGCT CAGCTGCTAC AGCGCCGCCT 1380 

40 CGACTTCCTC GCCAAGCACA TTTTGCACTG TAAGGCCAAG TACGGCCGCC GGTCTGGGAC 1440 

TTAGTGTCAC CGGGAGGAAA AGAGAGAGAT CTGGGGCTGG GGTATGGATG ATGGGGGGAA 1500 

GGGCGGTCGC CTCTGCCACT GTCAGGGACC AGCCGGCCAA CGCCCACCGG CAAAGGTGTC 1560 

TAAAAACTTC AGCTTTTCAC CCACCTGCCC CTTTCTTTCA ATCCCACGCT GTTTCCTTTC 1620 

AAAGTTCTGG GAGGACGAAC TCACCGAGGC GAGAAGTGTA ACATTCTCTC CACCCAGCTT 1680 

ATAAAAGGAT TCTTTACTGT GCCAGCACGG GGATTGGATC CGAAGAAACT GGCTACTGGG 1740 

GTTTGGCCCC CGAGTGGCCG TCCCTGTGGG AGATGCACCC CATTCTTGGG CCCCCCTCAT 1800 

TCCCTTTCCG AAAAAGGAAA ACTTGCGTTT GAGCCGTTGA GCTAATTCTG CAATTTTCTA 1860 

CCAAACAGAG CGCTGGTGGC CCCGGAGCAG GGCTGTGACA TTGGCTGGTG GAGCCCCTTC 1920 

CTGTGTTCTC CCTTTGTTCC AGCGCCGCGA TGGTGAGATC ACTGTTCCAA GCAGGGGGAC 1980 

GGCTCGCGAT AGGACAAAGA GAGCAGGACC TCCAGACTCT GGGGAGCCCT GCAGACCTTG 2040 

ACAATTTGCC TGACTCATTC CTGACCTCTT GTCATTTTGG CCTGAAGGCT ACAAATTGAG 2100 

GGTCAGCTGT ATGCACTAAG TCAAATAATG AATTTCTTCC TCCCTCTCGC AACCGACCAA 2160 

AATTTTGACA ACGATGATGT TCACCAGAAG GAAAAAAAAA TCAGTTTTAT GCACTTTATT 2220 

TTGTTTTGAT TTTCATTTTT TATTAAGAAA AAATTTTATT TTACAGAATT TACCTTCTCT 2280 

GTATATATGT GCATAAAGTG TGGTGTAAAT ATACTAAACA AACTTATATT TCAATAAAAG 2340 
GGAGTTTAAA ATTTAAAAAA AAAAAAA 



Seq ID NO: 542 Protein sequence 
Protein Accession ft: NP_055159 



1 11 21 31 41 51 

I I I I I 1 

MGRRMRGAAA TAGLWLLALG SLLALWGGLL PPRTELPASR PPEDRLPRRP ARSGGPAPAP 60 

RFPLPPPLAW DARGGSLKTF RALLTLAAGA DGPPRQSRSE PRWHVSARQP RPEESAAVHG 120 

65 GVFWSRGLEE QVPPGFSEAQ AAAWLEAARG ARMVALERGG CGRSSNRIAR FADGTRACVR 180 

YGINPEQIQG EALSYYLARL LGLQRHVPPL ALARVEARGA QWAQVQEELR AAHWTEGSW 240 

SLTRWLPNLT DVWPAPWRS EDGRLRPLRD AGGELANLSQ AELVDLVQWT DLILFDYLTA 300 

NFDRLVSNLF SLQWDPRVMQ RATSNLHRGP GGALVFLDNE AGLVHGYRVA GMWDKYNBPL 360 

LQSVCVFRER TARRVLELHR GQDAAARLLR LYRRHEPRFP ELAALADPHA QLLQRRLDFL 420 
AKHIZjHCKAK YGRRSGT 



Seq ID NO: 543 DNA sequence 

Nucleic Acid Accession #: XMJ)07652.4 

Coding sequence : 1 . . 12 90 

1 11 21 31 41 51 

! I I I I I 

ATGGCCGGCT CTGGCGCGTG GAAGCGCCTC AAATCTATGC TAAGGAAGGA TGATGCGCCG 60 
CTGTTTTTAA ATGACACCAG CGCCTTTGAC TTCTCGGATG AGGCGGGGGA CGAGGGGCTT 120 
TCTCGGTTCA ACAAACTTCG AGTTGTGGTG GCCGATGACG GTTCCGAAGC CCCGGAAAGG 180 
CCTGTTAACG GGGCGCACCC GACCCTCCAG GCCGACGATG ATTCCTTACT GGACCAAGAC 240 
TTACCTTTGA CCAACAGTCA GCTGAGTTTG AAGGTGGACT CCTGTGACAA CTGCAGCAAA 300 
CAGAGAGAGA TACTGAAGCA GAGAAAGGTG AAAGCCAGGT TGACCATTGC TGCCGTTCTG 360 
TACTTGCTTT TCATGATTGG AGAACTTGTA GGTGGATACA TTGCAAATAG CCTAGCAATC 420 

1162 
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ATGACAGATG CACTTCATAT GTTAACTGAC CTAAGCGCCA TCATACTCAC CCTGCTTGCT 480 

TTGTGGCTAT CATCAAAATC ACCAACCAAA AGATTCACCT TTGGATTTCA TCGCTTAGAG 540 

GTTTTGTCAG CTATGATTAG TGTGCTGTTG GTGTATATAC TTATGGGATT CCTCTTATAT 600 

GAAGCTGTGC AAAGAACTAT CCATATGAAC TATGAAATAA ATGGAGATAT AATGCTCATC 660 

ACCGCAGCTG TTGGAGTTGC AGTTAATGTA ATAATGGGGT TTCTGTTGAA CCAGTCTGGT 720 

CACCGTCACT CCCATTCCCA CTCCCTGCCT TCAAATTCCC CTACCAGAGG TTCTGGGTGT 780 

GAACGTAACC ATGGGCAGGA TAGCCTGGCA GTGAGAGCTG CATTTGTACA TGCTTTGGGA 840 

GATTTGGTAC AGAGTGTTGG TGTGCTAATA GCTGCATACA TCATACGATT CAAGCCAGAA 900 

TACAAGATTG CTGATCCCAT CTGTACATAC GTATTTTCAT TACTTGTGGC TTTTACAACA 960 

TTTCGAATCA TATGGGATAC AGTAGTTATA ATACTAGAAG GTGTGCCAAG CCATTTGAAT 1020 

GTAGACTATA TCAAAGAAGC CTTGATGAAA ATAGAAGATG TATATTCAGT CGAAGATTTA 1080 

AATATCTGGT CTCTCACTTC AGGAAAATCT ACTGCCATAG TTCACATACA GCTAATTCCT 1140 

GGAAGTTCAT CTAAATGGGA GGAAGTACAG TCCAAAGCAA ACCATTTATT ATTGAACACA 1200 

TTTGGCATGT ATAGATGTAC TATTCAGCTT CAGAGTTACA GGCAAGAAGT GGACAGAACT 1260 
TGTGCAAATT GTCAGAGTTC TAGTCCCTGA 

Seq ID NO: 544 Protein sequence 
Protein Accession 8: XP_007652.1 

1 11 21 31 41 51 

I I I I I I 

MAGSGAWKRL KSMLRKDDAP LFLNDTSAFD FSDEAGDEGL SRFNKLRWV ADDGSEAPER 60 

PVNGAHPTLQ ADDDSLLDQD LPLTNSQLSL KVDSCDNCSK QREILKQRKV KARLTIAAVL 120 

YLLFMIGELV GGYIANSLAI MTDALHMLTD LSAIILTLIiA LWLSSKSPTK RFTFGFHRLE 180 

VLSAMISVLL VYILMGFLLY EAVQRTIHMN YEINGDIMLI TAAVGVAVNV IMGFLLNQSG 240 

HRHSHSHSIiP SNSPTRGSGC ERNHGQDSLA VRAAFVHALG DLVQSVGVLI AAYIIRFKPE 300 

YKIADPICTY VFSLLVAFTT FRIIWDTWI ILEGVPSHLN VDYIKEALMK IEDVYSVEDL 360 

NIWSLTSGKS TAIVHIQLIP GSSSKWEEVQ SKANHLLLNT FGWYRCTIQL QSYRQEVDRT 420 
CANCQSSSP 

Seq ID NO: 545 DNA sequence 
Nucleic Acid Accession #: AB037765.1 
Coding sequence: 1..2478 

1 11 21 31 41 51 

I 1 I I I I 

ATGTTTTCCG GCTTCAATGT CTTTAGAGTT GGGATCTCTT TTGTCATAAT GTGCATTTTT 60 

TACATGCCAA CAGTAAACTC TTTACCAGAA CTGAGTCCTC AGAAATATTT TAGTACATTG 120 

CAACCAGGAA AAGCCTCTTT AGCTTATTTT TGTCAAGCTG ATTCCCCAAG AACATCTGTA 180 

TTTCTTGAAG AACTGAATGA GGCTGTTAGA CCTCTGCAGG ACTATGGAAT TTCAGTTGCC 240 

AAGGTTAATT GTGTCAAAGA AGAAATATCA AGATACTGTG GAAAAGAAAA GGATTTGATG 300 

AAAGCATATT TATTCAAGGG CAACATATTG CTCAGAGAAT TCCCTACTGA CACCTTGTTT 360 

GATGTGAATG CCATTGTCGC CCATGTTCTC TTTGCTCTTC TTTTTAGTGA AGTGAAATAT 420 

ATTACCAACC TGGAAGACCT TCAGAACATA GAAAATGCTC TGAAAGGAAA AGCAAATATT 480 

ATATTCTCAT ATGTAAGAGC CATTGGAATA CCAGAGCACA GAGCAGTCAT GGAAGCCGCT 540 

TTTGTGTATG GGACTACATA CCAATTTGTC TTAACCACAG AAATTGCCCT TTTGGAAAGT 600 

ATTGGCTCTG AGGATGTGGA ATATGCACAT CTCTACTTTT TTCATTGTAA ACTAGTCTTG 660 

GACTTGACCC AGCAATGTAG AAGAACACTA ATGGAACAGC CATTGACTAC ACTGAACATT 720 

CACCTGTTTA TTAAGACAAT GAAAGCACCT CTGTTGACTG AAGTTGCTGA AGATCCTCAA 780 

CAAGTTTCAA CTGTCCATCT CCAACTGGGC TTACCACTGG TTTTTATTGT TAGCCAACAG 840 

GCTACTTATG AAGCTGATAG AAGAACTGCA GAATGGGTTG CTTGGCGTCT TCTGGGAAAA 900 

GCAGGAGTTC TACTCTTGTT AAGGGACTCT TTGGAAGTGA ACATTCCTCA AGATGCTAAT 960 

GTGGTCTTCA AAAGAGCAGA AGAGGGAGTT CCAGTGGAAT TTTTGGTATT ACATGATGTT 1020 

GATTTAATAA TATCTCATGT GGAAAATAAT ATGCACATTG AGGAAATACA AGAAGATGAA 1080 

GACAATGACA TGGAAGGTCC AGATATAGAT GTTCAGGATG ATGAAGTGGC AGAAACTGTT 1140 

TTCAGAGATA GGAAGAGAAA ATTACCTTTG GAACTTACAG TGGAACTAAC AGAAGAAACA 1200 

TTTAATGCAA CAGTGATGGC TTCTGACAGC ATAGTACTCT TCTATGCTGG TTGGCAAGCA 1260 

GTATCCATGG CATTTTTGCA ATCCTATATT GATGTGGCAG TTAAACTGAA AGGCACATCT 1320 

ACTATGCTTC TTACTAGAAT AAACTGTGCA GATTGGTCTG ATGTATGTAC TAAGCAAAAT 1380 

GTTACTGAAT TTCCTATCAT AAAGATGTAC AAGAAAGGCG AGAACCCAGT ATCTTATGCT 1440 

GGAATGTTAG GAACCGAAGA TCTCCTAAAA TTTATCCAGC TCAACAGGAT TTCATATCCA 1500 

GTGAATATAA CATOGATCCA AGAAGCAGAA GAATATTTAA GTGGGGAATT ATATAAAGAC 1560 

CTCATCTTGT ATTCTAGTGT GTCAGTATTG GGACTATTTA GTCCAACCAT GAAAACAGCA 1620 

AAAGAAGATT TTAGTGAAGC AGGAAACTAC CTAAAAGGAT ATGTTATCAC TGGAATTTAT 1680 

TCTGAAGAAG ATGTTTTGCT ACTGTCAACC AAATATGCTG CAAGTCTTCC AGCCCTGCTG 1740 

CTTGCCAGAC ACACAGAAGG CAAAATAGAG AGCATCCCAC TAGCTAGCAC ACATGCACAA 1800 

GACATAGTTC AAATAATAAC AGATGCACTA CTGGAAATGT TTCCGGAAAT CACTGTGGAA 1860 

AATCTTCCCA GTTATTTCAG ACTTCAGAAA CCATTATTGA TTTTGTTCAG TGATGGCACT 1920 

GTAAATCCTC AGTATAAAAA AGCAATATTG ACACTGGTAA AGCAGAAATA CTTGGATTCA 1980 

TTTACTCCAT GCTGGTTAAA TCTAAAGAAT ACTCCAGTGG GGAGAGGAAT CTTGAGGGCA 2040 

TATTTTGATC CTCTGCCTCC CCTTCCTCTT CTTGTTTTGG TGAATCTGCA TTCAGGTGGC 2100 

CAAGTATTTG CATTTCCTTC AGACCAGGCT ATAATTGAAG AAAACCTTGT ATTGTGGCTG 2160 

AAGAAATTAG AAGCAGGACT AGAAAATCAT ATCACAATTT TACCTGCTCA AGAATGGAAA 2220 

CCTCCTCTTC CAGCTTATGA TTTTCTAAGT ATGATAGATG CCGCAACATC TCAACGTGGC 2280 
ACTAGGAAAG TTCCCAAGTG TATGAAAGAA ACAGATGTGC AGGAGAATGA TAAGGAACAA 2340 

CATGAAGATA AATCGGCAGT CAGAAAAGAA CCGATTGAAA CTCTGAGAAT AAAGCATTGG 2400 
AATAGAAGTA ATTGGTTTAA AGAAGCAGAA AAATCATTTA GACGTGATAA AGAGTTAGGA 2460 
TGCTCAAAAG TGAACTAA 

Seq ID NO: 546 Protein sequence 
Protein Accession #: BAA92582.1 

1 11 21 31 41 SI 

I I I I I I 



1163 



WO 03/042661 



MFSGFNVFRV GISFVIMCIP YMPTVNSLPE LSPQKYFSTL QPGKASLAYF CQADSPRTSV 60 

FLEELNEAVR PLQDYGISVA KVNCVKEEIS RYCGKEKDLM KAYLFKGNIL LREFPTDTLF 12 0 

DVNAIVAHVL FALLFSEVKY ITNLEDLQNI EN AL KG KAN I IFSYVRAIGI PBHRAVMEAA 180 

FVYGTTYQFV LTTEIALLES IGSEDVEYAH LYFFHCKLVL DLTQQCRRTL MBQPLTTLNI 240 

HLFIKTMKAP LLTEVAEDPQ QVSTVHLQLG LPLVFIVSQQ ATYEADRRTA EWVAVfRLLGK 300 

AGVLLLLRDS LEVNIPQDAN WFKRAESGV PVEFLVLHDV DLIISHVENN MHIEEIQEDE 360 

DNDMEGPDIO VQDDEVAETV FRDRKRKLPL ELTVELTEET FNATVMASDS IVLFYAGWQA 420 

VSMAFLQSYI DVAVKLKGTS TMLLTRINCA DWSDVCTKQN VTEFPIIKMY KKGENPVSYA 480 

GMLGTEDLLK FIQLNRISYP VNITSIQEAE EYLSGELYKD LILYSSVSVL GLFSPTMKTA S40 

KEDFSEAGNY LKGYVITGIY SEEDVLLLST KYAASLPALL LARHTEGKIE SIPLASTHAQ 600 

DIVQIITDAL LEMFPEITVE NLPSYFRLQK PLLILFSDGT VNPQYKKAIL TLVKQKYLDS 660 

FTPCWLNLKN TPVGRGILRA YFDPLPPLPL LVLVNLHSGG QVFAFPSDQA IIEENLVLWL 720 

KKLEAGLENH ITILPAQEWK PPLPAYDFLS MIDAATSQRG TRKVPKCMKE TDVQENDKEQ 780 
HEDKSAVRKE PIETLRIKHW NRSNWFKEAE KSFRRDKELG CSKVN 

Seq ID NO: 547 DMA sequence 

Nucleic Acid Accession #: NM_033102.1 

Coding sequence: 1. .1662 

1 11 21 31 41 51 

I I I 'I I I 

ATGGTCCAGA GGCTGTGGGT GAGCCGCCTG CTGCGGCACC GGAAAGCCCA GCTCTTGCTG 60 

GTCAACCTGC TAACCTTTGG CCTGGAGGTG TGTTTGGCCG CAGGCATCAC CTATGTGCCG 120 

CCTCTGCTGC TGGAAGTGGG GGTAGAGGAG AAGTTCATGA CCATGGTGCT GGGCATTGGT 180 

CCAGTGCTGG GCCTGGTCTG TGTCCCGCTC CTAGGCTCAG CCAGTGACCA CTGGCGTGGA 240 

CGCTATGGCC GCCGCCGGCC CTTCATCTGG GCACTGTCCT TGGGCATCCT GCTGAGCCTC 300 

TTTCTCATCC CAAGGGCCGG CTGGCTAGCA GGGCTGCTGT GCCCGGATCC CAGGCCCCTG 360 

GAGCTGGCAC TGCTCATCCT GGGCGTGGGG CTGCTGGACT TCTGTGGCCA GGTGTGCTTC 420 

ACTCCACTGG AGGCCCTGCT CTCTGACCTC TTCCGGGACC CGGACCACTG TCGCCAGGCC 480 

TACTCTGTCT ATGCCTTCAT GATCAGTCTT GGGGGCTGCC TGGGCTACCT CCTGCCTGCC 540 

ATTGACTGGG ACACCAGTGC CCTGGCCCCC TACCTGGGCA CCCAGGAGGA GTGCCTCTTT 600 

GGCCTGCTCA CCCTCATCTT CCTCACCTGC GTAGCAGCCA CACTGCTGGT GGCTGAGGAG 660 

GCAGCGCTGG GCCCCACCGA GCCAGCAGAA GGGCTGTCGG CCCCCTCCTT GTCGCCCCAC 720 

TGCTGTCCAT GCCGGGCCCG CTTGGCTTTC CGGAACCTGG GCGCCCTGCT TCCCCGGCTG 780 

CACCAGCTGT GCTGCCGCAT GCCCCGCACC CTGCGCCGGC TCTTCGTGGC TGAGCTGTGC 840 

AGCTGGATGG CACTCATGAC CTTCACGCTG TTTTACACGG ATTTCGTGGG CGAGGGGCTG 900 

TACCAGGGCG TGCCCAGAGC TGAGCCGGGC ACCGAGGCCC GGAGACACTA TGATGAAGGC 960 

GTTCGGATGG GCAGCCTGGG GCTGTTCCTG CAGTGCGCCA TCTCCCTGGT CTTCTCTCTG 1020 

GTCATGGACC GGCTGGTGCA GCGATTCGGC ACTCGAGCAG TCTATTTGGC CAGTGTGGCA 1080 

GCTTTCCCTG TGGCTGCCGG TGCCACATGC CTGTCCCACA GTGTGGCOGT GGTGACAGCT 1140 

TCAGCCGCCC TCACCGGGTT CACCTTCTCA GCCCTGCAGA TCCTGCCCTA CACACTGGCC 1200 

TCCCTCTACC ACCGGGAGAA GCAGGTGTTC CTGCCCAAAT ACCGAGGGGA CACTGGAGGT 1260 

GCTAGCAGTG AGGACAGCCT GATGACCAGC TTCCTGCCAG GCCCTAAGCC TGGAGCTCCC 1320 

TTCCCTAATG GACACGTGGG TGCTGGAGGC AGTGGCCTGC TCCCACCTCC ACCCGCGCTC 13 B0 

TGCGGGGCCT CTGCCTGTGA TGTCTCCGTA CGTQTGGTGG TGGGTGAGCC CACCGAGGCC 1440 

AGGGTGGTTC CGGGCCGGGG CATCTGCCTG GACCTCGCCA TCCTGGATAG TGCCTTCCTG 1500 

CTGTCCCAGG TGGCCCCATC CCTGTTTATG GGCTCCATTG TCCAGCTCAG CCAGTCTGTC 1560 

ACTGCCTATA TGGTGTCTGC CGCAGGCCTG GGTCTGGTCG GCATTTACTT TGCTACACAG 1620 
GTAGTATTTG ACAAGAGCGA CTTGGCCAAA TACTCAGCGT GA 

Seq ID NO i 548 Protein sequence 
Protein Accession fti NP_149093.1 

1 11 21 31 41 51 

I I I I I I 

MVQRLWVSRL LRHRKAQLLL VNLLTFGLEV CLAAGITYVP PLLLEVGVEE KFMTMVLGIG 60 

PVLGIiVCVPL LGSASDHWRG RYGRRRPFIW ALSLGILLSL PLIPRAGWLA GLLCPDPRPL 120 

ELALLILGVG LLDFCGQVCF TPLEALLSDL FRDPDHCRQA YSVYAFMISL GGCLGYLLPA 180 

IDWDTSAIAP YLGTQEECLF GLLTLIFLTC VAATLLVAEE AALGPTEPAE GLSAPSLSPH 240 

CCPCRARIiAF RNLGAUjPRL HQLCCRMPRT LRRLFVAELC SWMMHTFTL FYTDFVGEGL 300 

YQGVPRAEPG TEARRHYDEG VRMGSLGIiFIi QCAISLVFSL VMDRLVQRFG TRAVYLASVA 360 

AFPVAAGATC LSHSVAWTA SAALTGFTFS ALQILPYTLA SLYHREKQVF LPKYRGDTGG 420 

ASSEDSLMTS FLPGPKPGAP PPNGHVGAGG SGLLPPPPAL CGASACDVSV RVWGEPTBA 480 

RWPGRGICL DLAILDSAFL LSQVAPSLFM GSIVQLSQSV TAYMVSAAGL GLVAIYFATQ 540 
WFDKSDLAK YSA 

Seq ID NO: 549 DNA sequence 

Nucleic Acid Accession #i Eos sequence 

Coding sequence : 1 . . 13 89 

1 11 21 31 41 51 

I I I I I I 

ATGGGCTACC AGAGGCAGGA GCCTGTCATC CCGCCGCAGA GAGATTTAGA TGACAGAGAA 60 

ACCCTTGTTT CTGAACATGA GTATAAAGAG AAAACCTGTC AGTCTGCTGC TCTTTTTAAT 120 

GTTGTCAACT CGATTATAGG ATCTGGTATA ATAGGATTGC CTTATTCAAT GAAGCAAGCT 180 

GGGTTTCCTT TGGGAATATT GCTTTTATTC TGGGTTTCAT ATGTTACGGA CTTTTCCCTT 240 

GTTTTATTGA TAAAAGGAGG GGCCCTCTCT GGAACAGATA CCTACCAGTC TTTGGTCAAT 300 

AAAACTTTCG GCTTTCCAGG GTATCTGCTC CTCTCTGTTC TTCAGTTTTT GTATCCTTTT 360 

ATAGCAATGA TAAGTTACAA TATAATAGCT GGAGATACTT TGAGCAAAGT TTTTCAAAGA 420 

ATCCCAGGAG TTGATCCTGA AAACGTGTTT ATTGGTCGCC ACTTCATTAT TGGACTTTCC 480 

ACAGTTACCT TTACTCTGCC TTTATCCTTG TACOGAAATA TAGCAAAGCT TGGAAAGGTC 540 

TCCCTCATCT CTACAGGTTT AACAACTCTG ATTCTTGGAA TTGTAATGGC AAGGGCAATT 600 

TCACTGGGTC CACACATACC AAAAACAGAA GACGCTTGGG TATTTGCAAA GCCCAATGCC 660 

ATTCAAGCGG TCGGGGTTAT GTCTTTTGCA TTTATTTGCC ACCATAACTC CTTCTTAGTT 720 



1164 
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TACAGTTCTC 
GTGATTTCTG 
TTCACCCAAG 
AGATTTTGTT 
GAGGTAATTG 
ACAGTGATGG 
GTTCTAGAAC 
TGTTATCTGA 
ATGCTTCCCA 
CAAGACTGCA 
AATACCTCAG 
TTTCAATGA 



TAGAAGAACC 
TATTTATCTG 
GGGACTTATT 
ATGGTGTCAC 
CCAATGTGTT 
TCATCACTGT 
TCAATGGTGT 
AACTGTCTGA 
TTGGTGCTGT 
CCCATGGGCA 
AGTCTCATGT 



CACAGTAGCT 
TATATTCTTT 
TGAAAATTAC 
TGTCATTTTG 
TTTTGGTGGG 
AGCCACGCTT 
GCTCTGTGCA 
AGAACCAAGG 
GGTGATGGTT 
GGAAATGTTC 
TCAGCAGACA 



AAGTGGTCCC 
GCTACATGTG 
TGCAGAAATG 
ACATACCCTA 
AATCTTTCAT 
GTGTCATTGC 
ACTCCCCTCA 
ACACACTCCG 
TTTGGATTCG 
TACTGCTTTC 
ACACAACTTT 



GCCTTATCCA 
GATACTTGAC 
ATGACCTGGT 
TGGAATGCTT 
CGGTTTTCCA 
TGATTGATTG 
TTTTTATCAT 
ATAAGATTAT 
TCATGGCTAT 
CTGACAATTT 
CTACTTTAAA 



TATGTCCATC 
ATTTACTGGC 
AACATTTGGA 
TGTGACAAGA 
CATTGTTGTA 
CCTCGGGATA 
TCCATCAGCC 
GTCTTGTGTC 
TACAAATACT 
CTCTCTCACA 
TATTAGTATC 



Seq ID NO: 550 Protein sequence 
Protein Accession 8: Boa sequence 



MGYQRQEPVI 
GPPLGILLLF 
IAMISYNIIA 
SLISTGLTTL 
YSSLEEPTVA 
RFCYGVTVIL 
VLELNGVLCA 
QDCTHGQEMF 



11 

I 

PPQRDLDDRE 
WVSYVTDFSL 
GDTLSKVFQR 
ILGIVMARAI 
KWSRLIHMSI 
TYPMECFVTR 
TPLIFIIPSA 
YCFPDNFSLT 



21 
I 

TLVSEHEYKE 
VLLIKGGALS 
IPGVDPENVF 
SLGPHIPKTE 
VISVFICIFF 
EVIANVPFGG 
CYLKLSEEPR 
NTSESHVQQT 



31 
I 

KTCQSAALFN 
GTDTYQSLVN 
IGRHFIIGLS 
DAWVFAKPNA 
ATCGYLTFTG 
NLSSVFHIW 
THSDKIMSCV 
TQLSTLNISI 



41 

I 

WNSIIGSGI 
KTFGFPGYLL 
TVTFTLPLSL 
IQAVGVMSFA 
FTOGDLFENY 
TVMVITVATL 
MLPIGAWMV 
FQ 



51 
I 

IGLPYSMKQA 
LSVLQFLYPF 
YRNIAKLGKV 
FICHHNSFLV 
CRNDDLVTFG 
VSLLIDCLGI 
FGFVMAITNT 



Seq ID HO: 551 DNA sequence 

Nucleic Acid Accession tf: Eos sequence 

Coding sequence: 1..1284 



1 
I 

ATGGGCTACC 
AAGCAAGCTG 
TTTTCCCTTG 
TTGGTCAATA 
TATCCTTTTA 
TTTCAAAGAA 
GGACTTTCCA 
GGAAAGGTCT 
AGGGCAATTT 
CCCAATGCCA 
TTCTTAGTTT 
ATGTCCATCG 
TTTACTGGCT 
ACATTTGGAA 
GTGACAAGAG 
ATTGTTGTAA 
CTCGGGATAG 
CCATCAGCCT 
TCTTGTGTCA 
ACAAATACTC 
TCTCTCACAA 
ATTAGTATCT 



11 
I 

AGAGGCAGGA 
GGTTTCCTTT 
TTTTATTGAT 
AAACTTTCGG 
TAGCAATGAT 
TCCCAGGAGT 
CAGTTACCTT 
CCCTCATCTC 
CACTGGGTCC 
TTCAAGCGGT 
ACAGTTCTCT 
TGATTTCTGT 
TCACCCAAGG 
GATTTTGTTA 
AGGTAATTGC 
CAGTGATGGT 
TTCTAGAACT 
GTTATCTGAA 
TGCTTCCCAT 
AAGACTGCAC 
ATACCTCAGA 
TTCAACTCGA 



21 
I 

GCCTGTCATC 
GGGAATATTG 
AAAAGGAGGG 
CTTTCCAGGG 
AAGTTACAAT 
TGATCCTGAA 
TACTCTGCCT 
TACAGGTTTA 
ACACATACCA 
CGGGGTTATG 
AGAAGAACCC 
ATTTATCTGT 
GGACTTATTT 
TGGTGTCACT 
CAATGTGTTT 
CATCACTGTA 
CAATGGTGTG 
ACTGTCTGAA 
TGGTGCTGTG 
CCATGGGCAG 
GTCTCATGTT 
GTAA 



31 
I 

CCGCCGCAGA 
CTTTTATTCT 
GCCCTCTCTG 
TATCTGCTCC 
ATAATAGCTG 
AACGTGTTTA 
TTATCCTTGT 
ACAACTCTGA 
AAAACAGAAG 
TCTTTTGCAT 
ACAGTAGCTA 
ATATTCTTTG 
GAAAATTACT 
GTCATTTTGA 
TTTGGTGGGA 
GCCACGCTTG 
CTCTGTGCAA 
GAACCAAGGA 
GTGATGGTTT 
GAAATGTTCT 
CAGCAGACAA 



41 
I 

GAGGATTGCC 
GGGTTTCATA 
GAACAGATAC 
TCTCTGTTCT 
GAGATACTTT 
TTGGTCGCCA 
ACCGAAATAT 
TTCTTGGAAT 
ACGCTTGGGT 
TTATTTGCCA 
AGTGGTCCCG 
CTACATGTGG 
GCAGAAATGA 
CATACCCTAT 
ATCTTTCATC 
TGTCATTGCT 
CTCCCCTCAT 
CACACTCCGA 
TTGGATTCGT 
ACTGCTTTOC 
CACAACTTTC 



Seq ID NO: 552 Protein sequence 
Protein Accession Eos sequence 



780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



51 
1 

TTATTCAATG 
TGTTACAGAC 
CTACCAGTCT 
TCAGTTTTTG 
GAGCAAAGTT 
CTTCATTATT 
AGCAAAGCTT 
TGTAATGGCA 
ATTTGCAAAG 
CCATAACTCC 
CCTTATCCAT 
ATACTTGACA 
TGACCTGGTA 
GGAATGCTTT 
GGTTTTCCAC 
GATTGATTGC 
TTTTATCATT 
TAAGATTATG 
CATGGCTATT 
TGACAATTTC 
TACTTTAAAT 



1 
I 

MGYQRQEPVI 
LVNKTFGFPG 
GLSTVTFTLP 
PNAIQAVGVM 
FTGFTQGDLF 
IWTVMVITV 
SCVMLPIGAV 
ISIFQLE 



11 
I 

PPQRGLPYSM 
YLLLSVLQFL 
LSLYRNIAKL 
SFAFICHHNS 
ENYCRNDDLV 
ATLVSLLIDC 
VMVFGPVMAI 



21 
I 

KQAGFPLGIL 
YPFIAMISYN 
GKVSLISTGL 
FLVYSSLEEP 
TFGRFCYGVT 
LGIVLELNGV 
TNTQDCTHGQ 



31 
I 

LLFWVSYVTD 
IIAGDTLSKV 
TTLILGIVMA 
TVAKWSRLIH 
VILTYPMECF 
LCATPLIFII 
EMFYCFPDNF 



41 

I 

FSLVLLIKGG 
FQRIPGVDPE 
RAISLGPHIP 
MSIVISVFIC 
VTREVIANVF 
PSACYLKLSE 
SLTNTSESHV 



Seq ID NO: 553 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..1203 



1 
I 

ATGGGCTACC 
AAAGGAGGGG 
TTTCCAGGGT 
AGTTACAATA 
GATCCTGAAA 
ACTCTGCCTT 
ACAGGTTTAA 
CACATACCAA 



11 
I 

AGAGGCAGGA 
CCCTCTCTGG 
ATCTGCTCCT 
TAATAGCTGG 
ACGTGTTTAT 
TATCCTTGTA 
CAACTCTGAT 
AAACAGAAGA 



21 
I 

GCCTGTCATC 
AACAGATACC 
CTCTGTTCTT 
AGATACTTTG 
TGGTCGCCAC 
CCGAAATATA 
TCTTGGAATT 
CGCTTGGGTA 



31 
I 

CCGCCGCAGT 
TACCAGTCTT 
CAGTTTTTGT 
AGCAAAGTTT 
TTCATTATTG 
GCAAAGCTTG 
GTAATGGCAA 
TTTGCAAAGC 



41 
I 

TTTCCCTTGT 
TGGTCAATAA 
ATCCTTTTAT 
TTCAAAGAAT 
GACTTTCCAC 
GAAAGGTCTC 
GGGCAATTTC 
CCAATGCCAT 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



51 
I 

ALSGTDTYQS 
NVFIGRHFII 
KTEDAWVPAK 
IFFATCGYLT 
FGGNLSSVFH 
EPRTHSDKIM 
QQTTQLSTLN 



60 
120 
180 
240 
300 
360 
420 



51 
1 

TTTATTGATA 
AACTTTOGGC 
AGCAATGATA 
CCCAGGAGTT 
AGTTACCTTT 
CCTCATCTCT 
ACTGGGTCCA 
TCAAGCGGTC 



60 
120 
180 
240 
300 
360 
420 
480 
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WO 03/042661 



GGGGTTATGT CTTTTGCATT TATTTGCCAC CATAACTCCT TCTTAGTTTA CAGTTCTCTA 540 

GAAGAACCCA CAGTAGCTAA GTGGTCCCGC CTTATCCATA TGTCCATCGT GATTTCTGTA 600 

TTTATCTGTA TATTCTTTGC TACATGTGGA TACTTGACAT TTACTGGCTT CACCCAAGGG 660 

GACTTATTTG AAAATTACTG CAGAAATGAT GACCTGGTAA CATTTGGAAG ATTTTGTTAT 720 

GGTGTCACTG TCATTTTGAC ATACCCTATG GAATGCTTTG TGACAAGAGA GGTAATTGCC 780 

AATGTGTTTT TTGGTGGGAA TCTTTCATCG GTTTTCCACA TTGTTGTAAC AGTGATGGTC 840 

ATCACTGTAG CCACGCTTGT GTCATTGCTG ATTGATTGCC TCGGGATAGT TCTAGAACTC 900 

AATGGTGTGC TCTGTGCAAC TCCCCTCATT TTTATCATTC CATCAGCCTG TTATCTGAAA 960 

CTGTCTGAAG AACCAAGGAC ACACTCCGAT AAGATTATGT CTTGTGTCAT GCTTCCCATT 1020 

GGTGCTGTGG TGATGGTTTT TGGATTCGTC ATGGCTATTA CAAATACTCA AGACTGCACC 1080 

CATGGGCAGQ AAATGTTCTA CTGCTTTCCT GACAATTTCT CTCTCACAAA TACCTCAGAG 1140 

TCTCATGTTC AGCAGACAAC ACAACTTTCT ACTTTAAATA TTAGTATCTT TCAACTCGAG 1200 
TAA 



Seq ID NO i 554 Protein sequence 
Protein Accession #: Eos sequence 



1 11 21 31 41 51 

I I I I I I 

MGYQRQEPVI PPQFSLVLLI KGGALSGTDT YQSLVNKTPG FPGYLLLSVIi QFLYPFIAMI 60 

SYNIIAGDTL SKVFQRIPGV DPENVFIGRH FIIGLSTVTF TLPLSLYRNI AKLGKVSLIS 120 

TGLTTLILGI VMARAISLGP HIPKTEDAWV FAKPNAIQAV GVMSFAPICH HNSFLVYSSL 180 

EEPTVAKWSR LIHMSIVISV FICIFFATCG YLTFTGFTQG DLFENYCRND DLVTFGRFCY 240 

GVTVILTYPM ECFVTREVIA NVFPGGNLSS VFHIWTVMV ITVATIiVSLL IDCLGIVLEL 300 

NGVLCATPLI FIIPSACYLK LSEEPRTHSD KIMSCVMLPI GAWMVFGFV MAITNTQDCT 360 
HGQEMFYCFP DNFSLTNTSE SHVQQTTQLS TLNISIFQLE 



Seq ID NO: 55S DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1.-1140 

1 11 21 31 41 51 

I I I I I I 

ATGGGCTACC AGAGGCAGGA GCCTGTCATC CCGCCGCAGG TCAATAAAAC TTTCGGCTTT 60 

CCAGGGTATC TGCTCCTCTC TGTTCTTCAG TTTTTGTATC CTTTTATAGC AATGATAAGT 120 

TACAATATAA TAGCTGGAGA TACTTTGAGC AAAGTTTTTC AAAGAATCCC AGGAGTTGAT 180 

CCTGAAAAOG TGTTTATTGG TCGCCACTTC ATTATTGGAC TTTCCACAGT TACCTTTACT 240 

CTGCCTTTAT CCTTGTACCG AAATATAGCA AAGCTTGGAA AGGTCTCCCT CATCTCTACA 300 

GGTTTAACAA CTCTGATTCT TGGAATTGTA ATGGCAAGGG CAATTTCACT GGGTCCACAC 360 

ATACCAAAAA CAGAAGACGC TTGGGTATTT GCAAAGCCCA ATGCCATTCA AGCGGTCGGG 420 

GTTATGTCTT TTGCATTTAT TTGCCACCAT AACTCCTTCT TAGTTTACAG TTCTCTAGAA 480 

GAACCCACAG TAGCTAAGTG GTCCCGCCTT ATCCATATGT CCATCGTGAT TTCTGTATTT 540 

ATCTGTATAT TCTTTGCTAC ATGTGGATAC TTGACATTTA CTGGCTTCAC CCAAGGGGAC 600 

TTATTTGAAA ATTACTGCAG AAATGATGAC CTGGTAACAT TTGGAAGATT TTGTTATGGT 660 

GTCACTGTCA TTTTGACATA CCCTATGGAA TGCTTTGTGA CAAGAGAGGT AATTGCCAAT 720 

GTGTTTTTTG GTGGGAATCT TTCATCGGTT TTCCACATTG TTGTAACAGT GATGGTCATC 780 

ACTGTAGCCA CGCTTGTGTC ATTGCTGATT GATTGCCTCG GGATAGTTCT AGAACTCAAT 840 

GGTGTGCTCT GTGCAACTCC CCTCATTTTT ATCATTCCAT CAGCCTGTTA TCTGAAACTG 900 

TCTGAAGAAC CAAGGACACA CTCCGATAAG ATTATGTCTT GTGTCATGCT TCCCATTGGT 960 

GCTGTGGTGA TGGTTTTTGG ATTCGTCATG GCTATTACAA ATACTCAAGA CTGCACCCAT 1020 

GGGCAGGAAA TGTTCTACTG CTTTCCTGAC AATTTCTCTC TCACAAATAC CTCAGAGTCT 1080 
CATGTTCAGC AGACAACACA ACTTTCTACT TTAAATATTA GTATCTTTCA ACTCGAGTAA 

Seq ID NO: 556 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I ! I I I 

MGYQRQEPVI PPQVNKTFGF PGYLLLSVLQ FLYPFIAMIS YNIIAGDTLS KVFQRIPGVD 60 

PENVFIGRHF IIGLSTVTFT LPLSLYRNIA KLGKVSLIST GLTTLILGIV MARAISLGPH 120 

IPKTEDAWVF AKPNAIQAVG VMSFAFICHH NSFLVYSSLE EPTVAKWSRL IHMSIVISVF 180 

ICIFFATCGY LTFTGFTQGD LFENYCRNDD LVTFGRFCYG VTVILTYPME CFVTREVIAN 240 

VFFGGNLSSV FHIWTVMVI TVATLVSLLI DCLGIVLEU* GVLCATPLIF IIPSACYLKL 300 

SEEPRTHSDK IMSCVMIiPIG AWMVFGFVM AITNTQDCTH GQEMFYCFPD NFSLTNTSES 360 
HVQQTTQLST LNISIFQLE 

Seq ID NO: 557 DNA sequence 
Nucleic Acid Accession #» XMJ)57188.1 
Coding sequence: 769.. 4269 

1 11 21 31 41 51 

I I I 1 I I 

ATGGGATTGC CTCTCCCTCT CACTCTGGGC TTCTGTCCCA CTCTTATCTT AGTGTCAGTC 60 

CTCCCCCAAG TCTGTGTCCC TCTCTCTCCC CTAAATCTCT GGCCCCTCCT TTCTGAGTTC 120 

CTGCCCTTCC CCCAATTCTT TGGTTTTTGC ATCCCCCTCT GCCCCTTGCC TCAGTCAAGT 180 

CTCCCCTGTG TGTCTCTCTC CCCCCGGCCC GGACCTCTGC ACCCCCCAGG TCGCTGTCCC 240 

TCTGTCCCCT TATCGCGGCC TGGGACCCGC CCTCTCCCCG CCTCCCGCTT TGGCGTCTCC 300 

AAGACTCCCC GCCCCCCAGA CCTCGCCCCG CCCCAGGCTA GGCTGGAAAG TGGAGGATCC 360 

GGTTTGCTCT GGGCGGGTCT GGAAGCAGAG CCGGCGGAGG GAGCGCCGGG GCCCTGGGCT 420 

GCAGGAGGTT GCGGCGGCCG CGGCAGCATG GTGGTGCCGG AGAAGGAGCA GAGCTGGATC 480 

CCCAAGATCT TCAAGAAGAA GACCTGCACG ACGTTCATAG TTGACTCCAC AGATCCGGGG 540 

GATGGGGTCT CGCTCTATTG CCCAGGCTGT TCTCAAACTC CTGGGCTCAA GCAGTCCTTC 600 

TGCCTTGACC TCCCAAAGTG CTTGGATTGT GCCCAGCCGT CCTTGAAGTT TTGCTCAGAA 660 

GAGCAAACTT TCTGGGAAGT AGCTGCAGGT GTTGGAAGTA GCTGCAGGGG AACTAGGGGA 720 
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TTCAAGGATG GAGCTGAAAT GGGTGAACGG ACAAAGTCGG TAAACTGAAT GGAGGATGCC 780 

TTCGGGGCAG CCGTGGTGAC CGTGTGGGAC AGCGATGCAC ACACCACGGA GAAGCCCACC 840 

GATGCCTACG GAGAGCTGGA CTTCACGGGG GCCGGCCGCA AGCACAGCAA TTTCCTCCGG 900 

CTCTCTGACC GAACGGATCC AGCTGCAGTT TATAGTCTGG TCACACGCAC ATGGGGCTTC 960 

CGTGCCCCGA ACCTGGTGGT GTCAGTGCTG GGGGGATCGG GGGGCCCCGT CCTCCAGACC X020 

TGGCTGCAGG ACCTGCTGOG TCGTGGGCTG GTGCGGGCTG CCCAGAGCAC AGGAGCCTGG 1080 

ATTGTCACTG GGGGTCTGCA CACGGGCATC GGCCGGCATG TTGGTGTGGC TGTACGGGAC 1140 

CATCAGATGG CCAGCACTGG GGGCACCAAG GTGGTGGCCA TGGGTGTGGC CCCCTGGGGT 1200 

GTGGTCCGGA ATAGAGACAC CCTCATCAAC CCCAAGGGCT CGTTCCCTGC GAGGTACCGG 1260 

TGGCGCGGTG ACCCGGAGGA CGGGGTCCAG TTTCCCCTGG ACTACAACTA CTCGGCCTTC 1320 

TTCCTGGTGG ACGACGGCAC ACAOGGCTGC CTGGGGGGCG AGAACCGCTT CCGCTTGCGC 1380 

CTGGAGTCCT ACATCTCACA GCAGAAGACG GGCGTGGGAG GGACTGGAAT TGACATCCCT 1440 

GTCCTGCTCC TCCTGATTGA TGGTGATGAG AAGATGTTGA CGCGAATAGA GAACGCCACC 1500 

CAGGCTCAGC TCCCATGTCT CCTCGTGGCT GGCTCAGGGG GAGCTGCGGA CTGCCTGGCG 1560 

GAGACCCTGG AAGACACTCT GGCCCCAGGG AGTGGGGGAG CCAGGCAAGG CGAAGCCCGA 1620 

GATCGAATCA GGCGTTTCTT TCCCAAAGGG GACCTTGAGG TCCTGCAGGC CCAGGTGGAG 1680 

AGGATTATGA CCCGGAAGGA GCTCCTGACA GTCTATTCTT CTGAGGATGG GTCTGAGGAA 1740 

TTCGAGACCA TAGTTTTGAA GGCCCTTGTG AAGGCCTGTG GGAGCTCGGA GGCCTCAGCC 1800 

TACCTGGATG AGCTGGGTTT GGCTGTGGCT TGGAACCGCG TGGACATTGC CCAGAGTGAA I860 

CTCTTTCGGG GGGACATCCA ATGGCGGTCC TTCCATCTCG AAGCTTCCCT CATGGACGCC 1920 

CTGCTGAATG ACCGGCCTGA GTTCGTGCGC TTGCTCATTT CCCACGGCCT CAGCCTGGGC 1980 

CACTTCCTGA CCCCGATGCG CCTGGCCCAA CTCTACAGCG CGGCGCCCTC CAACTCGCTC 2040 

ATCCGCAACC TTTTGGACCA GGCGTCCCAC AGCGCAGGCA CCAAAGCCCC AGCCCJAAAA 2100 

GGGGGAGCTG CGGAGCTCCG GCCCCCTGAC GTGGGGCATG TGCTGAGGAT GCTGCTGGGG 2160 

AAGATGTGCG CGCCGAGGTA CCCCTCCGGG GGCGCCTGGG ACCCTCACCC AGGCCAGGGC 2220 

TTCGGGGAGA GCATGTATCT GCTCTCGGAC AAGGCCACCT CGCCGCTCTC GCTGGATGCT 2280 

GGCCTCGGGC AGGCCCCCTG GAGCGACCTG CTTCTTTGGG CACTGTTGCT GAACAGGGCA 2340 

CAGATGGCCA TGTACTTCTG GGAGATGGGT TCCAATGCAG TTTCCTCAGC TCTTGGGGCC 2400 

TGTTTGCTGC TCCGGGTGAT GGCAOGCCTG GAGCCTGACG CTGAGGAGGC AGCACGGAGG 2460 

AAAGACCTGG CGTTCAAGTT TGAGGGGATG GGCGTTGACC TCTTTGGCGA GTGCTATCGC 2520 

AGCAGTGAGG TGAGGGCTGC CCGCCTCCTC CTCCGTCGCT GCCCGCTCTG GGGGGATGCC 2580 

ACTTGCCTCC AGCTGGCCAT GCAAGCTGAC GCCCGTGCCT TCTTTGCCCA GGATGGGGTA 2640 

CAGTCTCTGC TGACACAGAA GTGGTGGGGA GATATGGCCA GCACTACACC CATCTGGGCC 2700 

CTGGTTCTCG CCTTCTTTTG CCCTCCACTC ATCTACACCC GCCTCATCAC CTTCAGGAAA 2760 

TCAGAAGAGG AGCCCACACG GGAGGAGCTA GAGTTTGACA TGGATAGTGT CATTAATGGG 2820 

GAAGGGCCTG TCGGGACGGC GGACCCAGCC GAGAAGACGC CGCTGGGGGT CCCGCGCCAG 2880 

TCGGGCCGTC CGGGTTGCTG CGGGGGCGGC TGCGGGGGGC GCCGGTGCCT ACGCCGCTGG 2940 

TTCCACTTCT GGGGCGGGCC GGTGACCATC TTCATGGGCA ACGTGGTCAG CTACCTGCTG 3000 

TTCCTGCTGC TTTTCTCGCG GGTGCTGCTC GTGGATTTCC AGCCGGCGCC GCCCGGCTCC 3060 

CTGGAGCTGC TGCTCTATTT CTGGGCTTTC ACGCTGCTGT GCGAGGAACT GCGCCAGGGC 3120 

CTGAGCGGAG GCGGGGGCAG CCTCGCCAGC GGGGGCCCCG GGCCTGGCCA TGCCTCACTG 3180 

AGCCAGCGCC TGCGCCTCTA CCTCGCCGAC AGCTGGAACC AGTGCGACCT AGTGGCTCTC 3240 

ACCTGCTTCC TCCTGGGCGT GGGCTGCOGG CTGACCCCGG GTTTGTACCA CCTGGGCCGC 3300 

ACTGTCCTCT GCATCGACTT CATGGTTTTC ACGGTGCGGC TGCTTCACAT CTTCACGGTC 3360 

AACAAACAGC TGGGGCCCAA GATCGTCATC GTGAGCAAGA TGATGAAGGA CGTGTTCTTC 3420 

TTCCTCTTCT TCCTCGGCGT GTGGCTGGTA GCCTATGGCG TGGCCACGGA GGGGCTCCTG 3480 

AGGCCACGGG ACAGTGACTT CCCAAGTATC CTGCGCCGCG TCTTCTACCG TCCCTACCTG 3540 

CAGATCTTCG GGCAGATTCC CCAGGAGGAC ATGGAOGTGG CCCTCATGGA GCACAGCAAC 3600 

TGCTCGTOGG AGCCCGGCTT CTGGGCACAC CCTCCTGGGG CCCAGGCGGG CACCTGCGTC 3660 

TCCGAGTATG CCAACTGGCT GGTGGTGCTG CTCCTCGTCA TCTTCCTGCT CGTGGCCAAC 3720 

ATCCTGCTGG TCAACTTGCT CATTGCCATG TTCAGTTACA CATTCGGCAA AGTACAGGGC 3780 

AACAGCGATC TCTACTGGAA GGCGCAGCGT TACCGCCTCA TCGGGGAATT CCACTCTOGG 3840 

CCCGCGCTGG CCCCGCCCTT TATCGTCATC TCCCACTTGC GCCTCCTGCT CAGGCAATTG 3900 

TGCAGGCGAC CCCGGAGCCC CCAGCCGTCC TCCCCGGCCC TCGAGCATTT CCGGGTTTAC 3960 

CTTTCTAAGG AAGCCGAGCG GAAGCTGCTA ACGTGGGAAT CGGTGCATAA GGAGAACTTT 4020 

CTGCTGGCAC GCGCTAGGGA CAAGCGGGAG AGCGACTCCG AGCGTCTGAA GCGCACGTCC 4080 

CAGAAGGTGG ACTTGGCACT GAAACAGCTG GGAGACATCC GCGAGTACGA ACAGCGCCTG 4140 

AAAGTGCTGG AGCGGGAGGT CCAGCAGTGT AGCCGCGTCC TGGGGTGGGT GGCCGAGGCC 4200 

CTGAGCCGCT CTGCCTTGCT GCCCCCAGGT GGGCOGCCAC CCCCTGACCT GCCTGGGTCC 4260 

AAAGACTGAG CCCTGCTGGC GGACTTCAAG GAGAAGCCCC CACAGGGGAT TTTGCTCCTA 4320 

GAGTAAGGCT CATCTGGGCC TCGGCCCCCG CACCTGGTGG CCTTGTCCTT GAGGTGAGCC 4380 

CCATGTCCAT CTGGGCCACT GTCAGGACCA CCTTTGGGAG TGTCATCCTT ACAAACCACA 4440 

GCATGCCCGG CTCCTCCCAG AACCAGTCCC AGCCTGGGAG GATCAAGGCC TGGATCCCGG 4500 

GCCGTTATCC ATCTGGAGGC TGCAGGGTCC TTGGGGTAAC AGGGACCACA GACCCCTCAC 4560 

CACTCACAGA TTCCTCACAC TGGGGAAATA AAGCCATTTC AGAGGAAAAA AAAAAAAAAA 4620 
AAAAAAAAAA AAAAAAAAAA A 

Seq ID NO: 558 Protein sequence 
Protein Accession #: XP_057188.l 



l 
I 

MEDAFGAAW 
TWGFRAPNLV 
AVRDHQMAST 
YSAPFLVDDG 
ENATQAQLPC 
AQVERIMTRK 
AQSBLFRGDI 
SNSLIRNLLD 
PGQGFGESMY 
ALGACLLLRV 
WGDATCLQLA 
TFRKSEEEPT 



11 
I 

TVWDSDAHTT 
VSVLGGSGGP 
GGTKWAMGV 
THGCLGGENR 
LLVAGSGGAA 
ELLTVYSSED 
QWRSFHLEAS 
QASHSAGTKA 
LLSDKATSPL 
MARLEPDAEE 
MQADARAFFA 



21 
I 

EKPTDAYGEL 
VLQTWLQDLL 
APWGWRNRD 
FRLRLESYIS 
DCLAETLEDT 
GSEEFETIVL 
LMDALLNDRP 
PALKGGAAEL 
SLDAGLGQAP 
AARRKDLAFK 
QDGVQSLLTQ 
VINGEGPVGT 



31 
I 

DFTGAGRKHS 
RRGLVRAAQS 
TLINPKGSFP 
QQKTGVGGTG 
LAPGSGGARQ 
KALVKACGSS 
EFVRXiLISHG 
RPPDVGHVLR 
WSDLLLWALL 
FEGMGVDLFG 
KWWGDMASTT 
ADPAEKTPLG 



41 
I 

NFLRLSDRTD 
TGAWIVTGGL 
ARYRWRGDPE 
IDIPVLLLLI 
GEARDRIRRF 
EASAYLDELR 
LSLGHFLTPM 
MLLGKMCAPR 
LNRAQMAMYF 
ECYRSSEVRA 
PIWALVLAFF 
VPRQSGRPGC 



51 
I 

PAAVYSLVTR 
HTGI GRHVGV 
DGVQFPLDYN 
DGDEKMLTRI 
FPKGDLEVLQ 
LAVAWNRVDI 
RliAQLYSAAP 
YPSGGAHDPH 
WEKGSNAVSS 
ARLLJjRRCPL 
CPPLIYTRLI 
CGGRCGGRRC 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
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LRRWFHFWGA PVTIFMGNW SYLLFLLLFS RVLLVDFQPA PPGSLELLLY FWAFTLLCEE 780 

LRQGLSGGGG SLASGGPGPG HASLSQRLRL Y1ADSWNQCD LVALTCFLLG VGCRLTPGLY 840 

HLGRTVLCID FMVFTVRLLH IFTVNKQLGP KIVIVSKMMK DVFFFLFFLG VWLVAYGVAT 900 

EGLLRPRDSD FPSILRRVFY RPYLQIFGQI PQEDMDVALM EHSNCSSEPG FWAHPPGAQA 960 

GTCVSQYANW LWLLLVIFL LVANILLVNL LIAMFSYTFG KVQGNSDLYW KAQRYRLIRE 1020 

FHSRPALAPP FIVISHLRLL LRQLCRRPRS PQPSSPALEH FRVYLSKEAE RKLLTWESVH 1080 

KENFLLARAR DKRESDSERL KRTSQKVDLA LKQLGHIREY EQRLKVLERE VQQCSRVLGW 1140 
VAEALSRSAL LPPGGPPPPD LPGSKD 

Seq ID NO: 559 DNA sequence 

Nucleic Acid Accession #: NM_006853.1 

Coding sequence: 26.. 874 



„ _ 1 11 21 31 41 51 

15 | | | | I I 

AGGAATCTGC GCTCGGGTTC CGCAGATGCA GAGGTTGAGG TGGCTGCGGG ACTGGAAGTC 60 

ATCGGGCAGA GGTCTCACAG CAGCCAAGGA ACCTGGGGCC CGCTCCTCCC CCCTCCAGGC 120 

CATGAGGATT CTGCAGTTAA TCCTGCTTGC TCTGGCAACA GGGCTTGTAG GGGGAGAGAC 180 

CAGGATCATC AAGGGGTTCG AGTGCAAGCC TCACTCCCAG CCCTGGCAGG CAGCCCTGTT 240 

CGAGAAGACG CGGCTACTCT GTGGGGCGAC GCTCATCGCC CCCAGATGGC TCCTGACAGC 300 

AGCCCACTGC CTCAAGCCCC GCTACATAGT TCACCTGGGG CAGCACAACC TCCAGAAGGA 360 

GGAGGGCTGT GAGCAGACCC GGACAGCCAC TGAGTCCTTC CCCCACCCCG GCTTCAACAA 420 

CAGCCTCCCC AACAAAGACC ACCGCAATGA CATCATGCTG GTGAAGATGG CATCG CCAGT 4 80 

CTCCATCACC TGGGCTGTGC GACCCCTCAC CCTCTCCTCA CGCTGTGTCA CTGCTGGCAC 540 

CAGCTGCCTC ATTTCCGGCT GGGGCAGCAC GTCCAGCCCC CAGTTACGCC TGCCTCACAC 600 

CTTGCGATGC GCCAACATCA CCATCATTGA GCACCAGAAG TGTGAGAACG CCTACCCCGG 660 

CAACATCACA GACACCATGG TGTGTGCCAG CGTGCAGGAA GGGGGCAAGG ACTCCTGCCA 720 

GGGTGACTCC GGGGGCCCTC TGGTCTGTAA CCAGTCTCTT CAAGGCATTA TCTCCTGGGG 780 

CCAGGATCCG TGTGCGATCA CCOGAAAGCC TGGTGTCTAC ACGAAAGTCT GCAAATATGT 840 

GGACTGGATC CAGGAGACGA TGAAGAACAA TTAGACTGGA CCCACCCACC ACAGCCCATC 900 

ACCCTCCATT TCCACTTGGT GTTTGGTTCC TGTTCACTCT GTTAATAAGA AACCCTAAGC 960 

CAAGACCCTC TACGAACATT CTTTGGGCCT CCTGGACTAC AGGAGATGCT GTCACTTAAT 1020 

AATCAACCTG GGGTTCGAAA TCAGTGAGAC CTGGATTCAA ATTCTGCCTT GAAATATTGT 1080 

GACTCTGGGA ATGACAACAC CTGGTTTGTT CTCTGTTGTA TCCCCAGCCC CAAAGACAGC 1140 
TCCTGGCCAT ATATCAAGGT TTGAATAAAT ATTTGCTAAA TGAGTG 



Seq ID NO: 560 Protein sequence 
Protein Accession 8 : NP_006844 . l 

1 11 21 31 41 51 

I I I I I I 

MRILQLILLA IATGLVGGET RIIKGFECXP HSQPWQAALF EKTRLLCGAT LIAPRWLLTA 60 

AHCLKPRYIV HLGQHNLQKE EGCEQTRTAT ESFPHPGFNN SLPNKDHRND IMLVKMASPV 120 

SITWAVRPLT LSSRCVTAGT SCLISGWGST SSPQLRLPHT LRCANITIIE HQKCENAYPG 180 

NITDTMVCAS VQEGGKDSCQ GDSGGPLVCN QSLQGIISWG QDPCAITRKP GVYTKVCKYV 240 
DWIQETMKNN 

Seq ID NO: 561 DNA sequence 

Nucleic Acid Accession #: AY046419.1 

Coding sequence: 1..1743 

1 11 21 31 41 51 

I I 'I I I I 

ATGTTTACCT TCCTGTCATC TGTCACTGCT GCTGTCAGTG GCCTCCTGGT GGGTTATGAA 60 

CTTGGGATCA TCTCTGGGGC TCTTCTTCAG ATCAAAACCT TATTAGCCCT GAGCTGCCAT 120 

GAGCAGGAAA TGGTTGTGAG CTCCCTCGTC ATTGGAGCCC TCCTTGCCTC ACTCACCGGA 180 

GGGGTCCTGA TAGACAGATA TGGAAGAAGG ACAGCAATCA TCTTGTCATC CTGCCTGCTT 240 

GGACTCGGAA GCTTAGTCTT GATCCTCAGT TTATCCTACA CGGTTCTTAT AGTGGGACGC 300 

ATTGCCATAG GGGTTTCCAT CTCCCTCTCT TCCATTGCCA CTTGTGTTTA CATCGCAGAG 360 

ATTGCTCCTC AACACAGAAG AGGCCTTCTT GTGTCACTGA ATGAGCTGAT GATTGTCATC 420 

GGCATTCTTT CTGCCTATAT TTCAAATTAC GCATTTGCCA ATGTTTTCCA TGGCTGGAAG 480 

TACATGTTTG GTCTTGTGAT TCCCTTGGGA GTTTTGCAAG CAATTGCAAT GTATTTTCTT 540 

CCTCCAAGCC CTCGGTTTCT GGTGATGAAA GGACAAGAGG GAGCTGCTAG CAAGGTTCTT 600 

GGAAGGTTAA GAGCACTCTC AGATACAACT GAGGAACTCA CTGTGATCAA ATCCTCCCTG 660 

AAAGATGAAT ATCAGTACAG TTTTTGGGAT CTGTTTCGTT CAAAAGACAA CATGCGGACC 720 

CGAATAATGA TAGGACTAAC ACTAGTATTT TTTGTACAAA TCACTGGCCA ACCAAACATA 780 

TTGTTCTATG CATCAACTGT TTTGAAGTCA GTTGGATTTC AAAGCAATGA GGCAGCTAGC 840 

CTCGCCTCCA CTGGGGTTGG AGTCGTCAAG GTCATTAGCA CCATCCCTGC CACTCTTCTT 900 

GTAGACCATG TCGGCAGCAA AACATTCCTC TGCATTGGCT CCTCTGTGAT GGCAGCTTCG 960 

TTGGTGACCA TGGGCATCGT AAATCTCAAC ATCCACATGA ACTTCACCCA TATCTGCAGA 1020 

AGCCACAATT CTATCAACCA GTCCTTGGAT GAGTCTGTGA TTTATGGACC AGGAAACCTG 1080 

TCAACCAACA ACAATACTCT CAGAGACCAC TTCAAAGGGA TTTCTTCCCA TAGCAGAAGC 1140 

TCACTCATGC CCCTGAGAAA TGATGTGGAT AAGAGAGGGG AGACGACCTC AGCATCCTTG 1200 

CTAAATGCTG GATTAAGCCA CACTGAATAC CAGATAGTCA CAGACCCTGG GGACGTCCCA 1260 

GCTTTTTTGA AATGGCTGTC CTTAGCCAGC TTGCTTGTTT ATGTTGCTGC TTTTTCAATT 1320 

GGTCTAGGAC CAATGCCCTG GCTGGTGCTC AGCGAGATCT TTCCTGGTGG GATCAGAGGA 1380 

OGAGCCATGG CTTTAACTTC TAGCATGAAC TGGGGCATCA ATCTCCTCAT CTCGCTGACA 1440 

TTTTTGACTG TAACT GATCT TATTGGCCTG CCATGGGTGT GCTTTATATA TACAATCATG 1500 

AGTCTAGCAT CCCTGCTTTT TGTTGTTATG TTTATACCTG AGACAAAGGG ATGCTCTTTG 1560 

GAACAAATAT CAATGGAGCT AGCAAAAGTG AACTATGTGA AAAACAACAT TTGTTTTATG 1620 

AGTCATCACC AAGAAGAATT AGTGCCAAAA CAGCCTCAAA AAAGAAAACC CCAGGAGCAG 1680 

CTCTTGGAGT GTAACAAGCT GTGTGGTAGG GGCCAATCCA GGCAGCTTTC TCCAGAGACC 1740 
TAA 
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Seq ID NO: 562 Protein sequence 
Protein Accession ft: AAL02327.1 

1 11 21 31 41 SI 

5 i t | l 1 l 

MPTFLSSVTA AVSGLLVGYE LGIISGALLQ IKTLLALSCH EQEMWSSLV IGALLASLTG 60 

GVLIDRYGRR TAIILSSCLL GLGSbVLILS LSYTVLIVGR IAIGVSISLS SIATCVYIAE 120 

IAPQHRRGLL VSLNELMIVI GILSAYISNY AFANVFHGWK YMFGbVIPLG VLQAIAMYFL 1B0 

PPSPRFLVMK GQEGAASKVL GRLRALSDTT EELTVIKSSL KDEYQYSFWD LFRSKDNMRT 240 

10 RIMIGLTLVP PVQITGQPNI LFYASTVLKS VGFQSKEAAS LASTGVGWK VISTIPATLL 300 

VDHVGSKTFL CIGSSVMAAS LVTMGIVNLN IHMNFTHICR SHNSINQSLD ESVIYGPGNL 360 

STNNNTLRDH FKGISSHSRS SLMPLRNDVD KRGETTSASL LNAGLSHTEY QIVTDPGDVP 420 

AFLKWLSLAS LLVYVAAFSI GLGPMPWLVL SEIFPGGIRG RAMALTSSMN WGINLLISLT 480 

FLTVTDLIGL PWVCFIYTIM SLASLLFWM FIPETKGCSL EQISMELAKV NYVKNNICFM 540 

15 SHHQEELVPK QPQKRKPQEQ LLECNKLCGR GQSRQLSPET 

Seq ID NO; S63 DNA sequence 

Nucleic Acid Accession #: XM_059466.1 

Coding sequence: 1..894 

1 11 21 31 41 51 

I I I I I I 

ATGGAGCCGC GGGCGCTCGT CACGGCGCTC AGCCTCGGCC TCAGCCTGTG CTCCCTGGGG 60 

CTGCTCGTCA CGGCCATCTT CACCGACCAC TGGTACGAGA CCGACCCCCG GCGCCACAAG 120 

25 GAGAGCTGCG AGCGCAGCCG CGCGGGCGCC GACCCCCCGG ACCAGAAGAA CCGCCTGATG 180 

CCGCTGTCGC ACCTGCCGCT GCGGGACTCG CGCCCGCTGG GGCGCCGGCT GCTCCGGGGC 240 

GGCCCGGGGC GCGCCGACCC CGAGTCCTGG CX3CTCGCTCC TGGGGCTCGG CGGGCTGGAC 300 

GCCGAGTGCG GCGGGCCCCT CTTCGCCACC TACTCGGGCC TCTGGAGGAA GTGCTACTTC 360 

CTGGGCATCG ACCGGGACAT CGACACCCTC ATCCTGAAAG GTATTGCGCA GCGATGCACG 420 

30 GCCATCAAGT ACCACTTTTC TCAGCCCATC CGCTTGCGAA ACATTCCTTT TAATTTAACC 480 

AAGACCATAC AGCAAGATGA GTGGCACCTG CTTCATTTAA GAAGAATCAC TGCTGGCTTC 540 

CTCGGCATGG CCGTAGCCGT CCTTCTCTGC GGCTGCATTG TGGCCACAGT CAGTTTCTTC 600 

TGGGAGGAGA GCTTGACCCA GCACGTGGCT GGACTCCTGT TCCTCATGAC AGGGATATTT 660 

TGCACCATTT CCCTCTGTAC TTATGCCGCC AGTATCTCGT ATGATTTGAA CCGGCTCCCA 720 

35 AAGCTAATTT ATAGCCTGCC TGCTGATGTG GAACATGGTT ACAGCTGGTC CATCTTTTGC 780 

GCCTGGTGCA GTTTAGGCTT TATTGTGGCA GCTGGAGGTC TCTGCATCGC TTATCCGTTT 840 
ATTAGCCGGA CCAAGATTGC ACAGCTAAAG TCTGGCAGAG ACTCCACGGT ATGA 



40 



Seq ID NO: 564 Protein sequence 
Protein Accession #; XP 059466.1 



1 11 21 31 41 51 

I I I I I I 

MEPRALVTAL SLGLSLCSLG LLVTAIFTDH WYETDPRRHK ESCERSRAGA DPPDQKNRLM 60 

45 PLSHLPLRDS PPLGRRLLPG GPGRADPESW RSLLGLGGLD AECGRPLFAT YSGLWRKCYF 120 

LGIDRDIDTL ILKGLAQRCT AIKYHFSQPI RLRNIPFNLT KTIQQDEWHIi LHLRRITAGF 180 

LGMAVAVLLC GCIVATVSFF WEESLTQHVA GLLFLMTGIF CTISLCTYAA SISYDLNRLP 240 
KLIYSLPADV EHGYSWSIFC AWCSLGF I VA AGGLCIAYPF ISRTKIAQLK SGRDSTV 

50 Seq ID NO: 565 DNA sequence 

Nucleic Acid Accession #; Eos sequence 
Coding sequence; 1..3315 

1 11 21 31 41 51 

55 | | | | | | 

ATGTCCTTTC GGGCAGCCAG GCTCAGCATG AGGAACAGAA GGAATGACAC TCTGGACAGC 60 

ACCCGGACCC TGTACTCCAG CGCGTCTCGG AGCACAGACT TGTCTTACAG TGAAAGCGAC 120 

TTGGTGAATT TTATTCAAGC AAATTTTAAG AAACGAGAAT GTGTCTTCTT TACCAAAGAT 180 

TCCAAGGCCA CGGAGAATGT GTGCAAGTGT GGCTATGCCC AGAGCCAGCA CATGGAAGGC 240 

60 ACCCAGATCA ACCAAAGTGA GAAATGGAAC TACAAGAAAC ACACCAAGGA ATTTCCTACC 300 

GACGCCTTTG GGGATATTCA GTTTGAGACA CTGGGGAAGA AAGGGAAGTA TATACGTCTG 360 

TCCTGCGACA CGGACGCGGA AATCCTTTAC GAGCTGCTGA CCCAGCACTG GCACCTGAAA 420 

ACACCCAACC TGGTCATTTC TGTGACOGGG GGCGCCAAGA ACTTCGCCCT GAAGCCGCGC 480 

ATGCGCAAGA TCTTCAGCOG GCTCATCTAC ATCGCGCAGT OCAAAGGTGC TTGGATTCTC 540 

65 ACGGGAGGCA CCCATTATGG CCTGATGAAG TACATCGGGG AGGTGGTGAG AGATAACACC 600 

ATCAGCAGGA GTTCAGAGGA GAATATTGTG GCCATTGGCA TAGCAGCTTG GGGCATGGTC 660 

TCCAACCGGG ACACCCTCAT CAGGAATTGC GATGCTGAGG GCTATTTTTT AGCCCAGTAC 720 

CTTATGGATG ACTTCACAAG AGATCCACTG TATATCCTGG ACAACAACCA CACACATTTG 780 

CTGCTCGTGG ACAATGGCTG TCATGGACAT CCCACTGTCG AAGCAAAGCT CCGGAATCAO 840 

70 CTAGAGAAGT ATATCTCTGA GCGCACTATT CAAGATTCCA ACTATGGTGG CAAGATCCCC 900 

ATTGTGTGTT TTGCCCAAGG AGGTGGAAAA GAGACTTTGA AAGCCATCAA TACCTCCATC 960 

AAAAATAAAA TTCCTTGTGT GGTGGTGGAA GGCTCGGGCC AGATCGCTGA TGTGATCGCT 1020 

AGCCTGGTGG AGGTGGAGGA TGCCCTGACA TCTTCTGCCG TCAAGGAGAA GCTGGTGCGC 1080 

TTTTTACCCC GCACGGTGTC CCGGCTGCCT GAGGAGGAGA CTGAGAGTTG GATCAAATGG 1140 

75 CTCAAAGAAA TTCTCGAATG TTCTCACCTA TTAACAGTTA TTAAAATGGA AGAAGCTGGG 1200 

GATGAAATTG TGAGCAATGC CATCTCCTAC GCTCTATACA AAGCCTTCAG CACCAGTGAG 1260 

CAAGACAAGG ATAACTGGAA TGGGCAGCTG AAGCTTCTGC TGGAGTGGAA CCAGCTGGAC 1320 

TTAGCCAATG ATGAGATTTT CACCAATGAC CGCCGATGGG AGTCTGCTGA CCTTCAAGAA 1380 

GTCATGTTTA CGGCTCTCAT AAAGGACAGA CCCAAGTTTG TCCGCCTCTT TCTGGAGAAT 1440 

80 GGCTTGAACC TACGGAAGTT TCTCACCCAT GATGTCCTCA CTGAACTCTT CTCCAACCAC 1500 

TTCAGCACGC TTGTGTACCG GAATCTGCAG ATCGCCAAGA ATTCCTATAA TGATGCCCTC 1560 

CTCACGTTTG TCTGGAAACT GGTTGCGAAC TTCCGAAGAG GCTTCCGGAA GGAAGACAGA 1620 

AATGGCCGGG ACGAGATGGA CATAGAACTC CACGACGTGT CTCCTATTAC TCGGCACCCC 1680 

CTGCAAGCTC TCTTCATCTG GGCCATTCTT CAGAATAAGA AGGAACTCTC CAAAGTCATT 1740 
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TGGGAGCAGA 
CTGGCCAAAG 
TACGAGACCC 
GAACAGCTGC 
GTGGAGGCCA 
CAATGGTATG 
ATTATACCCT 
AAGAAGCTGC 
AATGTGGTCT 
CATTCGGTGC 
GATGAAGTGA 
ATGGACACGC 
AATAAAAGCT 
CTAAGATTGA 
CAGAGGATGC 
TTTGGCGTGG 
CGTTCGGTCA 
GGTACCACGT 
GTGGAGCTGG 
TGCATCTACA 
TACACGGTGG 
CTGGTGCAGG 
TTCTACATGG 
TCTGTCTGCT 
GAAAACTACC 
CGATTTAGAC 
AATAAAATCA 



CCAGGGGCTG 
TGAAGAACGA 
GGGCTGTTGA 
TGGTCTATTC 
CAGACCAGCA 
GAGAGATTTC 
TGGTGGGCTG 
TTTGGTACTA 
TCTACATCGC 
CACACCCCCC 
GACAGTGGTA 
TGGGGCTTTT 
CTTTGTATTC 
TCCACATTTT 
TGATCGATGT 
CCAGGCAAGG 
TCTACGAGCC 
ATGACTTTGC 
ATGAGCACAA 
TGTTATCCAC 
GCACCGTCCA 
AGTACTGCAG 
TGGTGAAGAA 
GTTTCAAAAA 
TTGTCAAGAT 
AACTGGATAC 
AATGA 



CACTCTGGCA 
CATCAATGCT 
GCTGTTCACT 
CTGTGAAGCT 
TTTCATCGCC 
CCGAGACACC 
TGGCTTTGTA 
TGTGGCGTTC 
CTTCCTCCTG 
CGAGCTGGTC 
CGTAAATGGG 
TTACTTCATA 
TGGACGAGTC 
TACTGTAAGC 
GTTCTTCTTC 
GATCCTTAGG 
CTACCTGGCC 
CCACTGCACC 
CCTGCCCCGG 
CAACATCCTG 
GGAGAACAAT 
CCGCCTCAAT 
GTGCTTCAAG 
TGAAGACAAT 
CAACACAAAA 
AAAGCTTAAT 



GCCCTGGGAG 
GCTGGGGAGT 
GAGTGTTACA 
TGGGGTGGAA 
CAGCCTGGGG 
AAGAACTGGA 
TCATTTAGGA 
TTCACCTCCC 
CTGTTTGCCT 
CTGTACTCGC 
GTGAATTATT 
GCAGGAATTG 
ATTTTCTGTC 
AGAAACTTAG 
CTGTTCCTCT 
CAGAATGAGC 
ATGTTCGGCC 
TTCACTGGGA 
TTCCCCGAGT 
CTGGTCAACC 
GACCAGGTCT 
ATCCCCTTCC 
TGTTGCTGCA 
GAGACTCTGG 
GCCAACGACA 
GATCTCAAGG 



CCAGCAAGCT 
CCGAGGAGCT 
GCAGCGATGA 
GCAACTGTCT 
TCCAGAATTT 
AGATTATCCT 
AGAAACCTGT 
CCTTCGTGGT 
ACGTGCTGCT 
TGGTCTTTGT 
TTACTGACCT 
TATTTCGGCT 
TGGACTACAT 
GACCCAAGAT 
TTGCGGTGTG 
AGCGCTGGAG 
AGGTGCCCAG 
ATGAGTCCAA 
GGATCACCAT 
TGCTGGTCGC 
GGAAGTTCCA 
CCTTCATCGT 
AGGAGAAAAA 
CATGGGAGGG 
CCTCAGAGGA 
GTCTTCTGAA 



TCTGAAGACT 
GGCTAATGAG 
AGACTTGGCA 
GGAGCTGGCG 
TCTTTCTAAG 
GTGTCTGTTT 
CGACAAGCAC 
CTTCTCCTGG 
CATGGATTTC 
CCTCTTCTGT 
GTGGAATGTG 
CCACTCTTCT 
TATTTTCACT 
TATAATGCTG 
GATGGTGGCC 
GTGGATATTC 
TGACGTGGAT 
GCCACTGTGT 
CCCCCTGGTG 
CATGTTTGGC 
GAGGTACTTC 
CTTCGCTTAC 
CATGGAGTCT 
TGTCATGAAG 
AATGAGGCAT 
AGAGATTGCT 



Seq ID NO: 566 Protein sequence 
Protein Accession #: Eos sequence 



MSFRAARLSM 
SKATENVCKC 
SCDTDAEILY 
TGGTHYGIWK 
LMDDFTRDPL 
IVCFAQGGGK 
FLPRTVSRLP 
QDKDNWNGQL 
GLNLRKFLTH 
NGRDEMDIEL 
LAKVKNDINA 
VEATDQHFIA 
KKLLWYYVAF 
DBVRQWYVNG 
LRLIHIFTVS 
RSVIYEPYLA 
CIYMLSTNIL 
FYMWKKCFK 
RFRQLDTKLN 



11 

I 

RNRRNDTLDS 
GYAQSQHMEG 
ELLTQHWHLK 
YIGEWRDNT 
YILDNNHTHL 
BTLKAINTSI 
EEETESWIKW 
KLLLEWNQLD 
DVLTELFSNH 
HDVSPITRHP 
AGESEELANE 
QPGVQNFLSK 
FTSPFWFSW 
VNYFTDLWNV 
RNLGPKIIML 
MFGQVPSDVD 
IiVNLIiVAMFG 
CCCKEKNMES 
DLKGLLKEIA 



21 
I 

TRTLYSSASR 
TQINQSEKWN 
TPNLVISVTG 
ISRSSEENIV 
LLVDNGCHGH 
KNKIPCWVE 
LKEILECSHL 
LANDEIFTND 
FSTLVYRHLQ 
LQALFIWAIL 
YETRAVELFT 
QWYGEISRDT 
NWFYIAFLL 
MDTLGLFYFI 
QRMLIDVFFF 
GTTYDFAHCT 
YTVGTVQENN 
SVCCFKNEDN 
NKIK 



31 
I 

STDLSYSESD 
YKKHTKEFPT 
GAKHFALKPR 
AIGIAAWGMV 
PTVEAKLRNQ 
GSGQIADVIA 
LTVIKMEEAG 
RRWESADLQE 
IAKNSYNDAL 
QNKKBLSKVI 
ECYSSDEDLA 
KNWKIILCLF 
LFAYVIiIKDF 
AGIVFRLHSS 
LFLFAVWMVA 
FTGNESKPLC 
DQVWKFQRYF 
ETLAWEGVMK 



41 
I 

LVNFIQANFK 
DAFGDIQFET 
MRKIFSRLIY 
SNRDTLIRNC 
LEKYISERTI 
SLVEVEDALT 
DEIVSNAISY 
VMFTALIKDR 
LTFVWKLVAN 
WEQTRGCTLA 
EQLLVYSCEA 
IIPLVGCGFV 
HSVPHPPELV 
NKSSLYSGRV 
FGVARQGILR 
VELDEHNLPR 
LVQEYCSRLN 
ENYLVKINTK 



Seq ID NO: 567 DNA sequence 

Nucleic Acid Accession #: NM_0 06 911.1 

Coding sequence: 1..558 



1800 
1B60 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 



51 
I 

KRECVFFTKD 
LGKKGXYIRIi 
IAQSKGAWIL 
DAEGYFLAQY 
QDSNYGGKIP 
SSAVKEKLVR 
ALYKAFSTSE 
PKFVRLFLEN 
FRRGFRKEDR 
ALGASKLLKT 
WGGSNCLELA 
SFRKKPVDKH 
LYSLVFVLFC 
IFCLDYIIFT 
QNEQRWRWIF 
FPEWITIPLV 
IPFPFIVFAY 



ATGCCTCGCC 
AGAGCAGTCG 
CGCGCGCAGA 
GATGCTCCTC 
ACAGAAACTA 
GCCCTATCTG 
GATTCCAATC 
GCAGACAGCA 
AGACGACCCT 
CTTGCTAAAT 



11 
I 

TGTTCTTGTT 
CGGCCAAATG 
TTGCCATTTG 
AGACACCTAG 
TAATTATCAT 
AGAGGCAACC 
TTAGCTTTGA 
ATCCTTCAGA 
ACGTGGCACT 
ATTGCTGA 



21 
I 

CCACCTGCTA 
GAAGGACGAT 
CGGCATGAGC 
ACCAGTGGCA 
GTTGGAATTC 
ATCATTACCA 
AGAATTTAAG 
ATTAAAATAC 
GTTTGAGAAA 



31 
I 

GAATTCTGTT 
GTTATTAAAT 
ACCTGGAGCA 
GAAATTGTAC 
ATTGCTAATT 
GAGCTACAGC 
AAACTTATTC 
TTAGGCTTGG 
TGTTGCCTAA 



41 
I 

TACTACTGAA 
TATGCGGCCG 
AAAGGTCTCT 
CATCCTTCAT 
TGCCACCGGA 
AGTATGTACC 
GCAATAGGCA 
ATACTCATTC 
TTGGTTGTAC 



Seq ID NO: 568 Protein sequence 
Protein Accession #: NP_008842.1 



51 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



51 
I 

CCAATTTTCC 
CGAATTAGTT 
GAGCCAGGAA 
CAACAAAGAT 
GCTGAAGGCA 
TGCATTAAAG 
AAGTGAAGCC 
TCAAAAAAAG 
CAAAAGGTCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 



1 11 21 31 41 

I I I I.I 1 

MPRLFLFHLL BFCLLLNQFS RAVAAKWKDD VIKLCGRELV RAQIAICGMS TWSKRSLSQE 
DAPQTPRPVA EIVPSF1NKD TETIIIKLEF IANLPPELKA ALSERQPSLP ELQQYVPALK 
DSNLSFEEFK KLIRNRQSEA ADSNPSBLKY LGLDTHSQKK RRPYVALFEK CCLIGCTKRS 
LAKYC 



60 
120 
180 



Seq ID NO: 569 DNA sequence 

Nucleic Acid Accession ft: XM_0364S3.1 



1170 



WO 03/042661 



Coding sequence: 1..3978 

1 11 21 31 41 51 

I I I I I I 

ATGCTGCCCG TGTACCAGGA GGTGAAGCCC AACCCGCTGC AGGACGCGAA CCTCTGCTCA 60 

CGCGTGTTCT TCTGGTGGCT CAATCCCTTG TTTAAAATTG GCCATAAACG GAGATTAGAG 120 

GAAGATGATA TGTATTCAGT GCTGCCAGAA GACCGCTCAC AGCACCTTGG AGAGGAGTTG 180 

CAAGGGTTCT GGGATAAAGA AGTTTTAAGA GCTGAGAATG AOGCACAGAA GCCTTCTTTA 240 

ACAAGAGCAA TCATAAAGTG TTACTGGAAA TCTTATTTAG TTTTGGGAAT TTTTACGTTA 300 

ATTGAGGAAA GTGCCAAAGT AATCCAGCCC ATATTTTTGG GAAAAATTAT TAATTATTTT 360 

GAAAATTATG ATCCCATGGA TTCTGTGGCT TTGAACACAG CGTACGCCTA TGCCACGGTG 420 

CTGACTTTTT GCACGCTCAT TTTGGCTATA CTGCATCACT TATATTTTTA TCACGTTCAG 480 

TGTGCTGGGA TGAGGTTACG AGTAGCCATG TGCCATATGA TTTATCGGAA GGCACTTCGT 540 

CTTAGTAACA TGGCCATGGG GAAGACAACC ACAGGCCAGA TAGTCAATCT GCTGTCCAAT 600 

GATGTGAACA AGTTTGATCA GGTGACAGTG TTCTTACACT TCCTGTGGGC AGGACCACTG 660 

CAGGGGATCG CAGTGACTGC CCTACTCTGG ATGGAGATAG GAATATCGTG CCTTGCTGQG 720 

ATGGCAGTTC TAATCATTCT CCTGCCCTTG CAAAGCTGTT TTGGGAAGTT GTTCTCATCA 780 

CTGAGGAGTA AAACTGCAAC TTTCACGGAT GCCAGGATCA GGACCATGAA TGAAGTTATA 840 

ACTGGTATAA GGATAATAAA AATGTACGCC TGGGAAAAGT CATTTTCAAA TCTTATTACC 900 

AATTTGAGAA AGAAGGAGAT TTCCAAGATT CTGAGAAGTT CCTGCCTCAG GGGGATGAAT 960 

TTGGCTTCGT TTTTCAGTGC AAGCAAAATC ATCGTGTTTG TGACCTTCAC CACCTACGTG 1020 

CTCCTCGGCA GTGTGATCAC AGCCAGCCGC GTGTTOGTGG CAGTGACGCT GTATGGGGCT 1080 

GTGCGGCTGA CGGTTACCCT CTTCTTCCCC TCAGCCATTG AGAGGGTGTC AGAGGCAATC 1140 

GTCAGCATCC GAAGAATCCA GACCTTTTTG CTACTTGATG AGATATCACA GCGCAACCGT 1200 

CAGCTGCCGT CAGATGGTAA AAAGATGGTG CATGTGCAGG ATTTTACTGC TTTTTGGGAT 1260 

AAGGCATCAG AGACCCCAAC TCTACAAGGC CTTTCCTTTA CTGTCAGACC TGGCGAATTG 1320 

TTAGCTGTGG TCGGCCCCGT GGGAGCAGGG AAGTCATCAC TGTTAAGTGC CGTGCTCGGG 1380 

GAATTGGCCC CAAGTCACGG GCTGGTCAGC GTGCATGGAA GAATTGCCTA TGTGTCTCAG 1440 

CAGCCCTGGG TGTTCTCGGG AACTCTGAGG AGTAATATTT TATTTGGGAA GAAATACGAA 1500 

AAGGAACGAT ATGAAAAAGT CATAAAGGCT TGTGCTCTGA AAAAGGATTT ACAGCTGTTG 1560 

GAGGATGGTG ATCTGACTGT GATAGGAGAT CGGGGAACCA CGCTGAGTGG AGGGCAGAAA 1620 

GCACGGGTAA ACCTTGCAAG AGCAGTGTAT CAAGATGCTG ACATCTATCT CCTGGACGAT 1680 

CCTCTCAGTG CAGTAGATGC GGAAGTTAGC AGACACTTGT TCGAACTGTG TATTTGTCAA 1740 

ATTTTGCATG AGAAGATCAC AATTTTAGTG ACTCATCAGT TGCAGTACCT CAAAGCTGCA 1800 

AGTCAGATTC TGATATTGAA AGATGGTAAA ATGGTGCAGA AGGGGACTTA CACTGAGTTC 1860 

CTAAAATCTG GTATAGATTT TGGCTCCCTT TTAAAGAAGG ATAATGAGGA AAGTGAACAA 1920 

CCTCCAGTTC CAGGAACTCC CACACTAAGG AATCGTACCT TCTCAGAGTC TTCGGTTTGG 1980 

TCTCAACAAT CTTCTAGACC CTCCTTGAAA GATGGTGCTC TGGAGAGCCA AGATACAGAG 2040 

AATGTCCCAG TTACACTATC AGAGGAGAAC CGTTCTGAAG GAAAAGTTGG TTTTCAGGCC 2100 

TATAAGAATT ACTTCAGAGC TGGTGCTCAC TGGATTGTCT TCATTTTCCT TATTCTCCTA 2160 

AACACTGCAG CTCAGGTTGC CTATGTGCTT CAAGATTGGT GGCTTTCATA CTGGGCAAAC 2220 

AAACAAAGTA TGCTAAATGT CACTGTAAAT GGAGGAGGAA ATGTAACCGA GAAGCTAGAT 2280 

CTTAACTGGT ACTTAGGAAT TTATTCAGGT TTAACTGTAG CTACCGTTCT TTTTGGCATA 2340 

GCAAGATCTC TATTGGTATT CTACGTCCTT GTTAACTCTT CACAAACTTT GCACAACAAA 2400 

ATGTTTGAGT CAATTCTGAA AGCTCCGGTA TTATTCTTTG ATAGAAATCC AATAGGAAGA 2460 

ATTTTAAATC GTTTCTCCAA AGACATTGGA CACTTGGATG ATTTGCTGCC GCTGACGTTT 2520 

TTAGATTTCA TCCAGACATT GCTACAAGTG ■GTTGGTGTGG TCTCTGTGGC TGTGGCCGTG 2580 

ATTCCTTGGA TCGCAATACC CTTGGTTCCC CTTGGAATCA TTTTCATTTT TCTTCGGCGA 2640 

TATTTTTTGG AAACGTCAAG AGATGTGAAG CGCCTGGAAT CTACAACTCG GAGTCCAGTG 2700 

TTTTCCCACT TGTCATCTTC TCTCCAGGGG CTCTGGACCA TCCGGGCATA CAAAGCAGAA 2760 

GAGAGGTGTC AGGAACTGTT TGATGCACAC CAGGATTTAC ATTCAGAGGC TTGGTTCTTG 2820 

TTTTTGACAA CGTCCCGCTG GTTCGCCGTC CGTCTGGATG CCATCTGTGC CATGTTTGTC 2880 

ATCATCGTTG CCTTTGGGTC CCTGATTCTG GCAAAAACTC TGGATGCCGG GCAGGTTGGT 2940 

TTGGCACTGT CCTATGCCCT CACGCTCATG GGGATGTTTC AGTGGTGTGT TCGACAAAGT 3000 

GCTGAAGTTG AGAATATGAT GATCTCAGTA GAAAGGGTCA TTGAATACAC AGACCTTGAA 3060 

AAAGAAGCAC CTTGGGAATA TCAGAAACGC CCACCACCAG CCTGGCCCCA TGAAGGAGTG 3120 

ATAATCTTTG ACAATGTGAA CTTCATGTAC AGTCCAGGTG GGCCTCTGGT ACTGAAGCAT 3180 

CTGACAGCAC TCATTAAATC ACAAGAAAAG GTTGGCATTG TGGGAAGAAC CGGAGCTGGA 3240 

AAAAGTTCCC TCATCTCAGC CCTTTTTAGA TTGTCAGAAC CCGAAGGTAA AATTTGGATT 3300 

GATAAGATCT TGACAACTGA AATTGGACTT CACGATTTAA GGAAGAAAAT GTCAATCATA 3360 

CCTCAGGAAC CTGTTTTGTT CACTGGAACA ATGAGGAAAA ACCTGGATCC CTTTAATGAG 3420 

CACACGGATG AGGAACTGTG GAATGCCTTA CAAGAGGTAC AACTTAAAGA AACCATTGAA 3480 

GATCTTCCTG GTAAAATGGA TACTGAATTA GCAGAATCAG GATCCAATTT TAGTGTTGGA 3540 

CAAAGACAAC TGGTGTGCCT TGCCAGGGCA ATTCTCAGGA AAAATCAGAT ATTGATTATT 3600 

GATGAAGCGA CGGCAAATGT GGATCCAAGA ACTGATGAGT TAATACAAAA AAAAATCCGG 3660 

GAGAAATTTG CCCACTGCAC CGTGCTAACC ATTGCACACA GATTGAACAC CATTATTGAC 3720 

AGCGACAAGA TAATGGTTTT AGATTCAGGA AGACTGAAAG AATATGATGA GCCGTATGTT 3780 

TTGCTGCAAA ATAAAGAGAG CCTATTTTAC AAGATGGTGC AACAACTGGG CAAGGCAGAA 3840 

GCCGCTGCCC TCACTGAAAC AGCAAAACAG GTATACTTCA AAAGAAATTA TCCACATATT 3900 

GGTCACACTG ACCACATGGT TACAAACACT TCCAATGGAC AGCCCTCGAC CTTAACTATT 3960 
TTCGAGACAG CACTGTGA 



Seq ID NO: 570 Protein sequence 
Protein Accession #: XP_036453.l 

1 11 21 31 41 51 

I I I I I I 

MLPVYQEVKP NPLQDANLCS RVFFWWLNPL FKIGHKRRLE EDDMYSVLPE DRSQHLGEEL 60 
QGFWDKEVLR AENDAQKPSL TRAIIKCYWK SYLVLGIFTL IEESAKVIQP IFLGKIINYF 120 
ENYDPMDSVA LNTAYAYATV LTFCTLILAI LHHLYFYHVQ CAGMRLRVAM CHMIYRKALR 180 
LSNMAMGKTT TGQIVNLLSN DVNKFDQVTV FLHFLWAGPL QAIAVTALLW MEIGISCLAG 240 
MAVLIILLPL QSCFGKLFSS LRSKTATFTD ARIRTMNEVI TGIRIIKMYA WEKSFSNLIT 300 
NLRKKEISKI LRSSCLRGMN LASFFSASKI IVFVTFTTYV LLGSVITASR VFVAVTLYGA 360 
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VRLTVTLFFP 
KASETPTLQG 
QPWVFSGTLR 
ARVNLARAVY 
SQILILKDGK 
SQQSSRPSLK 
KTAAQVAYVL 
ARSLLVFYVL 
LDFIQTLLQV 
FSHLSSSLQG 
IIVAFGSLIL 
KEAPWEYQKR 
KSSLISALFR 
HTDEELWNAI* 
DEATANVDPR 
LLQNKESLFY 
FETAL 



SAIERVSEAI 
LSFTVRPGEL 
SNILFGKKYE 
QDADIYLLDD 
MVQKGTYTEF 
DGALESQDTE 
QDWWLSYWAN 
VNSSQTLHNK 
VGWSVAVAV 
LWTIRAYKAE 
AKTLDAGQVG 
PPPAWPHEGV 
LSEPEGKIWI 
QEVQLKETIE 
TDELIQKKIR 
KMVQQLGKAE 



VSIRRIQTFL 
LAWGPVGAG 
KERYEKVIKA 
PLSAVDAEVS 
LKSGIDFGSL 
NVPVTLSEEN 
KQSMLNVTVN 
MFESILKAPV 
IPWIAIPLVP 
ERCQELFDAH 
LALSYALTLM 
IIFDNVNFMY 
DKILTTEIGL 
DLPGKMDTEL 
EKFAHCTVLT 
AAALTETAKQ 



LLDEISQRNR 
KSSLLSAVLG 
CALKKDLQLL 
RKLFELCICQ 



RSEGKVGFQA 
GGGNVTEKLD 
LFFDRNPIGR 
LGIIFIFLRR 
QDLHSEAWFL 
GMFQWCVRQS 
SPGGPLVLKH 
HDLRKKMSII 
AESGSNFSVG 
IAHRLNTIID 
VYFKRNYPHI 



QLPSDGKKMV 
ELAPSHGLVS 
EDGDLTVIGD 
ILHEKITILV 
PPVPGTPTLR 
YKNYFRAGAH 
LNWYLGIYSO 
ILNRFSKDIG 
YFLETSRDVK 
FLTTSRWFAV 
AEVENMMISV 
LTALIKSQEK 
PQEPVLFTGT 
QRQLVCLARA 
SDKIMVLDSG 
GHTDHMVTNT 



HVQDFTAFWD 
VHGRIAYVSQ 
RGTTLSGGQK 
THQLQYLKAA 
NRTFSESSVW 
WIVFIFLILL 
LTVATVLFGI 
HLDDLLPLTF 
RLESTTRSPV 
RLDAICAMFV 
ERVIEYTDLE 
VGIVGRTGAG 
MRKNLDPFNB 
ILRKNQILII 
RLKEYDEPYV 
SNGQPSTLTI 



420 
4B0 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



Seq ID NO: 571 DNA sequence 
Nucleic Acid Accession #: AF07120 
Coding sequence: 116.. 4093 

11 



51 



1 11 21 31 41 

I I 1 I I 1 

GGACAGGCGT GGCGGCCGGA GCCCCAGCAT CCCTGCTTGA GGTCCAGGAG CGGAGCCCGC 60 

GGCCACCGCC GCCTGATCAG CGCGACCCCG GCCCGCGCCC GCCCCGCCCG GCAAGATGCT 120 

GCCCGTGTAC CAGGAGGTGA AGCCCAACCC GCTGCAGGAC GCGAACATCT GCTCACGCGT 180 

GTTCTTCTGG TGGCTCAATC CCTTGTTTAA AATTGGCCAT AAACGGAGAT TAGAGGAAGA 240 

TGATATGTAT TCAGTGCTGC CAGAAGACCG CTCACAGCAC CTTGGAGAGG AGTTGCAAGG 300 

GTTCTGGGAT AAAGAAGTTT TAAGAGCTGA GAATGACGCA CAGAAGCCTT CTTTAACAAG 360 

AGCAATCATA AAGTGTTACT GGAAATCTTA TTTAGTTTTG GGAATTTTTA CGTTAATTGA 420 

GGAAAGTGCC AAAGTAATCC AGCCCATATT TTTGGGAAAA ATTATTAATT ATTTTGAAAA 480 

TTATGATCCC ATGGATTCTG TGGCTTTGAA CACAGCGTAC GCCTATGCCA CGGTGCTGAC 540 

TTTTTGCACG CTCATTTTGG CTATACTGCA TCACTTATAT TTTTATCACG TTCAGTGTGC 600 

TGGGATGAGG TTACGAGTAG CCATGTGCCA TATGATTTAT CGGAAGGCAC TTCGTCTTAG 660 

TAACATGGCC ATGGGGAAGA CAACCACAGG CCAGATAGTC AATCTGCTGT CCAATGATGT 720 

GAACAAGTTT GATCAGGTGA CAGTGTTCTT ACACTTCCTG TGGGCAGGAC CACTGCAGGC 780 

GATCGCAGTG ACTGCCCTAC TCTGGATGGA GATAGGAATA TCGTGCCTTG CTGGGATGGC 840 

AGTTCTAATC ATTCTCCTGC CCTTGCAAAG CTGTTTTGGG AAGTTGTTCT CATCACTGAG 900 

GAGTAAAACT GCAACTTTCA CGGATGCCAG GATCAGGACC ATGAATGAAG TTATAACTGG 960 

TATAAGGATA ATAAAAATGT ACGCCTGGGA AAAGTCATTT TCAAATCTTA TTACCAATTT 1020 

GAGAAAGAAG GAGATTTCCA AGATTCTGAG AAGTTCCTGC CTCAGGGGGA TGAATTTGGC 1080 

TTCGTTTTTC AGTGCAAGCA AAATCATCGT GTTTGTGACC TTCACCACCT ACGTGCTCCT 1140 

CGGCAGTGTG ATCACAGCCA GCCGCGTGTT CGTGGCAGTG ACGCTGTATG GGGCTGTGCG 1200 

GCTGACGGTT ACCCTCTTCT TCCCCTCAGC CATTGAGAGG GTGTCAGAGG CAATCGTCAG 1260 

CATCCGAAGA ATCCAGACCT TTTTGCTACT TGATGAGATA TCACAGCGCA ACCGTCAGCT 1320 

GCCGTCAGAT GGTAAAAAGA TGGTGCATGT GCAGGATTTT ACTGCTTTTT GGGATAAGGC 1380 

ATCAGAGACC CCAACTCTAC AAGGCCTTTC CTTTACTGTC AGACCTGGCG AATTGTTAGC 1440 

TGTGGTCGGC CCCGTGGGAG CAGGGAAGTC ATCACTGTTA AGTGCCGTGC TCGGGGAATT 1500 

GGCCCCAAGT CACGGGCTGG TCAGCGTGCA TGGAAGAATT GCCTATGTGT CTCAGCAGCC 1560 

CTGGGTGTTC TCGGGAACTC TGAGGAGTAA TATTTTATTT GGGAAGAAAT ATGAAAAGGA 1620 

ACGATATGAA AAAGTCATAA AGGCTTGTGC TCTGAAAAAG GATTTACAGC TGTTGGAGGA 1680 

TGGTGATCTG ACTGTGATAG GAGATCGGGG AACCACGCTG AGTGGAGGGC AGAAAGCACG 1740 

GGTAAACCTT GCAAGAGCAG TGTATCAAGA TGCTGACATC TATCTCCTGG ACGATCCTCT 1800 

CAGTGCAGTA GATGCGGAAG TTAGCAGACA CTTGTTCGAA CTGTGTATTT GTCAAATTTT 1860 

GCATGAGAAG ATCACAATTT TAGTGACTCA TCAGTTGCAO TACCTCAAAG CTGCAAGTCA 1920 

GATTCTGATA TTGAAAGATG GTAAAATGGT GCAGAAGGGG ACTTACACTG AGTTCCTAAA 1980 

ATCTGGTATA GATTTTGGCT CCCTTTTAAA GAAGGATAAT GAGGAAAGTG AACAACCTCC 2040 

AGTTCCAGGA ACTCCCACAC TAAGGAATCG TACCTTCTCA GAGTCTTCGG TTTGGTCTCA 2100 

ACAATCTTCT AGACCCTCCT TGAAAGATGG TGCTCTGGAG AGCCAAGATA CAGAGAATGT 2160 

CCCAGTTACA CTATCAGAGQ AGAACCGTTC TGAAGGAAAA GTTGGTTTTC AGGCCTATAA 2220 

GAATTACTTC AGAGCTGGTG CTCACTGGAT TGTCTTCATT TTCCTTATTC TCCTAAACAC 2280 

TGCAGCTCAG GTTGCCTATG TGCTTCAAGA TTGGTGGCTT TCATACTGGG CAAACAAACA 2340 

AAGTATGCTA AATGTCACTG TAAATGGAGG AGGAAATGTA ACCGAGAAGC TAGATCTTAA 2400 

CTGGTACTTA GGAATTTATT CAGGTTTAAC TGTAGCTACC GTTCTTTTTG GCATAGCAAG 2460 

ATCTCTATTG GTATTCTACG TCCTTGTTAA CTCTTCACAA ACTTTGCACA ACAAAATGTT 2520 

TGAGTCAATT CTGAAAGCTC CGGTATTATT CTTTGATAGA AATCCAATAG GAAGAATTTT 2580 

AAATCGTTTC TCCAAAGACA TTGGACACTT GGATGATTTG CTGCCGCTGA CGTTTTTAGA 2640 

TTTCATCCAG ACATTGCTAC AAGTGGTTGG TGTGGTCTCT GTGGCTGTGG CCGTGATTCC 2700 

TTGGATCGCA ATACCCTTGG TTCCCCTTGG AATCATTTTC ATTTTTCTTC GGCGATATTT 2760 

TTTGGAAACG TCAAGAGATG TGAAGCGCCT GGAATCTACA ACTCGGAGTC CAGTGTTTTC 2820 

CCACTTGTCA TCTTCTCTCC AGGGGCTCTG GACCATCCGG GCATACAAAG CAGAAGAGAG 2880 

GTGTCAGGAA CTGTTTGATG CACACCAGGA TTTACATTCA GAGGCTTGGT TCTTGTTTTT 2940 

GACAACGTCC CGCTGGTTCG CCGTCCGTCT GGATGCCATC TGTGCCATGT TTGTCATCAT 3000 

OGTTGCCTTT GGGTCCCTGA TTCTGGCAAA AACTCTGGAT GCCGGGCAGG TTGGTTTGGC 3060 

ACTGTCCTAT GCCCTCACGC TCATGGGGAT GTTTCAGTGG TGTGTTCGAC AAAGTGCTGA 3120 

AGTTGAGAAT ATGATGATCT CAGTAGAAAG GGTCATTGAA TACACAGACC TTGAAAAAGA 3180 

AGCACCTTGG GAATATCAGA AACGCCCACC ACCAGCCTGG CCCCATGAAG GAGTGATAAT 3240 

CTTTGACAAT GTGAACTTCA TGTACAGTCC AGGTGGGCCT CTGGTACTGA AGCATCTGAC 3300 

AGCACTCATT AAATCACAAG AAAAGGTTGG CATTGTGGGA AGAACCGGAG CTGGAAAAAG 3360 

TTCCCTCATC TCAGCCCTTT TTAGATTGTC AGAACCCGAA GGTAAAATTT GGATTGATAA 3420 

GATCTTGACA ACTGAAATTG GACTTCACGA TTTAAGGAAG AAAATGTCAA TCATACCTCA 3480 

GGAACCTGTT TTGTTCACTG GAACAATGAG GAAAAACCTG GATCCCTTTA AGGAGCACAC 3540 

GGATGAGGAA CTGTGGAATG CCTTACAAGA GGTACAACTT AAAGAAACCA TTGAAGATCT 3600 

1172 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



TCCTGGTAAA 
ACAACTGGTG 
AGCGACGGCA 
ATTTGCCCAC 
CAAGATAATG 
GCAAAATAAA 
TGCCCTCACT 
CACTGACCAC 
GACAGCACTG 
TTTGGACTAT 
CAAGATGCTA 



ATGGATACTG 
TGCCTTGCCA 
AATGTGGATC 
TGCACCGTGC 
GTTTTAGATT 
GAGAGCCTAT 
GAAACAGCAA 
ATGGTTACAA 
TGAATCCAAC 
GTAAACCACA 
GTTCATTTGA 



AATTAGCAGA 
GGGCAATTCT 
CAAGAACTGA 
TAACCATTGC 
CAGGAAGACT 
TTTACAAGAT 
AACAGGTATA 
ACACTTCCAA 
CAAAATGTCA 
TTGTACTTTT 
ATATTTCTCC 



ATCAGGATCC 
CAGGAAAAAT 
TGAGTTAATA 
ACACAGATTG 
GAAAGAATAT 
GGTGCAACAA 
CTTCAAAAGA 
TGGACAGCCC 
AGTCOGTTCC 
TTTTACTTTG 
C 



AATTTTAGTG 
CAGATATTGA 
CAAAAAAAAA 
AACACCATTA 
GATGAGCCGT 
CTGGGCAAGG 
AATTATCCAC 
TCGACCTTAA 
GAAGGCATTT 
GCAACAAATA 



TTGGACAAAG 
TTATTGATGA 
TCCGGGAGAA 
TTGACAGCGA 
ATGTTTTGCT 
CAGAAGCCGC 
ATATTGGTCA 
CTATTTTCGA 
TCCACTAGTT 
TTTATACATA 



3660 
3720 
37B0 
3840 
3900 
3960 
4020 
4080 
4140 
4200 



Seq ID NO: 572 Protein sequence 
Protein Accession #: AAC27076.1 



1 
1 

MLPVYQKVKP 
QGFWDKEVLR 
ENYDPMDSVA 
LSNMAMGKTT 
MAVLIILLPL 
NLRKKEISKI 
VRLTVTLFFP 
KASETPTLQG 
QPWVFSGTLR 
ARVNLARAVY 
SQILILKDGK 



NTAAQVAYVL 
ARSLLVFYVL 
LDFIQTIiLQV 



IIVAFGSLIL 
KEAPWEYQKR 
KSSLISALFR 
HTDEELWNAL 
DBATANVDPR 
LLQNKESLFY 
FETAL 



11 
I 

KPLQDANICS 
AENDAQKPSL 
LNTAYAYATV 
TGQIVNliLSN 
QSCFGKLFSS 
LRSSCLRGMN 
SAIERVSEAI 
LSFTVRPGEL 
SNILFGKKYB 
QDADIYLLDD 
MVQKGTYTEF 
DGALESQDTE 
QDWWLSYWAN 
VNSSQTLHNK 
VGWSVAVAV 
LWTIRAYKAE 
AKTLDAGQVG 
PPPAWPHEGV 
LSEPEGKIWI 
QEVQLKETIE 
TDELIQKKIR 
KMVQQLGKAE 



21 
I 

RVFFWWLNPL 
TRAIIKCYWK 
IiTFCTLILAI 
DVNKFDQVTV 
LRSKTATFTD 
LASFFSASKI 
VSIRRIQTFL 
LAWGPVGAG 
KERYEKVIKA 
PLSAVDAEVS 
LKSGIDFGSL 
NVPVTLSEEN 
KQSMLNVTVN 
MFESILKAPV 
IPWIAIPLVP 
ERCQELFDAH 
LALSYALTLM 
IIFDNVNFMY 
DKILTTEIGL 
DLPGKMDTEL 
EKFAHCTVLT 
AAALTETAKQ 



31 
I 

FKIGHKRRLE 
SYLVLGIFTL 
LHHLYFYHVQ 
FLHFLWAGPL 
ARIRTMNEVI 
IVFVTFTTYV 
LLDEISQRNR 
KSSLLSAVXiG 
CALKKDLQLL 
RHLFELCICQ 
LKKDNEESEQ 
RSEGKVGFQA 
GGGNVTEKLD 
LFFDRNPIGR 
LGIIFIFLRR 
QDLHSEAWFL 
GMFQWCVRQS 
SPGGPLVLKH 
HDLRKKMSI I 
AESGSNFSVG 
IAHRLNTIID 
VYFKRNYPHI 



41 
1 

EDDMYSVLPE 
IEESAKVIQP 
CAGMRLRVAM 
QAIAVTALLW 
TGIRIIKMYA 
LLGSVITASR 
QLPSDGKKMV 
ELAPSHGLVS 
EDGDLTVIGD 
IIJ1EKITILV 
PPVPGTPTLR 
YKNYFRAGAH 
LNWYLGIYSG 
ILKRFSKDIG 
YFLETSRDVK 
FLTTSRWFAV 
AEVENMMISV 
LTALIKSQEK 
PQEPVLFTGT 
QRQLVCLARA 
SDKIMVUDSG 
GHTDHMVTNT 



51 
I 

DRSQHLGEEL 
IFLGKIINYF 
CHMIYRKALR 
MEIGISCLAG 
WEKSFSNLIT 
VFVAVTLYGA 
HVQDFTAFWD 
VHGR2AYVSQ 
RGTTLSGGQK 
THQLQYLKAA 
NRTFSESSVW 
WIVFIFLILIi 
LTVATVLFGI 
HLDDLLPLTF 
RLESTTRSPV 
RLDAICAMFV 
ERVIEYTDLB 
VGIVGRTGAG 
MRKNLDPFKE 
ILRKNQXLII 
RLKEYDEPYV 
SNGQPSTLTI 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



Seq ID NO: 573 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..1365 



ATGGAATCAA 
GGCATAAATG 
TTTGCCAAAT 
AGAAATCCTA 
GATGCTCTCA 
CTGTGGGACC 
AGGATAAACC 
TTGATTGTCA 
GCCAGCGGGC 
CTTGCCCGCC 
ATTGAAAATT 
AGCTTGGCCA 
AGAAACCAAC 
ATAGTTGCCA 
CAACTTTATT 
TGTAGAAAAC 
CTCTGCTTAC 
GTTCATGCAA 
ATCTCCTTTG 
TCAGTGAGCA 
GTCGCTCTGC 
GAAGAGTACT 
ATTGTAATTC 



11 
I 

TCTCTATGAT 
GTATCAAAGA 
CCTTGACCAT 
AGTTTGCTTC 
CAAAAACAAA 
TGAGACATCT 
AGTACCCAGA 
AAGGATTTAA 
AGGTTTATAT 
AGTTGAATTT 
TACCCCTACG 
CATTTTTTTT 
AGAGTGACTT 
TTACTTTGCT 
ACGGCACCAA 
AGCTTGGATT 
CGATGAGAAG 
ATATTGAAAA 
GCATAATGAG 
ATGCTTTAAA 
TCATAAGTAC 
ACAGATTTTA 
TGGATCTTTT 



21 
I 

GGGAAGGCCT 
TGCAAGGAAG 
TCGACTTATT 
TGAATTTTTT 
TATAATATTT 
GCTTGTGGGT 
ATCCAATGCT 
TGTTGTCTCA 
ATGCAGCAAC 
CATTCCCATT 
ACTCTTTACT 
CCTTTATTCC 
TTACAAAATT 
CTCCCTAGTA 
GTATAGGAGA 
ACTAAGTTTT 
GTCAGAGAGA 
CTCTTGGAAT 
CCTTGGCTTA 
CTGGAGAGAA 
TTTCCATGTT 
TACACCACCA 
GCAGCTTTGC 



31 
I 

AAGAGCCTTA 
GTCACTGTAG 
AGATGCGGCT 
CCTCATGTGG 
GTTGCTATAC 
AAAATCCTGA 
GAATATTTGG 
GCTTGGGCAC 
AATATTCAAG 
GACTTGGGAT 
CTCTGGAGAG 
TTTGTCAGAG 
CCTATAGAGA 
TACCTCGCAG 
TTTCCACCTT 
TXCTTCGCTA 
TATTTGTTTC 
GAGGAAGAAG 
CTTTCCCTCC 
TTCAGTTTTA 
TTAATTTATG 
AACTTTGTTC 
AGATACCCAG 



41 
I 

GTGAAACTTG 
GTGTGATTGG 
ATCATGTGGT 
TAGATGTCAC 
ACAGAGAACA 
TTGATGTGAG 
CTTCATTATT 
TTCAGTTAGG 
CGCGACAACA 
CCTTATCATC 
GGCCAGTGGT 
ATGTGATTCA 
TTGTGAATAA 
GTCTTCTGGC 
GGTTGGAAAC 
TGGTCCATGT 
TCAACATGGC 
TTTGGAGAAT 
TGGCAGTCAC 
TTCAGTCTAC 
GATGGAAACG 
TTGCTCTTGT 
ACTGA 



51 
I 

TTTACCTAAT 
AAGTGGAGAT 
CATAGGAAGT 
TCATCATGAA 
TTATACCTCC 
CAATAACATG 
CCCAGATTCT 
ACCTAAGGAT 
GGTTATTGAA 
AGCCAGAGAG 
GGTAGCTATA 
TCCATATGCT 
AACCTTACCT 
AGCTGCTTAT 
CTGGTTACAG 
TGCCTACAGC 
TTATCAGCAG 
TGAAATGTAT 
TTCTATCCCT 
ACTTGGATAT 
AGCTTTTGAG 
TTTGCCCTCA 



Seq ID NO: 574 Protein sequence 
Protein Accession #: Eos sequence 



MESISMMGSP 
RNPKFASEFF 
RINQYPESNA 
LARQLNFIPI 
RNQQSDFYKI 
CRKQLGLLSF 
ISFGIMSLGL 
EEYYRFYTPP 



11 
I 

KSLSETCLPN 
PHWDVTHHE 
EYLASLFPDS 
DLGSLSSARE 
PIEIVNKTLP 
FFAMVHVAYS 
LSLLAVTSIP 
NFVLALVLPS 



21 
I 

GINGIKDARK 
DALTKTNIIF 
LIVKGFNWS 
IENLPLRLFT 
IVAITLLSIiV 
LCLPMRRSER 
SVSNALNWRE 
IVILDLLQLC 



31 
I 

VTVGVIGSGD 
VAIHRBHYTS 
AWALQLGPKD 
LWRGPVWAI 
YLAGLLAAAY 
YLFLNMAYQQ 
FSFIQSTLGY 
RYPD 



41 
I 

FAKSLTIRLI 
LWDLRHltLVG 
ASRQVYICSN 
SLATFFFLYS 
QLYYGTKYRR 
VHANIENSWN 
VALLISTFHV 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



51 
I 

RCGYHWIGS 
KILIDVSNNM 
NIQARQQVIE 
FVRDVIHPYA 
FPPWLETWLQ 
EEEVWRIEMY 
LIYGWKRAFE 



60 
120 
180 
240 
300 
360 
420 
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Seq ID NO: 575 DNA sequence 

Nucleic Acid Accession ft: NM_001873.1 

Coding sequence; 3.. 1721 

I 11 21 31 41 51 

I I I i I I 

AAATGGCGTG CCCGTCTCTC CGCCGGCCCC CTGCCTCGCA GTGGTTTCTC CTGCAGCTCC 60 

CCTGGGCTCC GCGGCCAGTA GTGCAGCCCG TGGAGCCGCG GCTTTGCCCG TCTCCTCTGG 120 

GTGGCCCCAG TGCGCGGGCT GACACTCATT CAGCCGGGGA AGGTGAGGCG AGTAGAGGCT 180 

GGTGCGGAAC TTGCCGCCCC CAGCAGCGCC GGCGGGCTAA GCCCAGGGCC GGGCAGACAA 240 

AAGAGGCCGC CCGCGTAGGA AGGCACGGCC GGCGGCGGCG GAGCGCAGCG ATGGCCGGGC 300 

GAGGGGGCAG CGCGCTGCTG GCTCTGTGCG GGGCACTGGC TGCCTGCGGG TGGCTCCTGG 360 

GCGCCGAAGC CCAGGAGCCC GGGGCGCCCG CGGCGGGCAT GAGGCGGCGC CGGCGGCTGC 420 

AGCAAGAGGA CGGCATCTCC TTCGAGTACC ACCGCTACCC CGAGCTGCGC GAGGCGCTCG 480 

TGTCCGTGTG GCTGCAGTGC ACCGCCATCA GCAGGATTTA CACGGTGGGG CGCAGCTTCG 540 

AGGGCCGGGA GCTCCTGGTC ATCGAGCTGT CCGACAACCC TGGCGTCCAT GAGCCTGGTG GOO 

AGCCTGAATT TAAATACATT GGGAATATGC ATGGGAATGA GGCTGTTGGA CGAGAACTGC 660 

TCATTTTCTT GGCCCAGTAC CTATGCAACG AATACCAGAA GGGGAACGAG ACAATTGTCA 720 

ACCTGATCCA CAGTACCCGC ATTCACATCA TGCCTTCCCT GAACCCAGAT GGCTTTGAGA 780 

AGGCAGCGTC TCAGCCTGGT GAACTCAAGG ACTGGTTTGT GGGTCGAAGC AATGCCCAGG 840 

GAATAGATCT GAACCGGAAC TTTCCAGACC TGGATAGGAT AGTGTACGTG AATGAGAAAG 900 

AAGGTGGTCC AAATAATCAT CTGTTGAAAA ATATGAAGAA AATTGTGGAT CAAAACACAA 960 

AGCTTGCTCC TGAGACCAAG GCTGTCATTC ATTGGATTAT GGATATTCCT TTTGTGCTTT 1020 

CTGCCAATCT CCATGGAGGA GACCTTGTGG CCAATTATCC ATATGATGAG ACGCGGAGTG 1080 

GTAGTGCTCA CGAATACAGC TCCTCCCCAG ATGACGCCAT TTTCCAAAGC TTGGCCCGGG 1140 

CATACTCTTC TTTCAACCCG GCCATGTCTG ACCCCAATCG GCCACCATGT CGCAAGAATG 1200 

ATGATGACAG CAGCTTTGTA GATGGAACCA CCAACGGTGG TGCTTGGTAC AGCGTACCTG 1260 

GAGGGATGCA AGACTTCAAT TACCTTAGCA GCAACTGTTT TGAGATCACC GTGGAGCTTA 1320 

GCTGTGAGAA GTTCCCACCT GAAGAGACTC TGAAGACCTA CTGGGAGGAT AACAAAAACT 1380 

CCCTCATTAG CTACCTTGAG CAGATACACC GAGGAGTTAA AGGATTTGTC CGAGACCTTC 1440 

AAGGTAACCC AATTGCGAAT GCCACCATCT CCGTGGAAGG AATAGACCAC GATGTTACAT 1500 

CCGCAAAGGA TGGTGATTAC TGGAGATTGC TTATACCTGG AAACTATAAA CTTACAGCCT 1560 

CAGCTCCAGG CTATCTGGCA ATAACAAAGA AAGTGGCAGT TCCTTACAGC CCTGCTGCTG 1620 

GGGTTGATTT TGAACTGGAG TCATTTTCTG AAAGGAAAGA AGAGGAGAAG GAAGAATTGA 1680 

TGGAATGGTG GAAAATGATG TCAGAAACTT TAAATTTTTA AAAAGGCTTC TAGTTAGCTG 1740 

CTTTAAATCT ATCTATATAA TGTAGTATGA TGTAATGTGG TCTTTTTTTT AGATTTTGTG 1800 

CAGTTAATAC TTAACATTGA TTTATTTTTT AATCATTTAA ATATTAATCA ACTTTCCTTA 1860 

AAATAAATAG CCTCTTAGGT AAAAATATAA GAACTTGATA TATTTCATTC TCTTATATAG 1920 

TATTCATTTT CCTACCTATA TTACACAAAA AAGTATAGAA AAGATTTAAG TAATTTTGCC 1980 

ATCCTAGGCT TAAATGCAAT ATTCCTGGTA TTATTTACAA TGCAGAATTT TTTGAGTAAT 2040 

TCTAGCTTTC AAAAATTAGT GAAGTTCTTT TACTGTAATT GGTGACAATG TCACATAATG 2100 

AATGCTATTG AAAAGGTTAA CAGATACAGC TCGGAGTTGT GAGCACTCTA CTGCAAGACT 2160 

TAAATAGTTC AGTATAAATT GTCGTTTTTT TCTTGTGCTG ACTAACTATA AGCATGATCT 2220 

TGTTAATGCA TTTTTGATGG GAAGAAAAGG TACATGTTTA CAAAGAGGTT TTATGAAAAG 2280 

AATAAAAATT GACTTCTTGC TTGTACATAT AGGAGCAATA CTATTATATT ATGTAGTCCG 2340 

TTAACACTAC TTAAAAGTTT AGGGTTTTCT CTTGGTTGTA GAGTGGCCCA GAATTGCATT 2400 
CTGAATGAAT AAAGGTTAAA AAAAAATCOC CAGTGAAAAA AAA 



Seq ID NO: 576 Protein sequence 
Protein Accession ft: NP__001864.1 



1 
I 

MAGRGGSALL 
EALVSVWLQC 
RELLIFLAQY 
NAQGIDLNRN 
FVLSANLHGG 



NKNSLISYLE 
LTASAPGYLA 



11 
I 

ALCGAliAACG 
TAISRIYTVG 
LCNEYQKGNE 
FPDLDRIVYV 
DLVANYPYDB 
DGTTNGGAWY 
QIHRGVKGFV 
ITKKVAVPYS 



21 
I 

WLLGAEAQEP 
RSFEGRELLV 
TIVNLIHSTR 



TRSGSAHEYS 
SVPGGMQDFN 
RDIOGNPIAN 
PAAGVDFELE 



31 
I 

GAPAAGMRRR 
IELSDNPGVH 
IHIMPSLNPD 
LLKNMKKIVD 
SSPDDAIFQS 
YLSSNCFEIT 
ATISVEGIDH 



41 
I 

RRLQQEDGIS 
EPGEPEPKYI 
GFEKAASQPG 
QNTKLAPETK 
LARAYSSFNP 
VELSCEKFPP 
DVTSAKDGDY 



51 
I 

FEYHRYPELR 
GNMHGNEAVG 
ELKDWFVGRS 
AVIHWIMDIP 
AMSDPNRPPC 
EETLKTYHED 
WRLLIPGNYK 



60 
120 
180 
240 
300 
360 
420 



Seq ID NO: 577 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 1..933 



1 
I 

ATGTGCAGCA 
TTCGACAAGA 
TTCCCCTGTG 
GACTGTCCOG 
GCCCGCTACC 
AATAACTGTC 
GGGCAGGTGT 
ATCATCGGCA 
CACCAGCGGA 
CTGCTGTCCC 
AATAATGGCA 
CCACCCTCCT 
CCGCCCTACT 
CGGTCCGGGA 
GACACCAGCC 



11 
I 

ATGGACGGTG 
GTGATGAGAA 
CCAGCGGCAT 
ATGGCAGCGA 
ACTGCAAGAA 
AAGACAACAG 
TTGTGACTTC 
GCTCCGTCAT 
AGCGGAACAA 

TCCAGTATGT 
ACTCCGAGGC 
CTTCTGACAC 
GTGCCAACAG 
ACAGCCCGGG 



21 
I 

CATCCCGGGC 
GGAGTGCCCC 
CCATTGCATC 
TGAAGAGAAC 
CGGCCTCTGT 
TGATGAGGAA 
AGAGAACCAA 
TTTTGTGCTG 
CCTCATGACG 
CCTGGACCAC 
GGCCAGCCAG 
CTTGCTGGAC 
GGAATCTCTG 
TGCCAGCTCC 
GCAGCCTGGC 



31 
I 

GCCTGGCAGT 
AAGGCTAAGT 
ATTGGTCGCT 
TGCACAGCAA 
ATTGACAAGA 
AGCTGTGAAA 
CTTGTGTATT 
GTGGTGGCCC 
CTGCCCGTGC 
CCCCACCACT 
GCGGAGCAGA 
CAGAGGCCTG 
AACCAAGCCG 
CAGGCAGCCA 
CCCCAGGAGG 



41 

i 

GTGACGGGCT 
CGAAATGTGG 
TCCGGTGCAA 
ACCCTCTGCT 
GCTTCATCTG 
GTTCTCAAGA 
ACCCCAGCAT 
TGCTGGCACT 
ACCGGCTGCA 
GCAACGTCAC 
ATGCGTCGGA 
CGTGGTATGA 
ACCTGCCCCC 
GCAGCCTCCT 
GCACTGCTGA 



51 
I 

GCCTGACTGC 
CCCGACCTTC 
TGGGTTTGAG 
TTGCTCCACC 
CGATGGACAG 
ACCCGGCAGT 
CACCTATGCC 
GGTCTTGCAC 
GCACCCTGTG 
CTACAACGTC 
AGTAGGCTCC 
CCTTCCTCCA 
CTACCGCTCC 
GAGCGTGGAA 
GCCCAGGGAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



1174 



WO 03/042661 



TCTGAGCCCA GCCAGGGCAC TGAAGAAGTA TAA 

Seq ID NO: 578 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MCSNGRCIPG AWQCDGLFDC FDKSDEKECP KAKSKCGPTP FPCASGIHCI IGRFRCNGFE 60 

DCPDGSDEEN CTANPLLCST ARYHCKNGLC IDKSFICDGQ NNCQDNSDEE SCESSQEPGS 120 

GQVFVTSENQ LVYYPSITYA IIGSSVIFVL WALLALVLH HQRKRNNLMT LPVHRLQHPV 180 

LLSRLWLDH PHHCNVTYNV NNGIQYVASQ AEQNASEVGS PPSYSEALLD QRPAWYDLPP 240 

PPYSSDTESL NQADLPPYRS RSGSANSASS QAASSLLSVE DTSHSPGQPG PQEGTAEPRD 300 
SEPSQGTEEV 

Seq ID NO: 579 DNA sequence 
Nucleic Acid Accession #: AF179274.1 
Coding sequence: 1..1125 

1 11 21 31 41 51 

I I I I I I 

ATGGTGCTGT GGGAGTCCCC GCGGCAGTGC AGCAGCTGGA CACTTTGCGA GGGCTTTTGC 60 

TGGCTGCTGC TGCTGCCCGT CATGCTACTC ATCGTAGCCC GCCCGGTGAA GCTCGCTGCT 120 

TTCCCTACCT CCTTAAGTGA CTGCCAAACG CCCACCGGCT GGAATTGCTC TGGTTATGAT 180 

GACAGAGAAA ATGATCTCTT CCTCTGTGAC ACCAACACCT GTAAATTTGA TGGGGAATGT 240 

TTAAGAATTG GAGACACTGT GACTTGCGTC TGTCAGTTCA AGTGCAACAA TGACTATGTG 300 

CCTGTGTGTG GCTCCAATGG GGAGAGCTAC CAGAATGAGT GTTACCTGCG ACAGGCTGCA 360 

TGCAAACAGC AGAGTGAGAT ACTTGTGGTG TCAGAAGGAT CATGTGCCAC AGATGCAGGA 420 

TCAGOATCTG GAGATGGAGT CCATGAAGGC TCTGGAGAAA CTAGTCAAAA GGAGACATCC 480 

ACCTGTGATA TTTGCCAGTT TGGTGCAGAA TGTGACGAAG ATGCCGAGGA TGTCTGGTGT 540 

GTGTGTAATA TTGACTGTTC TCAAACCAAC TTCAATCCCC TCTGCGCTTC TGATGGGAAA 600 

TCTTATGATA ATGCATGCCA AATCAAAGAA GCATCGTGTC AGAAACAGGA GAAAATTGAA 660 

GTCATGTCTT TGGGTCGATG TCAAGATAAC ACAACTACAA CTACTAAGTC TGAAGATGGG 720 

CATTATGCAA GAACAGATTA TGCAGAGAAT GCTAACAAAT TAGAAGAAAG TGCCAGAGAA 780 

CACCACATAC CTTGTCCGGA ACATTACAAT GGCTTCTGCA TGCATGGGAA GTGTGAGCAT 840 

TCTATCAATA TGCAGGAGCC ATCTTGCAGG TGTGATGCTG GTTATACTGG ACAACACTGT 900 

GAAAAAAAGG ACTACAGTGT TCTATACGTT GTTCCCGGTC CTGTACGATT TCAGTATGTC 960 

TTAATCGCAG CTGTGATTGG AACAATTCAG ATTGCTGTCA TCTGTGTGGT GGTCCTCTGC 1020 

ATCACAAGGA AATGCCCCAG AAGCAACAGA ATTCACAGAC AGAAGCAAAA TACAGGGCAC 1080 
TACAGTTCAG ACAATACAAC AAGAGCGTCC ACGAGGTTAA TCTGA 

Seq ID NO: 580 Protein sequence 
Protein Accession #: NP_057276.2 

1 11 21 31 41 51 

I I I I I I 

MVLWESPRQC SSWTLCEGFC WLLLLPVMLL IVARPVKLAA FPTSLSDCQT PTGWNCSGYD 60 

DRENDLFLCD TNTCKFDGEC LRIGDTVTCV CQFKCNNDYV PVCGSNGESY QNECYLRQAA 120 

CKQQSEILW SEGSCATDAG SGSGDGVHEG SGETSQKETS TCDICQFGAE CDEDAEDVWC 180 

VCNIDCSQTN FNPLCASDGK SYDNACQIKE ASCQKQEKIE VMSLGRCQDN TTTTTKSEDG 240 

HYARTDYAEN ANKLEESARE HHIPCPEHYN GFCMHGKCEH SINMQEPSCR CDAGYTGQHC 300 

EKKDYSVLYV VPGPVRFQYV LIAAVIGTIQ IAVICVWLC ITRKCPRSNR IHRQKQNTGH 360 
YSSDNTTRAS TRLI 

Seq ID NO: 581 DNA sequence 
Nucleic Acid Accession #i S78203.1 
Coding sequence i 1 . . 2 1 90 

1 11 21 31 41 51 

I I I I I I 

ATGAATCCTT TCCAGAAAAA TGAGTCCAAG GAAACTCTTT TTTCACCTGT CTCCATTGAA 60 

GAGGTACCAC CTCGACCACC TAGCCCTCCA AAGAAGCCAT CTCCGACAAT CTGTGGCTCC 120 

AACTATCCAC TGAGCATTGC CTTCATTGTG GTGAATGAAT TCTGCGAGCG CTTTTCCTAT 180 

TATGGAATGA AAGCTGTGCT GATCCTGTAT TTCCTGTATT TCCTGCACTG GAATGAAGAT 240 

ACCTCCACAT CTATATACCA TGCCTTCAGC AGCCTCTGTT ATTTTACTCC CATCCTGGGA 300 

GCAGCCATTG CTGACTOGTG GTTGGGAAAA TTCAAGACAA TCATCTATCT CTCCTTGGTG 360 

TATGTGCTTG GCCATGTGAT CAAGTCCTTG GGTGCCTTAC CAATACTGGG AGGACAAGTG 420 

GTACACACAG TCCTATCATT GATCGGCCTG AGTCTAATAG CTTTGGGGAC AGGAGGCATC 480 

AAACCCTGTG TGGCAGCTTT TGGTGGAGAC CAGTTTGAAG AAAAACATGC AGAGGAACGG 540 

ACTAGATACT TCTCAGTCTT CTACCTGTCC ATCAATGCAG GGAGCTTGAT TTCTACATTT 600 

ATCACACCCA TGCTGAGAGG AGATGTGCAA TGTTTTGGAG AAGACTGCTA TGCATTGGCT 660 

TTTGGAGTTC CAGGACTGCT CATGGTAATT GCACTTGTTG TGTTTGCAAT GGGAAGCAAA 720 

ATATACAATA AACCACCCCC TGAAGGAAAC ATAGTGGCTC AAGTTTTCAA ATGTATCTGG 780 

TTTGCTATTT CCAATCGTTT CAAGAACCGT TCTGGAGACA TTCCAAAGCG ACAGCACTGG 840 

CTAGACTGGG CAGCTGAGAA ATATCCAAAG CAGCTCATTA TGGATGTAAA GGCACTGACC 900 

AGGGTACTAT TCCTTTATAT CCCATTGCCC ATGTTCTGGG CTCTTTTGGA TCAGCAGGGT 960 

TCACGATGGA CTTTGCAAGC CATCAGGATG AATAGGAATT TGGGGTTTTT TGTGCTTCAG 1020 

CCGGACCAGA TGCAGGTTCT AAATCCCTTT CTGGTTCTTA TCTTCATCCC GTTGTTTGAC 1080 

TTTGTCATTT ATCGTCTGGT CTCCAAGTGT GGAATTAACT TCTCATCACT TAGGAAAATG 1140 

GCTGTTGGTA TGATCCTAGC GTGCCTGGCA TTTGCAGTTG CGGCAGCTGT AGAGATAAAA 1200 

ATAAATGAAA TGGCCCCAGC CCAGTCAGGT CCCCAGGAGG TTTTCCTACA AGTCTTGAAT 1260 

CTGGCAGATG ATGAGGTGAA GGTGACAGTG GTGGGAAATG AAAACAATTC TCTGTTGATA 1320 

GAGTCCATCA AATCCTTTCA GAAAACACCA CACTATTCCA AACTGCACCT GAAAACAAAA 1380 

AGCCAGGATT TTCACTTCCA CCTGAAATAT CACAATTTGT CTCTCTACAC TGAGCATTCT 1440 

GTGCAGGAGA AGAACTGGTA CAGTCTTGTC ATTCGTGAAG ATGGGAACAG TATCTCCAGC 1500 
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ATGATGGTAA AGGATACAGA AAGCAAAACA ACCAATGGGA TGACAACCGT GAGGTTTGTT 1560 

AACACTTTGC ATAAAGATGT CAACATCTCC CTGAGTACAG ATACCTCTCT CAATGTTGGT 1620 

GAAGACTATG GTGTGTCTGC TTATAGAACT GTGCAAAGAG GAGAATACCC TGCAGTGCAC 1680 

TGTAGAACAG AAGATAAGAA CTTTTCTCTG AATTTGGGTC TTCTAGACTT TGGTGCAGCA 1740 

3 TATCTGTTTG TTATTACTAA TAACACCAAT CAGGGTCTTC AGGCCTGGAA GATTGAAGAC 1800 

ATTCCAGCCA ACAAAATGTC CATTGCGTGG CAGCTACCAC AATATGCCCT GGTTACAGCT 1860 

GGGGAGGTCA TGTTCTCTGT CACAGGTCTT GAGTTTTCTT ATTCTCAGGC TCCCTCTAGC 1920 

ATGAAATCTG TGCTCCAGGC AGCTTGGCTA TTGACAATTG CAGTTGGGAA TATCATCGTG 1980 

CTTGTTGTGG CACAGTTCAG TGGCCTGGTA CAGTGGGCCG AATTCATTTT GTTTTCCTGC 2040 

CTCCTGCTGG TGATCTGCCT GATCTTCTCC ATCATGGGCT ACTACTATGT TCCTGTAAAG 2100 

ACAGAGGATA TGCGGGGTCC AGCAGATAAG CACATTCCTC ACATCCAGGG GAACATGATC 2160 
AAACTAGAGA CCAAGAAGAC AAAACTCTGA 



Seq ID NO: 582 Protein sequence 
15 Protein Accession #i AAB34388-1 

1 11 21 31 41 51 

I I I I I i 

MNPFQKNESK ETLFSFVSIE EVPPRPPSPP KKPSPTICGS NYPLSIAFIV VNEFCERFSY 60 

Z\J YGMKAVLILY FLYFLHWNED TSTSIYHAFS SLCYFTPILG AAIADSWLGK FKTI I YLSLV 120 

YVLGHVIKSL GALPILGGQV VHTVLSLIGL SLIALGTGGI KPCVAAFGGD QFEEKHABER 180 

TRYFSVFYIiS INAGSLISTF ITPMLRGDVQ CFG ED CY ALA FGVPGLLMVI ALWFAMGSK 240 

IYNKPPPEGN IVAQVFKCIW FAISNRFKNR SGDIPKRQHW LDWAAEKYPK QLIMDVKALT 300 

RVLFLYIPLP MFWALLDQQG SRWTLQAIRM NRNLGPFVLQ PDQMQVLNPF LVLIFIPLFD 360 

2.3 FVIYRLVSKC GINFSSLRKM AVGMILACLA FAVAAAVEIK INEMAPAQSG PQEVFLQVLN 420 

LADDEVKVTV VGNENNSLLI ESIKSFQKTP HYSKLHLKTK SQDFHFHLKY HNLSLYTEHS 480 

VQEKNWYSLV IREDGNSISS MMVKDTESKT TNGMTTVRFV NTLHKDVNIS LSTDTSLNVG 540 

EDYGVSAYRT VQRGEYPAVH CRTEDKNFSL NLGLLDFGAA YLFVITNNTN QGLQAWKIED 600 

IPANKMSIAW QLPQYALVTA GBVMFSVTGL EFSYSQAPSS MKSVLQAAWL LTIAVGNIIV 660 

LVVAQFSGLV QWAEFILFSC LLLVICLIFS IMGYYYVPVK TEDMRGPADK HIPHIQGNMI 720 
KLETKKTKL 
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Seq ID NOs 583 DMA sequence 
Nucleic Acid Accession #: NM_032642.1 
3 J Coding sequence: 184.. 1263 
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1 11 21 31 41 51 

I I I I I I 

GACCATTAGC AGGCACCCAG GCCTGTCTTT GGCTCGGAAA CGGTGGCCCC CAATGTAGCC 60 

TAGTTTGAAC CTAGGAACTG CAGGACCAGA GAGATTCCAC TGGAGCCTGA TGGACGGGTG 120 

ACAGAGGGAA CCCTACTCTG GAAACTGTCA GTCCCAGGGC ACTGGGGAGG GCTGAGGCCG 180 

ACCATGCCCA GCCTGCTGCT GCTGTTCACG GCTGCTCTGC TGTCCAGCTG GGCTCAGCTT 240 

CTGACAGACG CCAACTCCTG GTGGTCATTA GCTTTGAACC CGGTGCAGAG ACCCGAGATG 300 

TTTATCATCG GTGCCCAGCC CGTGTGCAGT CAGCTTCCCG GGCTCTCCCC TGGCCAGAGG 360 

AAGCTGTGCC AATTGTACCA GGAGCACATG GCCTACATAG GGGAGGGAGC CAAGACTGGC 420 

ATCAAGGAAT GCCAGCACCA GTTCCGGCAG CGGCGGTGGA ATTGCAGCAC AGCGGACAAC 480 

GCATCTGTCT TTGGGAGAGT CATGCAGATA GGCAGCCGAG AGACCGCCTT CACCCACGCG 540 

GTGAGCGCCG CGGGCGTGGT CAACGCCATC AGCCGGGCCT GCCGOGAGGG CGAGCTCTCC 600 

ACCTGCGGCT GCAGCCGGAC GGCGCGGCCC AAGGACCTGC CCCGGGACTG GCTGTGGGGC 660 

GGCTGTGGGG ACAACGTGGA GTACGGCTAC CGCTTCGCCA AGGAGTTTGT GGATGCCCGG 720 

GAGCGAGAGA AGAACTTTGC CAAAGGATCA GAGGAGCAGG GCCGGGTGCT CATGAACCTG 780 

CAAAACAACG AGGCCGGTCG CAGGGCTGTG TATAAGATGG CAGACGTAGC CTGCAAATGC 840 

CACGGCGTCT CGGGGTCCTG CAGCCTCAAG ACCTGCTGGC TGCAGCTGGC CGAGTTCCGC 900 

AAGGTCGGGG ACCGGCTGAA GGAGAAGTAC GACAGCGCGG CCGCCATGCG CGTCACCCGC 960 

AAGGGCCGGC TGGAGCTGGT CAACAGCCGC TTCACCCAGC CCACCCCGGA GGACCTGGTC 1020 

TATGTGGACC CCAGCCCCGA CTACTGCCTG CGCAACGAGA GCACGGGCTC CCTGGGCACG 1080 

CAGGGCCGCC TCTGCAACAA GACCTCGGAG GGCATGGATG GCTGTGAGCT CATGTGCTGC 1140 

GGGCGTGGCT ACAACCAGTT CAAGAGCGTG CAGGTGGAGC GCTGCCACTG CAAGTTCCAC 1200 

TGGTGCTGCT TCGTCAGGTG TAAGAAGTGC ACGGAGATCG TGGACCAGTA CATCTGTAAA 1260 

TAGCCCGGAG GGCCTGCTCC CGGCCCCCCC TGCACTCTGC CTCACAAAGG TCTATATTAT 1320 

ATAAATCTAT ATAAATCTAT TTTATATTTG TATAAGTAAA TGGGTGGGTG CTATACAATG 1380 

GAAAGATGAA AATGGAAAGG AAGAGCTTAT TTAAGAGACG CTGGAGATCT CTGAGGAGTG 1440 

GACTTTGCTG GTTCTCTCCT CTTGGTGGGT GGGAGACAGG GCTTTTTCTC TCCCTCTGGC 1S0O 

GAGGACTCTC AGGATGTAGG GACTTGGAAA TATTTACTGT CTGTCCACCA CGGCCTGGAG 1560 

GAGGGAGGTT GTGGTTGGAT GGAGGAGATG ATCTTGTCTG GAAGTCTAGA GTCTTTGTTG 1620 

GTTAGAGGAC TGCCTGTGAT CCTGGCCACT AGGCCAAGAG GCCCTATGAA GGTGGCGGGA 1680 

ACTCAGCTTC AACCTCGATG TCTTCAGGGT CTTGTCCAGA ATGTAGATGG GTTCCGTAAG 1740 

AGGCCTGGTG CTCTCTTACT CTTTCATCCA CGTGCACTTG TGCGGCATCT GCAGTTTACA 1800 

GGAACGGCTC CTTCCCTAAA ATGAGAAGTC CAAGGTCATC TCTGGCCCAG TGACCACAGA I860 

GAGATCTGCA CCTCCCGGAC TTCAGGCCTG CCTTTCCAGC GAGAATTCTT CATCCTCCAC 1920 

GGTTCACTAG CTCCTACCTG AAGAGGAAAG GGGGCCATTT GACCTGACAT GTCAGGAAAG 1980 

CCCTAAACTG AATGTTTGCG CCTGGGCTGC AGAAGCCAGG GTGCATGACC AGGCTGCGTG 2040 

GACGTTATAC TGTCTTCCCC CACCCCCGGG GAGGGGAAGC TTGAGCTGCT GCTGTCACTC 2100 

CTCCACCGAG GGAGGCCTCA CAAACCACAG GACGCTGCAA CGGGTCAGGC TGGCGGGCCC 2160 

GGCGTGCTCA TCATCTCTGC CCCAGGTGTA CGGTTTCTCT CTGACATTAA ATGCCCTTCA 2220 
TGGAAAAAAA AAAAAGAAAA AAAAAAAAAA AA 

Seq ID NO: 584 Protein sequence 
Protein Accession gt NP_116031.1 

1 11 21 31 41 51 

I I I I I I 

MPSLLLLFTA ALLSSWAQLL TDANSWWSLA LNPVQRPEMF IIGAQPVCSQ LPGLSPGQRK 60 

LCQLYQEHMA YIGEGAKTGI KECQHQFRQR RWNCSTADNA SVFGRVMQIG SRETAFTHAV 120 
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SAAGWNAIS RACREGELST CGCSRTARPK DLPRDWLWGG CGDNVEYGYR PAKE FVD ARE 180 

REKNPAKGSE EQGRVLMNLQ NNBAGRRAVY KMADVACKCH GVSGSCSLKT CWLQLAEFRK 240 

VGORLKEKYD SAAAMRVTRK GRLELVNSRF TQPTPEDLVY VDPSPDYCLR NESTGSLGTQ 300 
GRLCNKTSEG MDGCELMCCG RGYNQFKSVQ VERCHCKFHW CCFVRCKKCT EIVDQYICK 

Seq ID NO: 585 DMA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..1479 

1 11 21 31 41 51 

I I I I I I 

ATGGCTTTGA ACTCAGGGTC ACCACCAGCT ATTGGACCTT ACTATGAAAA CCATGGATAC 60 

CAACCGGAAA ACCCCTATCC CGCACAGCCC ACTGTGGTCC CCACTGTCTA GGAGGTGCAT 120 

CCGGCTCAGT ACTACCCGTC CCCCGTGCCC CAGTACGCCC CGAGGGTCCT GACGCAGGCT 180 

TCCAACCCCG TCGTCTGCAC GCAGCCCAAA TCCCCATCCG GGACAGTGTG CACCTCAAAG 240 

ACTAAGAAAG CACTGTGCAT CACCTTGACC CTGGGGACCT TCCTCGTGGG AGCTGCGCTG 300 

GCCGCTGGCC TACTCTGGAA GTTCATGGGC AGCAAGTGCT CCAACTCTGG GATAGAGTGC 360 

GACTCCTCAG GTACCTGCAT CAACCCCTCT AACTGGTGTG ATGGCGTGTC ACACTGCCCC 420 

GGCGGGGAGG ACGAGAATCG GTGTGTTCGC CTCTACGGAC CAAACTTCAT CCTTCAGGTG 480 

TACTCATCTC AGAGGAAGTC CTGGCACCCT GTGTGCCAAG ACGACTGGAA CGAGAACTAC 540 

GGGCGGGCGG CCTGCAGGGA CATGGGCTAT AAGAATAATT TTTACTCTAG CCAAGGAATA 600 

GTGGATGACA GCGGATCCAC CAGCTTTATG AAACTGAACA CAAGTGCCGG CAATGTCGAT 660 

ATCTATAAAA AACTGTACCA CAGTGATGCC TGTTCTTCAA AAGCAGTGGT TTCTTTACGC 720 

TGTATAGCCT GCGGGGTCAA CTTGAACTCA AGCCGCCAGA GCAGGATCGT GGGCGGCGAG 780 

AGCGCGCTCC CGGGGGCCTG GCCCTGGCAG GTCAGCCTGC ACGTCCAGAA CGTCCACGTG 840 

TGCGGAGGCT CCATCATCAC CCCCGAGTGG ATCGTGACAG CCGCCCACTG CGTGGAAAAA 900 

CCTCTTAACA ATCCATGGCA TTGGACGGCA TTTGCGGGGA TTTTGAGACA ATCTTTCATG 960 

TTCTATGGAG CCGGATACCA AGTAGAAAAA GTGATTTCTC ATCCAAATTA TGACTCCAAG 1020 

ACCAAGAACA ATGACATTGC GCTGATGAAG CTGCAGAAGC CTCTGACTTT CAACGACCTA 1080 

GTGAAACCAG TGTGTCTGCC CAACCCAGGC ATGATGCTGC AGCCAGAACA GCTCTGCTGG 1140 

ATTTCCGGGT GGGGGGCCAC CGAGGAGAAA GGGAAGACCT CAGAAGTGCT GAACGCTGCC 1200 

AAGGTGCTTC TCATTGAGAC ACAGAGATGC AACAGCAGAT ATGTCTATGA CAACCTGATC 1260 

ACACCAGCCA TGATCTGTGC CGGCTTCCTG CAGGGGAACG TCGATTCTTG CCAGGGTGAC 1320 

AGTGGAGGGC CTCTGGTCAC TTCGAAGAAC AATATCTGGT GGCTGATAGG GGATACAAGC 1380 

TGGGGTTCTG GCTGTGCCAA AGCTTACAGA CCAGGAGTGT ACGGGAATGT GATGGTATTC 1440 
ACGGACTGGA TTTATCGACA AATGAGGGCA GACGGCTAA 

Seq ID NO: 586 Protein sequence 
Protein Accession #r Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MALNSGSPPA IGPYYENHGY QPENPYPAQP TWPTVYEVH PAQYYPSPVP QYAPRVLTQA 60 

SNPWCTQPK SPSGTVCTSK TKKALCITLT LGTPLVGAAL AAGLLMKFMG SKCSNSGIEC 120 

DSSGTCINPS NWCDGVSHCP GGEDENRCVR LYGPNPILQV YSSQRKSWHP VCQDDWNENY 180 

GRAACRDMGY KNNFYSSQGI VDDSGSTSFM KLNTSAGNVD IYKKLYHSDA CSSKAWSLR 240 

CIACGVNLNS SRQSRIVGGE SALPGAWPWQ VSLHVQNVHV CGGSIITPEW IVTAAHCVEK 300 

PLNNPWHWTA FAGILRQSFM FYGAGYQVEK VISHPNYDSK TKNNDIALMK LQKPLTFNDL 360 

VKPVCLPNPG MMLQPEQLCW ISGWGATEEK GKTSEVLNAA KVLLIETQRC NSRYVYDNLI 420 

TPAMICAGPL QGNVDSCQGD SGGPLVTSKN NIWWLIGDTS WGSGCAKAYR PGVYGNVMVF 480 
TDWIYRQMRA DG 



Seq ID NO: 587 DNA sequence 
Nucleic Acid Accession S: NM_005656.1 
J J Coding sequences 57.. 1535 

1 11 21 31 41 51 

I I I I I I 

<fl GTCATATTGA ACATTCCAGA TACCTATCAT TACTCGATGC TGTTGATAAC AGCAAGATGG 60 

OU CTTTGAACTC AGGGTCACCA CCAGCTATTG GACCTTACTA TGAAAACCAT GGATACCAAC 120 

CGGAAAACCC CTATCCCGCA CAGCCCACTG TGGTCCCCAC TGTCTACGAG GTGCATCCGG 180 

CTCAGTACTA CCCGTCCCCC GTGCCCCAGT ACGCCCCGAG GGTCCTGACG CAGGCTTCCA 240 

ACCCCGTCGT CTGCACGCAG CCCAAATCCC CATCCGGGAC AGTGTGCACC TCAAAGACTA 300 

- AGAAAGCACT GTGCATCACC TTGACCCTGG GGACCTTCCT CGTGGGAGCT GCGCTGGCOG 360 

OD CTGGCCTACT CTGGAAGTTC ATGGGCAGCA AGTGCTCCAA CTCTGGGATA GAGTGOGACT 420 

CCTCAGGTAC CTGCATCAAC CCCTCTAACT GGTGTGATGG CGTGTCACAC TGCCCCGGCG 480 

GGGAGGACGA GAATCGGTGT GTTCGCCTCT ACGGACCAAA CTTCATCCTT CAGATGTACT 540 

CATCTCAGAG GAAGTCCTGG CACCCTGTGT GCCAAGACGA CTGGAACGAG AACTACGGGC 600 

„ GGGCGGCCTG CAGGGACATG GGCTATAAGA ATAATTTTTA CTCTAGCCAA GGAATAGTGG 660 

/U ATGACAGCGG ATCCACCAGC TTTATGAAAC TGAACACAAG TGCCGGCAAT GTCGATATCT 720 

ATAAAAAACT GTACCACAGT GATGCCTGTT CTTCAAAAGC AGTGGTTTCT TTACGCTGTT 780 

TAGCCTGCGG GGTCAACTTG AACTCAAGCC GCCAGAGCAG GATCGTGGGC GGTGAGAGCG 840 

CGCTCCCGGG GGCCTGGCCC TGGCAGGTCA GCCTGCACGT CCAGAACGTC CAOGTGTGCG 900 

rjc GAGGCTCCAT CATCACCCCC GAGTGGATCG TGACAGCCGC CCACTGCGTG GAAAAACCTC 960 

/J TTAACAATCC ATGGCATTGG ACGGCATTTG CGGGGATTTT GAGACAATCT TTCATGTTCT 1020 

ATGGAGCCGG ATACCAAGTA CAAAAAGTGA TTTCTCATCC AAATTATGAC TCCAAGACCA 1080 

AGAACAATGA CATTGCGCTG ATGAAGCTGC AGAAGCCTCT GACTTTCAAC GACCTAGTGA 1140 

AACCAGTGTG TCTGCCCAAC CCAGGCATGA TGCTGCAGCC AGAACAGCTC TGCTGGATTT 1200 

cn CCGGGTGGGG GGCCACCGAG GAGAAAGGGA AGACCTCAGA AGTGCTGAAC GCTGCCAAGG 1260 

OU TGCTTCTCAT TGAGACACAG AGATGCAACA GCAGATATGT CTATGACAAC CTGATCACAC 1320 

CAGCCATGAT CTGTGCCGGC TTCCTGCAGG GGAACGTCGA TTCTTGCCAG GGTGACAGTG 1380 

GAGGGCCTCT GGTCACTTCG AACAACAATA TCTGGTGGCT GATAGGGGAT ACAAGCTGGG 1440 

GTTCTGGCTG TGCCAAAGCT TACAGACCAG GAGTGTACGG GAATGTGATG GTATTCACGG 1500 

ACTGGATTTA TCGACAAATG AAGGCAAACG GCTAATCCAC ATGGTCTTCG TCCTTGACGT 1560 
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CGTTTTACAA 
ATGATTCAGA 
TGCCATACTG 
CCGCAAGGGG 
GTTGGAGGCT 
T6AGCATGQA 
GGAAAGGGAG 
TCCCCAGCCT 
GATGGTGGCC 
AAGGGGAACA 
GAGGGAAGCA 
CATTGGGTGG 
TCCTAGCACC 
ATGTCGGCCT 
ATGCTCAGTT 
CTGAGTTCAA 



GAAAACAATG 
GGTCACTTCA 
TGCAGGCTGC 
TGATGGCCGG 
GCCCCCATTG 
GCTGTCACTT 
ACAGCCAGGT 
ACTTCACAAG 
AGAAATAAAG 
GAAACATTTT 
ATTGAAAAGG 
GGCTCCTGGG 
CTGGAGAGTG 
CTTCAGGCCT 
TAAGGTACAC 
AGCCATCTT 



GGGCTGGTTT 
TTTTTATTAA 
AGTGGCTCCC 
CTGGTTGTGG 
AGATCTTCCT 
CTCAGCTGCT 
GGCACCTGCA 
GGGATTTTGC 
GGACCAGCCC 
TGTTCTTATG 
AACTTGCCCT 
AGGGAGACTC 
AATGCCCCTT 
GATAGTCATT 
TGTTTCCATG 



TGCTTCCCCG 
ACAGTGAACT 
CTGCCCAGCC 
GCACTGGCGG 
GCTGAGTCCT 
GGATGACTTG 
GCGGCTGCCC 
TGATGGGTTC 
TTCATGGGTG 
GGGTGAGAAT 
GAGCACTCCT 
AGCCTTCCTC 
GGTCCCTGGC 
GGAAATTGAG 
TTATGTTTCT 



TGCATGATTT 
TGTCTGGCTT 
TGCTCTCCCT 
TCAATTGTGG 
TTCCAGGGGC 
AGATGAAAAA 
TCTGGGGCCA 
TTAGAGCCTT 
GTGACGTGGT 
ATAGACAGTG 
GGTGCAGGTC 
CTCATCCTCC 
AGGGCGCCAA 
GTCCATGGGG 
ACACATTGAT 



Seq ID NO: SBB Protein sequence 
Protein Accession #: NP_005647.1 



MALNSGSPPA 
SNPWCTQPK 
DSSGTCINPS 
GRAACRDKGY 
CLACGVNLNS 
PLNNPWHVITA 
VKPVCLPNPG 
TPAMICAGPL 
TDWIYRQMKA 



11 
I 

IGPYYENHGY 
SPSGTVCTSK 
NWCDGVSHCP 
KNNFYSSQGI 
SRQSRIVGGE 
PAGILRQSFM 
MMLQPEQLCH 
QGNVDSCQGD 
NG 



21 

I 

QPENPYPAQP 
TKKALCITLT 
GGEDENRCVR 
VDDSGSTSFM 
SALPGAWPWQ 
FYGAGYQVQK 
ISGWGATBEK 
SGGPLVTSNN 



31 
I 

TWPTVYEVH 
LGTFIiVGAAL 
LYGPNFILQM 
KLNTSAGNVD 
VSLHVQNVHV 
VISHPNYDSK 
GKT9EVLNAA 
NIWWLIGDTS 



ACTCTTAGAG 
TGGCACTCTC 
AACCCCTTGT 
AAGGAAGAGG 
CAATTTTGGA 
GGAGAGACAT 
CTTGGTAGTG 
AGCAGCCCTG 
AGTCACTTGT 
CCCTTGGTGC 
TCCACCTGCA 
CTGACCCTGC 
GTTTGGCACC 
GAAATCAAGG 
GGTGGTGACC 



51 
I 



1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



PAQYYPSPVP QYAPRVLTQA 
AAGLLWKFMG SKCSNSGIEC 



IYKKLYHSDA CSSKAWSLR 
CGGSIITPEW IVTAAHCVEK 
TKNNDIALMK LQKPLTFNDL 
KVLLIETQRC NSRYVYDNL1 
WGSGCAKAYR PGVYGNVMVF 



60 
120 
180 
240 
300 
360 
420 
480 



Seq ID NO: 589 DNA sequence 
Nucleic Acid Accession #: NMJ)01935.1 
Coding sequence: 1..2301 



1 11 21 31 41 51 

I I I ■ I I I 

ATGAAGACAC CGTGGAAGAT TCTTCTGGGA CTGCTGGGTG CTGCTGCGCT TGTCACCATC 60 

ATCACCGTGC CCGTGGTTCT GCTGAACAAA GGCACAGATG ATGCTACAGC TGACAGTCGC 120 

AAAACTTACA CTCTAACTGA TTACTTAAAA AATACTTATA GACTGAAGTT ATACTCCTTA 180 

AGATGGATTT CAGATCATGA ATATCTCTAC AAACAAGAAA ATAATATCTT GGTATTCAAT 240 

GCTGAATATG GAAACAGCTC AGTTTTCTTG GAGAACAGTA CATTTGATGA GTTTGGACAT 300 

TCTATCAATG ATTATTCAAT ATCTCCTGAT GGGCAGTTTA TTCTCTTAGA ATACAACTAC 360 

GTGAAGCAAT GGAGGCATTC CTACACAGCT TCATATGACA TTTATGATTT AAATAAAAGG 420 

CAGCTGATTA CAGAAGAGAG GATTCCAAAC AACACACAGT GGGTCACATG GTCACCAGTG 480 

GGTCATAAAT TGGCATATGT TTGGAACAAT GACATTTATG TTAAAATTGA ACCAAATTTA 540 

CCAAGTTACA GAATCACATG GACGGGGAAA GAAGATATAA TATATAATGG AATAACTGAC 600 

TGGGTTTATG AAGAGGAAGT CTTCAGTGCC TACTCTGCTC TGTGGTGGTC TCCAAACGGC 660 

ACTTTTTTAG CATATGCCCA ATTTAAGGAC ACAGAAGTCC CACTTATTGA ATACTCCTTC 720 

TACTCTGATG AGTCACTGCA GTACCCAAAG ACTGTACGGG TTCCATATCC AAAGGCAGGA 780 

GCTGTGAATC CAACTGTAAA UrrClTTG ' JLT GTAAATACAG ACTCTCTCAG CTCAGTCACC 840 

AATGCAACTT CCATACAAAT CACTGCTCCT GCTTCTATGT TGATAGGGGA TCACTACTTG 900 

TGTGATGTGA CATGGGCAAC ACAAGAAAGA ATTTCTTTGC AGTGGCTCAG GAGGATTCAG 960 

AACTATTCGG TCATGGATAT TTGTGACTAT GATGAATCCA GTGGAAGATG GAACTGCTTA 1020 

GTGGCACGGC AACACATTGA AATGAGTACT ACTGGCTGGG TTGGAAGATT TAGGCCTTCA 1080 

GAACCTCATT TTACCCTTGA TGGTAATAGC TTCTACAAGA TCATCAGCAA TGAAGAAGGT 1140 

TACAGACACA TTTGCTATTT CCAAATAGAT AAAAAAGACT GCACATTTAT TACAAAAGGC 1200 

ACCTGGGAAG TCATCGGGAT AGAAGCTCTA ACCAGTGATT ATCTATACTA CATTAGTAAT 1260 

GAATATAAAG GAATGCCAGG AGGAAGGAAT CTTTATAAAA TCCAACTTAG TGACTATACA 1320 

AAAGTGACAT GCCTCAGTTG TGAGCTGAAT CCGGAAAGGT GTCAGTACTA TTCTGTGTCA 1380 

TTCAGTAAAG AGGCGAAGTA TTATCAGCTG AGATGTTCCG GTCCTGGTCT GCCCCTCTAT 1440 

ACTCTACACA GCAGCGTGAA TGATAAAGGG CTGAGAGTCC TGGAAGACAA TTCAGCTTTG 1500 

GATAAAATGC TGCAGAATGT CCAGATGCCC TCCAAAAAAC TGGACTTCAT TATTTTGAAT 1560 

GAAACAAAAT TTTGGTATCA GATGATCTTG CCTCCTCATT TTGATAAATC CAAGAAATAT 1620 

CCTCTACTAT TAGATGTGTA TGCAGGCCCA TGTAGTCAAA AAGCAGACAC TGTCTTCAGA 1680 

CTGAACTGGG CCACTTACCT TGCAAGCACA GAAAACATTA TAGTAGCTAG CTTTGATGGC 1740 

AGAGGAAGTG GTTACCAAGG AGATAAGATC ATGCATGCAA TCAACAGAAG ACTGGGAACA 1800 

TTTGAAGTTG AAGATCAAAT TGAAGCAGCC AGACAATTTT CAAAAATGGG ATTTGTGGAC 1860 

AACAAACGAA TTGCAATTTG GGGCTGGTCA TATGGAGGGT ACGTAACCTC AATGGTCCTG 1920 

GGATOGGGAA GTGGCGTGTT CAAGTGTGGA ATAGCOGTGG CGCCTGTATC COGGTGGGAG 1980 

TACTATGACT CAGTGTACAC AGAACGTTAC ATGGGTCTCC CAACTCCAGA AGACAACCTT 2040 

GACCATTACA GAAATTCAAC AGTCATGAGC AGAGCTGAAA ATTTTAAACA AGTTGAGTAC 2100 

CTCCTTATTC ATGGAACAGC AGATGATAAC GTTCACTTTC AGCAGTCAGC TCAGATCTCC 2160 

AAAGCCCTGG TCGATGTTGG AGTGGATTTC CAGGCAATGT GGTATACTGA TGAAGACCAT 2220 

GGAATAGCTA GCAGCACAGC ACACCAACAT ATATATACCC ACATGAGCCA CTTCATAAAA 2280 
CAATGTTTCT CTTTACCTTA G 

Seq ID NO: S90 Protein sequence 
Protein Accession #: NP_001926.1 

1 11 21 31 41 51 

I I I I I I 

MKTPWKILLG LLGAAALVTI ITVPWLLNK GTDDATADSR KTYTLTDYLK NTYRLKLYSL 60 
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RWISDHEYLY KQENNILVFN AEYGNSSVFL ENSTFDEFGH SINDYSISPD GQFILLEYNY 120 

VKQWRHSYTA SYDIYDLNKR QLITEERIPN NTQWVTWSPV GHKLAYVWNN DIYVKIEPNL. 180 

PSYRITWTGK EDIIYNGITD WVYEEEVPSA YSALWWSPNG TFLAYAQFND TEVPLIEYSF 240 

YSDESLQYPK TVRVPYPKAG AVNPTVKFFV VNTDSLSSVT NATS IQ I TAP ASMLIGDHYL 300 

5 CDVTWATQER ISLQWLRRIQ NYSVMDICDY DESSGRWNCL VARQHIEMST TGWVGRFRPS 360 

EPHFTLDGNS FYKIISNEEG YRHICYPQID KKDCTFITKG TWEVIGIEAL TSDYLYYISN 420 

EYKGMPGGRN LYKIQLSDYT KVTCLSCELN PERCQYYSVS FSKEAKYYQL RCSGPGLPLY 480 

TLHSSVNDKG LRVLEDNSAL DKMLQNVQMP SKKLDFIILN ETKFWYQMIL PPHFDKSKKY 540 

PLLLDVYAGP CSQKADTVFR LNWATYLAST ENIIVASFDG RGSGYQGDKI MHAINRRLGT 600 

10 FEVEDQIEAA RQFSKMGFVD NKRIAIWGWS YGQYVTSMVL GSGSGVFKCG IAVAPVSRWE 660 

YYDSVYTERY MGLPTPEDNL DHYRNSTVMS RAENFKQVEY LLIHGTADDN VHFQQSAQIS 720 
KALVDVGVDF QAMWYTDEDH GIASSTAHQH IYTHMSHFIK QCFSLP 



Seq ID NO: 591 DMA sequence 
15 Nucleic Acid Accession ft: NM_016077.1 
Coding sequence : 128 . - 667 

1 11 21 31 41 51 

20 TCGCTTTGTG ATTCTTGATC CGGAACTTTG TCACCCAGGA ACCCCGGAAG AGGTAGCTCA 60 

CGCGATAGAA ACGTGTTCGC TTGCCCAGAA GAAGGGAAGG CGCGAGTGAG GAAAGGAGGT 120 

ACTGTAGATG CCCTCCAAAT CCTTGGTTAT GGAATATTTG GCTCATCCCA GTACACTCGG 180 

CTTGGCTGTT GGAGTTGCTT GTGGCATGTG CCTGGGCTGG AGCCTTCGAG TATGCTTTGG 240 

GATGCTCCCC AAAAGCAAGA CGAGCAAGAC ACACACAGAT ACTGAAAGTG AAGCAAGCAT 300 

25 CTTGGGAGAC AGCGGGGAGT ACAAGATGAT TCTTGTGGTT CGAAATGACT TAAAGATGGG 360 

AAAAGGGAAA GTGGCTGCCC AGTGCTCTCA TGCTGCTGTT TCAGCCTACA AGCAGATTCA 420 

AAGAAGAAAT CCTGAAATGC TCAAACAATG GGAATACTGT GGCCAGCCCA AGGTGGTGGT 480 

CAAAGCTCCT GATGAAGAAA CCCTGATTGC ATTATTGGCC CATGCAAAAA TGCTGG QACT 540 

GACTGTAAGT TTAATTCAAG ATGCTGGACG TACTCAGATT GCACCAGGCT CTCAAACTGT 600 

30 CCTAGGGATT GGGCCAGGAC CAGCAGACCT AATTGACAAA GTCACTGGTC ACCTAAAACT 660 

TTACTAGGTG GACTTTGATA TGACAACAAC CCCTCCATCA CAAGTGTTTG AAGCCTGTCA 720 

GATTCTAACA ACAAAAGCTG AATTTCTTCA CCCAACTTAA ATGTTCTTGA GATGAAAATA 780 
AAACCTATTC CCATGTTCTA AAAAAA 

35 Seq ID NO: 592 Protein sequence 
Protein Accession ft: NP_0S7161.1 

1 11 21 31 41 51 

M(\ 1 I I I I I 

40 MPSKSLVMEY LAHPSTLGLA VGVACGMCLG WSLRVCFGML PKSKTSKTHT DTESEASILG 60 
DSGEYKMILV VRNDLKMGKG RVAAQCSHAA VSAYKQIQRR NPEMLKQWEY CGQPKVWKA 120 
PDEETLIALL AHAKMLGLTV SLIQDAGRTQ IAPGSQTVLG IGPGPADLID KVTGHLKLY 



Seq ID NO: 593 DNA sequence 
45 Nucleic Acid Accession ft: FGENESH predicted 
Coding sequence: 1 . .1896 

1 11 21 31 41 51 

50 ATGCGCGCCG TGCCGCTGCC CGCCCCGCTC CTGCCGCTGC TGCTGCTCGC GCTCCTGGCC 60 

GCTCCCGCCG CCCGCGCCAG CAGAGCCGAG TCCGTCTCCG CGCCGTGGCC CGAACCCGAG 120 

CGCGAGTCGC GGCCACCGCC CGGCCCGGGG CCCGGGAACA CCACCCGGTT TGGGTCTGGG 180 

GCGGCGGGCG GCAGCGGCAG CTCCAGCTCC AACAGCAGTG GCGACGCCTT GGTGACCCGC 240 

ATTTCCATCC TCCTCCGCGA CCTACCCACC CTCAAGGCAG CCGTGATCGT GGCGTTCGCC 300 

55 TTTACCACCC TCCTCATCGC CTGCCTGCTG CTGCGCGTCT TCAGGTCGGG AAAGAGGTTA 360 

AAGAAGACAC GCAAGTATGA TATCATCACC ACTCCAGCAG AGCGAGTGGA AATGGCGCCA 420 

CTAAATGAAG AGGATGATGA AGATGAGGAC TCCACAGTAT TCGACATCAA ATACAGAGTG 480 

TCCTTGCCGG CTGCACTGAG ACGTCAGCTG CCAGGGTGCC AGACGCTACT GACAGTTCCT 540 

GTGCCCCCAC CCTTCATCCT CGACATTGAC CTTCCAGCAA GATGCAGTGG AAGGCCTGAT 600 

60 GGTGGAATCA GACCTGGTAA AACCTGTTTC CCAGCCTGGT GGCATCCTGT GGAAAGTTGG 660 

TCAGCTGCAA CCTGGGGTGT GAAGGACTGG ACCTGGAAGC CCTCTTGCGT CGGAGGTGTT 720 

GAAACCAAAA CGAACGTTAT GTATAAAACC CCAGCTCCAT CGTGOGTGTC AGGCATCTGC 780 

TCAGACTGTC ACTGGCAAGC TCGTTTCCAC GTCACCACAA TGGAGTTGCT TCTGCCACCC 840 

TTTGGGCATC CCTTTAAAGT GCCCCCTACT TCTACTCCCC ATGGTTTTCG ACAACTGCAG 900 

65 CTGAATCTCA TGGAAAAGCT GGATTCCTCT GCCTTACGCA GAAACACCCG GGCTCCATCT 960 

GCCAGGTGCT TGCCACTGGT CCTGGCAGAA ATGGCGGCTG CTGAAAGTGA CCTTCCAAAT 1020 

CCTTGGTGGC ACTTCAGCGC CACAGGCTCT CCAATAAAAA CCCTTTACAC ACAAACCATG 1080 

AGTACCTTGG GCTTGGATGT TTTCTGTGGT GCCGGCCAGC GGGGCACCTT TTGTGAAGAC 1140 

_ A AGAGCAGTGA CTAAGGTTCT CCAGGGTAGC TCTTTCTCCA AACAGCTGCG CTGGAAGCCA 1200 

70 GCCCTAGAGA GTGGGTTTCC CCATCATCTC AGGCTTCTCA GAGAGTGTCC TCCGCTGAGC 1260 

ACCCATCCTG TCAGGTTGGC TCGTTCAGAT GCCCGGGGAC AAGCCAGCCT GACGGGGAGG 1320 

AGGGTGTTTC GGCGTCCGCG GCAGTCTCTG CATGGCGGAG GGTCAGCGGG TACCGCAACT 1380 

TGCCTTTTGG TTTTGAAGAT TCTGTTGAGG CGCCATCCTC ACCTTGACCT CTTCTACAAA 1440 

ATCTGTCTOC CCTGCTGTGC OGTGGAACAC CTACGGGAAG CCAAGAGAAG CTCAGTGACT 1500 

75 GTCCTTGCGT CATTTGAGCA GAGCCCACAA AAGGCAGCTG CTGCCCACGG GGAGCCTGTC 1560 

AAACGAGGGC CCAGTGGGCA ATTGACCAGA CACACATGCC CTGGCTGGGG GATCACACAT 1620 

GCGAACCTGC AGACAATTCC AGATACCCAA GGCCAGGAAG GCCCACGTGA GGATGTCACT 1660 

CACCCTGGAG GAGACTTGGA TGGGGTGGCA AATTTCTATT TGGAGGAAGA GGGTTTCCAG 1740 

_ _ GATGGCAGAT GCCAGAAGAT GGTCCTGATG TCTGAGGAAG GGCCACCTAG TTTGACAGGA 1800 

80 TGTGAGAGGC TCACAGGTTC CCATCACTTC TCCAGCCATT CCAAGTCTTG GTCCTTCCTT 1860 
TCCCCCCGAC AGCCCCTGTT TCTGTCCAGG CCCTGA 



Seq ID NO: 594 Protein sequence 
Protein Accession ft: FGENESH predicted 
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1 11 21 31 41 51 

I I I I I I 

MRAVPLPAPL LPLLLLALLA APAARASRAE SVSAPWPEPE RESRPPPGPG PGNTTRPGSG 60 

AAGGSGSSSS NSSGDALVTR ISILLRDLPT LKAAVIVAFA FTTLLIACLL LRVFRSGKRL 120 

KKTRKYDIIT TPAERVEMAP LNEEDDEDED STVFDIKYRV SLPAALRRQL PGCQTLLTVP 180 

VPPPFILDID LPARCSGRPD GGIRPGKTCF PAWWHPVESW SAATWGVKDW TWKPSCVGGV 240 

ETKTMVMYKT PAPSCVSGIC SDCHWQARFH VTTMELLLPP FGHPFKVPPT STPHGFRQLQ 300 

LNLMEKLDSS ALRRNTRAPS ARCLPLVLAE MAAAESDLPN PWWHFSATGS PIKTLYTQTM 360 

STLGLDVFCG AGQRGTFCED RAVTKVLQGS SFSKQLRWKP ALESGFPHHL RLLRECPPIiS 420 

THPVRLARSD ARGQASLTGR RVFRRPRQSL HGGGSAGTAT CLLVLKILLR RHPHLDLFYK 480 

ICLPCCAVEH LREAKRSSVT VLASFEQSPQ KAAAAHGEPV KRGPSGQLTR HTCPGWGITH 540 

ANLQTIPDTQ GQEGPREDVT HPGGDLDGVA NFYLEEEGFQ DGRCQKMVLM SEEGPPSLTG 600 
CERLTGSHHF SSHSKSWSFL SPRQPLFLSR P 

Seq ID NO: 595 DMA sequence 

Nucleic Acid Accession ft: NM_021614.1 

Coding sequence: 1..1740 

1 11 21 31 41 51 

I I I I I I 

ATGAGCAGCT GCAGGTACAA CGGGGGCGTC ATGCGGCCGC TCAGCAACTT GAGCGCGTCC 60 

CGCCGGAACC TGCACQAGAT GGACTCAGAG GCGCAGCCCC TGCAGCCCCC CGCGTCTGTC 120 

GGAGGAGGTG GCGGCGCGTC CTCCCCGTCT GCAGCCGCTG CCGCCGCCGC CGCTGTTTCG 180 

TCCTCAGCCC CCGAGATCGT GGTGTCTAAG CCCGAGCACA ACAACTCCAA CAACCTGGCG 240 

CTCTATGGAA CCGGCGGCGG AGGCAGCACT GGAGGAGGCG GCGGCGGTGG CGGGAGCGGG 300 

CACGGCAGCA GCAGTGGCAC CAAGTCCAGC AAAAAGAAAA ACCAGAACAT CGGCTACAAG 360 

CTGGGCCACC GGCGCGCCCT GTTCGAAAAG CGCAAGCGGC TCAGCGACTA CGCGCTCATC 420 

TTCGGCATGT TCGGCATCGT GGTCATGGTC ATCGAGACCG AGCTGTCGTG GGGCGCCTAC 480 

GACAAGGCGT CGCTGTATTC CTTAGCTCTG AAATGCCTTA TCAGTCTCTC CACGATCATC 540 

CTGCTCGGTC TGATCATCGT GTACCACGCC AGGGAAATAC AGTTGTTCAT GGTGGACAAT 600 

GGAGCAGATG ACTGGAGAAT AGCCATGACT TATGAGCGTA TTTTCTTCAT CTGCTTGGAA 660 

ATACTGGTGT GTGCTATTCA TCCCATACCT GGGAATTATA CATTCACATG GACGGCCCGG 720 

CTTGCCTTCT CCTATGCCCC ATCCACAACC ACCGCTGATG TGGATATTAT TTTATCTATA 780 

CCAATGTTCT TAAGACTCTA TCTGATTGCC AGAGTCATGC TTTTACATAG CAAACTTTTC 840 

ACTGATGCCT CCTCTAGAAG CATTGGAGCA CTTAATAAGA TAAACTTCAA TACACGTTTT 900 

GTTATGAAGA CTTTAATGAC TATATGCCCA GGAACTGTAC TCTTGGTTTT TAGTATCTCA 960 

TTATGGATAA TTGCCGCATG GACTGTCCGA GCTTGTGAAA GGTACCATGA TCAACAGGAT 1020 

GTTACTAGCA ACTTCCTTGG AGCGATGTGG TTGATATCAA TAACTTTTCT CTCCATTGGT 1080 

TATGGTGACA TGGTACCTAA CACATACTGT GGAAAAGGAG TCTGCTTACT TACTGGAATT 1140 

ATGGGTGCTG GTTGCACAGC CCTGGTGGTA GCTGTAGTGG CAAGGAAGCT AGAACTTACC 1200 

AAAGCAGAAA AACACGTGCA CAATTTCATG ATGGATACTC AGCTGACTAA AAGAGTAAAA 1260 

AATGCAGCTG CCAATGTACT CAGGGAAACA TGGCTAATTT ACAAAAATAC AAAGCTAGTG 1320 

AAAAAGATAG ATCATGCAAA AGTAAGAAAA CATCAACGAA AATTCCTGCA AGCTATTCAT 1380 

CAATTAAGAA GTGTAAAAAT GGAGCAGAGG AAACTGAATG ACCAAGCAAA CACTTTGGTG 1440 

GACTTGGCAA AGACCCAGAA CATCATGTAT GATATGATTT CTGACTTAAA CGAAAGGAGT 1500 

GAAGACTTCG AGAAGAGGAT TGTTACCCTG GAAACAAAAC TAGAGACTTT GATTGGTAGC 1560 

ATCCACGCCC TCCCTGGGCT CATAAGCCAG ACCATCAGGC AGCAGCAGAG AGATTTCATT 1620 

GAGGCTCAGA TGGAGAGCTA CX3ACAAGCAC GTCACTTACA ATGCTGAGCG GTCCCGGTCC 1680 
TCGTCCAGGA GGCGGCGGTC CTCTTCCACA GCACCACCAA CTTCATCAGA GAGTAGCTAG 

Seq ID NO: 596 Protein sequence 
Protein Accession #: NP_067627.1 

1 11 21 31 41 51 

I I I 1 I I 

MSSCRYNGGV MRPLSNLSAS RRNLHEMDSE AQPLQPPASV GGGGGASSPS AAAAAAAAVS 60 

SSAPEIWSK PEHNN8NNLA LYGTGGGGST GGGGGGGGSG HGSSSGTKSS KKKNQNIGYK 120 

LGHRRALFEK RKRLSDYALI FGMFGIWMV IETELSWGAY DKASLYSLAL KCLISLSTII 180 

LLGLIIVYHA REIQLFMVDN GADDWRIAMT YERIFFICLE ILVCAIHPIP GNYTFTWTAR 240 

LAFSYAPSTT TAD VD II LSI PMFLRLYLIA RVMLLHSKLP TDASSRSIGA LNKINFNTRF 300 

VMKTLMTICP GTVLLVFSIS LWIIAAWTVR ACERYHDQQD VTSNFLGAKW LISITFLSIG 360 

YGDMVPNTYC GKGVCLLTGI MGAGCTALW AWARKLELT KAEKHVHNFM MDTQLTKRVK 420 

NAAANVLRET WLIYKNTKLV KKIDHAKVRK HQRKFLQAIH QLRSVKMEQR KLNDQANTLV 480 

DLAKTQNIMY DMISDLNERS EDFEXRIVTL ETKLETLIGS IHAUPGLISQ TIRQQQRDFI 540 
EAQMESYDKH VTYNAERSRS SSRRRRSSST APPTSSESS 



Seq ID NO: 597 DNA sequence 
Nucleic Acid Accession #: NM_016029.1 
/U Coding sequence: 228.. 1097 



1 11 21 31 41 51 

I I I I I I 

CTGOGATCCC GCAGGGCAGC GACGCGACTC TGGTGCGGGC CGTCTTCTTC CCCCCGAGCT 60 

GGGCGTGCGC GGCCGCAATG AACTGGGAGC TGCTGCTGTG GCTGCTGGTG CTGTGCGCGC 120 

TGCTCCTGCT CTTGGTGCAG CTGCTGCGCT TCCTGAGGGC TGACGGCGAC CTGACGCTAC 180 

TATGGGCCGA GTGGCAGGGA CGACGCCCAO AATGGGAGCT GACTGATATG GTGGTGTGGG 240 

TGACTGGAGC CTCGAGTGGA ATTGGTGAGG AGCTGGCTTA CCAGTTGTCT AAACTAGGAG 300 

TTTCTCTTGT GCTGTCAGCC AGAAGAGTGC ATGAGCTGGA AAGGGTGAAA AGAAGATGCC 360 

TAGAGAATGG CAATTTAAAA GAAAAAGATA TACTTGTTTT GCCCCTTGAC CTGACCGACA 420 

CTGGTTCCCA TGAAGCGGCT ACCAAAGCTG TTCTCCAGGA GTTTGGTAGA ATCGACATTC 480 

TGGTCAACAA TGGTGGAATG TCCCAGCGTT CTCTGTGCAT GGATACCAGC TTGGATGTCT 540 

ACAGAAAGCT AATAGAGCTT AACTACTTAG GGACGGTGTC CTTGACAAAA TGTGTTCTGC 600 

CTCACATGAT CGAGAGGAAG CAAGGAAAGA TTGTTACTGT GAATAGCATC CTGGGTATCA 660 
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TATCTGTACC TCTTTCCATT GGATACTGTG CTAGCAAGCA TGCTCTCCGG GGTTTTTTTA 720 

ATGGCCTTOG AACAGAACTT GCCACATACC CAGGTATAAT AGTTTCTAAC ATTTGCCCAG 780 

GACCTGTGCA ATCAAATAtT GTGGAGAATT CCCTAGCTGG AGAAGTCACA AAGACTATAG 840 

GCAATAATGG AGACCAGTCC CACAAGATGA CAACCAGTCG TTGTGTGCGG CTGATGTTAA 900 

TCAGCATGGC CAATGATTTG AAAGAAGTTT GGATCTCAGA ACAACCTTTC TTGTTAGTAA 960 

CATATTTGTG GCAATACATG CCAACCTGGG CCTGGTGGAT AACCAACAAG ATGGGGAAGA 1020 

AAAGGATTGA GAACTTTAAG AGTGGTGTGG ATGCAGACTC TTCTTATTTT AAAATCTTTA 1080 

AGACAAAACA TGACTGAAAA GAGCACCTGT ACTTTTCAAG CCACTGGAGG GAGAAATGGA 1140 

AAACATGAAA ACAGCAATCT TCTTATGCTT CTGAATAATC AAAGACTAAT TTGTGATTTT 1200 

ACTTTTTAAT AGATATGACT TTGCTTCCAA CATGGAATGA AATAAAAAAT AAATAATAAA 1260 
AGATTGCCAT GAATCTTGCA AA 

Seq ID NO: 598 Protein sequence 
Protein Accession fl: NP_0S7113.1 

1 11 21 31 41 51 

I ) II I I 

MNWELLLWLL VLCALLLLLV QLLRFLRADG DLTLLWAEWQ GRRPEWELTD MWWVTGASS 60 
GIGEELAYQL SKLGVSLVLS ARRVHELERV KRRCLENGNL KEKDILVLPL DLTDTGSHEA 120 
ATKAVLQEFG RIDIliVNNGG MSQRSLCMDT SLDVYRKLIE LNYLGTVSLT KCVLPHMIER 180 
KQGKIVTVNS ILGIISVPLS IGYCASKHAL RGFFNGLRTE LATYPGIIVS NICPGPVQSN 240 
IVENSLAGEV TKTIGNNGDQ SHKMTTSRCV RLMLISMAND LKEVWISEQP FLLVTYLWQY 300 
MPTWAWWITN KMGKKRIENF KSGVDADSSY FKIFKTKHD 

Seq ID NO: 599 DNA sequence 

Nucleic Acid Accession ft: NMJJ00793.2 

Coding sequence: 401.. 1222 



1 11 21 31 41 51 

30 | | | | | | 

GCCTGCAGAG AGAGGCACTT TGCACCACAG ACAGATAGCA AGAAGGGAAA GACAGAGAGT 60 

GAGAAAAAAG AGGAGTCAGT CGCTCCTGGG GAAGGGAGAG AGTGAGACTG GGAGAAAGAG 120 

AAGCACAGAA AGTGTGTGTA AAACGGAGTA AAGAAAGAAA AAAAAAAAAC TACCCTTAAA ■ 180 

GCACATTTAA AAAAAAAAAA CTCTGGCAAT TCAAGAAAGA AACAGGCTAC GTTTAAAGAG 240 

CATAGAGACA ATGAAAGGCT AAAGAAAATT TTAAAATCTC TGCCACAGTC TCATAGGTGC 300 

TTGGAAATGA AAGTAGAACT GCCTGTCTTT AACGGACTCT GACAGAGAGG GTGAAGGGGA 360 

ACCAGAGOGC ACAAGGGAAC TGACTCAGGA GGCAGAGAAG ATGGGCATCC TCAGCGTAGA 420 

CTTGCTGATC ACACTGCAAA TTCTGCCAGT TTTTTTCTCC AACTGCCTCT TCCTGGCTCT 480 

CTATGACTCG GTCATTCTGC TCAAGCACGT GGTGCTGCTG TTGAGCCGCT CCAAGTCCAC 540 

TCGCGGAGAG TGGCGGCGCA TGCTGACCTC AGAGGGACTG CGCTGCGTCT GGAAGAGCTT 600 

CCTCCTCGAT GCCTACAAAC AGGTGAAATT GGGTGAGGAT GCCCCCAATT CCAGTGTGGT 660 

GCATGTCTCC AGTACAGAAG GAGGTGACAA CAGTGGCAAT GGTACCCAGG AGAAGATAGC 720 

TGAGGGAGCC ACATGCCACC TTCTTGACTT TGCCAGCCCT GAGCGCCCAC TAGTGGTCAA 780 

CTTTGGCTCA GCCACTTGAC CTCCTTTCAC GAGCCAGCTG CCAGCCTTCC GCAAACTGGT 840 

GGAAGAGTTC TCCTCAGTGG CTGACTTCCT GCTGGTCTAC ATTGATGAGG CTCATCCATC 900 

AGATGGCTGG GCGATACCGG GGGACTCCTC T T TGTCTTTT GAGGTGAAGA AGCACCAGAA 960 

CCAGGAAGAT CGATGTGCAG CAGCCCAGCA GCTTCTGGAG CGTTTCTCCT TGCCGCCCCA 1020 

GTGCCGAGTT GTGGCTGACC GCATGGACAA TAACGCCAAC ATAGCTTACG GGGTAGCCTT 1080 

TGAACGTGTG TGCATTGTGC AGAGACAGAA AATTGCTTAT CTGGGAGGAA AGGGCCCCTT 1140 

CTCCTACAAC CTTCAAGAAG TCCGGCATTG GCTGGAGAAG AATTTCAGCA AGAGATGAAA 1200 

GAAAACTAGA TTAGCTGGTT AAAGGTATGA TTATAAGAGA GCTTATTGTT TTAAAAAGTT 1260 

ATATAAAGGC AAGGAAATTA AGAACTGAAT CCATATTTCA ACAGAGCCCT ATTGGCTTAC 1320 

TGAAAGACAG GAGTTTATCT ATCGGAAGAA CATGAATCTC TAACAGCTCC ATACTTCTTT 1380 

CACTACTCAA ATGGCATTGG GCTGAGTAAG TAACCATATC ACCTCTCTTC TTAGTAAAAA 1440 

GCCCTATGTG AAAAGATCCC AAGATGGAGA GGAAGAAACG CTAATTCAGC ATGTGTTCAT 1500 

TCTGCATTGA GAAGGAACTG ATACATCTGA TGCATGCTTT GAGACCAGAA GAAAAGACTT 1560 

ACCTGAATAA TTACTACATT AGGGAAGCTA CTGTCTACGT TAAGATAAAG GGTATTGCCT 1620 

TGGCTCTATT TGGCATGGAT GGAGCCCAGT TGGAAAATTC CCAAATATTA CAACAAGTCC 1680 

TTGAACCCAG GCCATGTGGT TAGACGTTGG TGTTAAGGTT AGACCTTATG TTAGAGTCAT 1740 

TTCTGATGTT CCAGCTTCTA GCCATGTAGT GCTCTCAGTC TTCATACCCC AGAAATTATT 1800 

GGTATATTTG TAGATACCGA GAATGATCCC TCAGTCTGAG AGGTTAGAAT GATCATCTGT 1860 

AATCTGAGGG TTAATTTCTA GGCAGGTGGA GAGAGTGGTA AAAAAGAAAT GAAATTGACA 1920 

AGCTAGGAAA GAGGAGGCAG AAAGATTTGG AAAATTCACA GAGTTTCACC CTTAAGCTGT 1980 

AGAGAGTGGG TCACATTTGT TAGCCACGGA AACATAGAAA CATACACAAG GCCAGAAAAA 2040 

GAAGAAGGAG CTCAACTAAA AGTGGCATAG AGAATACACA TATAAAAACA ATATATTTGT 2100 

CATATGCTCC TAGAGAGGAG AAAGGGGTGA TTGAAAGAAA AAAAAATACT TAAATATTTG 2160 

TAATTGTGAG GGGTTTCTTT TGGAAATAAT TACTTTTGAA CCATGTATGT GGTATGTATA 2220 

TTTTCAGTGG GTTAATTATA CCCCATGATA CCTATTAAAG GAAAACCAGT GGGTCTGGTG 2280 

GTGCTGGTCT TTTCCTCCCC ATTCCTACAA TTTCTATGTG GCCCAAGTCA TTCCTAATCT 2340 

TGGTCTCTAT AGCAGTGTTC TCTCTGAATG CTGAGCTGAA GAAATTATAC GTACATACAC 2400 

ACATACATAC ATACATACAA ATATATGTAT ATATATTCTC AGCTGCTGCG GGAGGTAGGT 2460 

ACCATGGCCA TTCAGCACAG CCTTGATTTC CTCCCAAAGT AGGTGAGCTA TAGTGAAGAA 2520 

TAGGTGCAAA CAAACAAGCT TACTTCCATT GCAAAATAGA AGAAGAGGAA GTTAGAGATA 2580 

ATTCTGATCA ATCATTTTGG AGGCTTTGTT ATAAGGCAAC CCCCGGTATA TCATGGAATT 2640 

TCCATTGACA TTTGAATTTG GACTTGGATC TTCCCTTGGT GCCATTAGCT GAGGTTTAGT 2700 

AATCTAAAGT CCCTATAGTA TATGATTATA ATGCTATTTT AAAAAATATA TATATAAAAT 2760 

ATTTTTTTCT TTTTAAAATA GACACTATAG TTTTACCCAT AAGTAATATT TAAAGATTAT 2820 

AGCTCCCAAA AGAATGGACC AACCACTTTC GTATCATAAT TTCTTTTTGG TAAATATGAG 2880 

ACTATTATGA AATCATAGTA TATGATTGTA TTTAAAGGTA CAATCAAAGG ATCTTTTGTC 2940 

CATTCCATTA ATAACTGAAT AAAAAATAAA TAAAATGGAT AGAAAAAAAC TAAAGTTGAA 3000 

AATACATTCT TAAACTAGTT GTCTGAAATG AGAAAAGAGT GAGAACTAGG TGTGCAAGAA 3060 

CCAAACGTAT TTTATTTTAT TTTTTAAATG GGAGCAACAT ATCAGTCGTG TCACCAGCTG 3120 

GTATATTGTG TAAATATTAA AGCTCCATTG GGACTGATTT TTCATGGCAA CATCAGCTTT 3180 

CTAATGTTCT AAATTCTATA AAAACCACCC ACAAAGAAAC AAAGCAAATT TCATTATCTA 3240 
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ATGAGTTQCT GGAAAATCAT ATTGAGAATA ATTATTTCAG ATTCCTCAGT TGTTAACTTC 3300 
TACATTCAAG GGCTTATCTC TGCCCCCATT GATTTTTAAC CTCAAAATGG TGTGAGATTT 3360 
ACTGTGGAAC CCTAAAGCAG TAAAATAAAA AACCTGGTTG CAGCACATTC ACACTGTTGT 3420 
CCTTAAAATT CCCCTTTTTT CTCTATGTAC GATAAAGTAA CAGTATGTCA GATAAGCCGG 3480 
TGGGGGGATG AGATTAGGCT GAGGCAGTGC TAGTCAACTG GGGGAAAAGG ATGATGGAAA 3S40 
AATCACCCAG TTGTGCTATA TTTTTAAAGA AGGAGGTCGT TTATGTGTGC AGACAATTCT 3600 
CCCTGAGGTT AGCCCAATGG AGAAATGAAG CAGAGGAAGG AAACATAGAA AGACATGGGC 3660 
TATCAGGGAG GAAGATGTTC AATAGAACAT GCAAGAATTT CTGGAAGAAA GGCTGTGGAA 3720 
GGGCCAATGG AGAAAATGAA TGGACAAAGC TCAGGAATCC CTACGCTATG TAGAATGTTC 3780 
TTGGTGTTAT CAGGGTTAAG CCCTCTAATT ATGTAACCTA TTTATCGCAA CATGAATTTT 3840 
TATGATTTCT TGTGATGTAT TCTTTTATGA AATTAACAAG AACTCATTAT TTTGAGGTAG 3900 
AGGAAAATCA ATGCTTTATC TGATATGCTG AGAAATTATT AGATTGCCAA TACTCATGTG 3960 
CGTTTCATGT GTTTTATAAG GTTTGTTCCT TTGAAGAATT GTAGTTCTTA GTCCCACAGG 4020 
GAAATGTGTA TCTATTTATA TATCATAGTA TAAATCTATG ATATATTTAT ATCATATATA 4080 
AAAGTCTGAG TTCTCTTTCT TAGTCCCTAA TCATGTTTCT CCCATAGGCT GTGTTTACAT 4140 
GGAGCTATCG GTTTAGCCTT TTAAGCTTCA TTAGCTTGTC TATTATTGAA ATAGTTTCCA 4200 
AGAAATTTTA GATATTATCA TAACATCTGG GTCTACTCAA ACACTTATTG TTTGAAAGAC 4260 
TTATGTCTTG GACCTATCAA AAACTGACTT TATTTATTGC TTAGTGAAAA TACTAGTGGG 4320 
ATCAACAATG ATTTTCTTGA ATGGGGATGA ATGGAGATGC CCGCACAGTA ATGTAGAAAT 4380 
GTTTCATACA GCTATTAAAA TGTAACTGAC CTCCTTAGAG GCAGATTAGT AACTGTTCCT 4440 
ACTTTGTATA GCTAAGTGAC AGTCACTTAA CTTACATGAC TTTCTTTTTT CACATTGGGT 4500 
CTCTGGTCCT GTGTCTTCAC CTCATTTATA GCAOGTCTCC TTGATTTTTG GTAGTATCAA 4560 
CTTCCCAGTG ATCTGTTCAG TTAAGTTCTT CTCCCGTTAA CCAGGAAGTG CTTATTCTCT 4620 
CATCACAGTG GGAAGAATAG CCTATTGTCT TTCATTTTGC CTGAGTGTAT TTTACTATTT 4680 
GGGCTCTGAA ATAAAAATTA TGAAATATGG TGAGGTCACA TGTTGGTGCT GCCTTGCTGC 4740 
ATAAAATTCT AGGAGGGCAG GTTAGGAGAC AGTTATGTAT GGCCTTTCGG GAAAATTCAA 4800 
AGGGTGGGAT TACAAGGGTG TTCCTCAGGC ATGCCCCTAT GGGCCCTATG TGGAAGCAAG 4860 
AAGAATTGAC TGATTTACAG GACTTCTCTT TATGTCAATC TTAAGAGGAT GGATGAATCT 4920 
GGACATTTGT TCCACCCGAC CTCTGACTGA TGGTTTGGAA AATAACTTTA ATTAGGATCA 4980 
TATGACCATT GAAAAAGGAA AAATGTAGAC TCTGACTTCC GTCCCACTGA AGGATTAATG 5040 
AAAACCTTTA CTAGCATTTA GAGCTTTTCA GAACATCCCC ACTGTCATGT GTCTCAGCAG 5100 
TGGAGACTGC AAGTAAGGCT TTTAATTTTA GGAGGTTTTT TTTTTTTTTT TTTTTTCCCC 5160 
TAAATGGTAT GGCCAAAAGT CAGAGTTAAA ATATATATAG TTAGATTCCA ACTTCCTCCT 5220 
TCACTCTAAA AATAGAATCC AAACCCACTC TTCATATATG CTTCCAGAAT GGGGCTTAAG 5280 
TACCAATCTC TGCTTTGCAA TGGGCACAAT CTTGGTCATG TCCTGAGGCT CTCTAAGAAA 5340 
AGAGAGGATC TAGGATGGGA GAGCTAGAAA GTTGCTAACT GGGAAGAACA AGGCCCTGAG 5400 
GGGTTGGTCT ACCAATCTGG GAAGATTTGA AAACAAACTT CTCGCAACTG AAGGAAGGCT 5460 
GAAGGCTGCT GCAAGTCATT GAGTGACTTT AGGATGAGCA AAACATTGGG CCACTTCCTA 5520 
ATGCCCTATG TGTATAGTAC CAGAAGCAAG GTCTCAGACT TAACAGACCC AGCTCTGTTC 5580 
CAAGGTGAGT CTGAACCAAT AGAAAGCAAA CATGTGCAGA TATCCAAACA AGACTGCTCA 5640 
TGCAAGTCGG GGCTGGCTAC CCGTCTTAGG CAGCAACAGC AGAGCTCCAG GGAGCTTATT 5700 
CAATATTTAC TGAGACTTCG AAGACCCAGC AGATGTTTAA TGAAGTCACT ATTTTGGCTC 5760 
AAACCCTCCA CTTCTCCCCC TCCCCTCAAA AAGCCAACAG GTAAACACAT AAATGAAAGA 5820 
AACCCACAGA AGGGGATGGG AAATAAAGAA AA7TCTCTCA AGACTTCTCC AGGCCCATGT 5880 
CACTGGTCAG CGTGGTTTTT ATGTGTATTA GGATTGGGGG ATGTGAAGAA ATAAGTATCC 5940 
AGTACTTTAT AACCAAAGCA ATTAAATGAT ATTGGGGTAG GGAATGTTGG CCAGTTTTGT 6000 
TTAGTTTTGC CATCACATTQ TCACCCAGAC CTCACCTAGC CCCAAGTAAT CGGGCGCCCC 6060 
GAAGAGGGAG ACAGAGATGT GCCAGAGTTG ACCCAGTGTG CGGATGATAA CTACTGACGA 6120 
AAGAGTCATC GACCTCAGTT AGTGGTTGGA TGTAGTCACA TTAGTTTGCC TCTCCCCATC 6180 
TTTGTCTCCC TGGCAAGGAG AATATGCGGG ACATGATGCT AAGAGCCCTG GGTAAATGTG 6240 
GTGAGAATGC ACGCGTGCAT ATGCTACACA TATGTGCTTC TCAGTTGCAG AAAATGAACT 6300 
GCTTTGGGAG ATTATCAGTA GAAAGAGTGT TATCATATTG GTGCTGAGTG CTATGTGTGC 6360 
TTATACAATT TGTTCTTGTA TTTTAATAAA CTTTGAATAA AAGAATAAAA AAAAAAAAAA 6420 
AAAAAAAAA 

Seq ID NO: 600 Protein sequence 
Protein Accession #: NP_000784.2 

1 11 21 31 41 51 

1 I I I I I 

MGILSVDLLI TLQILPVFFS NCLPLALYDS VILLKHWLL LSRSKSTRGE WRRMLTSEGL 
RCVWKSFLLD AYKQVKLGED APNSSWHVS STEGGDNSGN GTQEKIAEGA TCHLLDPASP 
ERPLWNPGS ATUPPPTSQL PAFRKLVEEF SSVADPLLVY IDEAHPSDGW AIPGDSSLSF 
EVKKHQNQED RCAAAQQLLE RFSLPPQCRV VADRMDNNAN IAYGVAFERV CXVQRQKIAY 
LGGKGPFSYN LQEVRBWLEK NFSKRUKKTR LAG 

Seq ID NO: 601 DMA sequence 
Nucleic Acid Accession 8: NM_0OS233.1 
Coding sequence: 101.. 3052 



60 
120 
180 
240 



1 
I 

CCATGGATGG 
ATGGAGATAT 
CCTCCTCCTT 
CAATGAAGTC 
TCCATCACAT 
TTACCAGGTG 
CCCCAGGAAC 
TAGCATTCCA 
TGATGATGAT 
AGCTGATGAA 
GATTAGAGAA 
TGCTTGTGTT 



11 
I 

TAACTTCTCC 
GCTCCTCTCA 
CTCAGCTGCT 
AATCTACTGG 
GGGTGGGAAG 
TGCAATGTCA 
TCAGCTCAGA 
TTGGTTTTAG 
CATGGGGTGA 
AGTTTCACTC 
GTAGGTCCTG 
GCCTTGGTGT 



21 

1 

AGCAATCAGA 
CTGCCCTCTG 
CTGTTCTCGA 
ATTCAAAAAC 
AGATCAGTGG 
TGGACCACAG 
AGATTTATGT 
GAACTTGCAA 
AATTTCGAGA 
AAATGGATCT 
TCAACAAGAA 
CTGTGAGAGT 



31 
I 

GCGCTCCCCC 
CACCAGCAAC 
CAGCTTCGGG 
AATTCAAGGG 
TGTGGATGAA 
TCAAAACAAT 
GGAGCTCAAG 
GGAGACATTC 
GCATCAGTTT 
TGGGGACCGT 
GGGATTTTAT 
ATACTTCAAA 



41 

I 

TCACATCAGT 
ATGGATTGTC 
GAACTGATTC 
GAGCTGGGCT 
CATTACACAC 
TGGCTGAGAA 
TTCACTCTAC 
AACCTGTACT 
ACAAAGATTG 
ATTCTGAAGC 
TTGGCATTTC 
AAGTGCCCAT 



51 
I 

GGCATGCTTC 
AGCTCTCCAT 
CGCAGCCTTC 
GGATCTCTTA 
CCATCAGGAC 
CAAACTGGGT 
GAGACTGCAA 
ACATGGAGTC 
ACACCATTGC 
TCAACACTGA 
AAGATGTTGG 
TTACAGTGAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



1182 



WO 03/042661 



PCT7US02/36810 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



GAATCTGGCT ATGTTTCCAG ACACGGTACC CATGGACTCC CAGTCCCTGG TGGAGGTTAG 780 

AGGGTCTTGT GTCAACAATT CTAAGGAGGA AGATCCTCCA AGGATGTACT GCAGTACAGA 840 

AGGCGAATGG CTTGTACCCA TTGGCAAGTG TTCCTGCAAT GCTGGCTATG AAGAAAGAGG 900 

TTTTATGTGC CAAGCTTGTC GACCAGGTTT CTACAAGGCA TTGGATGGTA ATATGAAGTG 960 

TGCTAAGTGC CCGCCTCACA GTTCTACTCA GGAAGATGOT TCAATGAACT GCAGGTGTGA 1020 

GAATAATTAC TTCCGGGCAG ACAAAGACCC TCCATCCATG GCTTGTACCC GACCTCCATC 1080 

TTCACCAAGA AATGTTATCT CTAATATAAA CGAGACCTCA GTTATCCTGG ACTGGAGTTG 1140 

GCCCCTGGAC ACAGGAGGCC GGAAAGATGT TACCTTCAAC ATCATATGTA AAAAATGTGG 1200 

GTGGAATATA AAACAGTGTG AGCCATGCAG CCCAAATGTC CGCTTCCTCC CTCGACAGTT 1260 

TGGACTCACC AACACCACGG TGACAGTGAC AGACCTTCTG GCACATACTA ACTACACCTT 1320 

TGAGATTGAT GCCGTTAATG GGGTGTCAGA GCTGAGCTCC CCACCAAGAC AGTTTGCTGC 1380 

GGTCAGCATC ACAACTAATC AGGCTGCTCC ATCACCTGTC CTGACGATTA AGAAAGATCG 1440 

GACCTCCAGA AATAGCATCT CTTTGTCCTG GCAAGAACCT GAACATCCTA ATGGGATCAT 1500 

ATTGGACTAC GAGGTCAAAT ACTATGAAAA GCAGGAACAA GAAACAAGTT ATACCATTCT 1560 

GAGGGCAAGA GGCACAAATG TTACCATCAG TAGCCTCAAG CCTGACACTA TATACGTATT 1620 

CCAAATCCGA GCCCGAACAG CCGCTGGATA TGGGACGAAC AGCCGCAAGT TTGAGTTTGA 1680 

AACTAGTCCA GACTCTTTCT CCATCTCTGG TGAAAGTAGC CAAGTGGTCA TGATCGCCAT 1740 

TTCAGCGGCA GTAGCAATTA TTCTCCTCAC TGTTGTCATC TATGTTTTGA TTGGGAGGTT 1800 

CTGTGGCTAT AAGTCAAAAC ATGGGGCAGA TGAAAAAAGA CTTCATTTTG GCAATGGGCA 1860 

TTTAAAACTT CCAGGTCTCA GGACTTATGT TGACCCACAT ACATATGAAG ACCCTACCCA 1920 

AGCTGTTCAT GAGTTTGCCA AGGAATTGGA TGCCACCAAC ATATCCATTG ATAAAGTTGT 1980 

TGGAGCAGGT GAATTTGGAG AGGTGTGCAG TGGTCGCTTA AAACTTCCTT CAAAAAAAGA 2040 

GATTTCAGTG GCCATTAAAA CCCTGAAAGT TGGCTACACA GAAAAGCAGA GGAGAGACTT 2100 

CCTGGGAGAA GCAAGCATTA TGGGACAGTT TGACCACCCC AATATCATTC GACTGGAAGG 2160 

AGTTGTTACC AAAAGTAAGC CAGTTATGAT TGTCACAGAA TACATGGAGA ATGGTTCCTT 2220 

GGATAGTTTC CTACGTAAAC ACGATGCCCA GTTTACTGTC ATTCAGCTAG TGGGGATGCT 2280 

TCGAGGGATA GCATCTGGCA TGAAGTACCT GTCAGACATG GGCTATGTTC ACCGAGACCT 2340 

CGCTGCTCGG AACATCTTGA TCAACAGTAA CTTGGTGTGT AAGGTTTCTG ATTTCGGACT 2400 

TTCGCGTGTC CTGGAGGATG ACCCAGAAGC TGCTTATACA ACAAGAGGAG GGAAGATCCC 2460 

AATCAGGTGG ACATCACCAG AAGCTATAGC CTACCGCAAG TTCAOGTCAG CCAGCGATGT 2520 

ATGGAGTTAT GGGATTGTTC TCTGGGAGGT GATGTCTTAT GGAGAGAGAC CATACTGGGA 2580 

GATGTCCAAT CAGGATGTAA TTAAAGCTGT AGATGAGGGC TATCGACTGC CACCCCCCAT 2640 

GGACTGCCCA GCTGCCTTGT ATCAGCTGAT GCTGGACTGC TGGCAGAAAG ACAGGAACAA 2700 

CAGACCCAAG TTTGAGCAGA TTGTTAGTAT TCTGGACAAG CTTATCCGGA ATCCCGGCAG 2760 

CCTGAAGATC ATCACCAGTG CAGCCGCAAG GCCATCAAAC CTTCTTCTGG ACCAAAGCAA 2820 

TGTGGATATC TCTACCTTCC GCACAACAGG TGACTGGCTT AATGGTGTCC GGACAGCACA 2880 

CTGCAAGGAA ATCTTCACGG GCGTGGAGTA CAGTTCTTGT GACACAATAG CCAAGATTTC 2940 

CACAGATGAC ATGAAAAAGG TTGGTGTCAC CGTGGTTGGG CCACAGAAGA AGATCATCAG 3000 

TAGCATTAAA GCTCTAGAAA CGCAATCAAA GAATGGCCCA GTTCCCGTGT AAAGCACGAC 3060 

GGAAGTGCTT CTGGACGGAA GTGGTGGCTG TGGAAGGCGT CAAGTCATCC TGCAGACAGA 3120 
CAATAATTCT GGAGATACTG GTGGAAGTT 



Seq ID NO: 602 Protein sequence 
Protein Accession #: NP_005224.1 



1 
I 

MDCQLSILLL 
HYTPIRTYQV 
NLYYMESDDD 
LAFODVGACV 
RMYCSTEGEW 
SMNCRCENNY 
IICKKCGWNI 
PPRQPAAVSI 
ETSYTILRAR 
QWMIAISAA 
TYEDPTQAVH 



11 
I 

LSCSVLDSFG 
CNVMDHSQNN 



IQLVGMLRGI 
TRGGKIPIRW 
YRLPPPMDCP 
LUliDQSNVDI 
PQKKIISSIK 



ALVSVRVYFK 
LVPIGKCSCN 
FRADKDPPSM 
KQCEPCSPNV 
TTNQAAPSPV 
GTNVTISSLK 
VAIILLTWI 
EFAKELDATN 
ASIMGQFDHP 
ASGMKYLSDM 
TSPBAIAYRK 
AALYQLMLDC 
STFRTTGDWL 
ALETQSKNGP 



21 
I 

ELIPQPSNEV 
WLRTNWVPRN 
TKIDTIAADE 
KCPFTVKNLA 
AGYEERGFMC 
ACTRPPSSPR 
RFLPRQFGLT 
LTIKKDRTSR 
PDTIYVFQIR 
YVLIGRFCGY 
ISIDKWGAG 
NIIRLEGWT 
GYVHRDLAAR 
FTSASDVWSY 
WQKDRNNRPK 
NGVRTAHCKE 
VPV 



31 
I 

NLLDSKTIQG 
SAQKIYVELK 
SFTQMDLGDR 
MFPDTVPMDS 
QACRPGFYKA 
NVISNINETS 
NTTVTVTDLL 



ARTAAGYGTN 
KSKHGADEKR 
BFGEVCSGRL 
KSKPVMIVTE 
NILINSNLVC 
GIVLWEVMSY 
FEQIVSILDK 
IFTGVEYSSC 



41 
I 

ELGWISYPSH 
FTLRDCNSIP 
ILKLNTEIRE 
QSLVEVRGSC 
LDGNMKCAKC 
VILDWSWPLD 
AHTNYTFEID 
EHPNGIILDY 
SRKFEFETSP 
LHFGNGHLKL 
KLPSKKEISV 



51 
I 

GWEEISGVDE 
LVLGTCKETF 

VNNSKEEDPP 
PPHSSTQEDG 
TGGRKDVTFN 
AVNGVSELSS 
EVKYYEKQEQ 



KVSDFGLSRV 
GERPYWENSN 
LIRNPGSLKI 
DTIAKISTDD 



Seq ID NO: 603 DNA sequence 

Nucleic Acid Accession #: NMJ>05727.1 

Coding sequence : 122 . . 847 



GCCAGGCGTC 
TGGAGCCTCA 
CATGCAGTGC 
GTGTGGTGCA 
GAAGATCTTC 
CATCGCAGCC 
TGAGAGCAAG 
GGTTGCAGCT 
GCTGGTAGTG 
GAACACCACC 
CTCACCCTAC 
CAACACAGCC 
CTTCAATCAG 




GGGCCACTGT 
GGCGTTGTGG 
TGTGCCCTCG 

CCTGCCATCA 
ATGAAAGGGC 
TTCAAAGAGA 
AATGAAACCT 
CTTTTGTATG 



21 
I 

CCCACTCAGT 
CTTTCAGAAC 
TTAAGACCAT 
CAGTGGGCAT 
CGTCCAGTGC 
TCTTTGCTCT 
TGACGTTCTT 
CCTTGGTGTA 
AGAAAGATTA 
TCAAGTGCTG 
ACAGTGCCTT 
GCACCAAGCA 
ACATCCGAAC 



31 

I 

GGCAACACCC 
TCACTGCCAA 
GATGATCCTC 
CTGGGTGTCA 
CATGCAGTTT 
TGGTTTCCTG 
CTTCATCCTC 
CACCACAATG 
TGGTTCCCAG 
TGGCTTCACC 
TCCCCCATTC 
AAAGGCTCAC 
TAATGCAGTC 



41 
I 

GGGAGCTGTT 
GAGCCCTGAA 
TTCAATTTGC 
ATCGATGGGG 
GTCAACGTGG 
GGCTGCTATG 
CTCCTCATCT 
GCTGAGCACT 
GAAGACTTCA 
AACTATACGG 
TGTTGCAATG 
GACCAAAAAG 
ACCGTGGGTG 



PGLRTYVDPH 
AIKTLKVGYT 
LRKHDAQFTV 
LEDDPEAAYT 
QDVIRAVDEG 
ITSAAARPSN 
MKKVGVTWG 



51 
I 

TTGTCCTTTG 
CAGGAGCCAC 
TCATCTTTCT 
CATCCTTTCT 
GCTACTTCCT 
GTGCTAAGAC 
TCATTGCTGA 
TCCTGACGTT 
CTCAAGTGTG 
ATTTTGAGGA 
ACAACGTCAC 
TAGAGGGTTG 
GTGTGGCAGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



1183 



WO 03/042661 



PCT/US02/36810 
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TGGAATTGGG GGCCTCGAGC TGGCTGCCAT GATTGTGTCC ATGTATCTGT ACTGCAATCT 840 

ACAATAAGTC CACTTCTGCC TCTGCCACTA CTGCTGCCAC ATGGGAACTG TGAAGAGGCA 900 

CCCTGGCAAG CAGCAGTGAT TGGGGGAGGG GACAGGATCT AACAATGTCA CTTGGGCCAG 960 

AATGGACCTG CCCTTTCTGC TCCAGACTTG GGGCTAGATA GGGACCACTC CTTTTAGGCG 1020 

ATGCCTGACT TTCCTTCCAT TGGTGGGTGG ATGGGTGGGG GGCATTCCAG AGCCTCTAAG 1080 

GTAGCCAGTT CTGTTGCCCA TTCCCCCAGT CTATTAAACC CTTGATATGC CCCCTAGGCC 1140 

TAGTGGTGAT CCCAGTGCTC TACTGGGGGA TGAGAGAAAG GCATTTTATA GCCTGGGCAT 1200 

AAGTGAAATC AGCAGAGCCT CTGGGTGGAT GTGTAGAAGG CACTTCAAAA TGCATAAACC 1260 
TGTTACAATG TTAAAAAA 

Seq ID NO: 604 Protein sequence 
Protein Accession #: NP 005718.1 



1 11 21 31 41 SI 

15 } | | | | | 

MQCFSFIKTM MILFNLLIFL CGAALIAVGI WVSIDGASFL KIFGPLSSSA MQFVNVGYFL 60 
IAAGWVFAL GFLGCYGAKT ESKCALVTFP FILLLIFIAE VAAAWAIiVY TTMAEHFLTL 120 
LWPAIKKDY GSQEDFTQVW NTTMKGLKCC GFTNYTDFED SPYFKENSAF PPFCCNDNVT 180 
NTANBTCTEQ KAHDQKVEGC FNQLLYDIRT NAVTVGGVAA GIGGLE1AAM IVSMYLYCNL 240 
Q 



Seq ID NO: 605 DNA sequence 

Nucleic Acid Accession #: NM_000729.2 

Coding sequence: 

1 11 21 31 41 51 

I I I 1 I I 

GGCTCAGCTG CCGGGCTGCT CCGGTTGGAA ACGCCAAGCC AGCTGCCGTC CTAATCCAAA 60 
AGCCATGAAC AGCGGCGTGT GCCTGTGCGT GCTGATGGCG GTACTGGCGG CTGGCGCCCT 120 
GACGCAGCCG GTGCCTCCCG CAGATCCCGC GGGCTCCGGG CTGCAGCGGG CAGAGGAGGC 180 
GCCCCGTAGG CAGCTGAGGG TATCGGAGAG AACGGATGGC GAGTCCCGAG CGCACCTGGG 240 
CGCCCTGCTG GCAAGATACA TCCAGCAGGC CCGGAAAGCT CCTTCTGGAC GAATGTCCAT 300 
CGTTAAGAAC CTGCAGAACC TGGACCCCAG CCACAGGATA AGTGACCGGG ACTACATGGG 360 
CTGGATGGAT TTTGGCCGTC GCAGTGCCGA GGAGTATGAG TACCCCTCCT AGAGGACCCA 420 
GCCGCCATCA GCCCAACGGA AGCAACCTCC CAACCCAGAG GAGGCAGAAT AAGACAACAA 480 
TCACACTCAT AACTCATTGT CTGTGGAGTT TGACATTGAA TGTATCTATT TATTAAGTTC 540 
TCAATGTGAA AATTGTGTCT GTAAGATTGT CCAGTGCAAC CACACACGCT CACCAGAAGT 600 
TGTGCAAACT GAAGACAAAA CTGTTTTCTT CATCTGTGAC TCCTGTTCTG AAAATGTTGT 660 
TATGCTATTA AAGTGATTTC ATTCTGCC 

Seq ID NO: 606 Protein sequence 
Protein Accession #: NP_000720.l 



A - 1 11 21 31 41 51 

45 | i | | | | 

MNSGVCLCVL MAVLAAGALT QPVPPADPAG SGLQRAEEAP RRQLRVSQRT DGESRAHLGA 60 
LLARYIQQAR KAPSGRMSIV KNLQNLDPSH RISDRDYMGW MDFGRRSAEE YEYPS 



Seq ID NO: 607 DNA sequence 

Nucleic Acid Accession #: NM_001423.1 

Coding sequence : 219 . . 692 

1 11 21 31 41 51 

I I I 1 I I 

AGCACTCTCC AGCCTCTCAC CGCAAAATTA CACACCCCAG TACACCAGCA GAGGAAACTT 60 

ATAACCTCGG GAGGCGGGTC CTTCCCCTCA GTGCGGTCAC ATACTTCCAG AAGAGCGGAC 120 

CAGGGCTGCT GCCAGCACCT GCCACTCAGA GCGCCTCTGT CGCTGGGACC CTTCAGAACT 180 

CTCTTTGCTC ACAAGTTACC AAAAAAAAAA GAGCCAACAT GTTGQTATTG CTGGCTGGTA 240 

TCTTTGTGGT CCACATCGCT ACTGTTATTA TGCTATTTGT TAGCACCATT GCCAATGTCT 300 

GGTTGGTTTC CAATACGGTA GATGCATCAG TAGGTCTTTG GAAAAACTGT ACCAACATTA 360 

GCTGCAGTGA CAGCCTGTCA TATGCCAGTG AAGATGCCCT CAAGACAGTG CAGGCCTTCA 420 

TGATTCTCTC TATCATCTTC TGTGTCATTG CCCTCCTGGT CTTCGTGTTC CAGCTCTTCA 480 

CCATGGAGAA GGGAAACOGG TTCTTCCTCT CAGGGGCCAC CACACTGGTG TGCTGGCTGT 540 

GCATTCTTGT GGGGGTGTCC ATCTACACTA GTCATTATGC GAATCGTGAT GGAACGCAGT 600 

ATCACCACGG CTATTCCTAC ATCCTGGGCT GGATCTGCTT CTGCTTCAGC TTCATCATCG 660 

GCGTTCTCTA TCTGGTCCTG AGAAAGAAAT AAGGCCGGAC GAGTTCATGG GGATCTGGGG 720 

GGTGGGGAGG AGGAAGCCGT TGAATCTGGG AGGGAAGTGG AGGTTGCTGT ACAGGAAAAA 780 

CCGAGATAGG GGAGGGGGGA GGGGGAAGCA AAGGGGGGAG GTCAAATCCC AAACCATTAC 840 

TGAGGGGATT CTCTACTGCC AAGCCCCTGC CCTGGGGAGA AAGTAGTTGG CTAGTACTTT 900 

GATGCTCCCT TGATGGGGTC CAGAGAGCCT CCCTGCAGCC ACCAGACTTG GCCTCCAGCT 960 

GTTCTTAGTG ACACACACTG TCTGGGGCCC CATCAGCTGC CACAACACCA GCCCCACTTC 1020 

TGGGTCATGC ACTGAGGTCC ACAGACCTAC TGCACTGAGT TAAAATAGCG GTACAAGTTC 1080 

TGGCAAGAGC AGATACTGTC TTTGTGCTGA ATACGCTAAG CCTGGAAGCC ATCCTGCCCT 1140 

TCTGACCCAA AGCAAAACAT CACATTCCAG TCTGAAGTGC CTACTGGGGG GCTTTGGCCT 1200 

GTGAGCCATT GTCCCTCTTT GGAACAGATA TTTAGCTCTG TGGAATTCAG TGACAAAATG 1260 

GGAGGAGGAA AGAGAGTTTG TAAGGTCATG CTGGTGGGTT AGCTAAACCA AGAAGGAGAC 1320 

CTTTTCACAA TGGAAAACCT GGGGGATGGT CAGAGCCCAG TCGAGACCTC ACACACGGCT 1380 

GTCCCTCATG GAGACCTCAT GCCATGGTCT TTGCTAGGCC TCTTGCTGAA AGCCAAGGCA 1440 

GCTCTTCTGG AGTTTCTCTA AAGTCACTAG TGAACAATTC GGTGGTAAAA GTACCACACA 1500 

AACTATGGGA TCCAAGGGGC AGTCTTGCAA CAGTGCCATG TTAGGGTTAT GTTTTTAGGA 1560 

TTCCCCTCAA TGCAGTCAGT GTTTCTTTTA AGTATACAAC AGGAGAGAGA TGGACATGGC 1620 

TCATTGTAGC ACAATCCTAT TACTCTTCCT CTAACATTTT TGAGGAAGTT TTGTCTAATT 1680 

ATCAATATTG AGGATCAGGG CTCCTAGGCT CAGTGGTAGC TCTGGCTTAG ACACCACCTG 1740 

GAGTGATCAC CTCTTGGGGA CCCTGCCTAT CCCACTTCAC AGGTGAGGCA TGGCAATTCT 1800 

1184 



WO 03/042661 



PCT/US02/36810 



10 
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GGAAGCTGAT 
. CTATATAATT 
GCATTCCCAG 
TCTTTTCTTT 
CTGTCTGCTG 
TAAGCAGGCA 
GGGGTTTTTT 
TGAGACATCT 
TCCTTTTTGG 
ATTCTATTTC 
CTAGGCTGAG 
TGATTTCATT 
CAGACATATC 
ACCATGTATT 
ATTACTCTGG 
ATACTATAAT 
TTCTGATTCC 



TAAAACACAC 
GTGAAGTATT 
GAAAATACGA 
AAAATAAAAA 
CCGCAGGAGC 
TTGCTTTGCC 
GGGAAGACGT 
TGCCTACTTT 
GGAGTTGTTA 
TCTATGTTTA 
GTTAGAGAGA 
ATCACATGAT 
CAAAGGGAAT 
CCCTTATCTT 
TGGATTGTTC 
TGTAAATATT 
CTTCAAAAAA 



ATAAACCAAA 
AAGCCTACCG 
AAATCCCATG 
AGCAAAAACT 
ACCTCTATAC 
CTGGAGCAGC 
TTTCTTTATC 
TCTTTATTAG 
TGCCATGATT 
TTCTAGTTAA 
TTGGCCAGCA 
TATAGAAGGC 
ACTCACATTT 
TTACTTTTTT 
TAGTACTGTA 
TTGATACAAA 
AAAAAA 



ACCAAACAAC 
TATTTCAGCC 
AGATAAATAA 
CTTGTGGTAC 
AGGACTTAGA 
TATTTTAAGC 
GCCCTGAGAA 

TTTGGTATTT 
GGAAATGTTG 
AAAACTGTGG 
TGTCTTAGTG 
TGTTAAGAAG 
TCTGTGACAT 

TGTTTATAAC 



AGGCCCTTGG 
ATGATAAGAA 
AAATATAGGT 
CTAGTCAGAT 
AGTAGTATGT 
CATCTCAGAT 
GATCTACCCC 
ATCCATTTCT 
ATGTAAAAGG 
AGGGCAAGCC 
GAAGATGAAC 
CAAAAAACAT 
TTGAACTATG 
TTATGTCTCA 
TCGTTAATAG 
TCTAGGGATA 



GTGAAAGGTG 
CAGAGTGCCT 
GATGGGCAGA 
GGTAGACGAG 
TATTCCTGGT 
TCTGTCTAAA 
AGGGAGAATC 
TTTATACCTT 
ATTATTACTA 
ACCAAATTAC 
TTTGTCATTA 
ACTTACATTT 
ACTGGAGTAA 
TGTAATTTGC 
ATTATTTCAT 
TAAAAACAGA 



1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 



51 



Seq ID NO: 608 Protein sequence 
Protein Accession #: NP_001414.1 

1 11 21 31 41 

I I I I I I 

MLVLLAGIFV VHIATVIMLF VSTIANVWLV SNTVDASVGL WKNCTNISCS DSLSYASEDA 
LKTVQAFMII* SIIFCVIALL VFVPQLPTME KGNRFFLSGA TTLVCWLCIL VGVSIYTSHY 
ANRDGTQYHH GYSYILGWIC FCFSFIIGVL YLVLRKK 

Seq ID NO: 609 DNA sequence 

Nucleic Acid Accession #i NM_004961.2 

Coding sequence : 55 . .1575 



60 
120 



1 11 21 31 41 51 

I I I I I I 

GCCAGAGCGT GAGCCGCGAC CTCCGCGCAG GTGGTCGCGC CGGTCTCCGC GGAAATGTTG 
TCCAAAGTTC TTCCAGTCCT CCTAGGCATC TTATTGATCC TCCAGTCGAG GGTCGAGGGA 
CCTCAGACTG AATCAAAGAA TGAAGCCTCT TCCCGTGATG TTGTCTATGG CCCCCAGCCC 
CAGCCTCTGG AAAATCAGCT CCTCTCTGAG GAAACAAAGT CAACTGAGAC TGAGACTGGG 
AGCAGAGTTG GCAAACTGCC AGAAGCCTCT CGCATCCTGA ACACTATCCT GAGTAATTAT 
GACCAGAAAC TGCGCCCTGG CATTGGAGAG AAGCCCACTG TGGTCACTGT TGAGATCGCC 
GTCAACAGCC TTGGTCCTCT CTCTATCCTA GACATGGAAT ACACCATTGA CATCATCTTC 
TCCCAGACCT GGTACGACGA ACGCCTCTGT TACAACGACA CCTTTGAGTC TCTTGTTCTG 
AATGGCAATG TGGTGAGCCA GCTATGGATC CCGGACACCT TTTTTAGGAA TTCTAAGAGG 
ACCCACGAGC ATGAGATCAC CATGCCCAAC CAGATGGTCC GCATCTACAA GGATGGCAAG 
GTGTTGTACA CAATTAGGAT GACCATTGAT GCCGGATGCT CACTCCACAT GCTCAGATTT 
CCAATGGATT CTCACTCTTG CCCTCTATCT TTCTCTAGCT TTTCCTATCC TGAGAATGAG 
ATGATCTACA AGTGGGAAAA TTTCAAGCTT GAAATCAATG AGAAGAACTC CTGGAAGCTC 
TTCCAGTTTG ATTTTACAGG AGTGAGCAAC AAAACTGAAA TAATCACAAC CCCAGTTGGT 
GACTTCATGG TCATGACGAT TTTCTTCAAT GTGAGCAGGC GGTTTGGCTA TGTTGCCTTT 
CAAAACTATG TCCCTTCTTC CGTGACCACG ATGCTCTCCT GGGTTTCCTT TTGGATCAAG 
ACAGAGTCTG CTCCAGCCCG GACCTCTCTA GGGATCACCT CTGTTCTGAC CATGACCACG 
TTGGGCACCT TTTCTCGTAA GAATTTCCCG CGTGTCTCCT ATATCACAGC CTTGGATTTC 
TATATCGCCA TCTGCTTCGT CTTCTGCTTC TGCGCTCTGT TGGAGTTTGC TGTGCTCAAC 
TTCCTGATCT ACAACCAGAC AAAAGCCCAT GCTTCTCCTA AACTCCGCCA TCCTCGTATC 
AATAGCCGTG CCCATGCCCG TACCCGTGCA CGTTCCCGAG CCTGTGCCCO CCAACATCAG 
GAAGCTTTTG TGTGCCAGAT TGTCACCACT GAGGGAAGTG ATGGAGAGGA GCGCCCGTCT 
TGCTCAGCCC AGCAGCCCCC TAGCCCAGGT AGCCCTGAGG GTCCCOGCAG CCTCTGCTCC 
AAGCTGGCCT GCTGTGAGTG GTGCAAGCGT TTTAAGAAGT ACTTCTGCAT GGTCCCCGAT . 
TGTGAGGGCA GTACCTGGCA GCAGGGCCGC CTCTGCATCC ATGTCTACCG CCTGGATAAC 
TACTCGAGAG TTGTTTTCCC AGTGACTTTC TTCTTCTTCA ATGTGCTCTA CTGGCTTGTT 
TGCCTTAACT TGTAGGTACC AGCTGGTACC CTGTGGGGCA ACCTCTCCAG TTCCCCAGGA 
GGTCCAAGCC CCTTGCCAAG GGAGTTGGGG GAAAGCAGCA GCAGCAGCAG GAGCGACTAG 
AGTTTTTCCT GCCCCATTCC CCAAACAGAA GCTTGCAGAG GGTTTGTCTT TGCTGCCCCT 
CTCCCCTACC TGGCCCATTC ACTGAGTCTT CTCAGCAGAC CATTTCAAAT TATTAATAAA 
TGGGCCACCT CCCTCTTCTT CAAGGAGCAT COGTGATGCT CAGTGTTCAA AACCACAGCC 
ACTTAGTGAT CAGCTCCCTA AAACCATGCC TAAGTACAGG CGGATTAGCT ATCTTCCAAC 
AATGCTGACC ACCAGACAAT TACTGCATTT TTCCAGAAGC CCACTATTGC CTTTGTAGTG 
CTTTCGGCCC AGTTCTGGCC TCAGCCTCAA AGTGCACCGA CTAGTTGCTT GCCTATACCT 
GGCACCTCAT TAAGATGCTG GGCAGCAGTA TAACAGGAGG AAGAGATCCC TCTCCTTTGG 
TCAGATTATT ATGTTCTCAG TTCTCTCTCC CTGCTACCCC TTTCTCTGCA GATAGATAGA 
CACTGGCATT ATCCCTTTAG GAAGAGGGGG GGGCAGCAAG AGAGCCTATT TGGGACAGCA 
TTCCTCTCTC TCTGCTGCTG TGACATCTCC CTCTCCTTGC TGGCTCCATC TTTCGTCTGC 
ACTACCAATT CAATGCCCTT CATCCAATGG GTATCTATTT TTGTGTGTGA TTATAGTAAC 
TACTCCCTGC TTTATATGCC ACCCTCTTCC TTCTCTTTGA CCCCTGTGAC TCTTTCTGTA 
ACTTTCCCAG TGACTTCCCC TAGCCCTGAC CCAGGCACTA GGCCTTGGTG ACTTCCTGGG 
GCCAAGAAAC TAAGGAAACT CGGCTTTGCA ACAGGCATTA CTCGCCATTG ATTGGTGCCC 
ACCCAGGGCA CACTGTCGGA GTTCTATCAC TTGCTTGACC CCTGGACCCA TAAACCAGTC 
CACTGTTATA CCCGGGGCAC TCTAACCATC ACAATCAATC AATCAAATTC CCTTAAATTT 
GTATGGCACT GGAACTTTGG CAAAGCACTT TTGACAAGTT GTGTCTGATT GGAGCTTCAT 
GATAGCCTTG TGACATCTTT AGGGCAGGAT TCTTATCCCC ATTTTGCAGA TGAAAACCCT 
GAGTCACAGA TTTCTGTGGG ACTGTGGATC TCACTGGAAG CTATCCAAGA GCCCACTGTC 
ACCTTCTAGA CCACATGATA GGGCTAGACA GCTCAGTTCA CCATGATTCT CTTCTGTCAC 
CTCTGCTGGC ACACCAGTGG CAAGGCCCAG AATGGCGACC TCTCTTTAGC TCAATTTCTG 
GGCCTGAGGT GCTCAGACTG CCCCCAAGAT CAAATCTCTC CTGGCTGTAG TAACCCAGTG 
GAATGAATTT GGACATGCCC CAATGCTTCT ATATGCTAAG TGAAATCTGT GTCTGTAATT 
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TGTTGGGGGG TGGATAGGGT GGGGTCTCCA TCTACTTTTT GTCACCATCA TCTGAAATGG 
GGAAATATGT AAATAAATAT ATCAGCAAAG CAAAAAGAAA AAAAAAAA 

Seq ID NO: 610 Protein sequence 
Protein Accession tf: NP_004952.1 
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MLSKVLPVLIi 
TQSRVGKLPE 
IPSQTWYDER 
GKVLYTIRMT 
XLFQFDFTGV 
IKTESAPART 
LNFLIYNQTK 
PSCSAQQPPS 
DNYSRWFPV 



11 
I 

GILLILQSRV 
ASRILNTILS 
LCYNDTFESL 
IDAGCSLHML 
SNKTEIITTP 
SLGITSVLTM 
AHASPKLRHP 
PGSPEGPRSL 
TFFFFNVLYW 



21 
I 

EGPO.TESKNE 
NYDHKLRPGI 
VLNGNWSQL 
RFPMDSHSCP 
VGDFMVMTIF 
TTLGTFSRKN 
RINSRAHART 
CSKLACCEWC 
LVCLNL 



31 
I 

ASSRDWYGP 
GEKPTWTVE 
WIPDTFFRNS 
LSFSSFSYPE 
FNVSRRFGYV 
FPRVSYITAL 
RARSRACARQ 
KRFKKYFCMV 



41 
I 

QPQPLENQLL 
IAVNSLGPLS 
KRTHEHEITM 
NEMIYKWENF 
AFQNYVPSSV 
DFYIAICFVF 
HQEAFVCQIV 
PDCEGSTWQQ 



51 
I 

SEETKSTETE 
ILDMEYTIDI 
PNQMVRIYKD 
KLEINEKNSW 
TTMLSNVSFW 
CFCALLEFAV 



GRLCIHVYRL 



60 
120 
180 
240 
300 
360 
420 
480 



Seq ID NO: 611 DMA sequence 
Nucleic Acid Accession fh NM_021984.1 
Coding sequence: 572.. 1753 

1 11 21 31 41 51 

I I I I I I 

GCCAGAGCGT GAGCCGCGAC CTCCGCGCAG GTGGTCGCGC CGGTCTCCGC GGAAATGTTG 
TCCAAAGTTC TTCCAGTCCT CCTAGGCATC TTATTGATCC TCCAGTCGAG AACATGTATA 
CAGAGAAGTG CTCAAATCAT AAGTGTACAG CTGATGAGTT GTCAAAAAAT GACCACAGCG 
GTGTAAAGAA AGCCAAATCA AGGACCCGAA TGTGAGCAGG ACCTCAGAAG CCCCCTTTGT 
CACTGCCTCC CAGCAAAGGC AGCACTATCC GGACTTCTAA CACCATCGGG TCGAGGGACC 
TCAGACTGAA TCAAAGAATG AAGCCTCTTC CCGTGATGTT GTCTATGGCC CCCAGCCCCA 
GCCTCTGGAA AATCAGCTCC TCTCTGAGGA AACAAAGTCA ACTGAGACTG AGACTGGGAG 
CAGAGTTGGC AAACTGCCAG AAGCCTCTCG CATCCTGAAC ACTATCCTGA GTAATTATGA 
CCACAAACTG CGCCCTGGCA TTGGAGAGAA GCCCACTGTG GTCACTGTTG AGATCTCCGT 
CAACAGCCTT GGTCCTCTCT CTATCCTAGA CATGGAATAC ACCATTGACA TCATCTTCTC 
CCAGACCTGG TACGACGAAC GCCTCTGTTA CAACGACACC TTTGAGTCTC TTGTTCTGAA 
TGGCAATGTG GTGAGCCAGC TATGGATCCC GGACACCTTT TTTAGGAATT CTAAGAGGAC 
CCACGAGCAT GAGATCACCA TGCCCAACCA GATGGTCCGC ATCTACAAGG ATGGCAAGGT 
GTTGTACACA ATTAGGATGA CCATTGATGC CGGATGCTCA CTCCACATGC TCAGATTTCC 
AATGGATTCT CACTCTTGCC CTCTATCTTT CTCTAGCTTT TCCTATCCTG AGAATGAGAT 
GATCTACAAG TGGGAAAATT TCAAGCTTGA AATCAATGAG AAGAACTCCT GGAAGCTCTT 
CCAGTTGGAT TTTACAGGAG TGAGCAACAA AACTGAAATA ATCACAACCC CAGTTGGTGA 
CTTCATGGTC ATGACGATTT TCTTCAATGT GAGCAGGOGG TTTGGCTATG TTGCCTTTCA 
AAACTATGTC CCTTCTTCCG TGACCACGAT GCTCTCCTGG GTTTCCTTTT GGATCAAGAC 
AGAGTCTGCT CCAGCCCGGA CCTCTCTAGG GATCACCTCT GTTCTGACCA TGACCACGTT 
GGGCACCTTT TCTCGTAAGA ATTTCCCGCG TGTCTCCTAT ATCACAGCCT TGGATTTCTA 
TATCGCCATC TGCTTCGTCT TCTGCTTCTG CGCTCTGTTG GAGTTTGCTG TGCTCAACTT 
CCTGATCTAC AACCAGACAA AAGCCCATGC TTCTCCTAAA CTCCGCCATC CTCGTATCAA 
TAGCCGTGCC CATGCCCGTA CCCGTGCACG TTCCCGAGCC TGTGCCOGOC AACATCAGGA 
AGCTTTTGTG TGCCAGATTG TCACCACTGA GGGAAGTGAT GGAGAGGAGC GCCCGTCTTG 
CTCAGCCCAG CAGCCCCCTA GCCCAGGTAG CCCTGAGGGT CCCCGCAGCC TCTGCTCCAA 
GCTGGCCTGC TGTGAGTGGT GCAAGCGTTT TAAGAAGTAC TTCTGCATGG TCCCCGATTG 
TGAGGGCAGT ACCTGGCAGC AGGCCCGCCT CTGCATCCAT GTCTACCGCC TGGATAACTA 
CTCGAGAGTT GTTTTCCCAG TGACTTTCTT CTTCTTCAAT GTGCTCTACT GGCTTGTTTG 
CCTTAACTTG TAGGTACCAG CTGGTACCCT GTGGGGCAAC CTCTCCAGTT CCCCAGGAGG 
TCCAAGCCCC TTGCCAAGGG AGTTGGGGGA AAGCAGCAGC AGCAGCAGGA GCGACTAGAG 
TTTTTCCTGC CCCATTCCCC AAACAGAAGC TTGCAGAGGG TTTGTCTTTG CTGCCCCTCT 
CCCCTACCTG GCCCATTCAC TGAGTTTTCT CAGCAGAGCA TTTCAAATTA TTAATAAATG 
GGCCACCTCC CTCTTCTTCA AGGAGCATCC GTGATGCTCA GTGTTCAAAA CCACAGCCAC 
TTAGTGATCA GCTCCCTAAA ACCATGCCTA AGTACAGGCG GATTAGCTAT CTTCCAACAA 
TGCTGACCAC CAGACAATTA CTGCATTTTT CCAGAAGCCC ACTATTGCCT TTGCAGTGCT 
TTOGGCCCAG TTCTGGCCTC AGCCTCAAAG TGCACCGACT AGTTGCTTGC CTATACCTGG 
CACCTCATTA AGATGCTGGG CAGCAGTATA ACAGGAGGAA GAGATCCCTC TCCTTTGGTC 
AGATTATTAT GTTCTCAGTT CTCTCTCCCT GCTACCCCTT TCTCTGCAGA TAGATAGACA 
CTGGCATTAT CCCTTTAGGA AGAGGGGGGG GCAGCAAGAG AGCCTATTTG GGACAGCATT 
CCTCTCTCTC TGCTGCTGTG ACATCTCCCT CTCCTTGCTG GCTCCATCTT TOGTCTGCAC 
TACCAATTCA ATGCCCTTCA TCCAATGGGT ATCTATTTTT GTGTGTGATT ATAGTAACTA 
CTCCCTGCTT TATATGCCAC CCTCTTCCTT CTCTTTGACC CCTGTGACTC TTTCTGTAAC 
TTTCCCAGTG ACTTCCCCTA GCCCTGACCC AGGCACTAGG CCTTGGTGAC TTCCTGGGGC 
CAAGAAACTA AGGAAACTCG GCTTTGCAAC AGGCATTACT CGCCATTGAT TGGTGCCCAC 
CCAGGGCACA CTGTCGGAGT TCTATCACTT GCTTGACCCC TGGACCCATA AACCAGTCCA 
CTGTTATACC CGGGGCACTC TAACCATCAC AATCAATCAA TCAAATTCCC TTAAATTTGT 
ATGGCACTGG AACTTTGGCA AAGCACTTTT GACAAGTTGT GTCTGATTGG AGCTTCATGA 
TAGCCTTGTG ACATCTTTAG GGCAGGATTC TTATCCCCAT TTTGCAGATG AAAACCCTGA 
GTCACAGATT TCTGTGGGAC TGTGGATCTC ACTGGAAGCT ATCCAAGAGC CCACTGTCAC 
CTTCTAGACC ACATGATAGG GCTAGACAGC TCAGTTCACC ATGATTCTCT TCTGTCACCT 
CTGCTGGCAC ACCAGTGGCA AGGCCCAGAA TGGCGACCTC TCTTTAGCTC AATTTCTGGG 
CCTGAGGTGC TCAGACTGCC CCCAAGATCA AATCTCTCCT GGCTGTAGTA ACCCAGTGGA 
ATGAATTTGG ACATGCCCCA ATGCTTCTAT ATGCTAAGTG AAATCTGTGT CTGTAATTTG 
TTGGGGGGTG GATAGGGTGG GGTCTCCATC TACTTTTTGT CACCATCATC TGAAATGGGG 
AAATATGTAA ATAAATATAT CAGCAAAGC 

Seq ID NO: 612 Protein sequence 
Protein Accession #: NP_068819.1 
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1 11 21 31 41 51 

I 1 I I I I 

MEYTIDIIFS QTWYDERLCY NDTFESLVLN GNWSQLWIP DTFFRNSKRT KEHEITMPNQ 60 

MVRIYKDGKV LYTIRMTIDA GCSLHMLRFP MDSHSCPLSF SSFSYPENEM IYKWENFKLE 120 

INEKNSWKLF QLDFTGVSNK TEIITTPVGD FMVMTIFFNV SRRFGYVAFQ NYVPSSVTTM 180 

LSWVSFWIKT ESAPARTSLG ITSVLTMTTL GTFSRKNFPR VSYITALDFY IAICFVFCFC 240 

ALLEFAVLNF LIYNQTKAHA SPKLRHPRIN SRAHARTRAR SRACARQHQE AFVCQIVTTB 300 

GSDGEERPSC SAQQPPSPGS PEGPRSLCSK LACCEWCKRF KKYFCMVPDC EGSTWQQARL 360 
CIHVYRLDNY SRWFPVTFF FFNVLYWLVC LNL 

Seq ID NO: 613 DMA sequence 
Nucleic Acid Accession ft: NM_021987.1 
Coding sequence: 572.. 1657 

1 11 21 31 41 51 

1 I I I I I 

GCCAGAGCGT GAGCCGCGAC CTCCGCGCAG GTGGTCGCGC CGGTCTCCGC GGAAATGTTG 60 

TCCAAAGTTC TTCCAGTCCT CCTAGGCATC TTATTGATCC TCCAGTCGAG AACATGTATA 120 

CAGAGAAGTG CTCAAATCAT AAGTGTACAG CTGATGAGTT GTCAAAAAAT GACCACAGCG 180 

GTGTAAAGAA AGCCAAATCA AGGACCCGAA TGTGAGCAGG ACCTCAGAAG CCCCCTTTGT 240 

CACTGCCTCC CAGCAAAGGC AGCACTATCC GGACTTCTAA CACCATCGGG TCGAGGGACC 300 

TCAGACTGAA TCAAAGAATG AAGCCTCTTC CCGTGATGTT GTCTATGGCC CCCAGCCCCA 360 

GCCTCTGGAA AATCAGCTCC TCTCTGAGGA AACAAAGTCA ACTGAGACTG AGACTGGGAG 420 

CAGAGTTGGC AAACTGCCAG AAGCCTCTCG CATCCTGAAC ACTATCCTGA GTAATTATGA 480 

CCACAAACTG CGCCCTGGCA TTGGAGAGAA GCCCACTGTG GTCACTGTTG AGATCTCCGT 540 

CAACAGCCTT GGTCCTCTCT CTATCCTAGA CATGGAATAC ACCATTGACA TCATCTTCTC 600 

CCAGACCTGG AATTCTAAGA GGACCCACGA GCATGAGATC ACCATGCCCA ACCAGATGGT 660 

CCGCATCTAC AAGGATGGCA AGGTGTTGTA CACAATTAGG ATGACCATTG ATGCGGGATG 720 

CTCACTCCAC ATGCTCAGAT TTCCAATGGA TTCTCACTCT TGCCCTCTAT CTTTCTCTAG 780 

CTTTTCCTAT CCTGAGAATG AGATGATCTA CAAGTGGGAA AATTTCAAGC TTGAAATCAA 840 

TGAGAAGAAC TCCTGGAAGC TCTTCCAGTT TGATTTTACA GGAGTGAGCA ACAAAACTGA 900 

AATAATCACA ACCCCAGTTG GTGACTTCAT GGTCATGACG ATTTTCTTCA ATGTGAGCAG 960 

GCGGTTTGGC TATGTTGCCT TTCAAAACTA TGTCCCTTCT ' TCCGTGACCA CGATGCTCTC 1020 

CTGGGTTTCC TTTTGGATCA AGACAGAGTC TGCTCCAGCC CGGACCTCTC TAGGGATCAC 1080 

CTCTGTTCTG ACCATGACCA CGTTGGGCAC CTTTTCTCGT AAGAATTTCC CGCGTGTCTC 1140 

CTATATCACA GCCTTGGATT TCTATATCGC CATCTGCTTC GTCTTCTGCT TCTGCGCTCT 1200 

GTTGGAGTTT GCTGTGCTCA ACTTCCTGAT CTACAACCAG ACAAAAGCCC ATGCTTCTCC 1260 

TAAACTCCGC CATCCTCGTA TCAATAGCCG TGCCCATGCC CGTACCCGTG CACGTTCCCG 1320 

AGCCTGTGCC CGCCAACATC AGGAAGCTTT TGTGTGCCAG ATTGTCACCA CTGAGGGAAG 1380 

TGATGGAGAG GAGCGCCCGT CTTGCTCAGC CCAGCAGCCC CCTAGCCCAG GTAGCCCTGA 1440 

GGGTCCCCGC AGCCTCTGCT CCAAGCTGGC CTGCTGTGAG TGGTGCAAGC GTTTTAAGAA 1500 

GTACTTCTGC ATGGTCCCCG ATTGTGAGGG CAGTACCTGG CAGCAGGGCC GCCTCTGCAT 1560 

CCATGTCTAC CGCCTGGATA ACTACTCGAG AGTTGTTTTC CCAGTGACTT TCTTCTTCTT 1620 

CAATGTGCTC TACTGGCTTG TTTGCCTTAA CTTGTAGGTA CCAGCTGGTA CCCTGTGGGG 1680 

CAACCTCTCC AGTTCCCCAG GAGGTCCAAG CCCCTTGCCA AGGGAGTTGG GGGAAAGCAG 1740 

CAGCAGCAGC AGGAGCGACT AGAGTTTTTC CTGCCCCATT CCCCAAACAG AAGCTTGCAG 1800 

AGGGTTTGTC TTTGCTGCCC CTCTCCCCTA CCTGGCCCAT TCACTGAGTT TTCTCAGCAG 1860 

ACCATTTCAA ATTATTAATA AATGGGCCAC CTCCCTCTTC TTCAAGGAGC ATCCGTGATG 1920 

CTCAGTGTTC AAAACCACAG CCACTTAGTG ATCAGCTCCC TAAAACCATG CCTAAGTACA 1980 

GGCGGATTAG CTATCTTCCA ACAATGCTGA CCACCAGACA ATTACTGCAT TTTTCCAGAA 2040 

GCCCACTATT GCCTTTGCAG TGCTTTCGGC CCAGTTCTGG CCTCAGCCTC AAAGTGCACC 2100 

GACTAGTTGC TTGCCTATAC CTGGCACCTC ATTAAGATGC TGGGCAGCAG TATAACAGGA 2160 

GGAAGAGATC CCTCTCCTTT GGTCAGATTA TTATGTTCTC AGTTCTCTCT CCCTGCTACC 2220 

CCTTTCTCTG CAGATAGATA GACACTGGCA TTATCCCTTT AGGAAGAGGG GGGGGCAGCA 2280 

AGAGAGCCTA TTTGGGACAG CATTCCTCTC TCTCTGCTGC TGTGACATCT CCCTCTCCTT 2340 

GCTGGCTCCA TCTTTCGTCT GCACTACCAA TTCAATGCCC TTCATCCAAT GGGTATCTAT 2400 

TTTTGTGTGT GATTATAGTA ACTACTCCCT GCTTTATATG CCACCCTCTT CCTTCTCTTT 2460 

GACCCCTGTG ACTCTTTCTG TAACTTTCCC AGTGACTTCC CCTAGCCCTG ACCAGGCACT 2520 

AGGCCTTGGT GACTTCCTGG GGCCAAGAAA CTAAGGAAAC TCGGCTTTGC AACAGGCATT 2580 

ACTCGCCATT GATTGGTGCC CACCCAGGGC ACACTGTCGG AGTTCTATCA CTTGCTTGAC 2640 

CCCTGGACCC ATAAACCAGT CCACTGTTAT ACCCGGGGCA CTCTAACCAT CACAATCAAT 2700 

CAATCAAATT CCCTTAAATT TGTATGGCAC TGGAACTTTG GCAAAGCACT TTTGACAAGT 2760 

TGTGTCTGAT TGGAGCTTCA TGATAGCCTT GTGACATCTT TAGGGCAGGA TTCTTATCCC 2820 

CATTTTGCAG ATGAAAACCC TGAGTCACAG ATTTCTGTGG GACTGTGGAT CTCACTGGAA 2880 

GCTATCCAAG AGCCCACTGT CACCTTCTAG ACCACATGAT AGGGCTAGAC AGCTCAGTTC 2940 

ACCATGATTC TCTTCTGTCA CCTCTGCTGG CACACCAGTG GCAAGGGCCA GAATGGCGAC 3000 

CTCTCTTTAG CTCAATTTCT GGGCCTGAGG TGCTCAGACT GCCCCCAAGA TCAAATCTCT 3060 

CCTGGCTGTA GTAACCCAGT GGAATGAATT TGGACATGCC CCAATGCTTC TATATGCTAA 3120 

GTGAAATCTG TGTCTGTAAT TTGTTGGGGG GTGGATAGGG TGGGGTCTCC ATCTACTTTT 3180 
TGTCACCATC ATCTGAAATG GGGAAATATG TAAATAAATA TATCAGCAAA GC 

Seq ID NO: 614 Protein sequence 
Protein Accession #: NP_06B822.l 

1 11 21 31 41 51 

I I I I I I 

MEYTIDIIFS QTWNSKRTHE HEITMPNQMV RIYKDGKVLY TIRMTIDAGC SLHMLRFFMD 60 

SHSCPLSFSS FSYPENEMIY KWENFKLEIN EKNSWKLFQF DFTGVSNKTE IITTPVGDFM 120 

VMTIFFNVSR RFGYVAFQNY VPSSVTTMLS WVSFWIKTES APARTSLGIT SVLTMTTLGT 180 

FSRKNFPRVS YITALDFYIA ICFVFCFCAL LBFAVLNFLI YNQTKAHASP KLRHPRINSR 240 

AHARTRARSR ACARQHQEAF VCQIVTTEGS DGEERPSCSA QQPPSPGSPE GPRSLCSKLA 300 

CCEHCKRFKK YFCMVPDCEG STWQQGRLCI HVYRLDNYSR WFPVTFFFF NVLYWLVCLN 360 
Ii 
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Seq ID NO: 615 DNA sequence 

Nucleic Acid Accession #: NM_021990.1 

Coding sequence: 1309.. 2490 

1 11 21 31 41 51 

I I I I I I 

GCCAGAGCGT GAGCCGCQAC CTCCGCGCAG GTGGTCGCGC CGGTCTCCGC GGAAATGTTG 60 

TCCAAAGTTC TTCCAGTCCT CCTAGGCATC TTATTGATCC TCCAGTCGAG AACATGTATA 120 

CAGAGAAGTG CTCAAATCAT AAGTGTACAG CTGATGAGTT GTCAAAAAAT GACCACAGCG 180 

GTGTAAAGAA AGCCAAATCA AGGACCCGAA TGTGAGCAGG ACCTCAGAAG CCCCCTTTGT 240 

CACTGCCTCC CAGCAAAGGC AGCACTATCC GGACTTCTAA CACCATCGGT GAGTTTCATA 300 

CCTTGGCAGA TGGCCTTTAA CATTTTTGTT TAATTCAATT ATTCTTACTA ATCTTCTTCT 360 

TTTTCTTGGC TGTGGTGCAT GGCTGTGGAG CTCAGGGTGG ACTCCTGTTG GGCAGCCAGT 420 

TCCTGGATGG CTGTCTGTGG GTGGAGGACT CCTGCCTTTC CTGTTTAGAC ACCCACAAAG 480 

GCTGCTCTTT AGCCTCCTTC CCTTCATCCC CTTCCCCTGC CCCCAGTGCA ACGAGTATTA 540 

CACAACCAAC AAAACCGCAA AATATTCCCA CAATTTTCTG GTCCTCTCTG GGAGAGGCCG 600 

CTCTGGCTTT TCCTCTCAGC CCTGGCCCTC TGCCTGCTCC TCACTCCTGG TTGGTGCTGG 660 

TCAGGCTGAC TAGAGGCCAA GGCGACCAAC ACTAGGCAAA CGCGGCCAGC GCTCAGACAT 720 

AAATGCCCTC TTCATTTCAC GTGTAACATT CTTTTAAAAT CTAGGTCTTG GTTTTGTTGA 780 

TTTTTTCTTA AATAAAAGAG TGATCATAAA AGAGGGACAG CATAGAAAGT CCCCAAAGAG 840 

CAGCAAGGTT TTAAAGAAAT TCACAAGCCT AATCTGTCAC TGTCTTATAA TTTGCTATTA 900 

CCAGTCACAA TTTAACTAGG TTTTGTGTTG AAAACTTGTT TTGGTTTGCT TCTGTCCCAA 960 

GAGGCACTAG CTGGGGCCCC TACAGAGTGC AGGGCAGAGC TTCATTTTTC GTTTGAATGT 1020 

TCTAGGGTCG AGGGACCTCA GACTGAATCA AAGAATGAAG CCTCTTCCCG TGATGTTGTC 1080 

TATGGCCCCC AGCCCCAGCC TCTGGAAAAT CAGCTCCTCT CTGAGGAAAC AAAGTCAACT 1140 

GAGACTGAGA CTGGGAGCAG AGTTGGCAAA CTGCCAGAAG CCTCTCGCAT CCTGAACACT 1200 

ATCCTGAGTA ATTATGACCA CAAACTGCGC CCTGGCATTG GAGAGAAGCC CACTGTGGTC 1260 

ACTGTTGAGA TCTCCGTCAA CAGCCTTGGT CCTCTCTCTA TCCTAGACAT GGAATACACC 1320 

ATTGACATCA TCTTCTCCCA GACCTGGTAC GACGAACGCC TCTGTTACAA CGACACCTTT 1380 

GAGTCTCTTG TTCTGAATGG CAATGTGGTG AGCCAGCTAT GGATCCCGGA CACCTTTTTT 1440 

AGGAATTCTA AGAGGACCCA CGAGCATGAG ATCACCATGC CCAACCAGAT GGTCCGCATC 1500 

TACAAGGATG GCAAGGTGTT GTACACAATT AGGATGACCA TTGATGCCGG ATGCTCACTC 1560 

CACATGCTCA GATTTCCAAT GGATTCTCAC TCTTGCCCTC TATCTTTCTC TAGCTTTTCC 1620 

TATCCTGAGA ATGAGATGAT CTACAAGTGG GAAAATTTCA AGCTTGAAAT CAATGAGAAG 1680 

AACTCCTGGA AGCTCTTCCA GTTTGATTTT ACAGGAGTGA GCAACAAAAC TGAAATAATC 1740 

ACAACCCCAG TTGGTGACTT CATGGTCATG ACGATTTTCT TCAATGTGAG CAGGCGGTTT 1800 

GGCTATGTTG CCTTTCAAAA CTATGTCCCT TCTTCCGTGA CCACGATGCT CTCCTGGGTT 1860 

TCCTTTTGGA TCAAGACAGA GTCTGCTCCA GCCCGGACCT CTCTAGGGAT CACCTCTGTT 1920 

CTGACCATGA CCACGTTGGG CACCTTTTCT CGTAAGAATT TCCCGCGTGT CTCCTATATC 1980 

ACAGCCTTGG ATTTCTATAT CGCCATCTGC TTCGTCTTCT GCTTCTGCGC TCTGTTGGAG 2040 

TTTGCTGTGC TCAACTTCCT GATCTACAAC CAGACAAAAG CCCATGCTTC TCCTAAACTC 2100 

CGCCATCCTC GTATCAATAG CCGTGCCCAT GCCCGTACCC GTGCACGTTC CCGAGCCTGT 2160 

GCCCGCCAAC ATCAGGAAGC TTTTGTGTGC CAGATTGTCA CCACTGAGGG AAGTGATGGA 2220 

GAGGAGCGCC CGTCTTGCTC AGCCCAGCAG (XCCCTAGCC CAGGTAGCCC TGAGGGTCCC 2280 

CGCAGCCTCT GCTCCAAGCT GGCCTGCTGT GAGTGGTGCA AGCGTTTTAA GAAGTACTTC 2340 

TGCATGGTCC CCGATTGTGA GGGCAGTACC TGGCAGCAGG GCCGCCTCTG CATCCATGTC 2400 

TACOGGCTGG ATAACTACTC GAGAGTTGTT TTCCCAGTGA CTTTCTTCTT CTTCAATGTG 2460 

CTCTACTGGC TTGTTTGCCT TAACTTGTAG GTACCAGCTG GTACCCTGTG GGGCAACCTC 2520 

TCCAGTTCCC CAGGAGGTCC AAGCCCCTTG CCAAGGGAGT TGGGGGAAAG CAGCAGCAGC 2580 

AGCAGGAGCG ACTAGAGTTT TTCCTGCCCC ATTCCCCAAA CAGAAGCTTG CAGAGGGTTT 2640 

GTCTTTGCTG CCCCTCTCCC CTACCTGGCC CATTCACTGA GTTTTCTCAG CAGACCATTT 2700 

CAAATTATTA ATAAATGGGC CACCTCCCTC TTCTTCAAGG AGCATCCGTG ATGCTCAGTG 2760 

TTCAAAACCA CAGCCACTTA GTGATCAGCT CCCTAAAACC ATGCCTAAGT ACAGGCGGAT 2820 

TAGCTATCTT CCAACAATGC TGACCACCAG ACAATTACTG CATTTTTCCA GAAGCCCACT 2880 

ATTGCCTTTG CAGTGCTTTC GGCCCAGTTC TGGCCTCAGC CTCAAAGTGC ACCGACTAGT 2940 

TGCTTGCCTA TACCTGGCAC CTCATTAAGA TGCTGGGCAG CAGTATAACA GGAGGAAGAG 3000 

ATCCCTCTCC TTTGGTCAGA TTATTATGTT CTCAGTTCTC TCTCCCTGCT ACCCCTTTCT 3060 

CTGCAGATAG ATAGACACTG GCATTATCCC TTTAGGAAGA GGGGGGGGCA GCAAGAGAGC 3120 

CTATTTGGGA CAGCATTCCT CTCTCTCTGC TGCTGTGACA TCTCCCTCTC CTTGCTGGCT 3180 

CCATCTTTCG TCTGCACTAC CAATTCAATG CCCTTCATCC AATGGGTATC TATTTTTGTG 3240 

TGTGATTATA GTAACTACTC CCTGCTTTAT ATGCCACCCT CTTCCTTCTC TTTGACCCCT 3300 

GTGACTCTTT CTGTAACTTT CCCAGTGACT TCCCCTAGCC CTGACCAGGC ACTAGGCCTT 3360 

GGTGACTTCC TGGGGCCAAG AAACTAAGGA AACTCGGCTT TGCAACAGGC ATTACTCGCC 3420 

ATTGATTGGT GCCCACCCAG GGCACACTGT CGGAGTTCTA TCACTTGCTT GACCCCTGGA 3480 

CCCATAAAOC AGTCCACTGT TATACCCGGG GCACTCTAAC CATCACAATC AATCAATCAA 3540 

ATTCCCTTAA ATTTGTATGG CACTGGAACT TTGGCAAAGC ACTTTTGACA AGTTGTGTCT 3600 

GATTGGAGCT TCATGATAGC CTTGTGACAT CXTTAGGGCA GGATTCTTAT CCCCATTTTG 3660 

CAGATGAAAA CCCTGAGTCA CAGATTTCTG TGGGACTGTG GATCTCACTG GAAGCTATCC 3720 

AAGAGCCCAC TGTCACCTTC TAGACCACAT GATAGGGCTA GACAGCTCAG TTCACCATGA 3780 

TTCTCTTCTG TCACCTCTGC TGGCACACCA GTGGCAAGGC CCAGAATGGC GACCTCTCTT 3840 

TAGCTCAATT TCTGGGCCTG AGGTGCTCAG ACTGCCCCCA AGATCAAATC TCTCCTGGCT 3900 

GTAGTAACCC AGTGGAATGA ATTTGGACAT GCCCCAATGC TTCTATATGC TAAGTGAAAT 3960 

CTGTGTCTGT AATTTGTTGG GGGGTGGATA GGGTGGGGTC TCCATCTACT TTTTGTCACC 4020 
ATCATCTGAA ATGGGGAAAT ATGTAAATAA ATATATCAGC AAAGC 

Seq ID NO: 616 Protein sequence 
Protein Accession #: NP 068830.1 



or . 1 11 21 31 41 51 

80 | | i | | | 

MEYTIDIIFS QTWYDERIiCY NDTFESLVLN GNWSQLWIP DTFFRNSKRT HEHEITMPNQ 60 

MVRIYKDGKV LYTIRMTIDA GCSLHMLRFP MDSHSCPLSF SSPSYPENEM IYKWENFKLE 120 

INEKNSWKLF QFDFTGVSNK TEIITTPVGD FMVMTIFFNV SRRFGYVAFQ NYVPSSVTTM 180 

LSWVSFWIKT ESAPARTSLG ITSVLTMTTL GTFSRKNFPR VSYITALDFY IAICFVFCFC 240 

1188 



WO 03/042661 



ALLEFAVLNF LIYNQTKAHA SPKLRHPRIN SRAHARTRAR SRACARQHQE APVCQIVTTE 300 

GSDGEERPSC SAQQPPSPGS PEGPRSLCSK LACCEWCKRF KKYFCMVPDC EGSTWQQGRL 360 
CIHVYRLDNY SRWFPVTFF FFNVLYWLVC LNL 

Seq ID NO: 617 DNA sequence 

Nucleic Acid Accession tt: NMJ)04864.1 

Coding sequence: 26.. 952 

1 11 21 31 41 51 

I I I I I I 

CGGAACGAGG GCAACCTGCA CAGCCATGCC CGGGCAAGAA CTCAGGACGG TGAATGGCTC 60 

TCAGATGCTC CTGGTGTTGC TGGTGCTCTC GTGGCTGCCG CATGGGGGCG CCCTGTCTCT 120 

GGCCQAGGOG AGCCGCGCAA GTTTCCCGGG ACCCTCAGAG TTGCACTCCG AAGACTCCAG 1B0 

ATTCCGAGAG TTGCGGAAAC GCTACGAGGA CCTGCTAACC AGGCTGCGGG CCAACCAGAG 240 

CTGGGAAGAT TCGAACACCG ACCTCGTCCC GGCCCCTGCA GTCCGGATAC TCACGCCAGA 300 

AGTGCGGCTG GGATCCGGCG GCCACCTGCA CCTGCGTATC TCTCGGGCCG CCCTTCCCGA 360 

GGGGCTCCCC GAGGCCTCCC GCCTTCACCG GGCTCTGTTC CGGCTGTCCC CGACGGCGTC 420 

AAGGTCGTGG GACGTGACAC GACCGCTGCG GCGTCAGCTC AGCCTTGCAA GACCCCAAGC 480 

GCCCGCGCTG CACCTGCGAC TGTCGCCGCC GCCGTCGCAG TCGGACCAAC TGCTGGCAGA 540 

ATCTTCGTCC GCACGGCCCC AGCTGGAGTT GCACTTGCGG CCGCAAGCCG CCAGGGGGCG 600 

CCGCAGAGCG CGTGCGCGCA ACGGGGACGA CTGTCCGCTC GGGCCCGGGC GTTGCTGCCG 660 

TCTGCACACG GTCCGCGCGT CGCTGGAAGA CCTGGGCTGG GCCGATTGGG TGCTGTCGCC 720 

ACGGGAGGTG CAAGTGACCA TGTGCATCGG CGCGTGCCCG AGCCAGTTCC GGGCGGCAAA 7Q0 

CATGCACGCG CAGATCAAGA CGAGCCTGCA CCGCCTGAAG CCCGACACGG AGCCAGCGCC 840 

CTGCTGCGTG CCCGCCAGCT ACAATCCCAT GGTGCTCATT CAAAAGACCG ACACCGGGGT 900 

GTCGCTCCAG ACCTATGATG ACTTGTTAGC CAAAGACTGC CACTGCATAT GAGCAGTCCT 960 

GGTCCTTCCA CTGTGCACCT GCGOGGGGGA GGCGACCTCA GTTGTCCTGC CCTGTGGAAT 1020 

GGGCTCAAGG TTCCTGAGAC ACCCGATTCC TGCCCAAACA GCTGTATTTA TATAAGTCTG 1080 

TTATTTATTA TTAATTTATT GGGGTGACCT TCTTGGGGAC TCGGGGGCTG GTCTGATGGA 1140 

ACTGTGTATT TATTTAAAAC TCTGGTGATA AAAATAAAGC TGTCTGAACT GTTAAAAAAA 1200 



Seq ID NO: 618 Protein sequence 
Protein Accession ft: NP_004855.1 

1 11 21 31 41 51 

I I I I I | 

MPGQELRTVN GSQMLLVLLV LSWIiPHGGAL SLAEASRASF PGPSELHSED SRFRELRKRY 60 

EDLLTRLRAN QSWEDSNTDL VPAPAVRILT PEVRLGSGGH LHLRISRAAL PEGI*PEASRL 120 

HRAI*FRIiSPT ASRSWDVTRP LRRQLSLARP QAPALHLRLS PPPSQSDQLL AESSSARPQL 180 

ELHLRPQAAR GRRRARARNG DDCPLGPGRC CRLHTVRASL EDLGWADWVL SPREVQVTMC 240 

IGACPSQFRA ANMHAQIKTS LHRLKPDTEP APCCVPASYN PMVLIQKTDT GVSLQTYDDL 300 
LAKDCHCI 

Seq ID NO: 619 DNA sequence 

Nucleic Acid Accession fh NM 003979.2 

Coding sequence: 254.. 1357 ~ 

1 11 21 31 41 51 

I I I I I I 

A TAAC AGCAT GAAGTGCCGT GGAACTGGAA TAGGCGTGTC CTCTCCCTCG ACCCTCCCCC 60 

TCCTTGTCCC TCTGCTCACC CCTCGCTCGT TCCCTCCCTC CGGCGAGGGC CGCCTTTATA 120 

ACAACTGCTC AGAGTGCGAG GGCGGGATAG CTGTCCAAGG TCTCCCCCAG CACTGAGGAG 180 

CTCGCCTGCT GCCCTCTTGC GCGCGGGAAG CAGCACCAAG TTCACGGCCA ACGCCTTGGC 240 

ACTAGGGTCC AGAAT GGCTA CAACAGTCCC TGATGGTTGC CGCAATGGCC TGAAATCCAA 300 

GTACTACAGA CTTTGTGATA AGGCTGAAGC TTGGGGCATC GTCCTAGAAA OGGTGGCCAC 360 

AGCCGGGGTT GTGACCTOGG TGGCCTTCAT GCTCACTCTC CCGATCCTCG TCTGCAAGGT 420 

GCAGGA CTCC AACAGGCGAA AAATGCTGCC TACTCAGTTT CTCTTCCTCC TGGGTGTGTT 480 

GGGCATCTTT GG CCTCA CCT TCGCCTTCAT CATCGGACTG GACGGGAGCA CAGGGCCCAC 540 

ACGCTTCTTC CTCTTTGGGA TCCTCTTTTC CATCTGCTTC TCCTGCCXGC TGGCTCATGC 600 

TGTCAGTCTG ACCAAGCTCG TCCGGGGGAG GAAGCCCCTT TCCCTGTTGG TGATTCTGGG 660 

TCTGGCCGTG GGCTTCAGCC TAGTCCAGGA TGTTATCGCT ATTGAATATA TTGTCCTGAC 720 

CATGAATAGG ACCAACGTCA ATGTCTTTTC TGAGCTTTCC GCTCCTCGTC GCAATGAAGA 780 

CTTTGTCCTC CTGCTCACCT ACGTCCTCTT CTTGATGGCG CTGACCTTCC TCATGTCCTC 840 

CTTCACCTTC TGTGGTTCCT TCACGGGCTG GAAGAGACAT GGGGCCCACA TCTACCTCAC 900 

GATGCTCCTC TCCATTGCCA TCTGGGTGGC CTGGATCACC CTGCTCATGC TTCCTGACTT 960 

TGACCGCAGG TGGGATGACA CCATCCTCAG CTCCGCCTTG GCTGCCAATG GCTGGGTGTT 1020 

CCTGTTGGCT TATGTTAGTC CCGAGTTTTG GCTGCTCACA AAGCAACGAA ACCCCATGGA 1080 

TTATCCTGTT GAGGATGCTT TCTGTAAACC TCAACTCGTG AAGAAGAGCT ATGGTGTGGA 1140 

GAACAGAGCC TACTCTCAAG AGGAAATCAC TCAAGGTTTT GAAGAGACAG GGGACACGCT 1200 

CTATGCCCCC TATTCCACAC ATTTTCAGCT GCAGAACCAG CCTCCCCAAA AGGAATTCTC 1260 

CATCCCACGG GCCCACGCTT GGCCGAGCCC TTACAAAGAC TATGAAGTAA AGAAAGAGGG 1320 

CAGCTAACTC TGTCCTGAAG AGTGGGACAA ATGCAGCCGG GCGGCAGATC TAGCGGGAGC 1380 

TCAAAGGGAT GTGGGCGAAA TCTTGAGTCT TCTGAGAAAA CTGTACAAGA CACTACGGGA 1440 

ACAGTTTGCC TCCCTCCCAG CCTCAACCAC AATTCTTCCA TGCTGGGGCT GATGTGGGCT 1500 

AGTAAGACTC CAGTTCTTAG AGGCGCTGTA GTATTTTTTT TTTTTTGTCT CATCCTTTGG 1560 

ftTACTTCTTT TAAGTGGGAG TCTCAGGCAA CTCAAGTTTA GACCCTTACT C T TT TT G T TT 1620 

GTTTTTTGAA ACAGGATCTT GCTCTGTCAC CCAGGCTTGA GTGCAGTGGT GCGATCACAG 1680 

CCCAGTGCAG CCTCGACCAC CTGTGCTCAA GCAATCCTCC CATCTCCATC TCCCAAAGTG 1740 

CTGGGATGAC AGGCGTGAGC CACAGCTCCC AGCCTAGGCC CTTAATCTTG CTGTTATTTT 1800 

CCATGGACTA AAGGTCTGGT CATCTGAGCT CAOGCTGGCT CACACAGCTC TAGGGGCCTG 1860 

CTCCTCTAAC TCACAGTGGG TTTTGTGAGG CTCTGTGGCC CAGAGCAGAC CTGCATATCT 1920 

GAGCAAAAAT AGCAAAAGCC TCTCTCAGCC CACTGGCCTG AATCTACACT GGAAGCCAAC 1980 

TTGCTGGCAC CCCCGCTCCC CAACCCTTCT TGCCTGGGTA GGAGAGGCTA AAGATCACCC 2040 

1189 
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TAAATTTACT 
ATTCACAGGT 
TAATCTCCCC 
CTCCTTGTCA 
CTCATCTTGC 
GTGGGCATGO 
GCAATAAAQA 



CATCTCTCTA 
CACCCCTCTC 
CTACGCTCTG 
GGAGAATTTG 
ACCCCAACCT 
TCTCCTAATG 



GTGCTGCCTC 
TTCTTGCACT 
CCAGGAATTC 
TAGATCATTC 
CTGTAAATAG 
GAGGAGTGTT 
ACTCTTTCAT 



ACATTGGGCC 
GTCCCCAAAC 
TTTCAGACCT 
TCACTTCAAA 
ATTTACCGCA 
CATTGTATAA 
GGTGGTGGCA 



TCAGCAGCTC 
TTGCTGTCAA 
CACTAGCACA 
TTCCTGGGGC 
TTTACGGCTG 
TAAOTTATTC 
GCAAAAAAAA 



CCCAGCACCA 2100 

TTCCGAGATC 2160 

AGCCCGGTTG 2220 

TGATACTTCT 2280 

CATTCTGTAA 2340 

ACCTGAGTAT 2400 
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Seq ID NO: 620 Protein sequence 
Protein Accession #: KP 003970.1 



MATTVPDGCR 
RRKMLPTQFL 
KLVRGRKPLS 
LTYVLFLMAL 
DDTILSSALA 
SQEEITQGFE 



11 
I 

NGLKSKYYRL 
FLLGVLGIFG 
LLVILGLAVG 
TFLMSSFTFC 
ANGWVFLLAY 
ETGDTLYAPY 



21 
I 

CDKAEAWGIV 
LTFAFI IGLD 
FSLVQDVIAI 
GSFTGWKRHG 
VSPEFWLLTK 
STHFQLQNQP 



31 
I 

LETVATAGW 
GSTGPTRFFL 
EYIVLTMNRT 
AHIYLTMLLS 
QRNPMDYPVE 
PQKEFSIPRA 



41 

I 

TSVAFMLTLP 
FGILFSICFS 
NVNVFSELSA 
IAIWVAWITL 
DAFCKPQLVK 
HAWPSPYKDY 



51 
I 

ILVCKVQDSN 
CLLAHAVSLT 
PRRNEDFVLL 
LMLPDFDRRW 
KSYGVENRAY 
EVKKEGS 



Seq ID NOs 621 DNA sequence 
Nucleic Acid Accession #: NM_002423.2 
Coding sequence: 4 8.. 851 



ACCAAATCAA 
TGCTGTGTGC 
GAGGCATGAG 
ATGACTCAGA 
TCTTTGGCCT 
CCAGATGTGG 
CTTCCAAAGT 
TGGATCGATT 
GGAAAGTTGT 
ACTCCTACCC 
GTCTCGGAGG 
GGATTAACTT 
CCTCTGATCC 
AACTTTCCCA 
GAAAGAAATA 
TGTTGCACAA 
CTTTTTTATT 
ATGGTGTGAC 
ATGTTACATA 



11 
I 

CCATAGGTCC 
TGTGTGCCTG 
TGAGCTACAG 
AACAAAAAAT 
ACCTATAACT 
AGTGCCAGAT 
GGTCACCTAC 
AGTGTCAAAG 
ATGGGGAACT 
ATTTGATGGG 
AGATGCTCAC 
CCTGTATGCT 
TAATGCAGTG 
GGATGATATT 
GAAACTTCAG 
TCAGAATTGA 
GCAGTTGGTT 
TGTGTCTTAT 
CACAAATAAA 



21 
I 

AAGAACAATT 
CTGCCTGGCA 
TGGGAACAGG 
GCCAACAGTT 
GGAATGTTAA 
GTTGCAGAAT 
AGGATCGTAT 
GCTTTAAACA 
GCTGACATCA 
CCAGGAAACA 
TTCGATGAGG 
GCAACTCATG 
ATGTATCCAA 
AAAGGCATTC 
GCAGAACATC 
TAAGGACTGT 
TTTGAATGTC 
TCCATCTATG 
TAAAATGTTT 



31 
I 

GTCTCTGGAC 
GCCTGGCCCT 
CTCAGGACTA 
TAGAAGCCAA 
ACTCCCGCGT 
ACTCACTATT 
CATATACTCG 
TGTGGGGCAA 
TGATTGGCTT 
CGCTGGCTCA 
ATGAACGCTG 
AACTTGGCCA 
CCTATGGAAA 
AGAAACTATA 
CATTCATTCA 
TCCTCCACTC 
TTTCACTCCT 
AGCTTTGTCA 
ATTCCATGGT 



41 
I 

GGCAGCTATG 
GCCGCTGCCT 
TCTCAAGAGA 
ACTCAAGGAG 
CATAGAAATA 
TCCAAATAGC 
AGACTTACCG 
AGAGATCCCC 
TGCGCGAGGA 
TGCCTTTGCG 
GACGGATGGT 
TTCTTTGGGT 
TGGAGATCCC 
TGGAAAGAGA 
TTCATTGGAT 
CATTTAGCAA 
TTTATTGGTT 
GTGCGGGTAG 
AAATTTA 



51 
I 

CGACTCACCG 
CAGGAGGCGG 
TTTTATCTCT 
ATGCAAAAAT 
ATGCAGAAGC 
CCAAAATGGA 
CATATTACAG 
CTGCATTTCA 
GCTCATGGGG 
CCTGGGACAG 
AGCAGTCTAG 
ATGGGACATT 
CAAAATTTTA 
AGTAATTCAA 
TGTATATCAT 
TTATGTCACC 
AAACTCCTTT 
ATGTCAATAA 



Seq ID NO: 622 Protein sequence 
Protein Accession #: NP 002414.1 



51 



1 11 21 31 41 

I I I ! I 

MRLTVLCAVC LLPGSLALPL PQEAGGMSEL QWEQAQDYLK RFYLYDSETK NANSLEAKLK 
EMQKFFGLPI TGMLNSRVIE IMQKPROGVP DVAEYSLFPN SPKWTSKWT YRIVSYTRDL 
PHITVDRLVS KALNMWGKEI PLHFRKWWG TADIMIGFAR GAHGDSYPFD GPGNTLAHAF 
APGTGLGGDA HFDEDERWTD GSSLGIHFLY AATHELGHSL GMGHSSDPNA VMYPTYGNGD 
PQNFKLSQDD IKGIQKLYGK RSNSRKK 

Seq ID NO: 623 DNA sequence 

Nucleic Acid Accession #: NMJJ31457.1 

Coding sequence: 204.. 956 



1 

I 

AAACAGGAAA 
CAGGAAATAG 
ATTCTCTCAG 
ACGTGGCTGC 
CTGTGTTGGT 
TGCCCCTGTA 
TGGTGTCGAA 
GGGCCATCCA 
TTCTCGTAGG 
TGTGGTTTAT 
GCCTGCTGTC 
TCATACTCTT 
CTTACGCCTG 
TCCTGGAGTT 
AATCAAGCAA 
CAGAACCGGT 
ACAGATTCTG 
CCATAACCCA 
TCTCACCTTC 
ACAGAGATTT 
TATGTGGGCA 



11 
I 

TAAATACGAA 
AGGACTTCGG 
GAAAAAAAAC 
TGGCAGAGCA 
GGTGGCACCC 
TCCAAACAGC 
TGTGAATGGG 
GATCATCATT 
GGAATACCTG 
CATTTCAGGA 
TGGCAGTTTG 
CATCACAGAT 
GGGTGTGAAC 
TGGCATCGCA 
TGTGAGTGTC 
GACCTCACCA 
GAAGCATCTT 
GGTCGTTCCT 
ATTCTTCAAT 
TAAACAGATG 
TCCAGCCTCT 



21 
I 

TGAAACTGAG 
ATCCTTCTAA 
AAGGTCCCCA 
AGCATGAATT 
CACAATGGTT 
CAGCCGCAAG 
CAGCCTGTGC 
GGCCTGGCTC 
TCTATTTCAT 
TCTCTCTCCG 
GGCTTGAACA 
CTAAGTATTC 
CCTGGAATGG 
TGCGCATCTT 
ATCTATCCAA 
CCAAGTTATT 
TCACTGGGAC 
GTTCTGACAG 
TCAGTCTAGG 
TTAACCAAGA 
GGGGCCTTGG 



31 
I 

CTCTAAGCAG 
CCCTACCACC 
CAGCAAAGAA 
CGATGACTTC 
ATCCTGTGAC 
TCCACCTAGT 
AGAAAGCTCT 
ACATCGGCCT 
TCTACGGAGG 
TGGCAGCAGA 
TCGTCAGTGC 
CCCACCCATA 
CGATTTCTGG 
CCCACTTTGG 
ACATCTATGC 
CCAGTGAGAT 
CAAAAGAAGT 
CTGAGGAAAC 
AAACCATGCT 
GGGACTCCCT 
CACACACACA 



41 

I 

CATGTAACCT 
CAACTGGCCC 
AAGGAATAGG 
AGCAGTTCOG 
CCCAGGAATT 
TCCTGGGAAC 
GAAAGAAGGC 
CGGCTCCATC 
CTTTCCCTTC 
AAATCAGCCA 
AATCTGCTCT 
TGCCTACCCC 
CGTGCTGCTG 
CTGCCAGTTG 
AGCAAACCCA 
CCAAGCAAAT 
CCTCCTCCCT 
GTCTCTCCCA 
GTTTCTCTAT 
AGGGCACATG 
TTCGTGTGCT 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
B40 
900 
960 
1020 
1080 



60 
120 
180 
240 



51 

! 

GGCCTGCATC 
CAGTACATTC 
ATCAAGAGAT 
GTGGCCAATT 
ATGTCTCACG 
CCACCTAGTT 
AAAACCTTGG 
ATGGCGACGG 
TGGGGAGGCT 
TATTCTTATT 
GCAGTTGGAG 
GACTATTATC 
GTCTTCTGCC 
GTCTGCTGTC 
GTGATCACCC 
AAGTAAGGCT 
TTCTGGGCTT 
CTGTTTGTAC 
CAAGAAGAAG 
CATCAGCACA 
CTGCTGCATG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
7B0 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



1190 



WO 03/042661 



PCT/US02/36810 



TGAGCTTGTG GGTTAGAGGA ACAAATATCT AGACATTCAA TCTTCACTCT TTCAATTGTG 1320 
CATTCATTTA ATAAATAGAT ACTGAGCATT CAAAAAAAAA AAAAAAAAA 

Seq ID NO: 624 Protein sequence 
J Protein Accession fl: NP_113645.1 

1 11 21 31 41 51 

I I I I I I 

MNSMTSAVPV ANSVLWAPH NGYPVTPGIM SHVPLYPNSQ PQVHLVPGNP PSLVSNVNGQ 60 

1U PVQKALKEGK TLGAIQIIIG LAHIGLGSIM ATVLVGEYLS ISFYGGFPFW GGLWFIISGS 120 

LSVAAENQPY SYCLLSGSLG LNIVSAICSA VGVILPITDL SIPHPYAYPD YYPYAWGVNP 180 

GMAISGVLLV FCLLEFGIAC ASSHFGCQLV CCQSSNVSVI YPNIYAANPV ITPEPVTSPP 240 
SYSSEIQANK 



15 



30 
35 
40 
45 
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60 
65 
70 



80 



Seq ID NO: 625 DNA sequence 

Nucleic Acid Accession #: NMJ)0S221.3 

Coding sequence: 1..B70 



-^1 11 21 31 41 51 

20 | | | | | | 

ATGACAGGAG TGTTTGACAG AAGGGTCCCC AGCATCCGAT CCGGCGACTT CCAAGCTCCG 60 

TTCCAGACGT CCGCAGCTAT GCACCATCCG TCTCAGGAAT CGCCAACTTT GCCCGAGTCT 120 

TCAGCTACCG ATTCTGACTA CTACAGCCCT ACGGGGGGAG CCCCGCACGG CTACTGCTCT 180 

CCTACCTCGG CTTCCTATGG CAAAGCTCTC AACCCCTACC AGTATCAGTA TCACGGCGTG 240 

25 AACGGCTCCG CCGGGAGCTA CCCAGCCAAA GCTTATGCCG ACTATAGCTA CGCTAGCTCC 300 

TACCACCAGT ACGGCGGCGC CTACAACOGC GTCCCAAGCG CCACCAACCA GCCAGAGAAA 360 

GAAGTGACCG AGCCCGAGGT GAGAATGGTG AATGGCAAAC CAAAGAAAGT TCGTAAACCC 420 

AGGACTATTT ATTCCAGCTT TCAGCTGGCC GCATTACAGA GAAGGTTTCA GAAGACTCAG 480 

TACCTCGCCT TGCCGGAACG CGCCGAGCTG GCCGCCTCGC TGGGATTGAC ACAAACACAG 540 

GTGAAAATCT GGTTTCAGAA CAAAAGATCC AAGATCAAGA AGATCATGAA AAACGGGGAG 600 

ATGCCCCCGG AGCACAGTCC CAGCTCCAGC GACCCAATGG CGTGTAACTC GCCGCAGTCT 660 

CCAGCGGTGT GGGAGCCCCA GGGCTCGTCC CGCTCGCTCA GCCACCACCC TCATGCCCAC 720 

CCTCCGACCT CCAACCAGTC CCCAGCGTCC AGCTACCTGG AGAACTCTGC ATCCTGGTAC 780 

ACAAGTGCAG CCAGCTCAAT CAATTCCCAC CTGCCGCCGC CGGGCTCCTT ACAGCACCCG 840 
CTGGCGCTGG CCTCCGGGAC ACTCTATTAG 



Seq ID NO: 626 Protein sequence 
Protein Accession #: NP_005212.1 

1 11 21 31 41 51 

I I I I I I 

MTGVFDRRVP SIRSGDFQAP FQTSAAMHHP SQESPTLPES SATDSDYYSP TGGAPHGYCS 60 
PTSASYGKAL NPYQYQYHGV NGSAGSYPAK AYADYSYASS YHQYGGAYNR VPSATNQPEK 120 
EVTEPEVRMV NGKPKKVRKP RTIYSSPQLA ALQRRFQKTQ YLALPERAEL AASLGLTQTQ 180 
VKIWFQNKRS KIKKIMKNGB MPPEHSPSSS DPMACNSPQS PAVWEPQGSS RSLSHHPHAH 240 
PPTSNQSPAS SYUSNSASWY TSAASSINSH LPPPGSLQHP LALASGTLY 



Seq ID NO: 627 DNA sequence 
Nucleic Acid Accession #: NMJU4420 
J\) Coding sequence: 118., 792 



1 11 21 31 41 51 

1 I I I I I 

GCACGAGAGA CGACGTGCTG AGCTGCCAGC TTAGTGGAAG CTCTGCTCTG GGTGGAGAGC 60 

AGCCTCGCTT TGGTGACGCA CAGTGCTGGG ACCCTCCAGG AGCCCCGGGA TTGAAGGATG 120 

GTGGCGGCCG TCCTGCTGGG GCTGAGCTGG CTCTGCTCTC CCCTGGGAGC TCTGGTCCTG 180 

GACTTCAACA ACATCAGGAG CTCTGCTGAC CTGCATGGGG CCCGGAAGGG CTCACAGTGC 240 

CTGTCTGACA CGGACTGCAA TACCAGAAAG TTCTGCCTCC AGCCCCGCGA TGAGAAGCCG 300 

TTCTGTGCTA CATGTCGTGG GTTGCGGAGG AGGTGCCAGC GAGATGCCAT GTGCTGCCCT 360 

GGGACACTCT GTGTGAACGA TGTTTGTACT ACGATGGAAG ATGCAACCCC AATATTAGAA 420 

AGGCAGCTTG ATGAGCAAGA TGGCACACAT GCAGAAGGAA CAACTGGGCA CCCAGTCCAG 480 

GAAAACCAAC CCAAAAGGAA GCCAAGTATT AAGAAATCAC AAGGCAGGAA GGGACAAGAG 540 

GGAGAAAGTT GTCTGAGAAC TTTTGACTGT GGCCCTGGAC TTTGCTGTGC TCGTCATTTT 600 

TGGACGAAAA TTTGTAAGCC AGTCCTTTTG GAGGQACAGG TCTGCTCCAG AAGAGGGCAT 660 

AAAGACACTG CTCAAGCTCC AGAAATCTTC CAGCGTTGCG ACTGTGGCCC TGGACTACTG 720 

TGTCGAAGCC AATTGACCAG CAATCGGCAG CATGCTCGAT TAAGAGTATG CCAAAAAATA 780 

GAAAAGCTAT AAATATTTCA AAATAAAGAA GAATCCACAT TGCAAAAAAA AAAAAAAAAA 840 
A 

Seq ID NO: 628 Protein sequence 
Protein Accession #: NP 055235 



1 11 21 31 41 51 

7 r I I I I I I 

/D MVAAVLLGLS WLCSPLGALV LDFNNIRSSA DLHGARKGSQ CLSDTDCNTR KFCLQPRDEK 60 
PFCATCRGLR RRCQRDAMCC PGTLCVNDVC TTMEDATPIL ERQLDEQDGT HAEGTTGHPV 120 
QENQPKRKPS IKKSQGRKGQ EGESCLRTFD CGPGLCCARH FWTKICKPVL LEGQVCSRRG 180 
HKDTAQAPEI FQRCDCGPGL LCRSQLTSNR QHARLRVCQK IEKL 



Seq ID NO: 629 DNA sequence 

Nucleic Acid Accession ft: NM_002448.1 

Coding sequence: 241.. 1134 

1 11 21 31 41 51 

1191 



WO 03/042661 



PCT/US02/36810 
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GCGCGAGTGC 
ACGCCGGAGC 
AGGCCCCGGG 
CGGCTGGCCA 
ATGACTTCTT 
GGAGGCGCGG 
6AGGAGGGGG 
ATGGCCGACC 
CAGGCGGCGG 
QACQCQCCCT 
CCAGAAGATO 
CAGAGCCCCC 
CGCAAACACA 
CTGGAGCGCA 
AGCTCGCTCA 
GCAAAGAGAC 
CCACCGGCTG 
GCGGGTGCCT 
CCCGTGGGAC 
CCAGGTCCCC 
CTCCCCTGCT 
CTCTTTGCTC 
GTCCCTTAGT 
AGGGCAGAGA 
CTAGAGGCCA 
GAGAAATCGG 
AAAAAAAAAA 
TCCATTGGAA 
ACACAGATGT 



TCCCGGGAAC 
TGGCCTGCTG 
CCCTCGCAGA 
GTGCTGCGGC 
TGCCACTCGQ 
GCCAGGCCCC 
CCAAGCCCAA 
ACAGGAAGCC 



CTTCGCCGCG 
CGCTCGTCAA 
GCTTCTCCCC 
AGACGAACCG 
AGTTCCGCCA 
GCCTCACTGA 
TACAAGAGGC 
CCTTCGGCCT 
CGCTCTACGG 
TCTACACGGC 
ACCTGTGGGC 
CGGCACCGCC 
CCTGAGTTCA 
ACTCTTCTAG 
GGTTAACAGA 
TGTCTCCTGC 
TGTCTGACGA 
GAAAAGAGAA 
AATTTAAGAC 
GTTGCAAAGG 



I 

TCTGCCTGCG 
GGGAGGGGCG 
GGCCGGCCGC 
AGAAGGGGGG 
TGTCAAAGTG 
CAGCGCCGCC 
AGTGTCCCCT 
GGGGGCCAAG 
GCAGCCACTG 
GCCGCTOGGC 
AGCCGAGAGC 
GCCGCCGGCC 
TAAGCCGCGG 
GAAGCAGTAC 
GACGCAGGTG 
AGAGCTGGAG 
CTCCTTCCCT 
TGCCTCTGGC 
CCATGTGGGC 
CAGCCGATTC 
AGCCGCCTTC 
CTCTCCGAAG 
CATTTAGATC 
TTTATCTAGG 
ATAGCTTTTC 
TTTTGGAAAT 
AAAAAAGACT 
AAGTTCAACA 
TAGGTTGAAG 



I 

CGGCGGCAGC 
GGAGGCGCGC 
GCTCCCAGCC 
GCCCGGCTCT 
GAGGACTCCG 
GCGGCCACGG 
TCGCTCCTGC 
GAGAGCGCCC 
GGCGTCCCGC 
CATTTCTCGG 
CCCGAGAAGC 
AGGCGGCTGA 
ACGCCCTTCA 
CTGTCCATCG 
AAGATATGGT 
AAGCTGAAGA 
CTCGGCGGCC 
CCCTTCCAGC 
TACAGCATGT 
CTCCAGCCCT 
CCTTTAACCC 
TCTGATCCCT 
TACACTCTCG 
GTCCCCAGCA 
CCTGTCCTGA 
GAGAACAATC 
AGCCAGCCAG 
ACAAAACATT 
GGA 



GACCGGAGGC 
GOGGGAGGGT 
CGCCCGGAGC 
GCATGGCCCC 
CCTTCGGCAA 
CAGCCGCCAT 
CCTTCAGCGT 
TGGCGCCCTC 
CGGGGTCGCT 
TGGGGGGACT 
CCGAGAGGAC 
GCCCCCCAGC 
CCACCGCGCA 
CCGAGCGCGC 
TCCAGAACCG 
TGGCCGCCAA 
CCGCAGCTGT « 
GCGCCGCGCT 
ACCACCTGAC 
GGTGCTGTAC 
TCACACTGCT 
GCCAAAAAGT 
AGTTAAAGAT 
GAATTGACAG 
CACCAGGCAA 
TCAAAAAAAA 
GAAGATGAAT 



Seq ID NO: 630 Protein sequence 
Protein Accession #: NP_002439.l 



l 
1 

MTSLPLGVKV 
MADHRKPGAK 
PEDALVKAES 
LERKFRQKQY 
PPAAFGLSFP 



11 



21 



41 



CAGGCCCAGC 
CCGCCCGGCC 
CCATGCCCGG 
GGCTGCTGAC 
GCCGGCGGGG 
GGGCGCGGAC 
GGAGGCGCTC 
CGAGGGCGTG 
GGGAGCCCCG 
CCTCAAGCTG 
CCCGTGGATG 
CTGCACCCTC 
GCTGCTGGCG 
GGAGTTCTCC 
CCGCGCCAAG 
GCCCATGCTG 
. AGCGGCCGCG 
GCCTGTGGCG 
ATAGAGGGTC 
CCCCGACGTG 
CCAGTTTCAC 
GGCTGGAAGA 
GGGGAAACTG 
TTGAACAGAG 
GAAAAGCGCA 
AAAAAAAAAA 
CCTAGCTTCT 
GGCAGGGAAA 



51 



Seq ID NO: 631 DNA sequence 

Nucleic Acid Accession ft: NM_002557.l 

Coding sequence: 13.. 2049 



11 



21 



31 



41 
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31 

I ! I 

EDSAFGKPAG GGAGQAPSAA AATAAAMGAD EEGAKPKVSP SLLPFSVEAL 
ESALAPSEGV QAAGGSAQPL GVPPGSLGAP DAPSSPRPLG HFSVGGLLKL 
PEKPERTPWM QSPRFSPPPA RRLSPPACTL RKHKTNRKPR TPFTTAQLIA 
LSIAERAEFS SSLSLTETQV KIWFQNRRAK AKRLQEAELE KLKMAAKPML 
LGGPAAVAAA AGASLYGASG PFQRAALPVA PVGLYTAHVG YSMYHLT 



60 
120 
180 
240 



51 

I I I 

CAGACCATTG AGATGTGGAA GCTGTTGCTG TGGGTTGGGC TGGTTCTTGT GCTGAAACAC 
CAOGATGGTG CTGCCCATAA ACTCGTGTGT TATTTCACCA ACTGGGCACA CAGTCGGCCA 
GGCCCTGCCT CGATCTTGCC CCATGACCTG GACCCCTTTC TCTGCACCCA CCTGATATTT 
GCCTTTGCCT CAATGAACAA CAATCAGATT GTTGCTAAGG ATCTCCAGGA TGAGAAAATT 
CTCTACCCAG AGTTCAACAA ACTAAAGGAG AGGAACAGAG AGCTGAAAAC ACTACTGTCC 
ATCGGCGGGT GGAACTTTGG CACCTCAAGA TTCACCACTA TGTTGTCCAC ATTTGCCAAC 
CGTGAAAAGT TTATTGCTTC AGTTATATCC CTTCTGAGGA CACATGACTT TGATGGTCTT 
GACCTTTTCT TCTTATATCC TGGACTAAGA GGCAGCCCCA TGCATGACCG GTGGACTTTT 
CTCTTCTTAA TTGAAGAGCT CCTGTTTGCC TTCCGGAAGG AGGCACTGCT CACCATGCGC 
CCGAGGCTGC TGCTGTCTGC TGCTGTTTCT GGGGTCCCAC ACATCGTCCA AACATCCTAT 
GATGTGCGCT TTCTAGGAAG ACTCCTGGAT TTCATCAATG TCTTGTCTTA TGACTTACAT 
GGAAGTTGGG AAAGGTTCAC AGGACATAAT AGCCCCCTCT TCTCTCTGCC TGAAGACCCC 
AAATCTTCGG CATATGCTAT GAATTATTGG AGAAAGCTTG GGGCACCCTC AGAGAAGCTC 
ATCATGGGGA TCCCCACCTA TGGAOGTACC TTTCGCCTCC TCAAAGCCTC TAAGAATGGG 
TTGCAGGCCA GAGCGATCGG ACCAGCATCT CCAGGGAAGT ACACCAAGCA AGAAGGCTTC 
TTGGCTTATT TTGAGATTTG TTCCTTTGTC TGGGGAGCGA AGAAGCACTG GATTGATTAC 
CAGTATGTCC CGTATGCCAA CAAGGGGAAA GAGTGGGTTG GCTATGACAA TGCCATCAGC 
TTCAGTTACA AGGCATGGTT TATAAGGCGA GAGCATTTTG GGGGGGCCAT GGTGTGGACA 
TTGGACATGG ATGACGTCAG GGGCACGTTC TGTGGCACTG GCCCTTTCCC CCTTGTCTAC 
GTATTGAATG ATATCCTGGT GCGGGCTGAG TTCAGTTCAA CTTCTTTACC ACAATTTTGG 
CTGTCATCTG CTGTGAATTC TTCAAGCACT GACCCTGAAA GGCTGGCTGT GACCACGGCA 
TGGACCACTG ATAGTAAGAT TTTGCCCCCA GGAGGAGAGG CTGGGGTCAC TGAGATCCAC 
GGAAAGTGTG AAAATATGAC TATAACCCCT AGAGGTACAA CTGTGACCCC TACAAAGGAA 
ACTGTATCCC TTGGAAAGCA CACTGTAGCT CTAGGAGAGA AGACTGAGAT CACTGGGGCA 
ATGACCATGA CTTCTGTGGG TCATCAGTCC ATGACCCCTG GAGAGAAGGC CCTGACCCCT 
GTGGGTCATC AATCTGTGAC CACTGGACAG AAGACCCTGA CCTCTGTGGG TTATCAGTCT 
GTGACCCCTG GGGAAAAGAC CCTGACCCCT GTGGGTCATC AGTCTGTGAC CCCTGTGAGT 
CATCAGTCTG TGAGCCCTGG AGGAACGACT ATGACCCCTG TCCATTTTCA GACTGAGACC 
CTTAGACAGA ATACAGTGGC CCCTAGAAGG AAGGCTGTGG CCCGTGAAAA GGTGACTGTC 
CCCTCCAGAA ACATATCAGT CACCCCTGAA GGGCAGACTA TGCCTTTAAG AGGGGAGAAT 
TTGACTTCTG AGGTGGGCAC TCACCCCAGG ATGGGTAACT TGGGTCTTCA GATGGAAGCT 
GAAAACAGGA TGATGCTGTC CTCCAGCCCC GTCATCCAGC TCCCGGAACA AACTCCTCTA 
GCTTTTGACA ACCGCTTTGT TCCCATCTAT GGAAACCATT CCTCTGTCAA CTCAGTAACC 
CCTCAAACAA GTCCTCTTTC TCTAAAAAAA GAAATCCCAG AAAACTCTGC TGTGGATGAA 
GAAGCCTAAG CCCCTCTGGT GTCAGAAACC AGGGAAAACC CTTGTCTTTT CTTCTAAGTG 
ACATGTTGGA AGCCTTCTCA TCCCGGGGCA AAGCAGGCAT CAAAACCAGA ATAGGCCAAT 
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CTCTTTTCCA TTAAATAAAC TGTAAACACA AGAACCCA 

Seq ID NO: 632 Protein sequence 
Protein Accession ft: NP_002548.1 

1 11 21 31 41 SI 

I I I I I I 

MWKLLLWVGL VLVLKHHDGA AHKLVCYFTN WAHSRPGPAS ILPHDLDPFL CTHLIFAFAS 60 

MNNNQIVAKD LQDEKILYPE FNKLKERNRE LKTLLSIGGW NFGTSRFTTM LSTFANREKF 120 

IASVISLLRT HDFDGLDL.FF LYPGLRGSPM HDRWTFLFLI EELLFAFRKE ALLTMRPRLL 180 

LSAAVSGVPH IVQTSYDVRF LGRLLDFINV LSYDLHGSWB RFTGHNSPLF SLPEDPKSSA 240 

YAMNYWRKLG APSEKLIMGI PTYGRTFRLL KASKNGLQAR AIGPASPGKY TKQEGFLAYF 300 

EICSFVWGAK KHWIDYQYVP YANKGKEWVG YDNAISFSYK AWFIRREHFG GAMVWTLDMD 360 

DVRGTFCGTG PFPLVYVLND ILVRAEFSST SLPQFWLSSA VNSSSTDPER LAVTTAWTTD 420 

SKILPPGGEA GVTEIHGKCE NMTITPRGTT VTPTKETVSL GKHTVALGEK TEITGAMTMT 480 

SVGHQSMTPG EKALTPVGHQ SVTTGQKTLT SVGYQSVTPG EKTLTPVGHQ SVTPVSHQSV 540 

SPGGTTMTPV HFQTETLRQN TVAPRRKAVA REKVTVPSRN ISVTPEGQTM PLRGENLTSE 600 

VGTHPRMGNL GLQMEAENRM MLSSSPVIQL PEQTPLAFDN RFVPIYGNH3 SVNSVTPQTS 660 
PLSLKKEIPE NSAVDEEA 

Seq ID NO: 633 DNA sequence 

Nucleic Acid Accession NM_003BBS.l 

Coding sequence: 98.. 1021 



1 11 21 31 41 51 

I I I I I I 

AAACTCAGAA TTTTCGCGGG CTCGGTGAGC GGTTTTATCC CTCCGGCCGG CAGGCTGGGC 60 

GCAGGGGGCG AGCCCCCGCC CGGGGCGCAG CAGCACCATG GGCACGGTGC TGTCCCTGTC 120 

TCCCAGCTAC CGGAAGGCCA CGCTGTTTGA GGATGGCGCG GCCACCGTGG GCCACTATAC 180 

GGCCGTACAG AACAGCAAGA ACGCCAAGGA CAAGAACCTG AAGCGCCACT CCATCATCTC 240 

CGTGCTGCCT TGGAAGAGAA TCGTGGCCGT GTCGGCCAAG AAGAAGAACT CCAAGAAGGT 300 

GCAGCCTAAC AGCAGCTACC AGAACAACAT CACGCACCTC AACAATGAGA ACCTGAAGAA 360 

GTCGCTGTCG TGCGCCAACC TGTCCACATT CGCCCAGCCC CCACCGGCCC AGCCGCCTGC 420 

ACCCCCGGCC AGCCAGCTCT CGGGTTCCCA GACCGGGGGC TCCTCCTCAG TCAAGAAAGC 480 

CCCTCACCCr GCCGTCACCT CCGCAGGGAC GCCCAAACGG GTCATCGTCC AGGCGTCCAC 540 

CAGTGAGCTG CTTCGCTGCC TGGGTGAGTT TCTCTGCCGC CGGTGCTACC GCCTGAAGCA 600 

CCTGTCCCCC ACGGACCCCG TGCTCTGGCT GCGCAGOGTG GACCGCTCGC TGCTTCTGCA 660 

GGGCTGGCAG GACCAGGGCT TCATCACGCC GGCCAACGTG GTCTTCCTCT ACATGCTCTG 720 

CAGGGATGTT ATCTCCTCCG AGGTGGGCTC GGATCACGAG CTCCAGGCCQ TCCTGCTGAC 780 

4U ATGCCTGTAC CTCTCCTACT CCTACATGGG CAACGAGATC TCCTACCCGC TCAAGCCCTT 840 

CCTGGTGGAG AGCTGCAAGG AGGCCTTTTG GGACCGTTGC CTCTCTGTCA TCAACCTCAT 900 

GAGCTCAAAG ATGCTGCAGA TAAATGCCGA CCCACACTAC TTCACACAGG TCTTCTCCGA 960 

CCTGAAGAAC GAGAGCGGCC AGGAGGACAA GAAGCGGCTC CTCCTAGGCC TGGATCGGTG 1020 

AGCACTGTAG CCTGCGTCAT GGCTCAAGGA TTCAATGCAT TTTTAAGAAT TTATTATTAA 1080 
ATCAGTTTTG TGTACAG 



Seq ID NO: 634 Protein sequence 
Protein Accession #: NP_003876,1 

1 11 21 31 41 51 

I I I I I I 

MQTVLSLSPS YRKATLFEDG AATVGHYTAV QNSKNAKDKN LKRHSIISVL PWKRIVAVSA 60 
KKKNSKKVQP NSSYQNNITH LNNENLKKSL SCANLSTFAQ PPPAQPPAPP ASQLSGSQTG 120 
GSSSVKKAPH PAVTSAGTPK RVIVQASTSE LLRCLGEFLC RRCYRLKHLS PTDPVLWLRS 180 
VDRSLLLQGW QDQGFITPAN WFLYMLCRD VISSEVGSDH ELQAVLLTCL YLSYSYWGNE 240 
ISYPLKPFLV ESCKEAFWDR CLSVINLMSS KMLQINADPH YFTQVFSDIiK NESGQEDKKR 300 
LLLGLDR 
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Pkey: Unique Eos probesei identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UntgenelD: Unkjene number 

Unigene Title: Unigene gene title 

Seq ID Noj Sequence identification number linking information in Table 79A to sequences in Table 80 



Pkey 


ExAccn 


UnigenelD 


424212 


NMJ05814 


Hs.143131 


424503 


NM_0Q2205 


Hs.149609 


418007 


M13509 


Hs.83169 


418007 


M13509 


Hs.83169 


418738 


AW388633 


Hs.6682 


443646 


AI085198 


Hs.164226 


409956 


AW103364 


Hs727 


422B67 


L32137 


Hs.1584 


444381 


BE387335 


Hs.283713 


421582 


AI910275 


Hs.350470 


411789 


AF245505 


Hs72157 


452281 


T93500 


Hs.28792 


428698 


M852773 


Hs.334838 


421552 


AF026692 


Hs.105700 


425247 


NMJJ05940 


Hs.155324 


432201 


AI538613 


Hs.298241 


447377 


X77343 


Hs.334334 


446921 


AB012113 


Hs.16530 


4188B8 


AU076801 


Hs.89436 


432179 


X75208 


Hs.2913 


422578 


AF239666 


Hs.1545 


409889 


AW630041 


Hs.56937 


447033 


AI357412 


Hs.157601 


447033 


AI357412 


Hs.1 57601 


411975 


AI916058 


Hs.144583 


434206 


AW136973 


Hs.362915 


423936 


U77629 


Hs.135639 


447400 


AK000322 


Hs.18457 


449032 


AA045573 


Hs.22900 


415214 


AI445236 


Hs.125124 


443247 


BE614387 


Hs.333893 


422048 


NMJJ12445 


Hs.288126 


410418 


D31382 


Hs.63325 


446342 


BE298665 


Hs.14646 


411274 


NM-002776 


Hs.69423 


104978 


AI199268 


Hs.19322 


422260 


AA315993 


Hs.105484 


409041 


AB033025 


H3.50081 


420344 


BE463721 


Hs.97101 


422163 


AF027208 


Hs.1 12360 


437935 


AW939591 


Hs.5940 


422330 


D30783 


Hs.115263 


408908 


BE296227 


Hs.250822 


407811 


AW190902 


Hs.40098 


437852 


BE001836 


Hs.256897 


408243 


Y00787 


Hs.624 


426088 


AF038007 


Hs.166196 


439738 


BE246502 


Hs.9598 


419741 


NVL007019 


Hs,93002 


450983 


AA305384 


Hs.25740 


417771 


AA804698 


Hs.82547 


421379 


Y15221 


Hs.103982 


442006 


AW975183 


Hs.372210 


413048 


M93221 


Hs.75182 


443324 


R44013 


Hs.164225 


424917 


A1636208 


Hs.96901 


424917 


AJ636208 


Hs.96901 


444527 


NMJW5408 


Hs.113B3 


442652 


A1005163 


Hs.201378 


450726 


AW204600 


Hs.355462 


416965 


N26223 


Hs.160436 


442275 


AW449467 


Hs.54795 


431745 


AW972448 


Hs.163425 


431745 


AW972443 


Hs.163425 


453142 


AA033648 


Hs.7473 


421659 


NM.014459 


Hs.106511 


444090 


S59115 


Hs.10306 


421563 


NM_006433 


Hs.105806 


430413 


AW842182 


Hs.241392 


414991 


C17898 




419833 


AA251131 


Hs.220697 


424943 


AU077260 


Hs.153924 



glycoprotein A33 (transmembrane) 
Integrin, alpha 5 (fforonectin receptor, 
matrix metalloproteinase 1 (interstitial 
matrix metal lo proteinase 1 (interstitial 
solute carrier family 7, (cationic amino 
Thrombospondin 1 
Inhlbin, beta A (acCvin A, activin AB a 
cartilage oOgomeric matrix protein (pse 
hypothetical protein BC014245 
trefoil factor 1 (breast cancer, estroge 
Ad Bean 

Homo sapiens cONA FU11041 fis, done PL 

K1AA1866 protein 

secreted mzzled-related protein 4 

matrix metallaproteinase 1 1 (stromelystn 

Transmembrane protease, serine 3 

transcription factor AP-2 alpha 

small inducible cytokine subfamily A (Cy 

cadherin 17, LI cadherin (rrver-lntestin 

EphB3 

caudal type homeo box transcription fact 
suppression of tumcrigenicity 14 (colon 
Predicted gene: Eos cloned; secreted w/V 
Predicted gene; Eos cloned; secreted w/V 
3UTR of: dead ringer (DrosophBa)-Bke 
ESTs. Weakty similar to S69890 mitogen i 
achaete-scute complex (Drosophila) homo) 
hypothetical protein FU20315 
nuclear factor (erythroid-derived 2)-Kk 
EphB2 

c-Myc target JP01 

spondin 2. extracellular matrix protein 
transmembrane protease, serine 4 
sofute carrier famtfy 7 (cationic amino 
kaffikrein 10 

Homo sapiens. Similar to RIKEN cDNA 2010 

regenerating gene type IV 

Hypothetical protein, XPJB1B60 (WAA119 

putative G protein-coupled receptor 

prominin (mouseHike 1 

mucin 13, epithelial transmembrane 

epiregulin 

serine/threonine kinase 15 

cysteine knot superfamily 1, BMP antagon 

putative GPCR 

interleukin8 

ATPase, Class I, type 88, member 1 

sema domain, trnmunogtobuSn domain (Ig), 

ubicnifin carrier protein E2-C 

ER01 (S. cerevlslae)-Gke 

refinoic acid receptor responder (tazaro 

smafi inducible cytokine subfamily B (Cy 

ESTs. Weakly similar to S72482 hypothec" 

mannose receptor, C type 1 

ESTs 

hypothetical protein FU23049 

hypothetical protein FU23049 

small inducible cytokine subfamily A (Cy 

Homo sapiens cDMA FU40427 fis 

HUMPSPBA Human pulmonary surfactant-asso 

MDAC1 

Homo sapiens secret ogtobin, family 3A, m 
Novel FGENESH predicted cadherin repeat 
Novel FGENESH predicted cadherin repeat 
Homo sapiens gap junction protein, alpha 
protocadherin 17 

natural killer ceO group 7 sequence 



small inducible cytokine A5 (RANTES) 
Homo sapiens unregulated by BCG-CWS (LO 
Homo sapiens tryptophanyl-lRNA syrrthetas 
death-associated protein kinase 1 



Seq ID No. 

Seq ID No. C1&C217 

SeqIDNo.C2AC218 

Seq!DNo.C3&C219 

SeqtDNo.C4&C220 

SeqlDNo.C5&C221 

SeqlDNo.C6&C222 

Seq ID No. C7&C223 

SeqlDNo.C8&C224 

SeqiDNo.C9&C225 

Seq ID No. C10&C226 

Seq ID No. C11&C227 

Seq ID No. C1 2 

SeqlDNo.C13&C228 

SeqlDNo.C14&C229 

SeqtDNo.C15&C230 

SeqlDNo.C16&C231 

Seq ID No. C17&C232 

Seq ID No. C18&C233 

SeqlDNo.C19&C234 

SeqlDNo.C20&C235 

SeqlDNo.C21&C236 

SeqlDNo.C22&C237 

Seq ID NO.C23&C238 

SeqIDNo.C24&C239 

SeqlDNo.C25&C240 

SeqtDNo.C26&C241 

SeqlDNo,C27&C242 

SeqlDNo.C28&C243 

SeqlDNaC29&C244 

SeqlDNaC30&C245 

SeqlDNaC3UC246 

Seq !DNaC32&C247 

SeqlDNaC33&C248 

SeqtDNaC34&C249 

SeqtDNo.C35&C250 

SeqtD NO.C36&C251 

SeqtDNo.C37&C252 

Seq ID No. C38&C253 

SeqlDNo.C39&C254 

SeqlDNo.C40&C255 

SeqlDNo.C41&C256 

SeqtD No. C42&C257 

SeqlDNo.C43&C258 

SeqtDNoC44&C259 

SeqlDNo.C45&C260 

SeqlDNaC46&C261 

SeqlDNo.C47&C262 

SeqlDNo.C48&C263 

SeqlDNo.C49&C264 

SeqlDNaC50&C265 

SeqlDNaC51&C266 

SeqlDNaC52&C267 

SeqlDNaC53&C268 

SeqlDNo.C54&C269 

SeqIDNaC55&C270 

SeqlDNaC56&C271 

SeqlDNaC57&C272 

Seq ID No. C58&C273 

Seq ID No, C59&C274 

SeqlDNo.C60&C275 

Seq ID No. C61&C276 

Seq ID No. C62&C277 

SeqlDNaC63&C278 

SeqlDNaC64&C279 

SeqlDNo.C65&C280 

Seq ID No. C65&C281 

SeqlDNaC67&C282 

Seq ID No. C68&C283 

Seq ID No. C69&C284 

SeqlDNo.C70&C285 

Seq ID No. C71&C286 

Seq ID NO.C72&C287 
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430890 


X54232 


452401 


NM_007115 


439180 


AI3S3742 


410407 


X66839 


418526 


BE019020 


422627 


BE336857 


430486 


BE0621Q9 


423673 


BE003054 


423673 


BE003054 


431846 


BE019924 


431958 


X63629 


448733 


NM 005629 


426440 


BE382756 


428484 


AF104032 


429211 


AF052693 


423634 


AW959908 


457819 


AA057484 


424687 


J05070 


418462 


BE001596 


439506 


W79123 


407720 


AB037776 


418543 


NM_005329 


417512 


X76534 


415817 


U88967 


415817 


U88967 


415817 


U8B967 


415817 


U88967 


415817 


U88957 


415817 


U88967 


415817 


U88967 


415817 


U88967 


421817 


AF146074 


421817 


AF146074 


409420 


Z15008 


440659 


AF134160 


408790 


AW580227 


408790 


AW580227 


408790 


AW580227 


450701 


H39960 


414774 


X02419 


413691 


AB023173 


453857 


AL080235 


449101 


AA205847 


429263 


AA0190O4 


421474 


U76362 


421753 


BE314828 


408482 


NMJ»0676 


426761 


A1015709 


429736 


AF125304 


430985 


AA490232 


431890 


X17033 


432583 


AW023624 


446872 


X9705B 


453102 


NM.007197 


428513 


BE220808 


430280 


AA361258 


428486 


AW583497 


457489 


AI693815 


432874 


W94322 


445891 


AW391342 


445891 


AW391342 


404682 




429547 


AW009166 


404287 




404287 




404287 




418318 


U47732 


444754 


T83911 


432596 


AJ224741 


444006 


BE395085 


428505 


AL035461 


448844 


AI581519 


448844 


AI581519 


428392 


H10233 


448030 


N30714 


422109 


S73265 


449048 


Z45051 


417931 


W95642 


419216 


AU076718 


426227 


U6705B 


413554 


AA319146 


445417 


AK001058 


426322 


J05068 



Ks.2699 glypfcan 1 

Hs.29352 tumor necrosis factor, alpha-induced pro 
Hs.199067 v-erb-b2 avian erythroblastic leukemia v 
Hs.63287 carbonic anhydrase IX 

solute carrier family 16 (monocarboxyBc 
transforming growth factor, beta-induced 
chloride channel, calcium activated, tarn 
matrix metalloproteinase 12 (macrophage 
matrix metalloproteinase 12 (macrophage 
Hs.271580 uroplaWn 1B 

Hs.2877 cadherin 3, type 1, P-cadherin (placenta 
Hs.1 87958 solute carrier family 6 (neurotransmife 
Hs.169902 solute carrier family 2 (facilitated g!u 
Hs.1 84601 solute carrier family 7 (cationic amino 
Hs.1 98249 gap junction protein, beta 5 (connexin 3 
Hs.1 690 heparin-binding growth factor binding pr 
Hs.35406 FU20522 Hypothetical protein FU20522 
Hs.1 51738 matrix metalloproteinase 9 (getatinase B 
Hs.85266 integrin, beta 4 

G protein-coupled receptor 87 
Immunoglobulin superfamily, member 9 
hyaluronan synthase 3 
glycoprotein (transmembrane) nmb 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptoM 
protein tyrosine phosphatase, receptor-t 
Hs.108660 ATP-bindlng cassette, sub-family C (CFTR 
Hs.108660 ATP-bindlng cassette, sub-family C (CFTR 
laminin, gamma 2 (nfcein (100kD), kalinl 
claudinl 

neurotrophic tyrosine kinase, receptor, 
neurotrophic tyrosine kinase, receptor, 
neurotrophic tyrosine kinase, receptor, 
Hs.288467 hypothetical protein XP_098151 (leucine- 
Hs.77274 plasminogen activator, urokinase 
ATPase.aassV1.type11B 
Ras-induced senescence 1 (RIS1) 
G protein-coupled receptor 
Hs.1 98396 ATP-bindlng cassette, sub-femily A (ABC1 
Hs.1 046 37 solute carrier family 1 (giutamate trans 

ATP-birufing cassette, sub-famHy B (MOW 
adenosine A2b receptor 
POR1MIN Pro-oncosis receptor inducing me 
tumor necrosis factor receptor supertami 
ESTs, Weakty simitar to (78885 serine/th 
Hs.271986 tntegrin, alpha 2 (CD49B, alpha 2 suburri 
Hs.162282 potassium channel TASK-4; potassium chan 
Hs.16362 pyrimidinergic receptor P2Y, G-protein c 
Hs.31 664 frizzled (Drosophita) homoiog 10 
Hs.1 84697 piexinCI 
Hs.237868 Interleukin 7 receptor 
Hs.1 84604 pancreatic polypeptide 
Hs,127179 crypScgene 
Hs.279651 melanoma inhibitory activity 
Hs.199460 DPCR1 protein 
Hs.199460 DPCR1 protein 

ortholog of mouse porydomatn protein 
FGENESH predicted novel secreted protein 
FGENESH predicted novel CUB-domain conta 
FGENESH predicted novel CUB-domam coma 
FGENESH predicted novel CUB-dornain conta 
transmembrane 4 superfamily member 3 
transmembrane 4 superfamily member 4 
matriHn 3 

Hs.334762 type I transmembrane protein Fn14 
Hs.2281 chrornogranin B (secretogranin 1) 
Hs.1 771 64 FGENESH predicted novel cell surface pr 
Hs.177164 FGENESH predicted novel cefl surface pr 
Hs.2265 secretory granule, neuroendocrine protel 
Hs.325960 membrane-spanning 4-domains, subfamily A 
gastrirweleasing peptide 
similar to S68401 (cattle) glucose indue 
trefoil factor 3 (intestinaO 
small inducible cytokine subfamily B (Cy 
Hs.154299 Human proteinase activated receptor-2 mR 
Hs.75426 secretogranin II (chrornogranin C) 

a disintegrin-like and metalloprotease w 
transcobalamin I (vitamin B12 binding pr 



Hs.85838 
Hs.1 18787 
Hs.241551 
Hs.1695 
Hs.1695 



Hs.58561 
Hs.38002 
Hs.85962 
Hs.82226 
Hs.78867 
Hs.78867 
Hs.78867 
Hs.78867 
Hs.78867 
Hs.78867 
Hs.78867 
Hs.78867 



Hs.54451 

Hs.7327 

Hs.47860 

Hs.47860 

Hs.47860 



Hs.75478 
Hs.35861 
Hs.23016 



Hs.107911 

Hs.45743 

Hs.172089 

Hs.212680 

Hs.27323 



Hs.99376 



Hs.84072 
Hs.11881 
Hs.278461 



Hs.1473 
Hs.22920 
Hs.82961 
Hs.1 64021 



Hs.12680 
Hs.2012 
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SeqID No. C80&C295 
SeqID No. C81&C296 
SeqID No. C82&C297 
SeqID No. C83&C298 
SeqID No, C84&C299 
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SeqID No. C86&C301 
SeqID No. C87&C302 
SeqID No. C88&C303 
SeqID No. C89&C304 
Seq IDNO.C90&C305 
SeqID No. C91&C306 
SeqID NO.C92&C307 
Seq IDNO.C93&C308 
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Seq ID No. C100&C315 
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SeqlDNo.C102&C317 
SeqlDNo.C103&C318 
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SeqlDNo.C106&C321 
SeqID No. C107&C322 
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Seq ID No, C109&C324 
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SeqID No. C119&C334 
SeqlDNo.C120&C335 
SeqlDNo.C121&C336 
SeqlDNo.C122&C337 
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SeqlDNo,C124&C339 
SeqID Na C125&C340 
SeqlDNaC126&C341 
SeqIDNa C127&C342 
SeqID No. C128&C343 
SeqlDNo.C129&C344 
SeqlDNo.C130&C345 
Seq!DNo.C131&C346 
SeqIDNaC132&C347 
SeqtDNo.C133&C348 
SeqtDNo.C134&C349 
Seq(DNo.C135&C350 
SeqlDNaC136&C351 
SeqlDNaC137&C352 
SeqlDNaC138&C353 
SeqlDNaC139&C354 
SeqlDNo.C140&C355 
SeqlDNo.C14UC356 
Seq ID No. C142&C357 
SeqlDNo.C143&C358 
. SeqlDNo.C1444C359 
Seq ID No. C145&C360 
SeqlDNo.C146&C361 
SeqlDNo.C147&C362 
SeqiDNo.C148&C363 
SeqlDNo.C149&C364 
SeqlDNaC150AC365 
SeqIDNa C151&C366 
SeqlDNo.C'152&C367 
SeqIDNa C153&C368 
SeqIDNa C154&C369 
SeqIDNa C155&C370 
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409041 
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small Inducible cytokine subfamily A (Cy 
granin-Iike neuroendocrine peptide precu 
potassium channel, subfamily K, member 1 
potassium voltage-gated channel, Isk-reJ 
FAT tumor suppressor (DrosophBa) homolo 
transmembrane 4 superfamify member 1 
solute carrier family 4, sodium bicarbon 
ESTs 

plasminogen activator, urokinase recepto 
hypothetical protein DKFZp564D0462 
decay accelerating factor for complement 
G protein-coupled receptor 49 
G protein-coupled receptor 49 
frizzled (Drosophjla) homolog 6 
secreted phosphoproteln 1 (osteopontln, 
PTK7 protein tyrosine kinase 7 
leukemia inhibitory factor (cholinergic 
leuctne-flch repeat-containing G protein 
solute carrier family 39 {zinc transport 
G protein-coupled receptor 27 (GPR27) (S 
smoothened (Drosophila) homolog 
transmembrane receptor Unc5H2 mRNA 
wingless-type MMTV integration site fami 
ATP-bMng cassette, sub-family C (CFTR 
wingless-type MMTV integration site fami 
frizzled (Orosophila) homolog 8 
secreted frizzled-related protein 2 (sir 
daudin3 

six transmembrane epithelial antigen of 

C033 antigen (p45) 

ESTs 

WAA1324 protein 
aJpha-melhylacyl-CoA racemase 
Homo sapiens similar to Echinoidln (LOC1 
WAAM18 protein 
deiodinase, iodothyronlne, type II 
caidum/ealmodufin^pendent protein kin 
Epithetiat catdum channel 2, CaT-Iike A 
FXYD domain-containing ion transport reg 
FXYD domain-containing ton transport reg 
gamma-arninobutyric acid (GABA) A recepto 
gamnia-aminobu^fric acid (GABA) A recepto 
garnma-ammobutyrfc acid (GABA) A recepto 
garnma-animobulyrfc add (GABA) A recepto 
ERGL protein; ERGt053-fike protein 
ESTs 

prostate cancer associated protein 7 
hypothetical protein FU13593 
hypothetical protein FU 13593 
potassium intermediate/small conductance 
NG22 protein 
AIM-1 protein 

transmembrane 7 superfamlty member 2 
endoglycan 

BCL2-interactIng kBIer (apoptosis-induc 
neural pmEferalion, ditferentiafion an 
EphB6 

KIAA1181 protein 

hepsin (transmembrane protease, serine 1 
phosphofipase A2. group I1A (platelets, 
prostate differentiation factor 
ESTs 

Hypothetical protein. XP_051860 (WAA1 19 
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Seq IDNO.C187&C402 
SeqlDNaC188&C403 
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SeqlDNo.C197&C412 
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SeqlDNo.C200&C415 
Seq ID Mo. C201&C416 
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SeqlDNaC209&C424 
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TABLE 79B 



Pkey: Unique Eos probeset Identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT Number Accession 

414991 1785136J D78831 C17898D78863 

TABLE 79C 



Pkey: Unique number corresponding to an Eos probeset 

Ret Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et aL" refers to the publication entitled The DNA 

sequence of human chromosome 22." Dunham I. el a!., Nature (1999) 402:489495. 
Strand: Indicates DNA strand from which exons were predicted. 
Ntpositkm: Indicates nucleotide posi&ons of predicted exons. 
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Table 80: 

Seq ID NO: CI DNA Sequence 
Nucleic Acid Accession #: NM_005814 
5 Coding sequence: 345.. 1304 

1 11 21 31 41 51 

I I I I I I 

CTACCCCTTT GTGAGCAGTC TAGGACTTTG TACACCTGTT AAGTAGGGAG AAGGCAGGGG 60 

1U AGGTGGCTGG TTTAAGGGGA ACTTGAGGGA AGTAGGGAAG ACTCCTCTTO GGACCTTTGG 120 

AGTAGGTGAC ACATGAGCCC AGCCCCAGCT CACCTGCCAA TCCAGCTGAG GAGCTCACCT 180 

GCCAATCCAG CTGAGGCTGG GCAGAGGTGG GTGAGAAGAG GGAAAATTGC AGGGACCTCC 240 

AGTTGGGCCA GGCCAGAAGC TGCTGTAGCT TTAACCAGAC AGCTCAGACC TGTCTGGAGG 300 

CTGCCAGTGA CAGGTTAGGT TTAGGGCAGA GAAGAAGCAA GACCATGGTG GGGAAGATGT 360 

ID GGCCTGTGTT GTGGACACTC TGTGCAGTCA GGGTGACCGT CGATGCCATC TCTGTGGAAA 420 

CTCCGCAGGA CGTTCTTCGG GCTTCGCAGG GAAAGAGTGT CACCCTGCCC TGCACCTACC 480 

ACACTTCCAC CTCCAGTCGA GAGGGACTTA TTCAATGGGA TAAGCTCCTC CTCACTCATA 540 

CGGAAAGGGT GGTCATCTGG CCGTTTTCAA ACAAAAACTA CATCCATGGT GAGCTTTATA 600 

AGAATCGCGT CAGCATATCC AACAATGCTG AGCAGTCCGA TGCCTCCATC ACCATTGATC 660 

1\) AGCTGACCAT GGCTGACAAC GGCACCTACG AGTGTTCTGT CTCGCTGATG TCAGACCTGG 720 

AGGGCAACAC CAAGTCACGT GTCCGCCTGT TGGTCCTCGT GCCACCCTCC AAACCAGAAT 780 

GCGGCATCGA GGGAGAGACC ATAATTGGGA ACAACATCCA GCTGACCTGC CAATCAAAGG 840 

AGGGCTCACC AACCCCTCAG TACAGCTGGA AGAGGTACAA CATCCTGAAT CAGGAGCAGC 900 

CCCTGGCCCA GCCAGCCTCA GGTCAGCCTG TCTCCCTGAA GAATATCTCC ACAGACACAT 960 

CGGGTTACTA CATCTGTACC TCCAGCAATG AGGAGGGGAC GCAGTTCTGC AACATCACGG 1020 

TGGCCGTCAG ATCTCCCTCC ATGAACGTGG CCCTGTATGT GGGCATCGCG GTGGGCGTGG 1080 

TTGCAGCCCT CATTATCATT GGCATCATCA TCTACTGCTG CTGCTGCCGA GGGAAGGACG 1140 

ACAACACTGA AGACAAGGAG GATGCAAGGC CGAACCGGGA AGCCTATGAG GAGCCACCAG 1200 

AGCAGCTAAG AGAACTTTCC AGAGAGAGGG AGGAGGAGGA TGACTACAGG CAAGAAGAGC 1260 

AGAGGAGCAC TGGGCGTGAA TCCCCGGACC ACCTCGACCA GTGACAGGCC AGCAGCAGAG 1320 

GGCGGCGGAG GAAGGGTTAG GGGTTCATTC TCCCGCTTCC TGGCCTCCCT TCTCCTTTCT 1380 

AAGCCCTGTT CTCCTGTCCC TCCATCCCAG ACATTGATGG GGACATTTCT TCCCCAGTGT 1440 

CAGCTGTGGG GAACATGGCT GGCCTGGTAA GGGGGTCCCT GTGCTGATCC TGCTGACCTC 1500 

ACTGTCCTGT GAAGTAACCC CTCCTGGCTG TGACACCTGG TGCGGGCCTG GCCCTCACTC 1560 

AAGACCAGGC TGCAGCCTCC ACTTCCCTCG TAGTTGGCAG GAGCTCCTGG AAGCACAGCG 1620 

CTGAGCATGG GGCGCTCCCA CTCAGAACTC TCCAGGGAGG CGATGCCAGC CTTGGGGGGT 1680 

GGGGGCTGTC CTGCTCACCT GTGTGCCCAG CACCTGGAGG GGCACCAGGT GGAGGGTTTG 1740 

CACTCCACAC ATCTTTCTTG AATGAATGAA AGAATAAGTG AGTATGCTTG GGCCCTGCAT 1800 

TGGCCTGGCC TCCAGCTCCC ACTCCCTTTC CAACCTCACT TCCCGTAGCT GCCAGTATGT 1860 

TCCAAACCCT CCTGGGAAGG CCACCTCCCA CTCCTGCTGC ACAGGCCCTG GGGAGCTTTT 1920 

GCCCACACAC TTTCCATCTC TGCCTGTCAA TATCGTACCT GTCCCTCCAG GCCCATCTCA 1980 

AATCACAAGG ATTTCTCTAA CCCTATCCTA ATTGTCCACA TACGTGGAAA CAATCCTGTT 2040 

ACTCTGTCCC ACGTCCAATC ATGGGCCACA AGGCACAGTC TTCTGAGCGA GTGCTCTCAC 2100 

TGTATTAGAG CGCCAGCTCC TTGGGGCAGG GCCTGGGCCT CATGGCTTTT GCTTTCCCTG 2160 

AAGCCCTAGT AGCTGGCGCC CATCCTAGTG GGCACTTAAG CTTAATTGGG GAAACTGCTT 2220 

TGATTGGTTG TGCCTTCCCT TCTCTGGTCT CCTTGAGATG ATCGTAGACA CAGGGATGAT 2280 

TCCCACCCAA ACCCACGTAT TCATTCAGTG AGTTAAACAC GAATTGATTT AAAGTGAACA 2340 

CACACAAGGG AGCTTGCTTG CAGATGGTCT GAGTTCTTGT GTCCTGGTAA TTCCTCTCCA 2400 

GGCCAGAATA ATTGGCATGT CTCCTCAACC CACATGGGGT TCCTGGTTGT TCCTGCATCC 2460 

JU CGATACCTCA GCCCTGGCCC TGCCCAGCCC ATTTGGGCTC TGGTTTTCTG GTGGGGCTGT 2520 

CCTGCTGCCC TCCCACAGCC TCCTTCTGTT TGTCGAGCAT TTCTTCTACT CTTGAGAGCT 2580 

CAGGCAGCGT TAGGGCTGCT TAGGTCTCAT GGACCAGTGG CTGGTCTCAC CCAACTGCAG 2640 

TTTACTATTG CTATCTTTTC TGGATGATCA GAAAAATAAT TCCATAAATC TATTGTCTAC 2700 

TTGOGATTTT TTAAAAAATG TATATTTTTA TATATATTGT TAAATCCTTT GCTTCATTCC 2760 

AAATGCTTTC AGTAATAATA AAATTGTGGG TGG 2793 
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Seq ID NO: C2 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence-. 1..3150 



1 11 21 31 41 51 

I 1 I I I I 

ATGGGGAGCC GGACGCCAGA GTCCCCTCTC CACGCCGTGC AGCTGCGCTG GGGCCCCCGG 60 

CGCCGACCCC CGCTSSTGCC GCTGCTGTTG CTGCTSSTGC CGCCGCCACC CAGGGTCGGG 120 

GGCTTCAACT TAGACGCGGA GGCCCCAGCA GTACTCTCGG GGCCCCCGGG CTCCTTCTTC 180 

GGATTCTCAG TGGAGTTTTA CCGGCCGGGA ACAGACGGGG TCAGTGTGCT GGTGGGAGCA 240 

CCCAAGGCTA ATACCAGCCA GCCAGGAGTG CTGCAGGGTG GTGCTGTCTA CCTCTGTCCT 300 

TGGGGTGCCA GCCCCACACA GTGCACCCCC ATTGAATTTG ACAGCAAAGG CTCTCGGCTC 360 

CTGGAGTCCT CACTGTCCAG CTCAGAGGGA GAGGAGCCTG TGGAGTACAA GTCCTTGCAG 420 

/U TGGTTCGGGG CAACAGTTCG AGCCCATGGC TCCTCCATCT TGGCATGCGC TCCACTGTAC 480 

AGCTGGCGCA CAGAGAAGGA GCCACTGAGC GACCCCGTGG GCACCTGCTA CCTCTCCACA 540 

GATAACTTCA CCCGAATTCT GGAGTATGCA CCCTGCCGCT CAGATTTCAG CTGGGCAGCA 600 

GGACAGGGTT ACTGCCAAGG AGGCTTCAGT GCCGAGTTCA CCAAGACTGG CCGTGTGGTT 660 

TTAGGTGGAC CAGGAAGCTA TTTCTGGCAA GGCCAGATCC TGTCTGCCAC TCAGGAGCAG 720 

ATTGCAGAAT CTTATTACCC CGAGTACCTG ATCAACCTGG TTCAGGGGCA GCTGCAGACT 780 

CGCCAGGCCA GTTCCATCTA TGATGACAGC TACCTAGGAT ACTCTGTGGC TGTTGGTGAA 840 

TTCAGTGGTG ATGACACAGA AGACTTTGTT GCTGGTGTGC CCAAAGGGAA CCTCACTTAC 900 

GGCTATGTCA CCATCCTTAA TGGCTCAGAC ATTCGATCCC TCTACAACTT CTCAGGGGAA 960 

CAGATGGCCT CCTACTTTGG CTATGCAGTG GCCGCCACAG ACGTCAATGG GGACGGGCTG 1020 

GATGACTTGC TGGTGGGGGC ACCCCTGCTC ATGGATCGGA CCCCTGACGG GCGGCCTCAG 1080 

GAGGTGGGCA GGGTCTACGT CTACCTGCAG CACCCAGCCG GCATAGAGCC CACGCCCACC 1140 

CTTACCCTCA CTGGCCATGA TGAGTTTGGC CGATTTGGCA GCTCCTTGAC CCCCCTGGGG 1200 

GACCTGGACC AGGATGGCTA CAATGATGTG GCCATCGGGG CTCCCTTTGG TGGGGAGACC 1260 

CAGCAGGGAG TAGTGTTTGT ATTTCCTGGG GGCCCAGGAG GGCTGGGCTC TAAGCCTTCC 1320 
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CAGGTTCTGC AGCCCCTGTG GGCAGCCAGC CACACCCCAG ACTTCTTTGG CTCTGCCCTT 1380 

CGAGGAGGCC GAGACCTGGA TGGCAATGGA TATCCTGATC TGATTGTGGG GTCCTTTGGT 1440 

GTGGACAAGG CTGTGGTATA CAGGGGCCGC CCCATCGTGT CCGCTAGTGC CTCCCTCACC 1500 

ATCTTCCCCG CCATGTTCAA CCCAGAGGAG CGGAGCTGCA GCTTAGAGGG GAACCCTGTG 1560 

GCCTGCATCA ACCTTAGCTT CTGCCTCAAT GCTTCTGGAA AACACGTTGC TGACTCCATT 1620 

GGTTTCACAG TGGAACTTCA GCTGGACTGG CAGAAGCAGA AGGGAGGGGT ACGGCGGGCA 1680 

CTGTTCCTGG CCTCCAGGCA GGCAACCCTG ACCCAGACCC TGCTCATCCA GAATGGGGCT 1740 

CGAGAGGATT GCAGAGAGAT GAAGATCTAC CTCAGGAACG AGTCAGAATT TCGAGACAAA 1800 

CTCTCGCCGA TTCACATCGC TCTCAACTTC TCCTTGGACC CCCAAGCCCC AGTGGACAGC 1860 

CACGGCCTCA GGCCAGCCCT ACATTATCAG AGCAAGAGCC GGATAGAGGA CAAGGCTCAG 1920 

ATCTTGCTGG ACTGTGGAGA AGACAACATC TGTGTGCCTG ACCTGCAGCT GGAAGTGTTT 1980 

GGGGAGCAGA ACCATGTGTA CCTGGGTGAC AAGAATGCCC TGAACCTCAC TTTCCATGCC 2040 

CAGAATGTGG GTGAGGGTGG CGCCTATGAG GCTGAGCTTC GGGTCACCGC CCCTCCAGAG 2100 

GCTGAGTACT CAGGACTCGT CAGACACCCA GGGAACTTCT CCAGCCTGAG CTGTGACTAC 2160 

TTTGCCGTGA ACCAGAGCCG CCTGCTGGTG TGTGACCTGG GCAACCCCAT GAAGGCAGGA 2220 

GCCAGTCTGT GGGGTGGCCT TCGGTTTACA GTCCCTCATC TCCGGGACAC TAAGAAAACC 2280 

ATCCAGTTTG ACTTCCAGAT CCTCAGCAAG AATCTCAACA ACTCGCAAAG CGACGTGGTT 2340 

TCCTTTCGGC TCTCCGTGGA GGCTCAGGCC CAGGTCACCC TGAAOGGTGT CTCCAAGCCT 2400 

GAGGCAGTGC TATTCCCAGT AAGCGACTGG CATCCCCGAG ACCAGCCTCA GAAGGAGGAG 2460 

GACCTGGGAC CTGCTGTCCA CCATGTCTAT GAGCTCATCA ACCAAGGCCC CAGCTCCATT 2520 

AGCCAGGGTG TGCTGGAACT CAGCTGTCCC CAGGCTCTGG AAGGTCAGCA GCTCCTATAT 2580 

GTGACCAGAG TTACGGGACT CAACTGCACC ACCAATCACC CCATTAACCC AAAGGGCCTG 2640 

GAGTTGGATC COGAGGGTTC CCTGCACCAC CAGCAAAAAC GGGAAGCTCC AAGCCGCAGC 2700 

TCTGCTTCCT CGGGACCTCA GATCCTGAAA TGCCCGGAGG CTGAGTGTTT CAGGCTGCGC 2760 

TGTGAGCTGG GGCCCCTGCA CCAACAAGAG AGCCAAAGTC TGCAGTTGCA TTTCCGAGTC 2820 

TGGGCCAAGA CTTTCTTGCA GCGGGAGCAC CAGCCATTTA GCCTGCAGTG TGAGGCTGTG 28 BO 

TACAAAGCCC TGAAGATGCC CTACCGAATC CTGCCTCGGC AGCTGCCCCA AAAAGAGCGT 2940 

CAGGTGGCCA CAGCTGTGCA ATGGACCAAG GCAGAAGGCA GCTATGGCGT CCCACTGTGG 3000 

ATCATCATCC TAGCCAT CCT GTTTGGCCTC CTGCTCCTAG GTCTACTCAT CTACATCCTC 3060 

TACAAGCTTG GATTCTTCAA ACGCTCCCTC CCATATGGCA CCGCCATGGA AAAAGCTCAG 3120 

CTCAAGCCTC CAGCCACCTC TGATGCCTGA 3150 



Seq ID NO: C3 DNA Sequence 
„ Nucleic Acid Accession #: NM_002421.2 

35 Coding sequence: 1..1410 

1 11 21 31 41 51 

1 I I I I I 

ATGCACAGCT TTCCTCCACT GCTGCTGCTG CTGTTCTGGG GTGTGGTGTC ACACAGCTTC 60 

4U CCAGCGACTC TAGAAACACA AGAGCAAGAT GTGGACTTAG TCCAGAAATA CCTGGAAAAA 120 

TACTACAACC TGAAGAATGA TGGGAGGCAA GTTGAAAAGC GGAGAAATAG TGGCCCAGTG 180 

GTTGAAAAAT TGAAGCAAAT GCAGGAATTC TTTGGGCTGA AAGTGACTGG GAAACCAGAT 240 

GCTGAAACCC TGAAGGTGAT GAAGCAGCCC AGATGTGGAG TGCCTGATGT GGCTCAGTTT 300 

GTCCTCACTG AGGGGAACCC TCGCTGGGAG CAAACACATC TGACCTACAG GATTGAAAAT 360 

TACACGCCAG ATTTGCCAAG AGCAGATGTG GACCATGCCA TTGAGAAAGC CTTCCAACTC 420 

TGGAGTAATG TCACACCTCT GACATTCACC AAGGTCTCTG AGGGTCAAGC AGACATCATG 480 

ATATCTTTTG TCAGGGGAGA TCATCGGGAC AACTCTCCTT TTGATGGACC TGGAGGAAAT S40 

CTTGCTCATG CTTTTCAACC AGGCCCAGGT ATTGGAGGGG ATGCTCATTT TGATGAAGAT 600 

GAAAGGTGGA CCAACAATTT CAGAGAGTAC AACTTACATC GTGTTGCGGC TCATGAACTC 660 

GGCCATTCTC TTGGACTCTC CCATTCTACT GAXATCGGGG CTTTGATGTA CCCTAGCTAC 720 

ACCTTCAGTG GTGATGTTCA GCTAGCTCAG GATGACATTG ATGGCATCCA AGCCATATAT 780 

GGACGTTCCC AAAATCCTGT CCAGCCCATC GGCCCACAAA CCCCAAAAGC ATGTGACAGT 840 

AAGCTAACCT TTGATGCTAT AACTACGATT CGGGGAGAAG TGATGTTCTT TAAAGACAGA 900 

TTCTACATGC GCACAAATCC CTTCTACCCG GAAGTTGAGC TCAATTTCAT TTCTGTTTTC 960 

TGGCCACAAC TGCCAAATGG GCTTGAAGCT GCTTACGAAT TTGCCGACAG AGATGAAGTC 1020 

CGGTTTTTCA AAGGGAATAA GTACTGGGCT GTTCAGGGAC AGAATGTGCT ACACGGATAC 1080 

CCCAAGGACA TCTACAGCTC CTTTGGCTTC CCTAGAACTG TGAAGCATAT CGATGCTGCT 1140 

CTTTCTGAGG AAAACACTGG AAAAACCTAC TTCTTTGTTG CTAACAAATA CTGGAGGTAT 1200 

GATGAATATA AACGATCTAT GGATCCAGGT TATCCCAAAA TGATAGCACA TGACTTTCCT 1260 

OU GGAATTGGCC ACAAAGTTGA TGCAGTTTTC ATGAAAGATG GATTTTTCTA TTTCTTTCAT 1320 

GGAACAAGAC AATACAAATT TGATCCTAAA ACGAAGAGAA TTTTGACTCT CCAGAAAGCT 1380 

AATAGCTGGT TCAACTGCAG GAAAAATTAG 1410 

Seq ID NO: C4 DNA Sequence 
CO Nucleic Acid Accession 8; Eos sequence 
Coding sequence : 1 . . 1410 

1 11 21 31 41 51 

I I I I I I 

ATGCACAGCT TTCCTCCACT GCTGCTGCTG CTGTTCTGGG GTGTGGTGTC ACACAGCTTC 60 

CCAGCGACTC TAGAAACACA AGAGCAAGAT GTGGACTTAG TCCAGAAATA CCTGGAAAAA 120 

TACTACAACC TGAAGAATGA TGGGAGGCAA GTTGAAAAGC GGAGAAATAG TGGCCCAGTG 180 

GTTGAAAAAT TGAAGCAAAT GCAGGAATTC TTTGGGCTGA AAGTGACTGG GAAACCAGAT 240 

GCTGAAACCC TGAAGGTGAT GAAGCAGCCC AGATGTGGAG TGCCTGATGT GGCTCAGTTT 300 

GTCCTCACTG AGGGGAACCC TCGCTGGGAG CAAACACATC TGACCTACAG GATTGAAAAT 360 

TACACGCCAG ATTTGCCAAG AGCAGATGTG GACCATGCCA TTGAGAAAGC CTTCCAACTC 420 

TGGA GTAAT G TCACACCTCT GACATTCACC AAGGTCTCTG AGGGTCAAGC AGACATCATG 480 

ATATCTTTTG TCAGGGGAGA TCATCGGGAC AACTCTCCTT TTGATGGACC TGGAGGAAAT 540 

CTTGCTCATG CTTTTCAACC AGGCCCAGGT ATTGGAGGGG ATGCTCATTT TGATGAAGAT 600 

GAAAGGTGGA CCAACAATTT CAGAGAGTAC AACTTACATC GTGTTGCGGC TCATGCCCTC 660 

GGCCATTCTC TTGGACTCTC CCATTCTACT GATATCGGGG CTTTGATGTA CCCTAGCTAC 720 

ACCTTCAGTG GTGATGTTCA GCTAGCTCAG GATGACATTG ATGGCATCCA AGCCATATAT 780 

GGACGTTCCC AAAATCCTGT CCAGCCCATC GGCCCACAAA CCCCAAAAGC ATGTGACAGT 840 

AAGCTAACCT TTGATGCTAT AACTACGATT CGGGGAGAAG TGATGTTCTT TAAAGACAGA 900 
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TTCTACATGC GCACAAATCC CTTCTACCCG GAAGTTGAGC TCAATTTCAT TTCTGTTTTC 960 

TGGCCACAAC tgccaaatgg gcttgaagct gcttaogaat TTGCCGACAG AGATGAAGTC 1020 

CGGTTTTTCA AAGGGAATAA GTACTGGGCT GTTCAGGGAC AGAATGTGCT ACACGGATAC 10 80 

CCCAAGGACA TCTACAGCTC CTTTGGCTTC CCTAGAACTG TGAAGCATAT CGATGCTGCT 1140 

CTTTCTGAGG AAAACACTGG AAAAACCTAC TTCTTTGTTG CTAACAAATA CTGGAGGTAT 1200 

GATGAATATA AACGATCTAT GGATCCAGGT TATCCCAAAA TGATAGCACA TGACTTTCCT 1260 

GGAATTGGCC ACAAAGTTGA TGCAGTTTTC ATGAAAGATG GATTTTTCTA TTTCTTTCAT 1320 

GGAACAAGAC AATACAAATT TGATCCTAAA ACGAAGAGAA TTTTGACTCT CCAGAAAGCT 1380 

AATAGCTGGT TCAACTGCAG GAAAAATTAG 1410 

Seq ID NO: C5 DNA Sequence 

Nucleic Acid Accession fh NMJ>14331.2 

Coding sequence: 1..1506 

1 11 21 31 41 51 

I I I i I ! 

ATGGTCAGAA AGCCTGTTGT GTCCACCATC TCCAAAGGAG GTTACCTGCA GGGAAATGTT 60 

AACGGGAGGC TGCCTTCCCT GGGCAACAAG GAGCCACCTG GGCAGGAGAA AGTGCAGCTG 120 

AAGAGGAAAG TCACTTTACT GAGGGGAGTC TCCATTATCA TTGGCACCAT CATTGGAGCA 180 

GGAATCTTCA TCTCTCCTAA GGGCGTGCTC CAGAACACGG GCAGCGTGGG CATGTCTCTG 240 

ACCATCTGGA CGGTGTGTGG GGTCCTGTCA CTATTTGGAG CTTTGTCTTA TGCTGAATTG 300 

GGAACAACTA TAAAGAAATC TGGAGGTCAT TACACATATA TTTTGGAAGT CTTTGGTCCA 360 

TTACCAGCTT TTGTACGAGT CTGGGTGGAA CTCCTCATAA TACGCCCTGC AGCTACTGCT 420 

GTGATATCCC TGGCATTTGG ACGCTACATT CTGGAACCAT TTTTTATTCA ATGTGAAATC 480 

CCTGAACTTG CGATCAAGCT CATTACAGCT GTGGGCATAA CTGTAGTGAT GGTCCTAAAT 540 

AGCATGAGTG TCAGCTGGAG CGCCCGGATC CAGATTTTCT TAACCTTTTG CAAGCTCACA 600 

GCAATTCTGA TAATTATAGT CCCTGGAGTT ATGCAGCTAA TTAAAGGTCA AACGCAGAAC 660 

TTTAAAGACG CGTTTTCAGG AAGAGATTCA AGTATTACGC GGTTGCCACT GGCTTTTTAT 720 

TATGGAATGT ATGCATATGC TGGCTGGTTT TACCTCAACT TTGTTACTGA AGAAGTAGAA 780 

AACCCTGAAA AAACCATTCC CCTTGCAATA TGTATATCCA TGGCCATTGT CACCATTGGC 840 

TATGTGCTGA CAAATGTGGC CTACTTTACG ACCATTAATG CTGAGGAGCT GCTGCTTTCA 900 

AATGCAGTGG CAGTGACCTT TTCTGAGCGG CTACTGGGAA ATTTCTCATT AGCAGTTCCG 960 

ATCTTTGTTG CCCTCTCCTG CTTTGGCTCC ATGAACGGTG GTGTGTTTGC TGTCTCCAGG 1020 

TTATTCTATG TTGCGTCTCG AGAGGGTCAC CTTCCAGAAA TCCTCTCCAT GATTCATGTC 1080 

CGCAAGCACA CTCCTCTACC AGCTGTTATT GTTTTGCACC CTTTGACAAT GATAATGCTC 1140 

TTCTCTGGAG ACCTCGACAG TCTTTTGAAT TTCCTCAGTT TTGCCAGGTG GCTTTTTATT 1200 

GGGCTGGCAG TTGCTGGGCT GATTTATCTT CGATACAAAT GCCCAGATAT GCATCGTCCT 1260 

TTCAAGGTGC CACTGTTCAT CCCAGCTTTG TTTTCCTTCA CATGCCTCTT CATGGTTGCC 1320 

CTTTCCCTCT ATTCGGACCC ATTTAGTACA GGGATTGGCT TCGTCATCAC TCTGACTGGA 1380 

GTCCCTGCGT ATTATCTCTT TATTATATGG GACAAGAAAC CCAGGTGGTT TAGAATAATG 1440 

TCAGAGAAAA TAACCAGAAC ATTACAAATA ATACTGGAAG TTGTACCAGA AGAAGATAAG 1500 

TTATGAACTA ATGGACTTGA GATCTTGGCA ATCTGCCCAA GGGGAGACAC AAAATAGGGA 1560 

TTTTTACTTC ATTTTCTGAA AGTCTAGAGA ATTACAACTT TGGTGATAAA CAAAAGGAGT 1620 

CAGTTATTTT TATTCATATA TTTTAGCATA TTCGAACTAA TTTCTAAGAA ATTTAGTTAT 1680 

AACTCTATGT AGTTATAGAA AGTGAATATG CAGTTATTCT ATGAGTCGCA CAATTCTTGA 1740 

GTCTCTGATA CCTACCTATT GGGGTTAGGA GAAAAGACTA GACAATTACT ATGTGGTCAT 1800 

TCTCTACAAC ATATGTTAGC ACGGCAAAGA ACCTTCAAAT TGAAGACTGA GATTTTTCTG 1860 

TATATATGGG TTTTGTAAAG ATGGTTTTAC ACACTACAGA TGTCTATACT GTGAAAAGTG 1920 

TTTTCAATTC TGAAAAAAAG CATACATCAT GATTATGGCA AAGAGGAGAG AAAGAAATTT 1980 

ATTTTACATT GACATTGCAT TGCTTCCCCT TAGATACCAA TTTAGATAAC AAACACTCAT 2040 

GCTTTAATGG ATTATACCCA GAGCACTTTG AACAAAGGTC AGTGGGGATT GTTGAATACA 2100 

TTAAAGAAGA GTTTCTAGGG GCTACTGTTT ATGAGACACA TCCAGGAGTT ATGTTTAAGT 2160 

AAAAATCCTT GAGAATTTAT TATGTCAGAT GTTTTTTCAT TCATTATCAG GAAGTTTTAG 2220 

TTATCTGTCA TTTTTTTTTT TCACATCAGT TTGATCAGGA AAGTGTATAA CACATCTTAG 2280 

AGCAAGAGTT AGTTTGGTAT TAAATCCTCA TTAGAACAAC CACCTGTTTC ACTAATAACT 2340 

TACCCCTGAT GAGTCTATCT AAACATATGC ATTTTAAGCC TTCAAATTAC ATTATCAACA 2400 

TGAGAGAAAT AACCAACAAA GAAGATGTTC AAAATAATAG TCCCATATCT GTAATCATAT 2460 

CTACATGCAA TGTTAGTAAT TCTGAAGTTT TTTAAATTTA TGGCTATTTT TACACGATGA 2520 

TGAATTTTGA CAGTTTGTGC ATTTTCTTTA TACATTTTAT ATTCTTCTGT TAAAATATCT 2580 

CTTCAGATGA AACTGTCCAG ATTAATTAGG AAAAGGCATA TATTAACATA AAAATTGCAA 2640 

AAGAAATGTC GCTGTAAATA AGATTTACAA CTGATGTTTC TAGAAAATTT CCACTTCTAT 2700 

ATCTAGGCTT TGTCAGTAAT TTCCACACCT TAATTATCAT TCAACTTGCA AAAGAGACAA 2760 

CTGATAAGAA GAAAATTGAA ATGAGAATCT GTGGATAAGT GTTTGTGTTC AGAAGATGTT 2820 

GTTTTGCCAG TATTAGAAAA TACTGTGAGC CGGGCATGGT GGCTTACATC TGTAATCCCA 2880 

GCACTTTGGG AGGCTGAGGG GGTGGATCAC CTGAGGTCGG GAGTTCTAGA CCAGCCTGAC 2940 

CAACATGGAG AAACCCCATC TCTACTAAAA ATACAAAATT AGCTGGGCAT GGTGGCACAT 3000 

GCTGGTAATC TCAGCTATTG AGGAGGCTGA GGCAGGAGAA TTGCTTGAAC CCGGGAGGCG 3060 

GAGGTTGCAG TGAGCCAAGA TTGCACCACT GTACTCCAGC CTGGGTGACA AAGTCAGACT 3120 

CCATCTCCAA AAAAAAAAAA AAAA 3144 



Seq ID NO: C6 DNA Sequence 

Nucleic Acid Accession #: NMJ)03246.1 

Coding sequence: 112.. 3624 

1 11 21 31 41 51 

1 I I I I I 

GGACGCACAG GCATTCCCCG CGCCCCTCCA GCCCTCGCCG CCCTCGCCAC CGCTCCCGGC 60 

CGCCGCGCTC CGGTACACAC AGGATCCCTG CTGGGCACCA ACAGCTCCAC CATGGGGCTG 120 

GCCTGGGGAC TAGGCGTCCT GTTCCTGATG CATGTGTGTG GCACCAACCG CATTCCAGAG 180 

TCTGGCGGAG ACAACAGCGT GTTTGACATC TTTGAACTCA CCGGGGCCGC CCGCAAGGGG 240 

TCTGGGCGCC GACTGGTGAA GGGCCCCGAC CCTTCCAGCC CAGCTTTCCG CATCGAGGAT 300 

GCCAACCTGA TCCCCCCTGT GCCTGATGAC AAGTTCCAAG ACCTGGTGGA TGCTGTGCGG 360 

GCAGAAAAGG GTTTCCTCCT TCTGGCATCC CTGAGGCAGA TGAAGAAGAC CCGGGGCACG 420 
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CTGCTGGCCC TGGAGCGGAA AGACCACTCT GGCGAGGTCT TCAGCGTGGT GTCCAATGGC 480 

AAGGCGGGCA CCCTGGACCT CAGCCTGACC GTCCAAGGAA AGCAGCACGT GGTGTCTGTG 540 

GAAGAAGCTC TCCTGGCAAC CGGCCAGTGG AAGAGCATCA CCCTGTTTGT GCAGGAAGAC 600 

AGGGCCCAGC TGTACATCGA CTGTGAAAAG ATGGAGAATG CTGAGTTGGA CGTCCCCATC 660 

CAAAGCGTCT TCACCAGAGA CCTGGCCAGC ATCGCCAGAC TCCGCATCGC AAAGGGGGGC 720 

GTCAATGACA ATTTCCAGGG GGTGCTGCAG AATGTGAGGT TTGTCTTTGG AACCACACCA 780 

GAAGACATCC TCAGGAACAA AGGCTGCTCC AGCTCTACCA GTGTCCTCCT CACCCTTGAC 640 

AACAACGTGG TGAATGGTTC CAGCCCTGCC ATCCGCACTA ACTACATTGG CCACAAGACA 900 

AAGGACTTGC AAGCCATCTG CGGCATCTCC TGTGATGAGC TGTCCAGCAT GGTCCTGGAA 960 

lU CTCAGGGGCC TGCGCACCAT TGTGACCACG CTGCAGGACA GCATCCGCAA AGTGACTGAA 1020 

GAGAACAAAG AGTTGGCCAA TGAGCTGAGG CGGCCTCCCC TATGCTATCA CAACGGAGTT 1080 

CAGTACAGAA ATAACGAGGA ATGGACTGTT GATAGCTGCA CTGAGTGTCA CTGTCAGAAC 1140 

TCAGTTACCA TCTGCAAAAA GGTGTCCTGC CCCATCATGC CCTGCTCCAA TGCCACAGTT 1200 

CCTGATGGAG AATGCTGTCC TCGCTGTTGG CCCAGCGACT CTGCGGACGA TGGCTGGTCT 1260 

lO CCATGGTCCG AGTGGACCTC CTGTTCTACG AGCTGTGGCA ATGGAATTCA GCAGCGCGGC 1320 

CGCTCCTGCG ATAGCCTCAA CAACCGATGT GAGGGCTCCT CGGTCCAGAC ACGGACCTGC 1380 

CACATTCAGG AGTGTGACAA AAGATTTAAA CAGGATGGTG GCTGGAGCCA CTGGTCCCCG 1440 

TGGTCATCTT GTTCTGTGAC ATGTGGTGAT GGTGTGATCA CAAGGATCCG GCTCTGCAAC 1500 

TCTCCCAGCC CCCAGATGAA TGGGAAACCC TGTGAAGGCG AAGCGCGGGA GACCAAAGCC 1560 

TGCAAGAAAG ACGCCTGCCC CATCAATGGA GGCTGGGGTC CTTGGTCACC ATGGGACATC 1620 

TGTTCTGTCA CCTGTGGAGG AGGGGTACAG AAACGTAGTC GTCTCTGCAA CAACCCCGCA 1680 

CCCCAGTTTG GAGGCAAGGA CTGCGTTGGT GATGTAACAG AAAACCAGAT CTGCAACAAG 1740 

CAGGACTGTC CAATTGATGG ATGCCTGTCC AATCCCTGCT TTGCCGGCGT GAAGTGTACT 1800 

AGCTACCCTG ATGGCAGCTG GAAATGTGGT GCTTGTCCCC CTGGTTACAG TGGAAATGGC I860 

ATCCAGTGCA CAGATGTTGA TGAGTGCAAA GAAGTGCCTG ATGCCTGCTT CAACCACAAT 1920 

GGAGAGCACC GGTGTGAGAA CACGGACCCC GGCTACAACT GCCTGCCCTG CCCCCCACGC 1980 

TTCACOGGCT CACAGCCCTT CGGCCAGGGT GTCGAACATG CCACGGCCAA CAAACAGGTG 2040 

TGCAAGCCCC GTAACCCCTG CACGGATGGG ACCCACQACT GCAACAAGAA CGCCAAGTGC 2100 

on AACTACCTGG GCCACTATAG CGACCCCATG TACCGCTGCG AGTGCAAGCC TGGCTACGCT 2160 

3U GGCAATGGCA TCATCTGCGG GGAGGACACA GACCTGGATG GCTGGCCCAA TGAGAACCTG 2220 

GTGTGCGTGG CCAATGCGAC TTACCACTGC AAAAAGGATA ATTGCCCCAA CCTTCCCAAC 2280 

TCAGGGCAGG AAGACTATGA CAAGGATGGA ATTGGTGATG CCTGTGATGA TGACGATGAC 2340 

AATGATAAAA TTCCAGATGA CAGGGACAAC TGTCCATTCC ATTACAACCC AGCTCAGTAT 2400 

GACTATGACA GAGATGATGT GGGAGACCGC TGTGACAACT GTCCCTACAA CCACAACCCA 2460 

GATCAGGCAG ACACAGACAA CAATGGGGAA GGAGACGCCT GTGCTGCAGA CATTGATGGA 2S20 

GACGGTATCC TCAATGAACG GGACAACTGC CAGTACGTCT ACAATGTGGA CCAGAGAGAC 2S80 

ACTGATATGG ATGGGGTTGG AGATCAGTGT GACAATTGCC OCTTGGAACA CAATCCGGAT 2640 

CAGCTGGACT CTGACTCAGA CCGCATTGGA GATACCTGTG ACAACAATCA GGATATTGAT 2700 

GAAGATGGCC ACCAGAACAA TCTGGACAAC TGTCCCTATG TGCCCAATGC CAACCAGGCT 2760 

4U GACCATGACA AAGATGGCAA GGGAGATGCC TGTGACCACG ATGATGACAA CGATGGCATT 2820 

CCTGATGACA AGGACAACTG CAGACTCGTG CCCAATCCCG ACCAGAAGGA CTCTGACGGC 2880 

GATGGTCGAG GTGATGCCTG CAAAGATGAT TTTGACCATG ACAGTGTGCC AGACATCGAT 2940 

GACATCTGTC CTGAGAATGT TGACATCAGT GAGACCGATT TCCGCCGATT CCAGATGATT 3000 

CCTCTGGACC CCAAAGGGAC ATCCCAAAAT GACCCTAACT GGGTTGTACG CCATCAGGGT 3060 

45 AAAGAACTCG TCCAGACTGT CAACTGTGAT CCTGGACTCG CTGTAGGTTA TGATGAGTTT 3120 

AATGCTGTGG ACTTCAGTGG CACCTTCTTC ATCAACACCG AAAGGGACGA TGACTATGCT 3180 

GGATTTGTCT TTGGCTACCA GTCCAGCAGC CGCTTTTATG TTGTGATGTG GAAGCAAGTC 3240 

ACCCAGTCCT ACTGGGACAC CAACCCCACG AGGGCTCAGG GATACTCGGG CCTTTCTGTG 3300 

AAAGTTGTAA ACTCCACCAC AGGGCCTGGC GAGCACCTGC GGAACGCCCT GTGGCACACA 3360 

J\) GGAAACACCC CTGGCCAGGT GCGCACCCTG TGGCATGACC CTCGTCACAT AGGCTGGAAA 3420 

GATTTCACCG CCTACAGATG GCGTCTCAGC CACAGGCCAA AGACGGGTTT CATTAGAGTG 3480 

GTGATGTATG AAGGGAAGAA AATCATGGCT GACTCAGGAC CCATCTATGA TAAAACCTAT 3540 

GCTGGTGGTA GACTAGGGTT GTTTGTCTTC TCTCAAGAAA TGGTGTTCTT CTCTGACCTG 3600 

AAATACGAAT GTAGAGATCC CTAATCATCA AATTGTTGAT TGAAAGACTG ATCATAAACC 3660 

DO AATGCTGGTA TTGCACCTTC TGGAACTATG GGCTTGAGAA AACCCCCAGG ATCACTTCTC 3720 

CTTGGCTTCC TTCTTTTCTG TGCTTGCATC AGTGTGGACT CCTAGAACGT GCGACCTGCC 3780 

TCAAGAAAAT GCAGTTTTCA AAAACAGACT CATCAGCATT CAGCCTCCAA TGAATAAGAC 3840 

ATCTTCCAAG CATATAAACA ATTGCTTTGG TTTCCTTTTG AAAAAGCATC TACTTGCTTC 3900 

AGTTGGGAAG GTGCCCATTC CACTCTGCCT TTGTCACAGA GCAGGGTGCT ATTGTGAGGC 3960 

CATCTCT 3967 
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Seq ID NO: C7 DNA Sequence 

Nucleic Acid Accession #: NM_002192 

Coding sequence! 86.. 1366 

1 11 21 31 41 51 

> I I I I I 

TCCACACACA CAAAAAACCT GCGCGTGAGG GGGGAGGAAA AGCAGGGCCT TTAAAAAGGC 60 

AATCACAACA ACTTTTGCTG CCAGGATGCC CTTGCTTTGG CTGAGAGGAT TTCTGTTGGC 120 

AAGTTGCTGG ATTATAGTGA GGAGTTCCCC CACCCCAGGA TCCGAGGGGC ACAGCGCGGC 180 

CCCCGACTGT CCGTCCTGTG CGCTGGCCGC CCTCCCAAAG GATGTACCCA ACTCTCAGCC 240 

AGAGATGGTG GAGGCCGTCA AGAAGCACAT TTTAAACATG CTGCACTTGA AGAAGAGACC 300 

GGATGTCACC CAGCCGGTAC CCAAGGCGGC GCTTCTGAAC GCGATCAGAA AGCTTCATGT 360 

GGGCAAAGTC GGGGAGAACG GGTATGTGGA GATAGAGGAT GACATTGGAA GGAGGGCAGA 420 

AATGAATGAA CTTATGGAGC AGACCTCGGA GATCATCACG TTTGCCGAGT CAGGAACAGC 480 

CAGGAAGACG CTGCACTTCG AGATTTCCAA GGAAGGCAGT GACCTGTCAG TGGTGGAGCG 540 

TGCAGAAGTC TGGCTCTTCC TAAAAGTCCC CAAGGCCAAC AGGACCAGGA CCAAAGTCAC 600 

CATCCGCCTC TTCCAGCAGC AGAAGCACCC GCAGGGCAGC TTGGACACAG GGGAAGAGGC 660 

CGAGGAAGTG GGCTTAAAGG GGGAGAGGAG TGAACTGTTG CTCTCTGAAA AAGTAGTAGA 720 

CGCTCGGAAG AGCACCTGGC ATGTCTTCCC TGTCTCCAGC AGCATCCAGC GGTTGCTGGA 780 

CCAGGGCAAG AGCTCCCTGG ACGTTCGGAT TGCCTGTGAG CAGTGCCAGG AGAGTGGCGC 840 

CAGCTTGGTT CTCCTGGGCA AGAAGAAGAA GAAAGAAGAG GAGGGGGAAG GGAAAAAGAA 900 

GGGCGGAGGT GAAGGTGGGG CAGGAGCAGA TGAGGAAAAG GAGCAGTCGC ACAGACCTTT 960 

CCTCATGCTG CAGGCCCGGC AGTCTGAAGA CCACCCTCAT CGCCGGCGTC GGCGGGGCTT 1020 
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GGAGTGTGAT GGCAAGGTCA ACATCTGCTG TAAQAAACAG TTCTTTGTCA GTTTCAAGGA 1080 

CATCGGCTGO AATGACTGGA TCATTGCTCC CTCTGGCTAT CATGCCAACT ACTGCGAGGG 1140 

TGAGTGCCCO AGCCATATAG CAGGCACGTC CGGGTCCTCA CTGTCCTTCC ACTCAACAGT 1200 

CATCAACCAC TACCGCATGC GGGGCCATAG CCCCTTTGCC AACCTCAAAT CGTGCTGTGT 1260 

J GCCCACCAAG CTGAGACCCA TGTCCATGTT GTACTATGAT GATGGTCAAA ACATCATCAA 1320 

AAAGGACATT CAGAACATGA TCGTGGAGGA GTGTGGGTGC TCATAGAGTT GCCCAGCCCA 1380 

GGGGGAAAGG GAGCAAGAGT TGTCCAGAGA AGACAGTGGC AAAATGAAGA AATTTTTAAG 1440 

GTTTCTGAGT TAACCAGAAA AATAGAAATT AAAAACAAAA CAAAACAAAA AAAAAAACAA IS 00 

AAAAAAACAA AAGTAAATTA AAAACAAACC TGATGAAACA GATGAAACAG ATGAAGGAAG 1560 

1U ATGTGGAAAT CTTAGCCTGC CTTAGCCAGG GCTCAGAGAT GAAGCAGTGA AGAGACAGAT 1620 

TGGGAGGGAA AGGGAGAATG GTGTACCCTT TATTTCTTCT GAAATCACAC TGATGACATC 1680 

AGTTGTTTAA ACGGGGTATT GTCCTTTCCC CCCTTGAGGT TCCCTTGTGA GCTTGAATCA 1740 

ACCAATCTGA TCTGCAGTAG TGTGGACTAG AACAACCCAA ATAGCATCTA GAAAGCCATG 1800 

AGTTTGAAAG GGCCCATCAC AGGCACTTTC CTAGCCTAAT 1840 
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Seg ID NO: CB DNA Sequence 

Nucleic Acid Accession #: NM_000095.i 

Coding sequence: 26.. 2299 



1 11 21 31 41 51 

i ! I II t 

CAGCACCCAG CTCCCOGCCA CCGCCATGGT CCCCGACACC GCCTGCGTTC TTCTGCTCAC 60 

0 CCTGGCTGCC CTCGGCGCGT CCGGACAGGG CCAGAGCCCG TTGGGCTCAG ACCTGGGCCC 120 

Lj GCAGATGCTT CGGGAACTGC AGGAAACCAA CGCGGCGCTG CAGGACGTGC GGGACTGGCT 180 

GCGGCAGCAG GTCAGGGAGA TCACGTTCCT GAAAAACACG GTGATGGAGT GTGACGCGTG 240 

CGGGATGCAG CAGTCAGTAC GCACCGGCCT ACCCAGCGTG CGGCCCCTGC TCCACTGCGC 300 

GCCCGGCTTC TGCTTCCCCG GCGTGGCCTG CATCCAGACG GAGAGCGGOG GCCGCTGCGG 360 

CCCCTGCCCC GCGGGCTTCA CGGGCAAGGG CTCGCACTGC ACCGACGTCA ACGAGTGCAA 420 

J\) CGCCCACCCC TGCTTCCCCC GAGTCCGCTG TATCAACACC AGCCCGGGGT TCCGCTGCGA 480 

GGCTTGCCCG CCGGGGTACA GCGGCCCCAC CCACCAGGGC GTGGGGCTGG CTTTCGCCAA 540 

GGCCAACAAG CAGGTTTGCA CGGACATCAA CGAGTGTGAG ACCGGGCAAC ATAACTGCGT 600 

CCCCAACTCC GTGTGCATCA ACACCCGGGG CTCCTTCCAG TGCGGCCCGT GCCAGCCCGG 660 

^ CTTCGTGGGC GACCAGGCGT CCGGCTGCCA GCGCGGCGCA CAGCGCTTCT GCCCCGACGG 720 

J J CTCGCCCAGC GAGTGCCACG AGCATGCAGA CTGCGTCCTA GAGCGCGATG GCTCGCGGTC 780 

GTGCGTGTGT CGCGTTGGCT GGGCCGGCAA CGGGATCCTC TGTGGTCGCG ACACTGACCT 840 
AGACGGCTTC COGGACGAGA AGCTGCGCTG CCCGGAGCCG CAGTGCCGTA AGGACAACTG 



Seq ID NO: C9 DNA Sequence 

Nucleic Acid Accession &: XM_057014 

Coding sequence t 143.. 874 



900 



CGTGACTGTG CCCAACTCAG GGCAGGAGGA TGTGGACCGC GATGGCATCG GAGACGCCTG 960 

CGATCCGGAT GCCGACGGGG ACGGGGTOOC CAATGAAAAG GACAACTGCC CGCTGGTGCG 1020 

4U GAACCCAGAC CAGCGCAACA CGGACGAGGA CAAGTGGGGC GATGCGTGCG ACAACTGCCG 1080 

GTCCCAGAAG AACGACGACC AAAAGGACAC AGACCAGGAC GGCCGGGGCG ATGCGTGCGA 1140 

CGACGACATC GACGGCGACC GGATCCGCAA CCAGGCCGAC AACTGCCCTA GGGTACCCAA 1200 

CTCAGACCAG AAGGACAGTG ATGGCGATGG TATAGGGGAT GCCTGTGACA ACTGTCCCCA 1260 

. GAAGAGCAAC COGGATCAGG CGGATGTGGA CCACGACTTT GTGGGAGATG CTTGTGACAG 1320 

40 CGATCAAGAC CAGGATGGAG ACGGACATCA GGACTCTCGG GACAACTGTC CCACGGTGCC 1380 

TAACAGTGCC GAGGAGGACT CAGACCACGA TGGCCAGGGT GATGCCTGCG ACGACGACGA 1440 

CGACAATGAC GGAGTCCCTG ACAGTCGGGA CAACTGCCGC CTGGTGCCTA ACCCCGGCCA 1500 

GGAGGACGCG GACAGGGACG GCGTGGGCGA CGTGTGCCAG GACGACTTTG ATGCAGACAA 1560 

C(\ GGTGGTAGAC AAGATCGACG TGTGTCCGGA GAACGCTGAA GTCACGCTCA CCGACTTCAG 1620 

D\J GGCCTTCCAG ACAGTCGTGC TGGACCCGGA GGGTGACGCG CAGATTGACC CCAACTGGGT 1680 

GGTGCTCAAC CAGGGAAGGG AGATCGTGCA GACAATGAAC AGCGACCCAG GCCTGGCTGT 1740 

GGGTTACACT GOCTTCAATG GCGTGGACTT CGAGGGCACG TTCCATGTGA ACACGGTCAC 1800 

GGATGACGAC TATGCGGGCT TCATCTTTGG CTACCAGGAC AGCTCCAGCT TCTACGTGGT 1860 

CATGTGGAAG CAGATGGAGC AAACGTATTG GCAGGCGAAC CCCTTCCGTG CTGTGGCCGA 1920 

30 GCCTGGCATC CAACTCAAGG CTGTGAAGTC TTCCACAGGC CCCGGGGAAC AGCTGCGGAA 1980 

CGCTCTGTGG CATACAGGAG ACACAGAGTC CCAGGTGCGG CTGCTGTGGA AGGACCCGOG 2040 

AAACGTGGGT TGGAAGGACA AGAAGTCCTA TCGTTGGTTC CTGCAGCACC GGCCCCAAGT 2100 

GGGCTACATC AGGGTGCGAT TCTATGAGGG CCCTGAGCTG GTGGCOGACA GCAACGTGGT 2160 

^ CTTGGACACA ACCATGCGGG GTGGCCGCCT GGGGGTCTTC TGCTTCTCCC AGGAGAACAT 2220 

OK) CATCTGGGCC AACCTGCGTT ACCGCTGCAA TGACACCATC CCAGAGGACT ATGAGACCCA 2280 

TCAGCTGCGG CAAGCCTAGG GACCAGGGTG AGGACCCGCC GGATGACAGC CACCCTCACC 2340 

GCGGCTGGAT GGGGGCTCTG CACCCAGCCC AAGGGGTGGC CGTCCTGAGG GGGAAGTGAG 2400 

AAGGGCTCAG AGAGGACAAA ATAAAGTGTG TGTGCAGGG 2439 



1 11 21 31 41 51 

I I I I I I 

GGGAGGGAGA GAGGOGCGCG GGTGAAAGGC GCATTGATGC AGCCTGCGGC GGCCTCGGAG 60 

CGCGGCGGAG CCAGACGCTG ACCACGTTCC TCTCCTCGGT CTCCTCCGCC TCCAGCTCCG 120 

fJ CGCTGCCCGG CAGCCGGGAG CCATGCGACC CCAGGGCCCC GCCGCCTCCC CGCAGCGGCT 180 

CCGCGGCCTC CTGCTGCTCC TGCTGCTGCA GCTGCCCGCG CCGTCGAGCG CCTCTGAGAT 240 

CCCCAAGGGG AAGCAAAAGG CGCAGCTCCG GCAGAGGGAG GTGGTGGACC TGTATAATGG 300 

AATGTGCTTA CAAGGGCCAG CAGGAGTGCC TGGTCGAGAC GGGAGCCCTG GGGCCAATGG 360 

%rx CATTCCGGGT AC ACCTG GGA TCCCAGGTCG GGATGGATTC AAAGGAGAAA AGGGGGAATG 420 

OU TCTGAGGGAA AGCTTTGAGG AGTCCTGGAC ACCCAACTAC AAGCAGTGTT CATGGAGTTC 480 

ATTGAATTAT GGCATAGATC TTGGGAAAAT TGCGGAGTGT ACATTTACAA AGATGCGTTC S40 

AAATAGTGCT CTAAGAGTTT TGTTCAGTGG CTCACTTCGG CTAAAATGCA GAAATGCATG 600 

CTGTCAGCGT TGGTATTTCA CATTCAATGG AGCTGAATGT TCAGGACCTC TTCCCATTGA 660 

AGCTATAATT TATTTGGACC AAGGAAGCCC TGAAATGAAT TCAACAATTA ATATTCATCG 720 
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CACTTCTTCT 
CTGGGTTGGC 
TTCTCGCATC 
TTTTTTTATT 
CATCTGAATG 
TTTAAATCTA 
TGGTTAGAAT 
GGTCTTTTGT 
TGTACAATTT 
CAACCTTAAA 



GTGGAAGGAC 
ACTTGTTCAG 
ATTATTGAAG 
ATGCCTTGGA 
AAAAGCAAAG 
GCATTATTCA 
ACTTTCTTCA 
TTTTTCTCTT 
GTAAATGTTA 
AAAAAAAAAA 



TTTGTGAAGG 
ATTACCCAAA 
AACTACCAAA 
ATGGTTCACT 
CTAAATATGT 
TTTTGCTTCA 
TAGTCACATT 
AGTATAGCAT 
AGAATTTTTT 
AAAA 



AATTGGTGCT 
AGGAGATGCT 
ATAAATGCTT 
TAAATGACAT 
TTACAGACCA 
ATCAAAAGTG 
CTCTCAACCT 
TTTTAAAAAA 
TTATATCTGT 



GGATTAGTGG 
TCTACTGGAT 
TAATTTTCAT 
TTTAAATAAG 
AAGTGTGATT 
GTTTCAATAT 
ATAATTTGGA 
ATATAAAAGC 
TAAATAAAAA 



ATGTTGCTAT 
GGAATTCAGT 
TTGCTACCTC 
TTTATGTATA 
TCACACTGTT 
TTTTTTTA6T 
ATATTGTTGT 
TACCAATCTT 
TTATTTCCAA 



Seq ID NO: CIO DNA Sequence 
Nucleic Acid Accession #: NMJ)03225 
Coding sequence: 41. '.2 95 



ATCCCTGACT 
GGTGATCTGC 
GACAGAGACG 
GCCCTCCCAG 
GTGCTTCTAT 
TCTGCAGGGA 
GAGCTCGGCT 
CACAACACAG 
GATCGATATT 



11 
I 

CGGGGTCGCC 
GCCCTGGTCC 
TGTACAGTGG 
TGTGCAAATA 
CCTAATACCA 
TCTGCCTGCA 
GCCACCTCCA 
ATTGACTGCT 
AAAAAAAAAA 



21 
I 

TTTGGAGCAG 
TGGTGTCCAT 
CCCCCCGTGA 
AGGGCTGCTG 
TCGACGTCCC 
TCCTGACGGG 
CCGGACACCT 
CTGACTTTGA 
AAAAAAAAAA 



31 
I 

AGAGGAGGCA 
GCTGGCCCTC 
AAGACAGAAT 
TTTCGACGAC 
TCCAGAAGAG 
GTGCCGTCCC 
CAGACACGCT 
CTACTCAAAA 
AAAAAAAAAA 



41 
I 

ATGGCCACCA 
GGCACCCTGG 
TGTGGTTTTC 
ACCGTTCGTG 
GAGTGTGAAT 
CAGCACGGTG 
TCTGCAGCTG 
TTGGCCTAAA 
AAAAAAAAAA 



51 
I 

TGGAGAACAA 
CCGAGGCCCA 
CTGGTGTCAC 



TTTAGACACT 
ATTAGTCCCA 
TGCCTCGGCT 
AATTAAAAGA 
AAAAAAAAAA 



Seq ID NO: Cll DNA Sequence 

Nucleic Acid Accession ft: NM_015419.1 

Coding sequence : 1 . . 84 8 7 



1 
I 

ATGCCCAAGC 
CCGOGAGTGG 
TGCAOGTTCC 
AATTTGGGGT 
TTGGAGCTAC 
GACCTCAGCT 
CAGACCCTCC 
TTTATCCACC 
AATCTCCTCC 
AGACTCTCCA 
AGCATGCTTC 
TGCGATTGTG 
TGTAAAAAGG 
AAGTTGTACA 
GAGTCCCCTC 
GATGGTGGCA 
AATATGACCG 
GATGTGTACA 
GTTGCCTTGG 
ATAGCATACT 
CCCAGAGTCA 
GTGAGAGCCC 
CTGAACCGAC 
CAAACAATAT 
CCTAGTGGAG 
TGCAACGTGA 
CTOAAAGCGC 
AGGATCAAGT 
GATGAAATGG 
GAGAAAGACA 
GCTTTAGCAA 
GATTTGGCTA 
GTCCAAGTCA 
CATTTTACGG 
AGACGCCCAG 
GGCTCGGGCA 
GAGGTGTTCC 
AGAAGAAAGC 
GGTCGCAGAG 
GAGCGCTGGG 
GTACCCCCGT 
TTTCCTGCTG 
TCCTCAGCAG 
GCCAGCATGG 
AGCACACCTC 
ACTGAAGGAG 
TCTCCTACTC 
ACAGAGGGTT 
GAGCCTCCAT 
GATTTGGAGA 
CTTACTCCAA 



11 
I 

GCGCGCACTG 
CGCTGGCCTG 
GATCCCTGGC 
TTAATAGCAT 
TTATGATTCA 
CTCTTCAGGT 
AGGGTCTCTC 
CTCAAGCTTT 
ACCAGCTGCA 
CCATAAGGCA 
GGAACATGCC 
AGATGAGATG 
ACAAAGCTTA 
AACATGAGAT 
TGAGACAGAA 
GCCAGCTCAT 
ACGAGCACGG 
AGATTCACTT 
ACTTTGAGTG 
ACAGTGAAGT 
GCTACCAGTA 
AGATTCTTGC 
GTCAGAGTAC 
CCACCAAAGA 
CTGTGCAAAG 
AAGCTTCTGA 
CCATGGATGA 
CCATGGAGCC 
ACCGCATGGT 
CAGTGACAAT 
TACCCGAAGC 
ACACATCACA 
GTGATAGTGG 
TGGGAATCAC 
GTGCAAAGGC 
TGGGAGATGA 
TCAAAACAAA 
TGAAACTCTG 
TGTTTGAATC 
CTGATATTTT 
TGATTAAAAC 
TTTCTCCCCC 
ATGTACCTCT 
GGCTAGAACA 
TGGAGGAAGT 
ACCTGAAGGG 
TGCACACATT 
GGTCTGCAGC 
TGGATGCTGT 
CTAAGTCACA 
CCCCCACCAT 



21 
I 

GGGGGCCCTC 
CCCGCATCCT 

ACAGGCCCTG 
CGGCAATGAG 
TTTCAAGTTC 
TAACTTAATG 
CAACGGCTTA 
CCCCAGCACC 
CCTCTACTTA 
GCTTCTGGAG 
GTTTTTGGAA 
TGAAGGCGGT 
ACACAAGCTG 
CAGGAGCAGG 
CCTGGAGAAA 
GAACATGGTG 
GAACCAAACG 
TCCAATGACC 
TCCCGTGAAG 
CAGGCAGGAT 
AGAACCAGAA 
GGCCAAGAAG 
TACAAGGCAO 
AGATCAGACT 
GAGTCCATCT 
CCCAGACAGC 
ATCTQACTCA 
ATATAGGGTA 
TGGCAAGAAC 
CCACCTTAGC 
TGTATACATG 
TTACTACAGA 
AGTGACCAAG 
TCTTTCCAGA 
AGAGAACACT 
GGATGATGCC 
GAAGCATTCG 
TAGACGAAGG 
AGCCAAAGTC 
CACAAGTCCT 
CTCAGCATCT 
ACTTGGTGAA 
CAACCACAAT 
TGTTGATGAC 
GACAGCAGCC 
AGACACAGTC 
AGATGTTGGA 



ACCAGATGAG 
CTGGGTTAAT 



31 
i 

TCCGTGGTGC 
TGTGCCTGCT 
GCTGGCATTG 
TCAGAAACCT 
ATCCCAAGCA 
AGCTACAACA 
AGGCTGCACA 
ACGTCTCTGA 
TTCTCCACGT 
GCAGAGAACA 
AATCTTTACT 
TGGGATGCAA 
CAGTTGTGTG 
AAGGACATGA 
AGTATTGAGG 
TTCCAACTGC 
AACTTGGTCT 
GATCCTCCAG 
CGAGAAAACT 
CTACACAGAG 
GCTGATGAGG 
TGGGTCATGC 
GTGCTACTTT 
GCTCGGGGCA 
GTCCTGGAAG 
ATCTTCTGGG 
AAGTTCTCCA 
GGCTTGTACC 
CTTGTGCAGT 
CCAGGGGAGT 
TGGATTCTTC 
TTGCCAAATG 



AAAGGGTCTG 
GTCAGAGAAG 
TCAAGGAGAC 
ATCAATGGAG 
GAAAAAGAAC 
ATAAACATGG 
CGTGGGAAAA 
CCATCCTTGA 
CCTGTGCAGA 
GAAGAGCACG 
GGAGTTATTC 
CTTTCTGAGA 
CCTACACTTA 
TATGAAAAGC 
TCGTCACCAG 
GAGTCTGAGC 
GATAAGATGA 
GACTCCAGTA 



41 

I 

TGATCCTGCT 
ACGTCCCCAG 
CTAGACACGT 
CATTTGCAGG 
TCCCCGATGG 
AGCTGAGAGT 
TTGACCACAA 
GGCTACTCCA 
TCACATTTTT 
TGGTTAGAAC 
TGCAGGGAAA 
AATCCAGAGG 
CAATGTGCTT 
CTTGTCTGAA 
AGGAGCAAGA 
CCCAGTGGAG 
GTGACATCAA 
ATATTGACAT 
ATGAAAAGCT 
AGCTCATGCT 
AAGCTCTTTA 
AGCCATCCAT 
CCTACTACAC 
GAAGCTGGGT 
GGGGTCCATG 
TGCTTCCAGA 
TTCTCAGCAG 
AGTGCATTGC 
CTCCCTCCAC 
CGGTGACATT 
CAAACAGAAG 
GAACTCTTTC 
TCAACCAGCA 
GCTTGCCATC 
ACATCGTGGA 
TTCTGCATCC 
ACAAGAAAGC 
CAGAGACCAA 
CAAACAAACA 
ATCTCCCTAA 
GCCTAGAAGT 
CAGTAACCAG 
TTTTGGGTAC 
TTGTTGAACC 
AGACTGAGGA 
TATCTGAGCC 
CCACCCATGA 
AGCCCACATC 
CCATGCAATA 
AAGAAGACAC 
CATCACAGTT 



780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1284 



60 

120 

180 

240 

300 

360 

420 

480 

540 



51 

I 

TTGGGGCCAT 
CGAGGTCCAC 
GGAAAGAATC 
AGTGACCAAG 
AGCTTTAAGA 
GATCACAGGA 
CAAGATCGAG 
TTTGGAAGGA 
GGATTATTTC 
TCTTCCTGCC 
TCCGTGGACC 
AATTCTGAAG 
CAGTCCAAAG 
GCCTTCAATA 
ACAGGAAGAG 
CATCTCTTTG 
GAAACCAATG 
AAATGCAACA 
ATGGAAATTG 
CAGCAAAGAC 
CTACACAGGT 
AGATATCCAG 
CCAGTATTCT 
AATGATTGAG 
CCAGTTGAGC 
TGGCTCCATC 
TGGCTGGCTG 
TCAAGTGAGG 
TCAGCCAGCC 
GCCTTGCAAT 
GATAATTAAT 
CATCCCAAAG 
AGGGGCAGAC 
CAAAAGAGGC 
GGATGAAGGG 
AAAGGACCAA 
CAAGAAAGGG 
TGTTGCAGAA 
GATTAATCCG 
GGGCACAGAA 
CACACCACCT 
TGCTGAAGAA 
CATTTCCTCA 
TGAAGTAACA 
GATAACTTCC 
TTATGAACCA 
AGAGACGGCA 
CAGTGAGTAT 
CTTTGACCCA 
CTTTGCACAC 
ATTTGAGGAT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 
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TCTACTATAG GGGAACCAGG TGTCCCAGGC CAATCACATC TACAAGGACT GACAGACAAC 3120 

ATCCACCTTG TGAAAAGTAG TCTAAGCACT CAAGACACCT TACTGATTAA AAAGGGTATG 3180 

AAAGAGATGT CTCAGACACT ACAGGGAGGA AATATGCTAG AGGGAGACCC CACACACTCC 3240 

AGAAGTTCTG AGAGTGAGGG CCAAGAGAGC AAATCCATCA CTTTGCCTGA CTCCACACTG 3300 

5 GGTATAATGA GCAGTATGTC TCCAGTTAAG AAGCCTGCGG AAACCACAGT TGGTACCCTC 3360 

CTAGACAAAG ACACCACAAC AGTAACAACA ACACCAAGGC AAAAAGTTGC TCCGTCATCC 3420 

ACCATGAGCA CTCACCCTTC TCGAAGGAGA CCCAACGGGA GAAGGAGATT ACGCCCCAAC 3480 

AAATTCCGCC ACCGGCACAA GCAAACCCCA CCCACAACTT TTGCCCCATC AGAGACTTTT 3540 

TCTACTCAAC CAACTCAAGC ACCTGACATT AAGATTTCAA GTCAAGTGGA GAGTTCTCTG 3600 

10 GTTCCTACAG CTTGGGTGGA TAACACAGTT AATACCCCCA AACAGTTGGA AATGGAGAAG 3660 

AATGCAGAAC CCACATCCAA GGGAACACCA CGGAGAAAAC ACGGGAAGAG GCCAAACAAA 3720 

CATOGATATA CCCCTTCTAC AGTGAGCTCA AGAGCGTCCG GATCCAAGCC CAGCCCTTCT 3780 

CCAGAAAATA AACATAGAAA CATTGTTACT CCCAGTTCAG AAACTATACT TTTGCCTAGA 3840 

ACTGTTTCTC TGAAAACTGA GGGCCCTTAT GATTCCTTAG ATTACATGAC AACCACCAGA 3900 

15 AAAATATATT CATCTTACCC TAAAGTCCAA GAGACACTTC CAGTCACATA TAAACCCACA 3960 

TCAGATGGAA AAGAAATTAA GGATGATGTT GCCACAAATG TTGACAAACA TAAAAGTGAC 4020 

ATTTTAGTCA CTGGTGAATC AATTACTAAT GCCATACCAA CTTCTCGCTC CTTGGTCTCC 4080 

ACTATGGGAG AATTTAAGGA AGAATCCTCT CCTGTAGGCT TTCCAGGAAC TCCAACCTGG 4140 

AATCCCTCAA GGACGGCCCA GCCTGGGAGG CTACAGACAG ACATACCTGT TACCACTTCT 4200 

20 GGGGAAAATC TTACAGACCC TCCCCTTCTT AAAGAGCTTG AGGATGTGGA TTTCACTTCC 4260 

GAGTTTTTGT CCTCTTTGAC AGTCTCCACA CCATTTCACC AGGAAGAAGC TGGTTCTTCC 4320 

ACAACTCTCT CAAGCATAAA AGTGGAGGTG GCTTCAAGTC AGGCAGAAAC CACCACCCTT 4380 

GATCAAGATC ATCTTGAAAC CACTGTGGCT ATTCTCCTTT CTGAAACTAG ACCACAGAAT 4440 

CACACCCCTA CTGCTGCCCG GATGAAGGAG CCAGCATCCT CX3TCCCCATC CACAATTCTC 4500 

25 ATGTCTTTGG GACAAACCAC CACCACTAAG CCAGCACTTC CCAGTCCAAG AATATCTCAA 4560 

GCATCTAGAG ATTCCAAGGA AAATGTTTTC TTGAATTATG TGGGGAATCC AGAAACAGAA 4620 

GCAACCCCAG TCAACAATGA AGGAACACAG CATATGTCAG GGCCAAATGA ATTATCAACA 4680 

CCCTCTTCCG ACCGGGATGC ATTTAACTTG TCTACAAAGC TGGAATTGGA AAAGCAAGTA 4740 

TTTGGTAGTA GGAGTCTACC ACGTGGCCCA GATAGCCAAC GCCAGGATGG AAGAGTTCAT 4800 

30 GCTTCTCATC AACTAACCAG AGTCCCTGCC AAACCCATCC TACCAACAGC AACAGTGAGG 4860 

CTACCTGAAA TGTCCACACA AAGCGCTTCC AGATACTTTG TAACTTCCCA GTCACCTCGT 4920 

CACTGGACCA ACAAACCGGA AATAACTACA TATCCTTCTG GGGCTTTGCC AGAGAACAAA 4980 

CAGTTTACAA CTCCAAGATT ATCAAGTACA ACAATTCCTC TCCCATTGCA CATGTCCAAA 5040 

CCCAGCATTC CTAGTAAGTT TACTGACCGA AGAACTGACC AATTCAATGG TTACTCCAAA 5100 

35 GTGTTTGGAA ATAACAACAT CCCTGAGGCA AGAAACCCAG TTGGAAAGCC TCCCAGTCCA 5160 

AGAATTCCTC ATTATTCCAA TGGAAGACTC CCTTTCTTTA CCAACAAGAC TCTTTCTTTT 5220 

CCACAGTTGG GAGTCACCCG GAGACCCCAG ATACCCACTT CTCCTGCCCC AGTAATGAGA 5280 

GAGAGAAAAG TTATTCCAGG TTCCTACAAC AGGATACATT CCCATAGCAC CTTCCATCTG 5340 

GACTTTGGCC CTCCGGCACC TCCGTTGTTG CACACTCCGC • AGACCACGGG ATCACCCTCA 5400 

40 ACTAACTTAC AGAATATCCC TATGGTCTCT TCCACCCAGA GTTCTATCTC CTTTATAACA 5460 

TCTTCTGTCC AGTCCTCAGG AAGCTTCCAC CAGAGCAGCT CAAAGTTCTT TGCAGGAGGA 5520 

CCTCCTGCAT CCAAATTCTG GTCTCTTGGG GAAAAGCCCC AAATCCTCAC CAAGTCCCCA 5580 

CAGACTGTGT CCGTCACCGC TGAGACAGAC ACTGTGTTCC CCTGTGAGGC AACAGGAAAA 5640 

CCAAAGCCTT TCGTTACTTG GACAAAGGTT TCCACAGGAG CTCTTATGAC TCCGAATACC 5700 

45 AGGATACAAC GGTTTGAGGT TCTCAAGAAC GGTACCTTAG TGATACGGAA GGTTCAAGTA 5760 

CAAGATCGAG GCCAGTATAT GTGCACCGCC AGCAACCTGC ACGGCCTGGA CAGGATGGTG 5820 

GTCTTGCTTT CGGTCACCGT GCAGCAACCT CAAATCCTAG CCTCCCACTA CCAGGACGTC 5880 

ACTGTCTACC TGGGAGACAC CATTGCAATG GAGTGTCTGG CCAAAGGGAC COCAGCCCCC 5940 

CAAATTTCCT GGATCTTCCC TGACAGGAGG GTGTGGCAAA CTGTGTCCCC CGTGGAGAGC 6000 

50 CGCATCACCC TGCACGAAAA CCGGACCCTT TCCATCAAGG AGGCGTCCTT CTCAGACAGA 6060 

GGCGTCTATA AGTGCGTGGC CAGCAATGCA GCCGGGGCGG ACAGCCTGGC CATCCGCCTG 6120 

CACGTGGOGG CACTGCCCCC CGTTATCCAC CAGGAGAAGC TGGAGAACAT CTCGCTGCCC 6180 

CCGGGGCTCA GCATTCACAT TCACTGCACT GCCAAGGCTG OGCCCCTGCC CAGCGTGCGC 6240 

TGGGTGCTOG GGGACGGTAC CCAGATCCX3C CCCTCGCAGT TCCTCCACGG GAACTTGTTT 6300 

55 GTTTTCCCCA ACGGGACGCT CTACATCCGC AACCTCGCGC CCAAGGACAG CGGGCGCTAT 6360 

GAGTGGGTGG CCGCCAACCT GGTAGGCTCC GCGCGCAGGA CGGTGCAGCT GAACGTGCAG 6420 

CGTGCAGCAG CCAACGCGCG CATCACGGGC ACCTCCCCGC GGAGGACGGA CGTCAGGTAC 6480 

GGAGGAACCC TCAAGCTGGA CTGCAGCGCC TCGGGGGACC CCTGGCCGCG CATCCTCTGG 6540 

AGGCTGCCGT CCAAGAGGAT GATCGACGCG CTCTTCAGTT TTGATAGCAG AATCAAGGTG 6600 

60 TTTGCCAATG GGACCCTGGT GGTGAAATCA GTGAOGGACA AAGATGCCGG AGATTACCTG 6660 

TGCGTAGCTC GAAATAAGGT TGGTGATGAC TACGTGGTGC TCAAAGTGGA TGTGGTGATG 6720 

AAACCGGCCA AGATTGAACA CAAGGAGGAG AACGACCACA AAGTCTTCTA CGGGGGTGAC 6780 

CTGAAAGTGG ACTGTGTGGC CACCGGGCTT CCCAATCCCG AGATCTCCTG GAGCCTCCCA 6840 

GACGGGAGTC TGGTGAACTC CTTCATGCAG TCGGATGACA GCGGTGGACG CACCAAGOGC 6900 

65 TATGTCGTCT TCAACAATGG GACACTCTAC TTTAACGAAG TGGGGATGAG GGAGGAAGGA 6960 

GACTACACCT GCTTTGCTGA AAATCAGGTC GGGAAGGACG AGATGAGAGT CAGAGTCAAG 7020 

GTGGTGACAG CGCCCGCCAC CATCCGGAAC AAGACTTACT TGGCGGTTCA GGTGCCCTAT 7080 

GGAGACGTGG TCACTGTAGC CTGTGAGGCC AAAGGAGAAC CCATGCCCAA GGTGACTTGG 7140 

TTGTCCCCAA CCAACAAGGT GATCCCCACC TCCTCTGAGA AGTATCAGAT ATACCAAGAT 7200 

70 GGCACTCTCC TTATTCAGAA AGCCCAGCGT TCTGACAGCG GCAACTACAC CTGCCTGGTC 7260 

AGGAACAGCG CGGGAGAGGA TAGGAAGACG GTGTGGATTC ACGTCAACGT CCAGCCACCC 7320 

AAGATCAACG GTAACCCCAA CCCCATCACC ACCGTGCGGG AGATAGCAGC CGGGGGCAGT 7380 

CGGAAACTGA TTGACTGCAA AGCTGAAGGC ATCCCCACCC OGAGGGTGTT ATGGGCTTTT 7440 

CCCGAGGGTG TGGTTCTGCC AGCTCCATAC TATGGAAACC GGATCACTGT CCATGGCAAC 7500 

75 GGTTCCCTGG ACATCAGGAG TTTGAGGAAG AGCGACTCCG TCCAGCTGGT ATGCATGGCA 7560 

CGCAACGAGG GAGGGGAGGC GAGGTTGATC GTGCAGCTCA CTGTCCTGGA GCCCATGGAG 7620 

AAACCCATCT TCCACGACCC GATCAGCGAG AAGATCACGG CCATGGCGGG CCACACCATC 7680 

AGCCTCAACT GCTCTGCOGC GGGGACCCCG ACACCCAGCC TGGTGTGGGT CCTTCCCAAT 7740 

GGCACOGATC TGCAGAGTGG ACAGCAGCTG CAGCGCTTCT ACCACAAGGC TGACGGCATG 7800 

80 CTACACATTA GCGGTCTCTC CTCGGTGGAC GCTGGGGCCT ACCGCTGCGT GGCCCGCAAT 7860 

GCCGCTGGCC ACACGGAGAG GCTGGTCTCC CTGAAGGTGG GACTGAAGCC AGAAGCAAAC 7920 

AAGCAGTATC ATAACCTGGT CAGCATCATG AATGGTGAGA CCCTGAAGCT CCCCTGCACC 7980 

CCTCCCGGGG CTGGGCAGGG ACGTTTCTCC TGGACGCTCC CCAATGGCAT GCATCTGGAG 8040 

GGCCCCCAAA CCCTGGGACG CGTTTCTCTT CTGGACAATG GCACCCTCAC GGTTCGTGAG 8100 
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GCCTCGGTGT TTGACAGGGG TACCTATGTA TGCAGGATGG AGACGGAGTA CGGCCCTTCG 8160 

GTCACCAGCA TCCCCGTGAT TGTGATCGCC TATCCTCCCC GGATCACCAG CGAGCCCACC 8220 

CCGGTCATCT ACACCCGGCC CGGGAACACC GTGAAACTGA ACTGCATGGC TATGGGGATT 8280 

CCCAAAGCTG ACATCACGTG GGAGTTACCG GATAAGTCGC ATCTGAAGGC AGGGGTTCAG 8340 

5 GCTCGTCTGT ATGGAAACAG ATTTCTTCAC CCCCAGGGAT CACTGACCAT CCAGCATGCC 8400 

ACACAGAGAG ATGCCGGCTT CTACAAGTGC ATGGCAAAAA ACATTCTCGG CAGTGACTCC 8460 

AAAACAACTT ACATCCACGT CTTCTGAAAT GTGGATTCCA GAATGATTGC TTAGGAACTG 8520 

ACAACAAAGC GGGGTTTGTA AGGOAAGCCA GGTTGGGGAA TAGGAGCTCT TAAATAATGT 8580 

GTCACAGTGC ATGGTGGCCT CTGGTGGGTT TCAAGTTGAG GTTGATCTTG ATCTACAATT 8640 

GTTGGGAAAA GGAAGCAATG CAGACACGAG AAGGAGGGCT CAGCCTTGCT GAGACACTTT 8700 

CTTTTGTGTT TACATCATGC CAGGGGCTTC ATTCAGGGTG TCTGTGCTCT GACTGCAATT 8760 

TTTCTTCTTT TGCAAATGCC ACTCGACTGC CTTCATAAGC GTCCATAGGA TATCTGAGGA 8820 

ACATTCATCA AAAATAAGCC ATAGACATGA ACAACACCTC ACTACCCCAT TGAAGACGCA 8880 

TCACCTAGTT AACCTGCTGC AGTTTTTACA TGATAGACTT TGTTCCAGAT TGACAAQTCA 8940 

TCTTTCAGTT ATTTCCTCTG TCACTTCAAA ACTCCAGCTT GCCCAATAAG GATTTAGAAC 9000 

CAGAGTGACT GATATATATA TATATATTTT AATTCAGAGT TACATACATA CAGCTACCAT 9060 

TTTATATGAA AAAAGAAAAA CATTTCTTCC TGGAACTCAC TTTTTATATA ATGTTTTATA 9120 

TATATATTTT TTCCTTTCAA ATCAGACGAT GAGACTAGAA GGAGAAATAC TTTCTGTCTT 9180 

ATTAAAATTA ATAAATTATT GGTCTTTACA AGACTTGGAT ACATTACAGC AGACATGGAA 9240 

ATATAATTTT AAAAAATTTC TCTCCAACCT CCTTCAAATT CAGTCACCAC TGTTATATTA 9300 

CCTTCTCCAG GAACCCTCCA GTGGGGAAGG CTGCGATATT AGATTTCCTT GTATGCAAAG 9360 

TTTTTGTTGA AAGCTGTGCT CAGAGGAGGT GAGAGGAGAG GAAGGAGAAA ACTGCATCAT 9420 

AACTTTACAG AATTGAATCT AGAGTCTTCC CCGAAAAGCC CAGAAACTTC TCTGCAGTAT 9480 

CTGGCTTGTC CATCTGGTCT AAGGTGGCTG CTTCTTCCCC AGCCATGAGT CAGTTTGTGC 9540 

CCATGAATAA TACACGACCT GTTATTTCCA TGACTGCTTT ACTGTATTTT TAAGGTCAAT 9600 

ATACTGTACA TTTGATAATA AAATAATATT CTCCCAAAAA AAAAA 9645 
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Seg ID NO: C12 DNA Sequence 
Nucleic Acid Accession #: AK001903 
Coding sequence: none 

1 11 21 31 41 51 

! I 1 I I I 

TATCATGCAT GTGGGAAGGT GGGTGTGGTG AGAAAAGTTT TAAGGCAAGA GTAGATGGCC 60 

ATGTTCAACT TTACAAAATT TCTTGGAAAA CTGGCAGTAT TTTGAACTGC ATCTTCTTTG 120 

GTACCGGAAC CTGCAGAAAC AGTGTGAGAA ATTAAGTCCT GGTTCACTGC GCAGTAGCAA 180 

AGATGGTCAA GGCCATGGAA AAAGCAGAAA TTTACCAAGA AAGCTGATAC CCATGTATAG 240 

TTCCCACTCA TCTCAAATAC ATCTGCTATC TTTTTAAGCT AAGTCCTAGA CATATCGGGG 300 

ATAACATGGG GGTTGATTAG TGACCACAGT TATCAGAAGC AGAGAAATGT AATTCCATAT 360 

TTTATTTGAA ACTTATTCCA TATTTTAATT GGATATTGAG TGATTGGGTT ATCAAACACC 420 

CACAAACTTT AATTTTGTTA AATTTATATG GCTTTGAAAT AGAAGTATAA GTTGCTACCA 480 

TTTTTTGATA ACATTGAAAG ATAGTATTTT ACCATCTTTA ATCATCTTGG AAAATACAAG 540 

TCCTGTGAAC AACCACTCTT TCACCTAGCA GCATGAGGCC AAAAGTAAAG GCTTTAAATT 600 

ATAACATATG GGATTCTTAG TAGTATGTTT TTTTCTTGAA ACTCAGTGGC TCTATCTAAC 660 

CTTACTATCT CCTCACTCTT TCTCTAAGAC TAAACTCTAG GCTCTTAAAA ATCTGCCCAC 720 

ACCAATCTTA GAAGCTCTGA AAAGAATTTG TCTTTAAATA TCTTTTAATA GTAACATGTA 780 

TTTTATGGAC CAAATTGACA TTTTCGACTA TTTTTTCCAA AAAAGTCAGG TGAATTTCAG 840 

CACACTGAGT TGGGAATTTC TTATCCCAGA AGACCAACCA ATTTCATATT TATTTAAGAT 900 

TGAT TCCATA CT CCGTTT TC AAGGAGAATC CCTGCAGTCT CCTTAAAGGT AGAACAAATA 960 

CTTTCTATTT TTTTTTTCAC CATTGTGGGA TTGGACTTTA AGAGGTGACT CTAAAAAAAC 1020 

AGAGAACAAA TATGTCTCAG TTGTATTAAG CACGGACCCA TATTATCATA TTCACTTAAA 1080 

AAAATGATTT CCTGTGCACC TTTTGGCAAC TTCTCTTTTC AATGTAGGGA AAAACTTAGT 1140 

CACCCTGAAA ACCCACAAAA TAAATAAAAC TTGTAGATGT GGGCAGAAGG TTTGGGGGTG 1200 

GACATTGTAT GTGTTTAAAT TAAACCCTGT ATCACTGAGA AGCTGTTGTA TGGGTCAGAG 1260 

AAAATGAATG CTTAGAAGCT GTTCACATCT TCAAGAGCAG AAGCAAACCA CATGTCTCAG 1320 

CTA TATTA TT ATTTATTTTT TATGCATAAA GTGAATCATT TGTTCTGTAT TAATTTCCAA 1380 

AGGGTTTTAC CCTCTATTTA AATGCTTTGA AAAACAGTGC ATTGACAATG GGTTGATATT 1440 

TTTCTTTAAA AGAAAAATAT AATTATGAAA GCCAAGATAA TCTGAAGCCT GTTTTATTTT 1500 

AAAACTTTTT ATGTTCTGTG GTTGATGTTG TTTGTTTGTT TGTTTCTATT TTGTTGGTTT 1560 

TTTACTTTGT TTTTTGTTTT GTTTTQTTTT GTTTTGCATA CTACATGCAG TTCTTTAACC 1620 

AATGTC TGTT TGGCTAATGT AATTAAAGTT GTTAATTTAT ATGAGTGCAT TTCAACTATG 1680 

TCAATGGTTT CTTAATATTT ATTGTGTAGA AGTACTGGTA ATTTTTTTAT TTACAATATG 1740 

TTTAAAGAGA TAACAGTTTG ATATGTTTTC ATGTGTTTAT AGCAGAAGTT ATTTATTTCT 1800 

ATGGCATTCC AGCGGATATT TTGGTGTTTG CGAGGCATGC AGTCAATATT TTGTACAGTT 1860 

AGTGGACAGT ATTCAGCAAC GCCTGATAGC TTCTTTGGCC TTATGTTAAA TAAAAAGACC 1920 

TGTTTGGGAT GT 1932 

Seq ID NO: C13 Protein Sequence 
Nucleic Acid Accession #: Eos sequence 
Coding sequence: 1..5001 



nc 1 11 21 31 41 51 

75 I I I I I I 

ATGCCAGGCA CAAAACTAAC CCGAACAGGC GCCCCAGCAG ACTACAGAGT GATATTQAAG 60 

ACCTCTCAAG AGGACGAATT GGATGTACCT GACGACATCA GCGTCCGGGT TATGTCATCT 120 

CAGTCTGTGC TTGTGTCCTG GGTGGATCCT GTTCTGGAAA AACAGAAGAA AGTTGTTGCA 180 

Q ^ TCAAGACAGT ACACCGTGCG CTATCGAGAG AAGGGGGAAT TGGCCAGGTG GGATTATAAG 240 

OU CAGATCGCTA ACAGGCGTGT GCTGATTGAG AACCTGATTC CAGACACTGT GTATGAATTT 300 

GCAGTCCGTA TTTCACAGGG TGAAAGAGAT GGCAAATGGA GTACGTCAGT CTTCCAAAGA 360 

ACACCAGAAT CTGCCCCTAC CACAGCTCCT GAAAACTTGA ACGTCTGGCC AGTCAATGGC 420 

AAACCTACAG TTGTCGCTGC ATCTTGGGAT GCGCTACCAG AGACTGAGGG GAAAGTGAAA 480 

GTCTGTCTGC TGGACACAGG ACTGTTTTCA GTTTCCTCCT TCCAACCATC TGCCAAATCA S40 
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TTTCAGAATA CATTCTTTCA TACGCCCOGG CTCTCAAACC ATTTGGAGCA AAGTCCCTCA 600 

CCTATCCTGG AGACACTACT TCTGCCCTGG TGGATGGTCT GCAGCCTGGG GAACGCTATC 660 

TTTTCAAAAT CCGGGCCACA AACAGGAGAG GCCTGGGACC TCACTCCAAA GCCTTCATTG 720 

TCGCTATGCC AACAAGAATG CAGCTGTACC CAGAAGGATT TCAGTTGTCT AGCTTACCTG 780 

J ATCGATATCC AAACCAAACA AGTTAATAAA GATCCACAAC TGGAAGGGAG TGTTTTTGGA 840 

CCATGTTTTC TTTTCTACTT CCTCACATTT ATGCTGGATA TTGGOGGCTT TTCCTTCATT 900 

ATGTGCTATG AAGACCCANN TGTTTCTTCT TTGACAGGCA ATTCTTTAAA ATCTGTTGCA 960 

GCCAGTAAGG CGGATGTTCA GCAGAACACG GAGGACAATG GGAAACCCGA AAAACCTGAG 1020 

CCTTCCTCAC CTTCTCCCAG AGCTCCAGCT TCCTCCCAAC ACCCCTCTGT GCCTGCTTCT 1080 

ID CCCCAAGGGA GAAATGCCAA GGACCTTCTT CTTGACTTGA AGAACAAAAT ATTGGCTAAT 1140 

GGTGGGGCGC CCCGAAAACC CCAGCTTCGC GCCAAGAAGG CAGAGGAGCT GGATCTTCAG 1200 

TCGACAGAAA TCACTGGGGA GGAGGAGCTG GGTTCCCGGG AGGACTCGCC CATGTCACCC 1260 

TCAGACACCC AAGACCAGAA ACGGACCCTG AGGCCGCCAA GTAGACACGG CCACTCGGTG 1320 

GTTGCTCCCG GCAGGACTGC AGTGAGGGCC CGGATGCCAG CGCTGCCCCG AAGGGAAGGC 1380 

GTAGATAAGC CTGGCTTTTC CCTGGCCACG CAGCCCCGCC CAGGGGCGCC CCCCTCGGCT 1440 

TCGGCCTCTC CTGCCCACCA CGCGTCCACC CAGGGCACCT CTCATCGTCC TTCCCTGCCT 1500 

GCCAGCTTGA ATGACAACGA CTTGGTGGAC TCAGACGAAG ATGAGCGCGC TGTGGGCTCC 1560 

CTCCACCCCA AGGGCGCCTT CGCCCAGCCC CGGCCAGCCC TGTCCCCCAG CCGCCAGTCC 1620 

CCGTCCAGCG TTCTCCGCGA CAGAAGCTCT GTGCACCCCG GOGCAAAGCC AGCCTCGCCG 1680 

GCGCGGAGGA CCCCCCATTC AGGGGCCGCA GAGGAAGATT CCAGTGCCTC AGCCCCACCC 1740 

TCAAGACTTT CTCCACCCCA TGGGGGATCA TCTCGGCTGC TGCCCACCCA GCCACACCTG 1800 

AGCTCTCCAC TTTCCAAGGG CGGGAAGGAT GGTGAGGACG CCCCAGCCAC CAACTCCAAT 1860 

GCGCCATCAC GGTCCACCAT GTCCTCCTCC GTCTCTTCTC ATCTCTCGTC CAGGACGCAG 1920 

GTCTCTGAGG GAGCGGAGGC TTCTGATGGT GAAAGCCACG GTGACGGCGA TAGGGAAGAC 1980 

GGCGGAAGGC AGGCGGAGGC CACGGCCCAG ACGCTGCGGG CCCGGCCTGC CTCTGGACAC 2040 

TTCCATTTGC TCAGACACAA ACCCTTTGCT GCCAACGGGA GGTCTCCAAG CAGGTTCAGC 2100 

ATTGGGCGGG GACCTCGGCT GCAGCCCTCC AGCTCCCCAC AGTCGACTGT GCCCTCOCGA 2160 

GCCCACCCCA GGGTTCCCTC TCACTCTGAT TCCCACCCTA AGCTTAGCTC AGGTATCCAT 2220 

GGAGACGAGG AGGATGAGAA GCCGCTTCCT GCCACCGTTG TCAATGACCA CGTGCCTTCC 2280 

TCCTCCAGGC AGCCCATCTC CCGGGGCTGG GAGGACTTAA GGAGAAGCCC GCAGAGAGGG 2340 

GCCAGCCTGC ATCGGAAGGA ACCCATCCCA GAGAACCCCA AATCCACAGG GGCAGATACA 2400 

CATCCTCAGG GCAAGTACTC CTCCCTGGCC TCCAAGGCTC AGGATGTTCA ACAGAGCACA 2460 

GACGCGGACA CGGAGGGTCA TTCTCCCAAA GCACAGCCAG GGTCCACAGA CCGCCACGCG 2520 

TCCCCTGCTC GTCCTCCCGC AGCACGGTCA CAGCAGCATC CCAGTGTTCC CAGAAGGATG 258 0 

ACACCCGGCC GGGCCCCAGA ACAGCAGCCC CCTCCTCCOG TCGCCACGTC CCAGCACCAC 2640 

CCGGGACCCC AGAGCAGAGA CGCGGGTCGG TCACCTTCCC AGCCCAGGCT CTCACTGACC 2700 

CAGGCCGGGC GGCCCCGCCC CACGTCGCAG GGCCGCTCCC ACTCCTCCTC GGACCCTTAC 2760 

ACGGCGAGCT CCAGAGGGAT GCTCCCCACG GCCCTCCAGA ACCAGGACGA GGATGCCCAG ,2820 

GGCAGCTACG ACGACGACAG CACAGAAGTC GAGGCCCAGG ATGTGCGGGC CCCCGCGCAC 2880 

GCCGCGCGCG CCAAGGAGGC AGCTGCGTCC CTTCCCAAGC ACCAGCAGGT GGAGTCTCCC .2940 

ACAGGCGCAG GGGCAGGTGG CGACCACAGG TCCCAGCGCG GACATGCGGC CTCCCCCGCC 3000 

AGGCCCAGCC GACCCGGCGG CCCCCAGTCC CGCGCCCGGG TCCCCAGCAG GGCAGCGCCG 3060 

GGGAAGTCGG AGCCTCCTTC CAAGCGGCCC CTGTCCTCCA AGTCCCAGCA GTCGGTCTCA 3120 

GCCGAGGACG AGGAGGAGGA GGACGCGGGG TTTTTTAAAG GCGGGAAAGA AGACCTTCTG 3180 

TCTTCCTCTG TGCCAAAGTG GCCCTCTTCC TCCACTCCCA GGGGCGGCAA AGACGCCGAT 3240 

GGGAGCCTCG CCAAGGAAGA GAGGGAGCCT GCCATCGCGC TTGCCCCTCG CGGAGGGAGC 3300 

CTGGCTCCTG TGAAGCGACC TCTCCCCCCA CCTCCAGGCA GCTCCCCCAG GGCCTCCCAC 3360 

GTCCCTTCCC GACCGCCGCC TCGCAGCGCT GCCACOGTGA GCCCCGTCGC GGGCACCCAC 3420 

CCCTGGCCGC GGTACACCAC GCGCGCCCCV CCTGGCCACT TCTCCACCAC CCCGATGCTG 3480 

jU TCCTTGCGCC AGAGGATGAT GCATGCCAGA TTCCGTAACC CTCTCTCCCG ACAGCCTGCC 3540 

AGACCCTCTT ACAGACAAGG TTATAATGGC AGACCAAATG TAGAAGGGAA AGTCCTTCCT 3600 

GGTAGTAATG GAAAACCGAA TGGACAGAGA ATTATCAATG GCCCTCAAGG AACAAAGTGG 3660 

GTTGTGGACC TTGATCGTGG GTTAGTATTG AATGCAGAAG GAAGGTACCT CCAAGATTCA 3720 

CATGGAAATC CTCTTCGGAT TAAACTAGGA GGAGATGGTC GAACCATTGT AGATCTGGAA 3780 

GGGACCCCCG TGGTGAGTCC TGACGGCCTC CCACTCTTTG GGCAGGGGOG ACATGGCACA . 3840 
CCTCTGGCCA ATGCCCAAGA TAAGCCAATT TTGAGTCTTG GAGGAAAGCC GCTGGTGGGC . 3900 

TTGGAGGTCA TCAAAAAAAC CACCCATCCC CCTACCACTA CCATGCAGCC CACCACTACT 3960 

ACGACGCCCC TGCCTACCAC TACAACCCCG AGGCCCACCA CTGCCACCAC CATGCAGCCC 4020 

ACCACTACTA CGACGCCCCT GCCTACCACT ACACCGAGGC CCACCACTGC CACCACCCGC 4080 

CGCACGACCA CCAGGCGTCC AACAACCACA GTCCGAACCA CTACGCGGAC AACCACCACC 4140 

ACCACCCCCA AACCCACCAC TCCCATCCCC ACCTGTCCCC CTGGGACCTT GGAACGGCAC 4200 

GACGATGATG GCAACCTGAT AATGAGCTCC AATGGGATCC CAGAGTGCTA CGCTGAAGAA 4260 

GATGAGTTCT CAGGCTTGGA GACTGACACT GCAGTACCTA CGGAAGAGGC CTACGTTATA 4320 

TATGATGAAG ATTATGAATT TGAGACGTCA AGGCCACCAA CCACCACTGA GCCTTCGACC 4380 

ACTGCTACCA CACCGAGGGT GATCCCAGAG GAAGGCGCCA TCAGTTCCTT TCCTGAAGAA 4440 

GAATTTGATC TGGCTGGAAG GAAACGATTT GTTGCTCCTT ACGTGACGTA CCTAAATAAA 450 0 

GACCCATCAG CCCCGTGCTC TCTGACTGAT GCACTGGATC ACTTCCAAGT GGACAGCCTG 4560 

GATGAAATCA TCCCCAATGA CCTGAAGAAG AGTGATCTGC CTCCCCAGCA TGCTCCCCGC 4620 

AACATCACCG TGGTGGCCGT GOAAGGTTGC CACTCATTTG TCATTGTGGA TTGGGACAAA 4680 

GCCACCCCAG GAGATTTGGT CACAGGTTAT TTGGTTTACA GTGCATCCTA TGAAGATTTC 4740 

ATCAGGAACA AGTTTTCCAC TCAAGCTTCA TCAGTAACTC ACTTGCCCAT TGAGAACCTA 4800 

AAGCCCAACA CGAGGTATTA TTTTAAAGTG CAAGCACAAA ATCCTCATGG CTACGGACCT 4860 

ATCAGCCCTT CGGTCTCATT TGTCACCGAA TCAGATAATC CTCTGCTTGT TGTGAGGCCC 4920 

CCAGGCGGTG AGCTATCTGG ATCCCATTCG CTTTCAAACA TGATCCCAGC TACACGGACT 4980 

GCCATGGACG GCAATATGTG AAGCGCACGT GGTATCGAAA GTTCGTGGGA GTTG T T CT TT 5040 

GTAATTCACT GAGGTATAAA ATCTACCTCA GTGACAACCT GAAAGATACA TTCTACAGCA 5100 

TTGGAGACAG CTGGGGAAGA GGTGAAGACC ATTGCCAATT TGTGGATTCA CACCTTGATG 5160 

GAAGAACAGG GCCTCAGTCC TATGTAGAAG CCCTCCCTAC TATTCAAGGC TACTATCGCC 5220 

AGTATCGTCA GGAGCCTGTC AGGTTTGGGA ACATCGGCTT CGGAACCCCC TACTACTATG 5280 

TGGGCTGGTA CGAGTGTGGG GTCTCCATCC CTGGAAAGTG GTAATCACAG GACCGTCATG 5340 

CTGCAAGCTT GCCCTGCCCA GCCCCACCAA CTAAGTCGCA CTAGGGGCTG TGAGCAAAGA 5400 

CAGCCAGCAT GCTCAGCCCC GCTGCCCTAG GTGCCAGGAA GGTCACAGAT GGACACTGGC 5460 

CATTCTGGTC ATCTCAGTCT GGAACTCAGT CCCACTTCTT GGCCTGGACA ATGAACAGGA 5520 

TTCAGTTTTG CTGTTAACTT TGCTTCTCTA CTTTTTTTTG TTTGTTTGTA ATAGCACATC 5580 
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CCAGAGACAT CAGAAACCAG CAACTGATTC AGTGTQATTT CCCAGACTTT TTAGGCATGA 5640 

AATTCGGACA CTTCAGTATT TCCAGGAATA GCATATGCAC GCTGTTCTTG CTTCATGGAA 5700 

TGCTACATGC TTTCTGTTTT TCTCATTTTG GATTTCTCCA AAACTAACTG AATTTAAGCT 5760 

TCAGGTCCCT TTGTATGCAG TAGAAAGGAA TTATTAAAAA CACCACCAAA GAAAATAAAT 5820 

ATATCCTACT TGAAATTTAC TCTATGGACT TACCCACTGC TAGAATAAAT GTATCAAATC 5680 

TTATTTGTAA ATTCTCAATT TTGATATATA TATGTATATA TGCATATACA TATCCACACT 5940 

TGTCTGCAAG AATATTGATT AAAATTGCTA AATTTGTACT TGTTCACCAA AAAAAAAAAA 6000 

AAAAAAA 6007 

Seq ID NO: C14 DNA Sequence 
Nucleic Acid Accession #: NM_003014 
Coding sequence: 238.. 1278 



ir 1 11 21 31 41 51 

15 | I J | | | 

GGCGGGTTCG CGCCCCGAAG GCTGAGAGCT GGCGCTGCTC GTGCCCTGTG TGCCAGACGG 60 

CGGAGCTCCG CGGCCGGACC CCGCGGCCCC GCTTTGCTGC CGACTGGAGT TTGGGGGAAG 120 

AAACTCTCCT GCGCCCCAGA AGATTTCTTC CTCGGCGAAG GGACAGCGAA AGATGAGGGT 180 

GGCAGGAAGA GAAGGCGCTT TCTGTCTGCC GGGGTCGCAG CGCGAGAGGG CAGTGCCATG 240 

2Aj ttcctctcca TCCTAGTGGC GCTGTGCCTG TGGCTGCACC TGGCGCTGGG CGTGCGCGGC 300 

GCGCCCTGCG AGGCGGTGCG CATCCCTATG TGCCGGCACA TGCCCTGGAA CATCACGCGG 360 

ATGCCCAACC ACCTGCACCA CAGCACGCAG GAGAAOGCCA TCCTGGCCAT CGAGCAGTAC 420 

GAGGAGCTGG TGGACGTGAA CTGCAGCGCC GTGCTGCGCT TCTTCTTCTG TGCCATGTAC 480 

0 - GCGCCCATTT GCACCCTGGA GTTCCTGCAC GACCCTATCA AGCCGTGCAA GTCGGTGTGC 540 

2.J CAACGCGCGC GCGACGACTG CGAGCCCCTC ATGAAGATGT ACAACCACAG CTGGCCCGAA 600 

AGCCTGGCCT GGGACGAGCT GCCTGTCTAT GACCGTGGCG TGTGCATTTC GCCTGAAGCC 660 

ATCGTCACGG ACCTCCCGGA GGATGTTAAG TGGATAGACA TCACACCAGA CATGATGGTA 720 

CAGGAAAGGC CTCTTGATGT TGACTGTAAA CGCCTAAGCC CCGATCGGTG CAAGTGTAAA 780 

AAGGTGAAGC CAACTTTGGC AACGTATCTC AGCAAAAACT ACAGCTATGT TATTCATGCC 840 

J\J AAAATAAAAG CTGTGCAGAG GAGTGGCTGC AATGAGGTCA CAACGGTGGT GGATGTAAAA 900 

GAGATCTTCA AGTCCTCATC ACCCATCCCT CGAACTCAAG TCCCGCTCAT TACAAATTCT 960 

TCTTGCCAGT GTCCACACAT CCTGCCCCAT CAAGATGTTC TCATCATGTG TTACGAGTGG 1020 

CGTTCAAGGA TGATGCTTCT TGAAAATTGC TTAGTTGAAA AATGGAGAGA TCAGCTTAGT 1080 

AAAAGATCCA TACAGTGGGA AGAGAGGCTG CAGGAACAGC GGAGAACAGT TCAGGACAAG 1140 

DJ AAGAAAACAG CCGGGCGCAC CAGTCGTAGT AATCCCCCCA AACCAAAGGG AAAGCCTCCT 1200 

GCTCCCAAAC CAGCCAGTCC CAAGAAGAAC ATTAAAACTA GGAGTGCCCA GAAGAGAACA 1260 

AACCCGAAAA GAGTGTGAGC TAACTAGTTT CCAAAGCGGA GACTTCCGAC TTCCTTACAG 1320 

GATGAGGCTG GGCATTGCCT GGGACAGCCT ATGTAAGGCC ATGTGCCCCT TGCCCTAACA 1380 

A O ACTCACTGCA GTGCTCTTCA TAGACACATC TTGCAGCATT TTTCTTAAGG CTATGCTTCA 1440 

4U GTTTTTCTTT GTAAGCCATC ACAAGCCATA GTGGTAGGTT TGCCCTTTGG TACAGAAGGT 1500 

GAGTTAAAGC TGGTGGAAAA GGCTTATTGC ATTGCATTCA GAGTAACCTG TGTGCATACT 1560 

CTAGAAGAGT AGGGAAAATA ATGCTTGTTA CAATTCGACC TAATATGTGC ATTGTAAAAT 1620 

AAATGCCATA TTTCAAACAA AACACGTAAT TTTTTTACAG TATGTTTTAT TACCTTTTGA 1680 

TATCTGTTGT TGCAATGTTA GTGATGTTTT AAAATGTGAT GAAAATATAA TGTTTTTAAG 1740 

4 J AAGGAACAGT AGTGGAATGA ATGTTAAAAG ATCTTTATGT GTTTATGGTC TGCAGAAGGA 1800 

TTTTTGTGAT GAAAGGGGAT TTTTTGAAAA ATTAGAGAAG TAGCATATGG AAAATTATAA 1860 

TGTGTTTTTT TACCAATGAC TTCAGTTTCT GTTTTTAGCT AGAAACTTAA AAACAAAAAT 1920 

AATAATAAAG AAAAATAAAT AAAAAGGAGA GGCAGACAAT GTCTGGATTC CTGTTTTTTG 1980 

GTTACCTGAT TTCCATGATC ATGATGCTTC TTGTCAACAC CCTCTTAAGC AGCACCAGAA 2040 

J" ACAGTGAGTT TGTCTGTACC ATTAGGAGTT AGGTACTAAT TAGTTGGCTA ATGCTCAAGT 2100 

ATTTTATACC CACAAGAGAG GTATGTCACT CATCTTACTT OCCAGGACAT CCACCCTGAG 2160 

AATAATTTGA CAAGCTTAAA AATGGCCTTC ATGTGAGTGC CAAATTTTGT TTTTCTTCAT 2220 

TTAAATATTT TCTTTGCCTA AATACATGTG AGAGGAGTTA AATATAAATG TACAGAGAGG 2280 

AAAGTTGAGT TCCACCTCTG AAATGAGAAT TACTTGACAG TTGGGATACT TTAATCAGAA 2340 

D J AAAAAGAACT TATTTGCAGC ATTTTATCAA CAAATTTCAT AATTGTGGAC AATTGGAGGC 2400 

ATTTATTTTA AAAAACAATT TTATTGGCCT TTTGCTAACA CAGTAAGCAT GTATTTTATA 2460 

AGGCATTCAA TAAATGCACA ACGCCCAAAG GAAATAAAAT CCTATCTAAT CCTACTCTCC 2520 

ACTACACAGA GGTAATCACT ATTAGTATTT TGGCATATTA TTCTCCAGGT GTTTGCTTAT 2580 

<ft GCACTTATAA AATGATTTGA ACAAATAAAA CTAGGAACCT GTATACATGT GTTTCATAAC 2640 

OU CTGOCTOCTT TGCTTGGCCC TTTATTGAGA TAAGTTTTCC TGTCAAGAAA GCAGAAACCA 2700 

TCTCATTTCT AACAGCTGTG TTATATTCCA TAGTATGCAT TACTCAACAA ACTGTTGTGC 2760 

TATTGGATAC TTAGGTGGTT TCTTCACTGA CAATACTGAA TAAACATCTC ACCGGAATTC 2820 

- Seq ID NO: CI 5 DNA Sequence 

Oj Nucleic Acid Accession #: NM_005940 
Coding sequence: 23.. 1489 

1 11 21 31 41 51 

7n I i I I I I 

fVJ AAGCCCAGCA GCCCCGGGGC GGATGGCTCC GGCCGCCTGG CTCCGCAGCG CGGCCGCGCG 60 

CGCCCTCCTG CCCCCGATGC TGCTGCTGCT GCTCCAGCCG CCGCCGCTGC TGGCCCGGGC 120 

TCTGCCGCCG GACGTCCACC ACCTCCATGC CGAGAGGAGG GGGCCACAGC CCTGGCATGC 180 

AGCCCTGCCC AGTAGCCCGG CACCTGCCCC TGCCACGCAG GAAGCCCCCC GGCCTGCCAG 240 

CAGCCTCAGG CCTCCCCGCT GTGGCGTGCC CGACCCATCT GATGGGCTGA GTGCCCGCAA 300 

O CCGACAGAAG AGGTTCGTGC TTTCTGGCGG GCGCTGGGAG AAGACGGACC TCACCTACAG 360 

GATCCTTCGG TTCCCATGGC AGTTGGTGCA GGAGCAGGTG CGGCAGACGA TGGCAGAGGC 420 

CCTAAAGGTA TGGAGCGATG TGACGCCACT CACCTTTACT GAGGTGCACG AGGGCCGTGC 480 

TGACATCATG ATCGACTTCG CCAGGTACTG GCATGGGGAC GACCTGCCGT TTGATGGGCC S40 

cn TGGGGGCATC CTGGCCCATG CCTTCTTCCC CAAGACTCAC CGAGAAGGGG ATGTCCACTT 600 

OU CGACTATGAT GAGACCTGGA CTATCGGGGA TGACCAGGGC ACAGACCTGC TGCAGGTGGC 660 

AGCCCATGAA TTTGGCCACG TGCTGGGGCT GCAGCACACA ACAGCAGCCA AGGCCCPGAT 720 

GTCCGCCTTC TACACCTTTC GCTACCCACT GAGTCTCAGC CCAGATGACT GCAGGGGCGT 780 

TCAACACCTA TATGGCCAGC CCTGGCCCAC TGTCACCTCC AGGACCCCAG CCCTGGGCCC 840 

• CCAGGCTGGG ATAGACACCA ATGAGATTGC ACCGCTGGAG CCAGACGCCC CGCCAGATGC 900 
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CTGTGAGGCC TCCTTTGACG CGGTCTCCAC CATCCGAGGC GAGCTCTTTT TCTTCAAAGC 960 

GGGCTTTGTG TGGCGCCTCC GTGGGGGCCA GCTGCAGCCC GGCTACCCAG CATTGGCCTC 1020 

TCGCCACTGG CAGGGACTGC CCAGCCCTGT GGAGGCTGCC TTCGAGGATG CCCAGGGCCA 1060 

CATTTGGTTC TTCCAAGGTG CTCAGTACTG GGTGTACGAC GGTGAAAAGC CAGTCCTGGG 1140 

CCCCGCACCC CTCACCGAGC TGGGCCTGGT GAGGTTCCCG GTCCATGCTG CCTTGGTCTG 1200 

GGGTCCCGAG AAGAACAAGA TCTACTTCTT CCGAGGCAGG GACTACTGGC GTTTCCACCC 1260 

CAGCACCCGG CGTGTAGACA GTCCCGTGCC CCGCAGGGCC ACTGACTGGA GAGGGGTGCC 1320 

CTCTGAGATC GACGCTGCCT TCCAGGATGC TGATGGCTAT GCCTACTTCC TGCGCGGCCG 1380 

CCTCTACTGG AAGTTTGACC CTGTGAAGGT GAAGGCTCTG GAAGGCTTCC CCCGTCTCGT 1440 

1U GGGTCCTGAC TTCTTTGGCT GTGCCGAGCC TGCCAACACT TTCCTCTGAC CATGGCTTGG 1SO0 

ATGCCCTCAG GGGTGCTGAC CCCTGCCAGG CCACGAATAT CAGGCTAGAG ACCCATGGCC 1560 

ATCTTTGTGG CTGTGGGCAC CAGGCATGGG ACTGAGCCCA TGTCTCCTGC AGGGGGATGG 1620 

GGTGGGGTAC AACCACCATG ACAACTGCCG GGAGGGCCAC GCAGGTCGTG GTCACCTGCC 1680 

AGCGACTGTC TCAGACTGGG CAGGGAGGCT TTGGCATGAC TTAAGAGGAA GGGCAGTCTT 1740 

GGGACCCGCT ATGCAGGTCC TGGCAAACCT GGCTGCCCTG TCTCATCCCT GTCCCTCAGG 1800 

GTAGCACCAT GGCAGGACTG GGGGAACTGG AGTGTCCTTG CTGTATCCCT GTTGTGAGGT 1860 

TCCTTCCAGG GGCTGGCACT GAAGCAAGGG TGCTGGGGCC CCATGGCCTT CAGCCCTGGC 1920 

TGAGCAACTG GGCTGTAGGG CAGGGCCACT TCCTGAGGTC AGGTCTTGGT AGGTGCCTGC 1980 

ATCTGTCTGC CTTCTGGCTG ACAATCCTGG AAATCTGTTC TCCAGAATCC AGGCCAAAAA 2040 

GTTCACAGTC AAATGGGGAG GGGTATTCTT CATGCAGGAG ACCCCAGGCC CTGGAGGCTG 2100 

CAACATACCT CAATCCTGTC CCAGGCCGGA TCCTCCTGAA GCCCTTTTCG CAGCACTGCT 2160 

ATCCTCCAAA GCCATTGTAA ATGTGTGTAC AGTGTGTATA AACCTTCTTC TTCTTTTTTT 2220 

TTTTTAAACT GAGGATTGTC ATTAAACACA GTTGTTTTCT 2260 

25 Seq ID NO: CI 6 DNA Sequence 

Nucleic Acid Accession #: NM_024022 
Coding sequence: 202.. 1563 

„ 1 11 21 31 41 51 

30 I | | II | 

ACCGGGCACC GGACGGCTCG GGTACTTTCG TTCTTAATTA GGTCATGCCC GTGTGAGCCA 60 

GGAAAGGGCT GTGTTTATGG GAAGCCAGTA ACACTGTGGC CTACTATCTC TTCCGTGGTG 120 

CCATCTACAT TTTTGGGACT CGGGAATTAT GAGGTAGAGG TGGAGGCGGA GCCGGATGTC 180 

AGAGGTCCTG AAATAGTCAC CATGGGGGAA AATGATCCGC CTGCTGTTGA AGCCCCCTTC 240 

TCATTCCGAT CGCTTTTTGG CCTTGATGAT TTGAAAATAA GTCCTGTTGC ACCAGATGCA 300 

GATGCTGTTG CTGCACAGAT CCTGTCACTG CTGCCATTGA AGTTTTTTCC AATCATCGTC 360 

ATTGGGATCA TTGCATTGAT ATTAGCACTG GCCATTGGTC TGGGCATCCA CTTCGACTGC 420 

TCAGGGAAGT ACAGATGTCG CTCATCCTTT AAGTGTATCG AGCTGATAGC TCGATGTGAC 480 

GGAGTCTCGG ATTGCAAAGA CGGGGAGGAC GAGTACCGCT GTGTCCGGGT GGGTGGTCAG 540 

AATGCCGTGC TCCAGGTGTT CACAGCTGCT TCGTGGAAGA CCATGTGCTC CGATGACTGG 600 

AAGGGTCACT ACGCAAATGT TGCCTGTGCC CAACTGGGTT TCCCAAGCTA TGTGAGTTCA 660 

GATAACCTCA GAGTGAGCTC GCTGGAGGGG CAGTTCCGGG AGGAGTTTGT GTCCATCGAT 720 

CACCTCTTGC CAGATGACAA GGTGACTGCA TTACACCACT CAGTATATGT GAGGGAGGGA 7 BO 

TGTGCCTCTG GCCACGTGGT TACCTTGCAG TGCACAGCCT GTGGTCATAG AAGGGGCTAC 840 

AGCTCACGCA TCGTGGGTGG AAACATGTCC TTGCTCTCGC AGTGGCCCTG GCAGGCCAGC 900 

CTTCAGTTCC AGGGCTACCA CCTGTGCGGG GGCTCTGTCA TCACGCCCCT GTGGATCATC 960 

ACT GCTG CAC ACTGTGTTTA TGACTTGTAC CTCCCCAAGT CATGGACCAT CCAGGTGGGT 1020 

CTAGTTTCCC TGTTGGACAA TCCAGCCCCA TCCCACTTGG TGGAGAAGAT TGTCTACCAC 1080 

AGCAAGTACA AGCCAAAGAG GCTGGGCAAT GACATCGCCC TTATGAAGCT GGCCGGGCCA 1140 

CTCACGTTCA ATGAAATGAT CCAGCCTGTG TGCCTGCCCA ACTCTGAAGA GAACTTCCCC 1200 

GATGGAAAAG TGTGCTGGAC GTCAGGATGG GGGGCCACAG AGGATGGAGG TGACGCCTCC 1260 

CCTGTCCTGA ACCACGCGGC CGTCCCTTTG ATTTCCAACA AGATCTGCAA CCACAGGGAC 1320 

GTGTACGGTG GCATCATCTC CCCCTCCATG CTCTGCGCGG GCTACCTGAC GGGTGGCGTG 1380 

GACAGCTGCC AGGGGGACAG CGGGGGGCCC CTGGTGTGTC AAGAGAGGAG GCTGTGGAAG 1440 

TTAGTGGGAG CGACCAGCTT TGGCATCGGC TGCGCAGAGG TGAACAAGCC TGGGGTGTAC 1500 

ACCCGTGTCA CCTCCTTCCT GGACTGGATC CACGAGCAGA TGGAGAGAGA CCTAAAAACC 1560 

TGAAGAGGAA GGGGACAAGT AGCCACCTGA GTTCCTGAGG TGATGAAGAC AGCCCGATCC 1620 

TCCCCTGGAC TCCCGTGTAG GAACCTGCAC ACGAGCAGAC ACCCTTGGAG CTCTGAGTTC 1680 

CGGCACCAGT AGCAGGCCCG AAAGAGGCAC CCTTCCATCT GATTCCAGCA CAACCTTCAA 1740 

OU GCTGCTTTTT GTTTTTTGTT TTTTTGAGGT GGAGTCTCGC TCTGTTGCCC AGGCTGGAGT 1800 

GCAGTGGCGA AATCCCTGCT CACTGCAGCC TCCGCTTCCC TGGTTCAAGC GATTCTCTTG 1860 

C CTCAGCT TC CCCAGTAGCT GGGACCACAG GTGCCCGCCA CCACACCCAA CTAATTTTTG 1920 

TATTTTTAGT AGAGACAGGG TTTCACCATG TTGGCCAGGC TGCTCTCAAA CCCCTGACCT 1980 

CAAATGATGT GCCTGCTTCA GCCTCCCACA GTGCTGGGAT TACAGGCATG GGCCACCACG 2040 

CCTAG CCTCA CGCTCCTTTC TGATCTTCAC TAAGAACAAA AGAAGCAGCA ACTTGCAAGG 2100 

GCGGCCTTTC CCACTGGTCC ATCTGGTTTT CTCTCCAGGG GTCTTGCAAA ATTCCTGACG 2160 

AGATAAGCAG TTATGTGACC TCACGTGCAA AGCCACCAAC AGCCACTCAG AAAAGACGCA 2220 

CCAGCCCAGA AGTGCAGAAC TGCAGTCACT GCACGTTTTC ATCTCTAGGG ACCAGAACCA 2280 

AACCCACCCT TTCTACTTCC AAGACTTATT TTCACATGTG GGGAGGTTAA TCTAGGAATG 2340 

ACTCGTTTAA GGCCTATTTT CATGATTTCT TTGTAGCATT TGGTGCTTGA CGTATTATTG 2400 

TCCTTTGATT CCAAATAATA TGTTTCCTTC CCTCAAAAAA AAAAAAAAAA AAAAAAAAAA 2460 

AAAAA 2465 
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Seq ID NO: CI 7 DNA Sequence 
Nucleic Acid Accession #t NM_00322o 
Coding sequence: 63.. 1376 

1 U 21 31 41 51 

I 1 I I I I 

GAATT CCGGC TCTCTGGGTG AGAGACCGAG AGGGGCATAT CCGTTCACGC CGATCCATGA 60 

AAATGCTTTG GAAATTGACG GATAATATCA AGTACGAGGA CTGCGAGGAC CGTCACGACG 120 

GCACCAGCAA CGGGACGGCA CGGTTGCCCC AGCTGGGCAC TGTAGGTCAA TCTCCCTACA 180 

CGAGCGCCCC GCCGCTGTCC CACACCCCCA ATGCCGACTT CCAGCCCCCA TACTTCCCCC 240 

CACCCTACCA GCCTATCTAC CCCCAGTCGC AAGATCCTTA CTCCCACGTC AACGACCCCT 300 

1208 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



ACAGCCTGAA 
GGCAGAGCCA 
GCCTGGATCC 
TCAGCTCAGG 
CGCATGTAGA 
TGTCCCTGTC 

TCCTCAGCTC 
CACCCGAGTG 
ATGGAGGAAG 
GACGTAAAGC 
TAGCCAGGGA 
TTCTCAACCG 
CTACAAAACA 
GGAACTCACG 
ACCTCATCTC 
ACTATCTCAC 
GCCACACGGA 
CTCCTCCCGC 
GTGACAGCTC 
GCCGCCGCCG 
AGTGGGGCAG 
TGTGCCCCCG 



CCCCCTGCAC 
GGAGTCTGGG 
TCGCAGGGAC 
ACTCGGAGAC 
AGACCCGGGT 
CAAGTCCAAC 
CGTGGTGAAC 
CACCTCGAAG 
TCTCAACGCG 
ATCTTTAAGA 
TGCCAACGTT 
CTTTGGGTAC 
ACAACATTCC 
GATATGCAAA 
GCCCAACCCC 
CCACGGCTTC 
CGAGGCCCTC 
CAACAACGCC 
CCCGCCCCTC 
CGGGATCAGC 
CCGCTGCCCT 
CCTCTCCGAC 
GAATTC 



GCCCAGCCGC 
CTCCTGCACA 
TACAGGCGGC 
CTCTCGATCC 
ATTAACATCC 
AGCAATGCCG 
CCCAACGAAG 
TACAAGGTCA 
TCGCTGCTGG 
GAAAAACTGG 
ACCCTGCTCA 
GTGTGCGAAA 
GATCCCAATG 
GAGTTCACCG 
ATCCTGGAGC 
GGCAGCCCCG 
AAGGCCATGG 
AAAAGCAGTG 
CCACGCCTCA 
AACCCTTCCT 
TGGGTCCCCC 
TCTGCACCCG 



AGCCGCAGCA 
CGCACCGGGG 
ACGAGGACCT 
ACTCCTTACC 
CAGATCAAAC 
TCTCCGCCAT 
TCTTCTGTTC 
CGGTGGCGGA 
GCGGAGTGCT 
ACAAAATAGG 
CATCACTAGT 
CCGAATTTCC 
AGCAAGTGAC 
ACCTGCTGGC 
CCGGCATCCA 
CGGTGTGTGC 
ACAAAATGTA 
ACAAAGAGGA 
CCAGCCCCCC 
GCTGCTGCTA 
CGAGTCTCCG 
CCTCGACCTC 



CCCAGGCTGG 
GCTGCCTCAC 
CCTGCACGGC 
TCACGCCATC 
TGTAATTAAG 
CCCTATTAAC 
AGTTCCGGGT 
AGTGCAGCGG 
CCGGAGGGCG 
ATTAAATCTG 
AGAGGGAGAA 
TGCCAAAGCA 
AAGAAAAAAC 
TCAGGACCGA 
GAGCTGCTTG 
CGCGGTCACG 
CCTCAGCAAC 
GAAGCACAGA 
GCGCGCCCAC 
CTGCTGCTGC 
GGACTGCCCT 
CCCACCCGCT 



CCCGGCCAGA 
CAGCTGTCGG 
CCACACGCGC 
GAGGAGGTCC 
AAAGGCCCCG 
AAGGACAACC 
CGCCTCTCGC 
CGGCTCTCAC 
AAGTCTAAAA 
CCTGCAGGGA 
GCTGTCCACC 
GTAGCTGAAT 
ATGCTCCTGG 
TCTCCCCTGG 
ACCCACTTCA 
GCCCTGCAGA 
AACCCCAACA 
AAGTGAGGCT 
CCTCCGGCGG 
TGCTGCCGCC 
CTCGACTGTC 
CCCACACCCC 



Seq ID NO: CI 8 DNA Sequence 
Nucleic Acid Accession #: NM_002988 
Coding sequence : 71 . . 340 



1 
I 

CCGGCACGAG 
GCCCAGCATC 
CTGCTCCTGT 
GCAGATTCCA 
AGGTGTCATC 
GGTCCAGAAA 
GCCCAGTGAA 
CCACCCTGGA 
CTTTAATCTT 
ATTATGAAAT 
TTCCCTTCAA 
CAGACATTGT 
ACCTTTTAAT 
AAAAAAAAAA 




GGCCACCTCT 
AGTTTATGCA 
TAGTATTTTC 
CTCTTOGTAC 
GCCATATGTA 
ATATTGGCAG 
AAAAAAAAAA 



21 
1 

AGTTTCCAAG 
TTGCAGCTGC 
GTACCAACAA 
TAGTTGACTA 
AGAGAGGCCG 
ACCTGAAGCT 
CCAGGAGGGA 
TCTAAGAGTC 
TCATATTTCA 
TCTGACATCT 
ATTCAATGCA 
TCAAATGACA 
TACTTATTAT 
AAA 



31 
I 

CCCCAGCTCA 
CCTCCTTGTC 
AGAGCTCTGC 
TTCTGAAACC 
GCAGATCTGT 
GAATGCCTGA 
ACAGGAGCCT 
CCATCTGCTA 
TTTTGAAATT 
CATGACATTG 
TGGATCAATC 
AATCTTTATT 
ATAAAAGGTA 



41 
I 

CTCTGACCAC 
CTCGTCTGCA 
TGCCTCGTCT 
AGCCCCCAGT 
GCTGACCCCA 
GGGGCCTGGA 
GAGCCAGGGC 
TGCCCAGCCA 
GATTTCTATT 
TCTTTATCAT 
AGTGTGATTA 
GAATGGTTTT 
AACCAGCATT 



51 
I 

CCATGGCCCT 
ATACCTCCTG 
GCCCCAAGCC 
ATAAGAAGTG 
AGCTGCGAGG 
AATGGCCCTG 
CATTAACTAA 
GTTGAGCTGC 

GCTTTCTCAG 
GCTCAGCACC 
CTCACTGTGA 



Seq ID NO j C19 DNA Sequence 
Nucleic Acid Accession #: NM_004063 
Coding sequence: 121.. 2619 



1 
I 

AGGGAGTGTT 
GAAAAGGACT 
ATGATACTTC 
TATGGCCAAG 
GGCCAAGAAC 
GAACTAACTG 
AACAGAGCCT 
GCTAATGGAA 
GACAATCGAC 
CCAGGAAAGC 
GGCCAGCTTT 
CAGATCAACA 
CCTGCTAAGA 
GAGAATTCCT 
GCACCAAAAC 
CAGGTGCGGT 
AGATTCCCAT 
GAAGAAAAGG 
TCATATCCGC 
CCGTCACCAG 
ACCCTTACTG 
ATTGTGGAGC 
GGAATGTTAC 
ACGATAGAGG 
GATATCAATG 
GAAGACACAA 
TTTACTGGGA 
GGGGTTGACA 
TTTGAAACAG 
TTTGGTGTGA 
AATGAAGCAC 
ATAGGCACTA 
TCACTGAGGG 



11 
I 

CCCGGGGGAG 
TTTAACCACC 
AGGCCCATCT 
AGGGGAAGTT 
OGAGTCAAAT 
GGGAGACAGA 
TGGACAGGGA 
TTATAGTGGA 
CCACGTTTCT 
CCTTCTTGTA 
ATTACCAGAT 
ACAAAACGGG 
ATCCTTCCTA 
TCAGTGATAC 
CTGTGGAGAT 
GGAATGATCC 
TTTCAATTGA 
ATGCATATGT 
TGGAAATTCA 
TAACCGTATT 
CACATGACAG 
AAACTCCCAA 
AGTTAGCTAA 
TGTCTGACAA 
ATCAGATCCC 
ACATTGGGTC 
GTTCTAAAAT 
CAGATCCCCA 
CAGCTGTTTC 
AGTACAATGC 
CTCAATTTTC 
AAGTGGGCAA 
GAGACACAAG 



21 
1 

ATACTCCAGT 
ATTTTGTGAC 
TCACTCCCTG 
TAGTGGACCC 
TATATTCCAG 
CAACATATTT 
AACAAGATCT 
GGGTCCAGTC 
CCAGTCAAAG 
TGTCAATGCC 
TGTCATCCAG 
AGCCATCTCT 
TAATCTGGTG 
CACATCTGTG 
GGTGGAAAAC 
OGGTGCACAA 
CCAGGAAGGA 
TTTTTATGCA 
TGTAAAAGTT 
TGAGGXCCAG 
GGATGAAGAA 
ACTTCCCATG 
ACAGTCCTTG 
AGATTTCAAG 
CATCTTTGAA 
CACCATCTTA 
TCTGTATCAT 
TACCAACACC 
CAACATTGTG 
AAGTTCTTTT 
CCAACACGTA 
TGTGACTGCC 
AGGTTGGCTT 



31 
I 

CGTAGCAAGA 
TTACAGAAAG 
TGTCTTCTTA 
CTGAAACCCA 
TTTAAGGCCA 
GTGATAGAAC 
ACTCACAATC 
CCTATCACCA 
TACGAAGGCT 
ACAGACCTGG 
CTTCCCATGA 
CTTACCCGAG 
ATCTCAGTGA 
GATATCATAG 
TCAACTGATC 
TATTCCTTAG 
GATATTTACG 
GTTGCAAAGG 
AAAGATATTA 
GAGAATGAAC 
AATACTGCCA 
GATGGACTCT 
AAGAAGCAAG 
ACCCTTTGTT 
AAATCAGATT 
ACCATCCAGG 
ATCATAAAGG 
GGATATGTCA 
TTCAAAGCAG 
GCCAAGTTCA 
TTCCAAGCGA 
AAGGATCCAG 
AAAATTGACC 



41 
I 

GTCTCGACCA 
GAATTTGAAT 
TGCTTTATTT 
TGACATTTTC 
ATCCTCCTGC 
GGGAGGGACT 
TCCAGGTTGC 
TAGAAGTGAA 
CAGTAAGGCA 
ATGATCGGGC 
TCAACAATGT 
AGGGATCTCA 
AGGACATGGG 
TGACAGAGAA 
CTCACCCCAT 
TTGACAAAGA 
TGACTCAGCC 
ATGAGTACGG 
ATGATAATCC 
GACTGGGTAA 
ACAGTTTTCT 
TCCTAATCCA 
ATACTCCTCA 
TTGTGCAAAT 
ATGGAAACCT 
CCACTGATGC 
GAGACAGTGA 
TAATTAAAAA 
AAAATCCTGA 
CGCTTATTGT 
AAGTCAGTGA 
AAGGTCTGGA 
ACGTGACTGG 



360 

420 

480 

540 

600 

660 

720 

780 

B40 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1360 

1440 

1500 

1560 

1620 

1636 



51 
I 

CTGAATGGAA 
AAAGAAAACT 
GGCAACTGGA 
TATTTATGAA 
TGTGACTTTT 
TCTGTATTAC 
AGCCCTGGAC 
GGACATCAAC 
GAACTCTCGC 
CACTCCCAAT 
CATGTACTTT 
GGAATTGAAT 
AGGCCAGAGT 
TATTTGGAAA 
CAAAATCACT 
GAAGCTGCCA 
CTTGGACCGA 
AAAACCACTT 
ACCTACATGT 
CAGTATCGGG 
AAACTACAGG 
AACCTATGCT 
GTACAACTTA 
CAACGTTATT 
GACTCTTGCT 
TGATGAGCCA 
GGGACGCCTG 
GCCTCTTGAT 
GCCTCTAGTG 
GACAGATGTG 
GGATGTAGCT 
CATAAGCTAT 
TGAGATCTTT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

803 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

640 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

I960 



1209 



WO 03/042661 



PCT/US02/36810 



10 



AGTGTGGCTC CATTGGACAG AGAAGCCGOA AGTCCATATC GGGTACAAGT GGTGGCCACA 2040 

GAAGTAGGGG GGTCTTCCTT GAGCTCTGTG TCAGAGTTCC ACCTGATCCT TATGGATGTG 2100 

AATGACAACC CTCCCAGGCT AGCCAAGGAC TACACGGGCT TGTTCTTCTG CCATCCCCTC 2160 

AGTGCACCTG GAAGTCTCAT TTTCGAGGCT ACTGATGATG ATCAGCACTT ATTTCGGGGT 2220 

CCCCATTTTA CATTTTCCCT CGGCAGTGGA AGCTTACAAA ACGACTGGGA AGTTTCCAAA 2280 

ATCAATGGTA CTCATGCCCG ACTGTCTACC AGGCACACAG AGTTTGAGGA GAGGGAGTAT 2340 

GTCGTCTTGA TCCGCATCAA TGATGGGGGT CGGCCACCCT TGGAAGGCAT TGTTTCTTTA 2400 

CCAGTTACAT TCTGCAGTTG TGTGGAAGGA AGTTGTTTCC GGCCAGCAGG TCACCAGACT 2460 

GGGATACCCA CTGTGGGCAT GGCAGTTGGT ATACTGCTGA CCACCCTTCT GGTGATTGGT 2520 

ATAATTTTAG CAGTTGTGTT TATCCGCATA AAGAAGGATA AAGGCAAAGA TAATGTTGAA 2580 

AGTGCTCAAG CATCTGAAGT CAAACCTCTG AGAAGCTGAA TTTGAAAAGG AATGTTTGAA 2640 

TTTATATAGC AAGTGCTATT TCAGCAACAA CCATCTCATC CTATTACTTT TCATCTAACG 2700 

TGCATTATAA TTTTTTAAAC AGATATTCCC TCTTGTCCTT TAATATTTGC TAAATATTTC 2760 

TTTTTTGAGG TGGAGTCTTG CTCTGTCGCC CAGGCTGGAG TACAGTGGTG TGATCCCAGC 2820 

ID TCACTGCAAC CTCCGCCTCC TGGGTTCACA TGATTCTCCT GCCTCAGCTT CCTAAGTAGC 2880 

TGGGTTTACA GGCACCCACC ACCATGCCCA GCTAATTTTT GTATTTTTAA TAGAGAGGGG 2940 

GTTTCGCCAT TTGGCCAGGC TGGTCTTGAA CTCCTGACGT CAAGTGATCT GCCTGCCTTG 3000 

GTCTCCCAAT ACAGGCATGA ACCACTGCAC CCACCTACTT AGATATTTCA TGTGCTATAG 3060 

ACATTAGAGA GATTTTTCAT TTTTCCATGA CATTTTTCCT CTCTGCAAAT GGCTTAGCTA 3120 

CTTGTGTTTT TCCCTTTTGG GGCAAGACAG ACTCATTAAA TATTCTGTAC ATTTTTTCTT 3180 

TATCAAGGAG ATATATCAGT GTTGTCTCAT AGAACTGCCT GGATTCCATT TATGTTTTTT 3240 

CTGATTCCAT CCTGTGTCCC CTTCATCCTT GACTCCTTTG GTATTTCACT GAATTTCAAA 3300 

CATTTGTCAG AGAAGAAAAA CGTGAGGACT CAGGAAAAAT AAATAAATAA AAGAACAGCC 3360 

TTTTCCCTTA GTATTAACAG AAATGTTTCT GTGTCATTAA CCATCTTTAA TCAATGTGAC 3420 

ATGTTGCTCT TTGGCTGAAA TTCTTCAACT TGGAAATGAC ACAGACCCAC AGAAGGTGTT 3480 

CAAACACAAC CTACTCTGCA AACCTTGGTA AAGGAACCAG TCAGCTGGCC AGATTTCCTC 3540 

ACTACCTGCC ATGCATACAT GCTGCGCATG TTTTCTTCAT TCGTATGTTA GTAAAGTTTT 3600 

GGTTATTATA TATTTAACAT GTGGAAGAAA ACAAGACATG AAAAGAGTGG TGACAAATCA 3660 

AGAATAAACA CTGGTTGTAG TCAGTTTTGT TTGTTAA 3697 
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Seq ID NO: C20 DKA Sequence 
Nucleic Acid Accession »: NM_004443 
Coding sequence: 28.. 3024 

1 11 21 31 , 41 SI 

I I I I I I . 

GGCTCGGCTC CTAGAGCTGC CACGGCCATG GCCAGAGCCC GCCCGCCGCC GCOGCCGTCG 60 

CCGCCGCCGG GGCTTCTGCC GCTGCTCCCT CCGCTGCTGC TGCTGCCGCT GCTGCTGCTG 120 

CCCGCCGGCT GCCX3GGCGCT GGAAGAGACC CTCATGGACA CAAAATGGGT AACATCTGAG 180 

TTGGCGTGGA CATCTCATCC AGAAAGTGGG TGGGAAGAGG TGAGTGGCTA CGATGAGGCC 240 

ATGAATCCCA TCCGCACATA CCAGGTGTGT AATGTGCGCG AGTCAAGCCA GAACAACTGG 300 

CTTCGCACGG GGTTCATCTG GCGGCGGGAT GTGCAGCGGG TCTACGTGGA GCTCAAGTTC , 360 

ACTGTGCGTG ACTGCAACAG CATCCCCAAC ATCCCCGGCT CCTGCAAGGA GACCTTCAAC 420 

CTCTTCTACT ACGAGGCTGA CAGCGATGTG GCCTCAGCCT CCTCCCCCTT CTGGATGGAG 480 

AACCCCTACG TGAAAGTGGA CAOCATTGCA CCCGATGAGA GCTTCTCGCG GCTGGATGCC 540 

GGCCGTGTCA ACACCAAGGT GCGCAGCTTT GGGCCACTTT CCAAGGCTGG CTTCTACCTG 600 

GCCTTCCAGG ACCAGGGCGC CTGCATGTCG CTCATCTCCG TGCGCGCCTT CTACAAGAAG 660 

TGTGCATCCA CCACCGCAGG CTTOGCACTC TTCCCCGAGA CCCTCACTGG GGCGGAGCCC 720 

ACCTCGCTGG TCATTGCTCC TGGCACCTGC ATCCCTAACG CCGTGGAGGT GTCGGTGCCA 780 

CTCAAGCTCT ACTGCAACGG CGATGGGGAG TGGATGGTGC CTGTGGGTGC CTGCACCTGT 840 

GCCACCGGCC ATGAGOCAGC TGCCAAGGAG TCCCAGTGCC GCCCCTGTCC CCCTGGGAGC 900 

TACAAGGCGA AGCAGGGAGA GGGGCCCTGC CTCCCATGTC CCCCCAACAG CCGTACCACC .960 

TCCCCAGCCG CCAGCATCTG CACCTGCCAC AATAACTTCT ACCGTGCAGA CTCGGACTCT 1020 

GCGGACAGTG CCTGTACCAC CGTGCCATCT CCACCCCGAG GTGTGATCTC CAATGTGAAT 1080 

GAAACCTCAC TGATCCTCGA GTGGAGTGAG CCCCGGGACC TGGGTGGCCG GGATGACCTC 1140 

CTGTACAATG TCATCTGCAA GAAGTGCCAT GGGGCTGGAG GGGCCTCAGC CTGCTCAOGC 1200 

TGTGATGACA ACGTGGAGTT TGTGCCTCGG CAGCTGGGCC TGACGGAGCG CCGGGTCCAC 1260 

ATCAGCCATC TGCTGGCCCA CACGCGCTAC ACCTTTGAGG TGCAGGCGGT CAACGGTGTC 1320 

TOGGGCAAGA GCCCTCTGCC GOCTCGTTAT GCGGCCGTGA ATATCACCAC AAACCAGGCT 1380 

GCCCCGTCTG AAGTGCCCAC ACTACGCCTG CACAGCAGCT CAGGCAGCAG CCTCACCCTA 1440 

TCCTGGGCAC CCCCAGAGCG GCCCAACGGA GTCATCCTGG ACTACGAGAT GAAGTACTTT 1500 

GAGAAGAGCG AGGGCATCGC CTCCACAGTG ACCAGCCAGA TGAACTCCGT GCAGCTGGAC 1560 

GGGCTTCGGC CTGACGCCCG CTATGTGGTC CAGGTCCGTG CCCGCACAGT AGCTGGCTAT 1620 

GGGCAGTACA GCCGCCCTGC CGAGTTTGAG ACCACAAGTG AGAGAGGCTC TGGGGCCCAG 1680 

CAGCTCCAGG AGCAGCTTCC CCTCATCGTG GGCTCCGCTA CAGCTGGGCT TGTCTTCGTQ 1740 

GTGGCTGTCG TGGTCATCGC TATCGTCTGC CTCAGGAAGC AGCGACACGG CTCTGATTCG 1800 

GAGTACACGG AGAAGCTGCA GCAGTACATT GCTCCTGGAA TGAAGGTTTA TATTGACCCT 1860 

TTTACCTACG AGGACCCTAA TGAGGCTGTT CGGGAGTTTG GCAAGGAGAT CGACGTGTCC 1920 

TGCGTCAAGA TCGAGGAGGT GATCGGAGCT GGGGAATTTG GGGAAGTGTG CCGTGGTCGA 1980 

CTGAAACAGC CTGGCCGCCG AGAGGTGTTT GTGGCCATCA AGACGCTGAA GGTGGGCTAC 2040 

ACCGAGAGGC AGCGGCGGGA CTTCCTAAGC GAGGCCTCCA TCATGGGTCA GTTTGATCAC 210O 

CCCAATATAA TCCGGCTCGA GGGCGTGGTC ACCAAAAGTC GGCCAGTTAT GATCCTCACT 2160 

GAGTTCATGG AAAACTGCGC CCTGGACTCC TTCCTCCGGC TCAACGATGG GCAGTTCACG 2220 

GTCATCCAGC TGGTGGGCAT GTTGCGGGGC ATTGCTGCCG GCATGAAGTA CCTQTCCGAG 2280 

ATGAACTATG TGCACCGCGA CCTGGCTGCT CGCAACATCC TTGTCAACAG CAACCTGGTC 2340 

TGCAAAGTCT CAGACTTTGG CCTCTCCCGC TTCCTGGAGG ATGACCCCTC CGATCCTACC 2400 

TACACCAGTT CCCTGGGCGG GAAGATCCCC ATCCGCTGGA CTGCCCCAGA GGCCATAGCC 2460 

TATCGGAAGT TCACTTCTGC TAGTGATGTC TGGAGCTACG GAATTGTCAT GTGGGAGGTC 2520 

ATGAGCTATG GAGAGCGACC CTACTGGGAC ATGAGCAACC AGGATGTCAT CAATGCCGTG 2580 

GAGCAGGATT ACCGGCTGCC ACCACCCATG GACTGTCCCA CAGCACTGCA CCAGCTCATG 2640 

CTGGACTGCT GGGTGCGGGA CCGGAACCTC AGGCCCAAAT TCTCCCAGAT TGTCAATACC 2700 

CTGGACAAGC TCATCCGCAA TGCTGCCAGC CTCAAGGTCA TTGCCAGCGC TCAGTCTGGC 2760 

ATGTCACAGC CCCTCCTGGA CCGCAOGGTC CCAGATTACA CAACCTTCAC GACAGTTGGT 2820 
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GATTGGCTGG ATGCCATCAA GATGGGGCGG TACAAGGAGA GCTTCGTCAG TGCGGGGTTT 2880 

GCATCTTTTG ACCTGGTGGC CCAGATGAOG GCAGAAGACC TGCTCGGTAT TGGGGTCACC 2940 

CTGGCCGGCC ACCAGAAGAA GATCCTGAGC AGTATCCAGG ACATGCGGCT GCAGATGAAC 3000 

CAGAOGCTGC CTGTGCAGGT CTGACACCGG CTCCCACGGG GACCCTGAGG ACCGTGCAGG 3060 

J GATGCCAAGC AGCCGGCTGG ACTTTCGGAC TCTTGGACTT TTGGATGCCT GGCCTTAGGC 3120 

TGTGGCCCAG AAGCTGGAAG TTTGGGAAAG GCCCAAGCTG GGACTTCTCC AGGCCTGTGT 3180 

TCCCTCCCCA GGAAGTGCGC CCCAAACCTC TTCATATTGA AGATGGATTA GGAGAGGGGG 3240 

TGATGACCCC TCCCCAAGCC CCTCAGGGCC CAGACCTTCC TGCTCTCCAG CAGGGGATCC 3300 

CCACAACCTC ACACTTGTCT GTTCTTCAGT GCTGGAGGTC CTGGCAGGGT CAGGCTGGGG 3360 

1U TAAGCCGGGG TTCCACAGGG CCCAGCCCTG GCAGGGGTCT GGCCCCCCAG GTAGGCGGAG 3420 

AGCAGTCCCT CCCTCAGGAA CTGGAGGAGG GGACTCCAGG AATGGGGAAA TGTGACACCA 3480 

CCATCCTGAA GCCAGCTTGC ACCTCCAGTT TGCACAGGGA TTTGTCCTGG GGGCTGAGGG 3540 

CCCTGTCCCC ACCCCCGCCC TTGGTGCTGT CATAAAAGGG CAGGCAGGGG CAGGCTGAGG 3600 

AGTTGCCCTT TGCCCCCCAG AGACTGACTC TCAGAGCCAG AGATGGGATG TGTGAGTGTG 3660 

lj TGTGTGTGTG TGTGCGCGCG CGCGCGCGTG TGTGTGTGCA CGCACTGGCC TGCACAGAGA 3720 

GCATGGGTGA GCGTGTAAAA GCTTGGCCCT GTGCCCTACA ATGGGGCCAG CTGGGCCGAC 3780 

AGCAGAATAA AGGCAATAAG ATGAA 3805 
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Seq ID NO: C21 DNA Sequence 
Nucleic Acid Accession #: NM_001804 
Coding sequence: 82.-879 



or 1 11 21 31 41 51 

25 | | | | | | 

AGGTGAGCGG TTGCTCGTCG TCGGGGCGGC CGGCAGCGGC GGCTCCAGGG CCCAGCATGC 60 

GCGGGGGACC CCGCGGCCAC CATGTATGTG GGCTATGTGC TGGACAAGGA TTCGCCCGTG 120 

TACCCCGGCC CAGCCAGGCC AGCCAGCCTC GGCCTGGGCC CGGCAAACTA CGGCCCCCCG 180 

GCCCCGCCCC CX3GCGCCCCC GCAGTACCCC GACTTCTCCA GCTACTCTCA CGTGGAGCOG 240 

GCCCCCGCGC CCCCGACGGC CTGGGGGGCG CCCTTCCCTG CGCCCAAGGA CGACTGGGCC 300 

GCCGCCTACG GCCCGGGCCC CGCGGCCCCT GCCGCCAGCC CAGCTTCGCT GGCATTCGGG 360 

CCCCCTCCAG ACTTTAGCCC GGTGCCGGCG CCCCCTGGGC CCGGCCCGGG CCTCCTGGCG 420 

CAGCCCCTCG GGGGCCCGGG CACACCGTCC TCGCCCGGAG CGCAGAGGCC GACGCCCTAC 480 

GAGTGGATGC GGCGCAGCGT GGCGGCCGGA GGCGGCGGTG GCAGOGGTAA GACTCGGACC 540 

AAGGACAAGT ACCGCGTGGT CTACACCGAC CACCAACGCC TGGAGCTGGA GAAGGAGTTT 600 

CATTACAGCC GTTACATCAC AATCCGGCGG AAATCAGAGC TGGCTGCCAA TCTGGGGCTC 660 

ACTGAACGGC AGGTGAAGAT CTGGTTCCAA AACCGGCGGG CAAAGGAGCG CAAAGTGAAC 720 

AAGAAGAAAC AGCAGCAGCA ACAGCCCCCA CAGCCGCCGA TGGCCCACGA CATCACGGCC 780 

ACCCCAGCCG GGCCATCCCT GGGGGGCCTG TGTCCCAGCA ACACCAGCCT CCTGGCCACC 840 

TCCTCTCCAA TGCCTGTGAA AGAGGAGTTT CTGCCATAGC CCCATGCCCA GCCTGTGCGC 900 

CGGGGGACCT GGGGACTCGG GTGCTGGGAG TGTGGCTCCT GTGGGCCCAG GAGGTCTGGT 960 

CCGAGTCTCA GCCCTGACCT TCTGGGACAT GGTGGACAGT CACCTATCCA CCCTCTGCAT 1020 

CCCCTTGGCC CATTGTGTGC AGTAAGCCTG TTGGATAAAG ACCTTCCAGC TCCTGTGTTC 1080 

TAGACCTCTG GGGGATAAGG GAGTCCAGGG TGGATGATCT CAATCTCCCG TGGGCATCTC 1140 

AAGCCCCAAA TGGTTGGGGG AGGGGCCTAG ACAAGGCTCC AGGCCCCACC TCCTCCTCCA 1200 

TACGTTCAGA GGTGCAGCTG GAGGCCTGTG TGGGGACCAC ACTGATCCTG GAGAAAAGGG 1260 

ATGGAGCTGA AAAAGATGGA ATGCTTGCAG AGCATGACCT GAGGAGGGAG GAACGTGGTC 1320 

AACTCACACC TGCCTCTTCT GCAGCCTCAC CTCTACCTGC CCCCATCATA AGGGCACTGA 1380 

GCCCTTCCCA GGCTGGATAC TAAGCACAAA GCCCATAGCA CTGGGCTCTG ATGGCTGCTC 1440 

CACTGGGTTA CAGAATCACA GCCCTCATGA TCATTCTCAG TGAGGGCTCT GGATTGAGAG 1500 

GGAGGCCCTG GGAGGAGAGA AGGGGGCAGA GTCTTCCCTA CCAGGTTTCT ACACCCCCGC 1560 

CAGGCTGCCC ATCAGGGCCC AGGGAGCCCC CAGAGGACTT TATTCGGACC AAGCAGAGCT 1620 

CACAGCTGGA CAGGTGTTGT ATATAGAGTG GAATCTCTTG GATGCAGCTT CAAGAATAAA 1680 

TTTTTCTTCT CTTTTCAAA 1699 



Seq ID NO: 022 DNA Sequence 
Nucleic Acid Accession #: NM_021978 
Coding sequence : 36.. 2 60 3 ~" 



60 i 11 21 31 41 SI 

I I I I I I 

GACGCCTGTG AGACCCGCGA GCGGCCTCGG GGACCATGGG GAGCGATCGG GCCCGCAAGG 60 

GCGGAGGGGG CCCGAAGGAC TTCGGCGCGG GACTCAAGTA CAACTCCCGG CACGAGAAAG 120 

TGAATGGCTT GGAGGAAGGC GTGGAGTTCC TGCCAGTCAA CAAOGTCAAG AAGGTGGAAA 180 

0-> AGCATGGCCC GGGGCGCTGG GTGGTGCTGG CAGCCGTGCT GATCGGCCTC CTCTTGGTCT 240 

TGCTGGGGAT CGGCTTCCTG GTGTGGCATT TGCAGTACCG GGACGTGCGT GTCCAGAAGG 300 

TCTTCAATGG CTACATGAGG ATCACAAATG AGAATTTTGT GGATGCCTAC GAGAACTCCA 360 

ACTCCACTGA GTTTGTAAGC CTGGCCAGCA AGGTGAAGGA CGCGCTGAAG CTGCTGTACA 420 

GCGGAGTCCC ATTCCTGGGC CCCTACCACA AGGAGTCGGC TGTGACGGCC TTCAGCGAGG 480 

/U GCAGCGTCAT CGCCTACTAC TGGTCTGAGT TCAGCATCCC GCAGCACCTG GTGGAGGAGG 540 

CCGA GCGCG T CATGGCCGAG GAGCGCGTAG TCATGCTGCC CCCGCGGGCG CGCTCCCTGA 600 

AGTCCTTTGT GGTCACCTCA GTGGTGGCTT TCCCCACGGA CTCCAAAACA GTACAGAGGA 660 

CCCAGGACAA CAGCTGCAGC TTTGGCCTGC ACGCCCGCGG TGTGGAGCTG ATGCGCTTCA 720 

CCACGCCCGG CTTCCCTGAC AGCCCCTACC CCGCTCATGC CCGCTGCCAQ TGGGCCCTGC 780 

/J GGGGGGACGC CGACTCAGTG CTGAGCCTCA CCTTCCGCAG CTTTGACCTT GCGTCCTGCG 840 

ACGAGCGCGG CAGCGACCTG GTGACGGTGT ACAACACCCT GAGCCCCATG GAGCCCCACG 900 

CCCTGGTGCA GTTGTGTGGC ACCTACCCTC CCTCCTACAA CCTGACCTTC CACTCCTCCC 960 

AGAACGTCCT GCTCATCACA CTGATAACCA ACACTGAGCG GCGGCATCCC GGCTTTGAGG 1020 

CCACCTTCTT CCAGCTGCCT AGGATGAGCA GCTGTGGAGG CCGCTTACGT AAAGCCCAGG 1080 

OU GGACATTCAA CAGCCCCTAC TACCCAGGCC ACTACCCACC CAACATTGAC TGCACATGGA 1140 

ACATTGAGGT GCCCAACAAC CAGCATGTGA AGGTGCGCTT CAAATTCTTC TACCTGCTGG 1200 

AGCCCGGCGT GCCTGCGGGC ACCTGCCCCA AGGACTACGT GGAGATCAAT GGGGAGAAAT 1260 

ACTGCGGAGA GAGGTCCCAG TTCGTCGTCA CCAGCAACAG CAACAAGATC ACAGTTCGCT 1320 

TCCACTCAGA TCAGTCCTAC ACCGACACCG GCTTCTTAGC TGAATACCTC TCCTACGACT 1380 
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CCAGTGACCC ATGCCCGGGG CAGTTCACGT GCCGCACGGG GCGGTGTATC CGGAAGGAGC 1440 

TGCGCTGTGA TGGCTGGGCC GACTGCACCG ACCACAGCGA TGAGCTCAAC TGCAGTTGCG 1S00 

ACGCCGGCCA CCAGTTCACG TGCAAGAACA AGTTCTGCAA GCCCCTCTTC TGGGTCTGCG IS 60 

ACAGTGTGAA CGACTGCGGA GACAACAGCG ACGAGCAGGG GTGCAGTTGT CCGGCCCAGA 1S20 

CCTTCAGGTG TTCCAATGGG AAGTGCCTCT CGAAAAGCCA GCAGTGCAAT GGGAAGGACG 1680 

ACTGTGGGGA CGGGTCCGAC GAGGCCTCCT GCCCCAAGGT GAACGTCGTC ACTTGTACCA 1740 

AACACACCTA CCGCTGCCTC AATGGGCTCT GCTTGAGCAA GGGCAACCCT GAGTGTGACG 1800 

GGAAGGAGGA CTGTAGCGAC GGCTCAGATG AGAAGGACTG CGACTGTGGG CTGCGGTCAT 1860 

TCACGAGACA GGCTCGTGTT GTTGGGGGCA OGGATGOGGA TGAGGGCGAG TGGCCCTGGC 1920 

AGGTAAGCCT GCATGCTCTG GGCCAGGGCC ACATCTGCGG TGCTTCCCTC ATCTCTCCCA I960 

ACTGGCTGGT CTCTGCCGCA CACTGCTACA TCGATGACAG AGGATTCAGG TACTCAGACC 2040 

CCACGCAGTG GACGGCCTTC CTGGGCTTGC ACGACCAGAG CCAGCGCAGC GCCCCTGGGG 2100 

TGCAGGAGCG CAGGCTCAAG CGCATCATCT CCCACCCCTT CTTCAATGAC TTCACCTTCG 2160 

ACTATGACAT CGCGCTGCTG GAGCTGGAGA AACCGGCAGA GTACAGCTCC ATGGTGCGGC 2220 

CCATCTGCCT GCCGGACGCC TCCCATGTCT TCCCTGCCGG CAAGGCCATC TGGGTCACGG 2280 

GCTGGGGACA CACCCAGTAT GGAGGCACTG GCGCGCTGAT CCTGCAAAAG GGTGAGATCC 2340 

GCGTCATCAA CCAGACCACC TGCGAGAACC TCCTGCCGCA GCAGATCACG CCGCGCATGA 2400 

TGTGCGTGGG CTTCCTCAGC GGCGGCGTGG ACTCCTGCCA GGGTGATTCC GGGGGACCCC 2460 

TGTCCAGCGT GGAGGCGGAT GGGCGGATCT TCCAGGCCGG TGTGGTGAGC TGGGGAGACG 2520 

GCTGCGCTCA GAGGAACAAG CCAGGCGTGT ACACAAGGCT CCCTCTGTTT CGGGACTGGA 2580 

TCAAAGAGAA CACTGGGGTA TAGGGGCCGG GGCCACCCAA ATGTGTACAC CTGCGGGGCC 2640 

ACCCATCGTC CACCCCAGTG TGCACGCCTG CAGGCTGGAG ACTGGACCGC TGACTGCACC 2700 

AGCGCCCCCA GAACATACAC TGTGAACTCA ATCTCCAGGG CTCCAAATCT GCCTAGAAAA 2760 

CCTCTCGCTT CCTCAGCCTC CAAAGTGGAG CTGGGAGGTA GAAGGGGAGG ACACTGGTGG 2820 

TTCTACTGAC CCAACTGGGG GCAAAGGTTT GAAGACACAG CCTCCCCCGC CAGCCCCAAG 2880 

CTGGGCCGAG GCGCGTTTGT ' GTATATCTGC CTCCCCTGTC TGTAAGGAGC AGCGGGAACG 2940 

GAGCTTCGGA GCCTCCTCAG TGAAGGTGGT GGGGCTGCCG GATCTGGGCT GTGGGGCCCT 3000 

TGGGCCACGC TCTTGAGGAA GCCCAGGCTC GGAGGACCCT GGAAAACAGA CGGGTCTGAG 3060 

ACTGAAAATG GTTTACCAGC TCCCAGGTGA CTTCAGTGTG TGTATTGTGT AAATGAGTAA 3120 

AACATTTTAT TTCTTTTTAA AAAAAAAAA 3149 

Seq ID NO: C23 DNA Sequence 

Nucleic Acid Accession fl: Eos sequence 

Coding sequence: 1..2268 

1 11 21 31 41 51 

I 1 i I I I 

ATGCCCCCTT TCCTGTTGCT GGAAGCCGTC TQTQTTTTCC TGTTTTCCAG AGTGCCCCCA 60 

TCTCTCCCTC TCCAGGAAGT CCATGTAAGC AAAGAAACCA TCGGGAAGAT TTCAGCTGCC 120 

AGCAAAATGA TGTGGTGCTC GGCTGCAGTG GACATCATGT TTCTGTTAGA TGGGTCTAAC 180 

AGCGTCGGGA AAGGGAGCTT TGAAAGGTCC AAGCACTTTG CCATCACAGT CTGTGACGGT 240 

CTGGACATCA GCCCCGAGAG GGTCAGAGTG GGAGCATTCC AGTTCAGTTC CACTCCTCAT 300 

CTGGAATTCC CCTTGGATTC ATTTTCAACC CAACAGGAAG TGAAGGCAAG AATCAAGAGG 360 

ATGGTTTTCA AAGGAGGGCG CACGGAGACG GAACTTGCTC TGAAATACCT TCTGCACAGA 420 

GGGTTGCCTG GAGGCAGAAA TGCTTCTGTG CCCCAGATCC TCATCATCGT CACTGATGGG 480 

AAGTCCCAGG GGGATGTGGC ACTGCCATCC AAGCAGCTGA AGGAAAGGGG TGTCACTGTG 540 

TTTGCTGTGG GGGTCAGGTT TCCCAGGTGG GAGGAGCTGC ATGCACTGGC CAGCGAGCCT 600 

AGAGGGCAGC ACGTGCTGTT GGCTGAGCAG GTGGAGGATG CCACCAACGQ CCTCTTCAGC 660 

ACCCTCAGCA GCTCGGCCAT CTGCTCCAGC GCCACGCCAG ACTGCAGGGT CGAGGCTCAC 720 

CCCTGTGAGC ACAGGACGCT GGAGATGGTC CGGGAGTTCG CTGGCAATGC CCCATGCTGG 780 

AGAGGATCGC GGCGGACCCT TGCGGTGCTG GCTGCACACT GTCCCTTCTA CAGCTGGAAG 840 

AGAGTGTTCC TAACCCACCC TGCCACCTGC TACAGGACCA CCTGCCCAGG CCCCTGTGAC 900 

TCGCAGCCCT GCCAGAATGG AGGCACATGT GTTCCAGAAG GACTGGACGG CTACCAGTGC 960 

CTCTGCCCGC TGGCCTTTGG AGGGGAGGCT AACTGTGCCC TGAAGCTGAG CCTGGAATGC 1020 

AGGGTCGACC TCCTCTTCCT GCTGGACAGC TCTGCGGGCA CCACTCTGGA CGGCTTCCTG 1080 

CGGGCCAAAG TCTTCGTGAA GCGGTTTGTG CGGGCCGTGC TGAGCGAGGA CTCTCGGGCC 1140 

CGAGTGGGTG TGGCCACATA CAGCAGGGAG CTGCTGGTGG CGGTGCCTGT GGGGGAGTAC 1200 

CAGGATGTGC CTGACCTGGT CTGGAGCCTC GATGGCATTC CCTTCCGTGG TGGCCCCACC 1260 

CTGACGGGCA GTGCCTTGCG GCAGGCGGCA GAGCGTGGCT TCGGGAGCGC CACCAGGACA 1320 

GGCCAGGACC GGCCACGTAG AGTGGTGGTT TTGCTCACTG AGTCACACTC CGAGGATGAG 1380 

GTTGCGGGCC CAGCGCGTCA CGCAAGGGCG CGAGAGCTGC TCCTGCTGGG TGTAGGCAGT 1440 

GAGGCCGTGC GGGCAGAGCT GGAGGAGATC ACAGGCAGCC CAAAGCATGT GATGGTCTAC 1S00 

TOGGATCCTC AGGATCTGTT CAACCAAATC CCTGAGCTGC AGGGGAAGCT GTGCAGCCGG 1S60 

CAGCGGCCAG GGTGCCGGAC ACAAGCCCTG GACCTCGTCT TCATGTTGGA CACCTCTGCC 1620 

TCAGTAGGGC CCGAGAATTT TGCTCAGATG CAGAGCTTTG TGAGAAGCTG TGCCCTCCAG 1680 

TTTGAGGTGA ACCCTGACGT GACACAGGTC GGCCTGGTGG TGTATGGCAG CCAGGTGCAG 1740 

ACTGCCTTCG GGCTGGACAC CAAACCCACC CGGGCTGCGA TGCTGCGGGC CATTAGCCAG 1800 

GCCCCCTACC TAGGTGGGGT GGGCTCAGCC GGCACCGCCC TGCTGCACAT CTATGACAAA 1860 

GTGATGACCG TCCAGAGGGG TGCCCGGCCT GGTGTCCCCA AAGCTGTGGT GGTGCTCACA 1920 

GGCGGGAGAG GCGCAGAGGA TGCAGCCGTT CCTGCCCAGA AGCTGAGGAA CAATGGCATC 1980 

TCTGTCTTGG TCGTGGGCGT GGGGCCTGTC CTAAGTGAGG GTCTGCGGAG GCTTGCAGGT 2040 

CCCCGGGATT CCCTGATCCA CGTGGCAGCT TACGCCGACC TGCGGTACCA CCAGGACGTG 2100 

CTCATTGAGT GGCTGTGTGG AGAAGCCAAG CGGCCAGTCA ACCTCTGCAA ACCCAGCCCG 2160 

TGCATGAATG AGGGCAGCTG CGTCCTGCAG AATGGGAGCT ACCGCTGCAA GTGTCGGGAT 2220 

GGCTGGGAGG GCCCCCACTG CGAGAACCGA TTCTTGAGAC GCCCCTGA 2268 

Seq ID NO: C24 DNA Sequence 

Nucleic Acid Accession 8: Eos sequence 

Coding sequence: 1. .2424 

1 11 21 31 41 51 

I I 1 I I I 

ATGCCCCCTT TCCTGTTGCT GGAGGCCGTC TGTGTTTTCC TGTTTTCCAG AGTGCCCCCA 60 

TCTCTCCCTC TCCAGGAAGT CCATGTAAGC AAAGAAACCA TCGGGAAGAT TTCAGCTGCC 120 
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AGCAAAATGA 
AGCGTCGGGA 
CTGGACATCA 
CTGGAATTCC 
ATGGTTTTCA 
GGGTTGCCTG 
AAGTCCCAGG 
TTTGCTGTGG 
AGAGGGCAGC 
ACCCTCAGCA 
CCCTGTGAGC 
AGAGGATCGC 
AGAGTGTTCC 
TCGCAGCCCT 
CTCTGCCCGC 
AGGGTCGACC 
CGGGCCAAAG 
CGAGTGGGTG 
CAGGATGTGC 
CTGACGGGCA 
GGCCAGGACC 
GTTGCGGGCC 
GAGGCCGTGC 
TCGGATCCTC 
CAGCGGCCAG 
TCAGTAGGGC 
TTTGAGGTGA 
ACTGCCTTCG 
GCCCCCTACC 
GTGATGACCG 
GGCGGGAGAG 
TCTGTCTTGG 
CCCCGGGATT 
CTCATTGAGT 
TGCATGAATG 
GGCTGGGAGG 
CAGGGATGGA 
CGTACCCCTC 
AATGTCTGTG 



TGTGGTGCTC 
AAGGGAGCTT 
GCCCCGAGAG 
CCTTGGATTC 
AAGGAGGGCG 
GAGGCAGAAA 
GGGATGTGGC 
GGGTCAGGTT 
ACGTGCTGTT 
GCTCGGCCAT 
ACAGGACGCT 
GGCGGACCCT 
TAACCCACCC 
GCCAGAATGG 
TGGCCTTTGG 
TCCTCTTCCT 
TCTTCGTGAA 
TGGCCACATA 
CTGACCTGGT 
GTGCCTTGCG 
GGCCACGTAG 
CAGCGCGTCA 
GGGCAGAGCT 
AGGATCTGTT 
GGTGCCGGAC 
CCGAGAATTT 
ACCCTGACGT 
GGCTGGACAC 
TAGGTGGGGT 
TCCAGAGGGG 
GCGCAGAGGA 
TCGTGGGCGT 
CCCTGATCCA 
GGCTGTGTGG 
AGGGCAGCTG 
GCCCCCACTG 
TTCTTGAGAC 
CCAGCAACTA 
CCCCAGGTCC 



GGCTGCAGTG 
TGAAAGGTCC 
GGTCAGAGTG 
ATTTTCAACC 
CACGGAGACG 
TGCTTCTGTG 
ACTGCCATCC 
TCCCAGGTGG 
GGCTGAGCAG 
CTGCTCCAGC 
GGAGATGGTC 
TGCGGTGCTG 
TGCCACCTGC 
AGGCACATGT 
AGGGGAGGCT 
GCTGGACAGC 
GCGGTTTGTG 
CAGCAGGGAG 
CTGGAGCCTC 
GCAGGCGGCA 
AGTGGTGGTT 
CGCAAGGGOG 
GGAGGAGATC 
CAACCAAATC 
ACAAGCCCTG 
TGCTCAGATG 
GACACAGGTC 
CAAACCCACC 
GGGCTCAGCC 
TGCCCGGCCT 
TGCAGCCGTT 
GGGGCCTGTC 
CGTGGCAGCT 
AGAAGCCAAG 
CGTCCTGCAG 
CGAGAACOQT 
GCCCCTGAGG 
CAGAGAAGGC 
TTAG 



GACATCATGT 
AAGCACTTTG 
GGAGCATTCC 
CAACAGGAAG 
GAACTTGCTC 
CCCCAGATCC 
AAGCAGCTGA 
GAGGAGCTGC 
GTGGAGGATG 
GCCACGCCAG 
CGGGAGTTCG 
GCTGCACACT 
TACAGGACCA 
GTTCCAGAAG 
AACTGTGCCC 
TCTGCGGGCA 
CGGGCCGTGC 
CTGCTGGTGG 
GATGGCATTC 
GAGCGTGGCT 
TTGCTCACTG 
CGAGAGCTGC 
ACAGGCAGCC 
CCTGAGCTGC 
GACCTCGTCT 
CAGAGCTTTG 
GGCCTGGTGG 
CGGGCXGOGA 
GGCACCGCCC 
GGTGTCCCCA 
CCTGCCCAGA 
CTAAGTGAGG 
TACGCCGACC 
CAGCCAGTCA 
AATGGGAGCT 
GAGTGGAGCT 
CACATGGCTC 
CTGGGCACTG 



TTCTGTTAGA 
CCATCACAGT 
AGTTCAGTTC 
TGAAGGCAAG 
TGAAATACCT 
TCATCATCGT 
AGGAAAGGGG 
ATGCACTGGC 
CCACCAACGG 
ACTGCAGGGT 
CTGGCAATGC 
GTCCCTTCTA 
CCTGCCCAGG 
GACTGGACGG 
TGAAGCTGAG 
CCACTCTGGA 
TGAGCGAGGA 
CGGTGCCTGT 
CCTTCCGTGG 
TCGGGAGCGC 
AGTCACACTC 
TCCTGCTGGG 
CAAAGCATGT 
AGGGGAAGCT 
TCATGTTGGA 
TGAGAAGCTG 
TGTATGGCAG 
TGCTGCGGGC 
TGCTGCACAT 
AAGCTGTGGT 
AGCTGAGGAA 
GTCTGCGGAG 
TGCGGTACCA 
ACCTCTGCAA 
ACCGCTGCAA 
CTTGCTCTGT 
CCGTGCAGGA 
AAATGGTGCC 



TGGGTCTAAC 
CTGTGACGGT 
CACTCCTCAT 
AATCAAGAGG 
TCTGCACAGA 
CACTGATGGG 
TGTCACTGTG 
CAGCGAGCCT 
CCTCTTCAGC 
CGAGGCTCAC 
CCCATGCTGG 
CAGCTGGAAG 
CCCCTGTGAC 
CTACCAGTGC 
CCTGGAATGC 
CGGCTTCCTG 
CTCTCGGGCC 
GGGGGAGTAC 
TGGCCCCACC 
CACCAGGACA 
CGAGGATGAG 
TGTAGGCAGT 
GATGGTCTAC 
GTGCAGCCGG 
CACCTCTGCC 
TGCCCTCCAG 
CCAGGTGCAG 
CATTAGCCAG 
CTATGACAAA 
GGTGCTCACA 
CAATGGCATC 
GCTTGCAGGT 
CCAGGACGTG 
ACCCAGCCCG 
GTGTCGGGAT 
ATGTGTGAGC 
GGGCAGCAGC 
TACCTTCTGG 



Seq ID NO: C25 DNA Sequence 

Kucleic Acid Accession ft: XM__097386.3 

Coding sequence: 142.. 795 



CTCGCAGAAC 
GGGACCCCOG 
CTACCTCAGG 
TCCCGTGCAC 
CCTGGACCCC 
GGGTCGAGGG 
GTGTCTGTGG 
TTGAGCCTGC 
CGAGGCTGGG 
GGAAATGGGA 
CCGTGTCTGC 
TCTCCTGCCA 
CAGCACCCCC 
CTGGATGGGT 
CACAGTCAAG 
GGAACCTTCC 
ACGGGGGGCA 
TCTTATCATG 
CCTTGGCAAA 
GTTTGTCAGA 
AACTTAAAAA 
GTCTCGCCCT 
GTATTAAAAG 



11 
I 

CACCTGGACT 
TCCCACGCCT 
TCTGACTTTT 
CAGGGTCTGC 
TGGGGCCCCC 
CTGGGTCGTC 
GCGATCCGGC 
TGTTAACTCC 
TCCCGGCCCA 
GGCGGTTGCA 
AGAGGCTCCT 
CCGCCTGTCG 
CCTTATGCAG 
TCTAGTTCAC 
GCCAAOGGGG 
GTGGGACCCG 
GCTGGGGTCG 
TTTTACCCAC 
TGTAAACTCA 
CGGCGTGGGA 
AATTTTTAAA 
GGCCGCGGCA 
GAAAAAA 



21 
I 

CTGTCCGTGT 

GATGCCAAAA 
AGCAGCCACT 
GTGCACCTGC 
TCCCTOGGGC 
CTCCCGGCGG 
GATGATTGTA 
GGAGAAGGAA 
GAGGGTCTAT 
CCCTGCCTGA 
GTGGGGGTTT 
ACTGGGAGGG 
TTGGGACCGT 

TAAGTGGCTG 
TTGTTAAGGG 

GCCTTTCATT 
ACCACGCCTG 
AAACATAAAA 
GGGTGGCCTG 




TGGGTGGACC 
GGGACAGGCG 
GTCGCTGAAG 
GGGGCCCGGT 
GGTGGCCCCG 
AAATTCGGTG 
GGTCGGGCAG 
GGGGCCTGGC 
CTCTGAGATG 
TCCAGAAAGG 
TCACGCATCT 
TTTTCCCCAA 
CATGACGTGT 
AAACTCAGGT 
CTACTCTCTA 
TAACAATTTC 



TCCCCTCAGC 
TGCGTACTGA 
TTGGGAGAAA 
CGGGAGGGTG 
GTACAGTTGA 
TTGTGCTTTT 
GAAATTTCAG 
AATAGGAGGA 
CCTCTGGCTG 
AGTTTTOGCA 



Seq ID NO: C26 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 95.. 2128 



GGGGTAGTTT 
CTCCTGGAAA 
TGCTTCCTGG 
CACACTGAGA 
GATCAATATA 
GCTTCTGAAA 
CTTCACCCCT 
GCGAGTCAGC 
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TATTAGAAGA TCAGAAG AG C TTTCCTTGTT AAAGCCGTCT GGTGACTATT TTAAGAAGAA 54 0 

GAAGAAAAAA GACAAAAATA ATAAGGAACC CATAATTGAA GATATTCTAA ACCTGGAGAG 60 0 

TTCTCCAACA GCTTCAGGTT CATCAGTAAG TCCTGGTTTA TACAGTAAAA CCATGACCCC 660 

TATATATGAC CCCATCAATG GAACACCAGC ATCATCCACC ATGACTTGGT TCAGTGACAG 720 

J CCCTTTGACG GAACAAAACT GCAGCATCCT CGCATTCAGC CAACCCCCCC AGTCCCCAGA 780 

AGCACTTQCG GATATGTACC AGCCTCGGTC TCTGGTTGAT AAAGCCAAGC TCAATGCAGG 840 

TTGGCTAGAC TCCTCACGCT CCCTTATGGA ACAAGGCATC CAAGAGGATG AGCAGCTGCT 900 

CTTACGATTT AAATATTATT CTTTCTTCGA CTTGAATCCT AAATATGATG CTGTCCGAAT 960 

AAACCAACTC TATGAGCAAG CCAGGTGGGC CATTCTCTTA GAAGAAATTG ATTGCACAGA 1020 

iV GGAAGAAATG TTGATCTTTG CAGCTCTACA GTACCACATT AGCAAACTGT CGTTGTCTGC 1080 

TGAAACACAG GATTTTGCAG GCGAGTCCGA GGTTGATGAA ATAGAAGCGG CGCTTTCTAA 1140 

TTTGGAAGTA ACCCTAGAAG GTGGAAAAGC GGACAGCCTT TTGGAGGACA TTACTGATAT 1200 

CCCTAAACTT GCAGATAATC TCAAATTATT TAGG CCCAAG AAGTTACTAC CAAAAGCTTT 1260 

CAAACAATAT TGGTTTATCT TTAAAGACAC ATCCATAGCA TACTTTAAAA ATAAGGAACT 1320 

10 TGAACAAGGA GAACCACTAG AAAAACTAAA TCTTAGAGGC TGCGAAGTTG TGCCCGATGT 1380 

AAATGTAGCA GGAAGAAAAT TTGGAATCAA GTTACTAATC CCTGTTGCCG ATGGTATGAA 1440 

TGAAATGTAT TTGAGATGTG ACCATGAGAA TCAATACGCC CAATGGATGG CTGCCTGCAT 1500 

GTTGGCATCG AAGGGCAAAA CCATGGCAGA CAGCTCCTAC CAGCCAGAGG TCCTCAACAT 1560 

« n CCTTTCATTT CTGAGGATGA AAAACAGGAA CTCTGCATCT CAGGTGGCTT CCAGTCTCGA 1620 

/X) AAACATGGAT ATGAACCCAG AATGTTTTGT GTCACCACGG TGTGCAAAAA GACACAAATC 1680 

CAAACAGCTG GCOGCCCGGA TCCTGGAGGC GCACCAGAAC GTGGCCCAGA TGCCCCTGGT 1740 

CGAAGCCAAG CTGCGGTTCA TCCAGGCGTG GCAGTCACTG CCTGAGTTTG GCCTCACCTA 1800 

CTACCTTGTC AGATTTAAAG GAAGCAAAAA AGATGACATT CTGGGAGTTT CATATAACAG 1860 

GTTGATTAAA ATTGATGCAG CCACCGGGAT TCCAGTGACA ACATGGAGAT TCACAAATAT 1920 

ZD CAAACAGTGG AATGTAAACT GGGAAACCCG GCAGGTGGTC ATCGAGTTTG ACCAAAACGT 1980 

CTTTACTGCT TTCACCTGCC TGAGTGCAGA TTGCAAGATT GTGCACGAGT ACATTGGCGG 2040 

CTACATTTTC TTGTCCACCC GCTCCAAGGA CCAGAATGAA ACACTCGATG AGGACTTGTT 2100 

CCACAAATTG ACOGGCGGTC AGGATTGAAA CAAGCACGCG TGCTCGGCTC ACACCAACAA 2160 

»a GGCAAGCCAA AGGCGCCCCT CCCCAGAGGG ATCCCTAACG TGCCCAGCAT GTAGATTCTG 2220 

D\) GACTAACAGA CAACATACAT TCACCGCTGG TCACCCAGAT CCTCATTCAA ACCCACTGCT 2280 

GGCACATCCC TTTCCTTACT TTGCCCTGTG CTACCAGCCA CGGAAGGAGC CTCTCTTGTT 2340 

TTTTCTATAA AATGGGTAGG CAGGAGAAAA GCAGGTGCCC TAAGATTGCT CTAAGGCCCA 2400 

GCATGTGGTT ACAGTTCTCT GACTTGCAGA ACCTGCCAGG TGTATGGCTA CAAGTTATCC 2460 

„ TCGTGCTGAT CTGTCTCATT ACTAAGTCAA TGGAGAAGAC AGAAAGGTAA AAATCACGTG 2520 

3D TAGCAA GAAC AACTCTTATT TCACAAACTC AGGTATGAAA CGAAACGCCT GTCCTTCATG 2580 

GAACTGCTTT TAGCTCCTGT CTTTTCAAAA TGGCAGAGGG AGTTCCTACA CACACTTTTT 2640 

CCCTGGAGGC CAAGGTCTAG GGGTAGAAAG GGGAGGGGTG GGGCTACCAG GTAGCAGTTG 2700 

ACAACCCAAG GTCAGAGGAG TGGCCCTCAG TGTCATCTGT CCACAGTGAT ACCTGCCAAG 2760 

ACi ATGACCACTG ACCCACATCT GGTCTTAGTC ATTGGTCTCC TCAGATTTCT GGGGCCACCT 2820 

4U GCAAGCCCCA TTCCATTCCT ACAGATCTCT CAGCCACCTG TAAGTCCTTT GTGAAGATGT 2880 

GGGTGACACA GGGGGACAGG AAAACCCATT TCTCAACCCA GATCCATGTC TCCACTGCTT 2940 

CTACTCTGGG TTGGGATTCA GGAAGACAGG CACAGTCCTC TCTGTTCATA GAAACACCTG 3000 

CCAGTGTCAA GGATTCCAGT CAGGTGTCTA TCCCAACTGG TCAGGGAGAG AAGGGCAGAC 3060 

CCATTCTCAA AGACCACCAT GTCCAAGGTC TGACAGCTCC CCACTGGCTG CCCCCACAGG 3120 

HD GGCTTTAGGC TGGTCTGGGT CATGGGGAAG CGTCCCTCTT ATCGCTGGTC TGTGTTCTCC 3180 

TGGATTTGGT ATCTATGTTG GTACGACTCC TGGCCTTTTA TCTAAAGGAC TTTGGCTTTT 3240 

GTAAATCACA AGCCAATAAT AGACTTTTTT CTCCCCCTCT GTTTTTTGCT GTGTCATCTC 3300 

TGCCTTGAGA CTGCCTTGAG ACAGTGCTTG CCTTGAGAGA GTGAGCCAAT TAACAGCTGC 3360 

CTGAATTGTC ATTTTCCATT TTGGTTTGTT AGAGGTGGGA GGGGTGGGTT TTGAGAAGGT 3420 

DU CAAAAGCAAT ACCAGAAGTA AAGGGAAATA TGAGACAATA TTTTATTATT TTTTCATAGA 3480 

TGTTCTGCCA CACAAAGAAC TTGGGGTGTA AGGATAAGGC AAAAGCTCCA ATCCCATTTT 3540 

TCAGTTCTCC TAGGATGCAC CCCTCAGGGA GCCTGGCCAG AGTTCCGAGG CCCGTGAGCG 3600 

TCAGCTGTTG CTTTATTTTC CATCAAAGCC CTCTGAGAAG TGAGACCTCA GCAATTCCGG 3660 

GAGCCACATA GAGACAGACT TGGCAAGGGA CCCCCTGGTT CTGAGCCAGT AGCTGCCATC 3720 

DO TGGAAATTCC TCTTTTAGCC TCTCCTTAGA GGTGAATGTG AATGAAGCCT CCCAGGCACC 3780 

CGCTGAATTT CTGAGGCCTT GCTTAAAGCT CAGAAGTGGT TTAGGCATTT GGAAAATCTG 3840 

GTTCACATCA TAAAGAACTT GATTTGAAAT GTTTTCTATA GAAACAAGTG CTAAGTGTAC 3900 

CGTATTATAC TTGATGTTGG TCATTTCTCA GTCCTATTTC TCAGTTCTAT TATTTTAGAA 3960 

<0 CCTAGTCAGT TCTTTAAGAT TATAACTGGT CCTACATTAA AATAATGCTT CTCGATGTCA 4020 

OU GATTTTACCT GTTTGCTGCT GAGAACATCT CTGCCTAATT TACCAAAGCC AGACCTTCAG 4080 

TTCAACATGC TTCCTTAGCT TTTCATAGTT GTCTGACATT TCCATGAAAA CAAAGGAACC 4140 

AACTTTGTTT TAACCAAACT TTGTTTGGTT ACAGTTTTCA GGGGAGCGTT TCTTCCATGA 4200 

CACACAGCAA CATCCCAAAG AAATAAACAA GTGTGACAAA AAAAAAAAAA AACAAACCTA 4260 

AATGCTACTG TTCCAAAGAG CAACTTGATG GTTTTTTTTA ATACTGAGTG CAAAAGGTCA 4320 

03 CCCAAATTCC TATGATGAAA TTTTAAATTA ATGGGCACCT TTCAACATCA TTTGCTTCCT 4380 

TATCTACAGT TGATTCAGAA ATCTGCATTT TTTATTCTTT TATATGACTT TTAAGTAAAA 4440 

GATTTATATG GATTTGAAAA AAAAAAAAAA A 44 71 

- Seq ID NO: C27 Protein Sequence 

/U Protein Accession #: NP_00516l.l 

1 11 21 31 41 51 

I I I I I I 

MDGGTLPRSA PPAPPVPVGC AARRRPASPE LLRCSRRRRP ATAETGGGAA AVARRNERER 60 

/D NRVKLVNLGF QALRQHVPHG GASKKLSKVE TLRSAVEYIR ALQRLLAEHD AVRNAIAGGL 120 

RPQAVRPSAP RGPPGTTPVA ASPSRASSSP GRGGSSEPGS PRSAYSSDDS GCEGALSPAE 180 

RELLDFSSWL GGY 193 

Seq ID NOt C28 DNA Sequence 
OU Nucleic Acid Accession # : NM 017763 
Coding sequence; 169.. 2520 

1 11 21 31 41 51 

I I I I I I 
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AAAAAAAAAA AACTTTAGAG AAAGGAAGGG CCAAAACTAC GACTTGGCTT TCTGAAACGG 
AAGCATAAAT GTTCTTTTCC TCCATTTGTC TGGATCTGAG AACCTGCATT TGGTATTAGC 
TAGTGGAAGC AGTATGTATG GTTGAAGTGC ATTGCTGCAG CTGGTAGCAT GAGTGGTGGC 
CACCAGCTGC AGCTGGCTGC CCTCTGGCCC TGGCTGCTGA TGGCTACCCT GCAGGCAGGC 
TTTGGACGCA CAGGACTGGT ACTGGCAGCA GCGGTGGAGT CTGAAAGATC AGCAGAACAG 
AAAGCTGTTA TCAGAGTGAT CCCCTTGAAA ATGGACCCCA CAGGAAAACT GAATCTCACT 
TTGGAAGGTG TGTTTGCTGG TGTTGCTGAA ATAACTCCAG CAGAAGGAAA ATTAATGCAG 
TCCCACCCAC TGTACCTGTG CAATGCCAGT GATGACGACA ATCTGGAGCC TGGATTCATC 
AGCATCGTCA AGCTGGAGAG TCCTCGACGG GCCCCCCGCC CCTGCCTGTC ACTGGCTAGC 
AAGGCTCGGA TGGCGGGTGA GCGAGGAGCC AGTGCTGTCC TCTTTGACAT CACTGAGGAT 
CGAGCTGCTG CTGAGCAGCT GCAGCAGCCG CTGGGGCTGA CCTGGCCAGT GGTGTTGATC 
TGGGGTAATG ACGCTGAGAA GCTGATGGAG TTTGTGTACA AGAACCAAAA GGCCCATGTG 

AGGATTGAGC tgaaggagcc cccggcctgg ccagattatg atgtgtggat CCTAATGACA 

GTGGTGGGCA CCATCTTTGT GATCATCCTG GCTTCGGTGC TGCGCATCCG GTGCCGCCCC 
CGCCACAGCA GGCCGGATCC GCTTCAGCAG AGAACAGCCT GGGCCATCAG CCAGCTGGCC 
ACCAGGAGGT ACCAGGCCAG CTGCAGGCAG GCCCGGGGTG AGTGGCCAGA CTCAGGGAGC 
AGCTGCAGCT CAGCCCCTGT GTGTGCCATC TGTCTGGAGG AGTTCTCTGA GGGGCAGGAG 
CTACGGGTCA TTTCCTGCCT CCATGAGTTC CATCGTAACT GTGTGGACCC CTGGTTACAT 
CAGCATCGGA CTTGCCCCCT CTGCGTGTTC AACATCACAG AGGGAGATTC ATTTTCCCAG 
TCCCTGGGAC CCTCTCGATC TTACCAAGAA CCAGGTCGAA GACTCCACCT CATTCGCCAG 
CATCCCGGCC ATGCCCACTA CCACCTCCCT GCTGCCTACC TGTTGGGCCC TTCCCGGAGT 
GCAGTGGCTC GGCCCCCACG ACCTGGTCCC TTCCTGCCAT CCCAGGAGCC AGGCATGGGC 
CCTCGGCATC ACCGCTTCCC CAGAGCTGCA CATCCCCGGG CTCCAGGAGA GCAGCAGCGC 
CTGGCAGGAG CCCAGCACCC CTATGCACAA GGCTGGGGAA TGAGCCACCT CCAATCCACC 
TCACAGCACC CTGCTGCTTG CCCAGTGCCC CTACGCCGGG CCAGGCCCCC TGACAGCAGT 
GGATCTGGAG AAAGCTATTG CACAGAACGC AGTGGGTACC TGGCAGATGG GCCAGCCAGT 
GACTCCAGCT CAGGGCCCTG TCATGGCTCT TCCAGTGACT CTGTGGTCAA CTGCACGGAC 
ATCAGCCTAC AGGGGGTCCA TGGCAGCAGT TCTACTTTCT GCAGCTCCCT AAGCAGTGAC 
TTTGACCCCC TAGTGTACTG CAGCCCTAAA GGGGATCCCC AGCGAGTGGA CATGCAGCCT 
AGTGTGACCT CTCGGCCTCG TTCCTTGGAC TCGGTGGTGC CCACAGGGGA AACCCAGGTT 
TCCAGCCATG TCCACTACCA CCGCCACCGG CACCACCACT ACAAAAAGCG GTTCCAGTGG 
CATGGCAGGA AGCCTGGCCC AGAAACCGGA GTCCCCCAGT CCAGGCCTCC TATTCCTCGG 
ACACAGCCCC AGCCAGAGCC ACCTTCTCCT GATCAGCAAG TCACCGGATC CAACTCAGCA 
GCCCCTTCGG GGCGGCTCTC TAACCCACAG TGCCCCAGGG CCCTCCCTGA GCCAGCCCCT 
GGCCCAGTTG ACGCCTCCAG CATCTGCCCC AGTACCAGCA GTCTGTTCAA CTTGCAAAAA 
TCCAGCCTCT CTGCCCGACA CCCACAGAGG AAAAGGCGGG GGGGTCCCTC CGAGCCCACC 
CCTGGCTCTC GGCCCCAGGA TGCAACTGTG CACCCAGCTT GCCAGATTTT TCCCCATTAC 
ACCCCCAGTG TGGCATATCC TTGGTCCCCA GAGGCACACC CCTTGATCTG TGGACCTCCA 
GGCCTGGACA AGAGGCTGCT ACCAGAAACC CCAGGCCCCT GTTACTCAAA TTCACAGCCA 
GTGTGGTTGT GCCTGACTCC TCGCCAGCCC CTGGAACCAC ATCCACCTGG GGAGGGGCCT 
TCTGAATGGA GTTCTGACAC CGCAGAGGGC AGGCCATGCC CTTATCCGCA CTGCCAGGTG 
CTGTCGGCCC AGCCTGGCTC AGAGGAGGAA CTCGAGGAGC TGTGTGAACA GGCTGTGTGA 
GATGTTCAGG CCTAGCTCCA ACCAAGAGTG TGCTCCAGAT GTGTTTGGGC CCTACCTGGC 
ACAGAGTCCT GCTCCTGGGA AAGGAAAGGA CCACAGCAAA CACCATTCTT TTTGCCGTAC 
TTCCTAGAAG CACTGGAAGA GGACTGGTGA TGGTGGAGGG TGAGAGGGTG CCGTTTCCTG 
CTCCAGCTCC AGACCTTGTC TGCAGAAAAC ATCTGCAGTG CAGCAAATCC ATGTCCAGCC 
AGGCAACCAG CTGCTGCCTG TGGCGTGTGT GGGCTGGATC CCTTGAAGGC TGAGTTTTTG 
AGGGCAGAAA GCTAGCTATG GGTAGCCAGG TGTTACAAAG GTGCTGCTCC TTCTCCAACC 
CCTACTTGGT TTCCCTCACC CCAAGCCTCA TGTTCATACC AGCCAGTGGG TTCAGCAGAA 
CGCATGACAC CTTATCACCT CCCTCCTTGG GTGAGCTCTG AACACCAGCT TTGGCCCCTC 
CACAGTAAGG CTGCTACATC AGGGGCAACC CTGGCTCTAT CATTTTCCTT TTTTGCCAAA 
AGGACCAGTA GCATAGGTGA GCCCTGAGCA CTAAAAGGAG GGGTCCCTGA AGCTTTCCCA 
CTATAGTGTG GAGTTCTGTC CCTGAGGTGG GTACAGCAGC CTTGGTTCCT CTGGGGGTTO 
AGAATAAGAA TAGTGGGGAG GGAAAAACTC CTCCTTGAAG ATTTCCTGTC TCAGAGTCCC 
AGAGAGGTAG AAAGGAGGAA TTTCTGCTGG ACTTTATCTG GGCAGAGGAA GGATGGAATG 
AAGGTAGAAA AGGCAGAATT ACAGCTGAGC GGGGACAACA AAGAGTTCTT CTCTGGGAAA 
AGTTTTGTCT TAGAGCAAGG ATGGAAAATG GGGACAACAA AGGAAAAGCA AAGTGTGACC 
CTTGGGTTTG GACAGCCCAG AGGCCCAGCT CCCCAGTATA AGCCATACAG GCCAGGGACC 
CACAGGAGAG TGGATTAGAG CACAAGTCTG GCCTCACTGA GTGGACAAGA GCTGATGGGC 
CTCATCAGGG TGACATTCAC CCCAGGGCAG CCTGACCACT CTTGGCCCCT CAGGCATTAT 
CCCATTTGGA ATGTGAATGT GGTGGCAAAG TGGGCAGAGG ACCCCACCTG GGAACCTTTT 
TCCCTCAGTT AGTGGGGAGA CTAGCACCTA GGTACCCACA TGGGTATTTA TATCTGAACC 
AGACAGACGC TTGAATCAGG CACTATGTTA AGAAATATAT TTATTTGCTA ATATATTTAT 
CCACAAAAAA AAAAAAAAAA AA 



Seq ID NO: C29 Protein Sequence 
Protein Accession ft: NP__004280.2 

1 11 21 31 41 51 

I 1 I I I I 

MNSSAHYHVN FSQAISQDVN LHEAILLCPN NTFRRDPTAR TSQSQEPFLQ LNSHTTNPEQ 
TLPGTNLTGF LSPVDNHMRN LTSQDLLYDL DINIFDEINL MSLATEDNFD PIDVSQLFDE 
PDSDSGLSLD SSHNNTSVIK SNSSHSVCDE GAIGYCTDHE SSSHHDLEGA VGGYYPEPSK 
LGHLDQSDSD FHGDLTFQKV FHNHTYHLQP TAPESTSEPF PWPGKSQKIR SHYLEDTDRN 
LSRDEQRAKA LHIPFSVDEI VGMPVDSFNS MLSRYYLTDL QVSLIRDIRR RGKNKVAAQN 
CRKRKLDIIL NLEDDVCNLQ AKKETLKREQ AQCNKAINIM KQKLHDLYHD IFSRLRDDQG 
RPVNPNHYAL QCTHDGSILI VPKELVASGH KKETQKGKRK 
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Seq ID MO: C30 DMA Sequence 
Nucleic Acid Accession #: NM_004442 
Coding sequence: 19.. 2982 
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GCCCCGGGAA GCGCAGCCAT GGCTCTGCGG AGGCTGGGGG CCGCGCTGCT GCTGCTGCCG 60 

CTGCTCGCCG CCGTGGAAGA AACGCTAATG GACTCCACTA CAGCGACTGC TGAGCTGGGC 120 

TGGATGGTGC ATCCTCCATC AGGGTGGGAA GAGGTGAGTG GCTACGATGA GAACATGAAC 180 

ACGATCCGCA CGTACCAGGT GTGCAACGTG TTTGAGTCAA GCCAGAACAA CTGGCTACGG 240 

ACCAAGTTTA TCCGGCGCCG TGGCGCCCAC CGCATCCACG TGGAGATGAA GTTTTCGGTG 300 

CGTGACTGCA GCAGCATCCC CAGCGTGCCT GGCTCCTGCA AGGAGACCTT CAACCTCTAT 360 

TACTATGAGG CTGACTTTGA CTCGGCCACC AAGACCTTCC CCAACTGGAT GGAGAATCCA 420 

TGGGTGAAGG TGGATACCAT TGCAGCOGAC GAGAGCTTCT CCCAGGTGGA CCTGGGTGGC 480 

CGCGTCATGA AAATCAACAC CGAGGTGCGG AGCTTCGGAC CTGTGTCCCG CAGCGGCTTC 540 

TACCTGGCCT TCCAGGACTA TGGCGGCTGC ATGTCCCTCA TCGCCGTGCG TGTCTTCTAC 600 

CGCAAGTGCC CCCGCATCAT CCAGAATGGC GCCATCTTCC AGGAAACCCT GTCGGGGGCT 660 

GAGAGCACAT CGCTGGTGGC TGCCCGGGGC AGCTGCATCG CCAATGCGGA AGAGGTGGAT 720 

GTACCCATCA AGCTCTACTG TAACGGGGAC GGCGAGTGGC TGGTGCCCAT CGGGCGCTGC 780 

ATGTGCAAAG CAGGCTTCGA GGCCGTTGAG AATGGCACCG TCTGCCGAGG TTGTCCATCT 840 

GGGACTTTCA AGGCCAACCA AGGGGATGAG GCCTGTACCC ACTGTCCCAT CAACAGCCGG 900 

ACCACTTCTG AAGGGGCCAC CAACTGTGTC TGCCGCAATG GCTACTACAG AGCAGACCTG 960 

GACCCCCTGG ACATGCCCTG CACAACCATC CCCTCCGCGC CCCAGGCTGT GATTTCCAGT 1020 

GTCAATGAGA CCTCCCTCAT GCTGGAGTGG ACCCCTCCCC GCGACTCCGG AGGCCGAGAG 1080 

GACCTCGTCT ACAACATCAT CTGCAAGAGC TGTGGCTCGG GCCGGGGTGC CTGCACCCGC 1140 

TGCGGGGACA ATGTACAGTA CGCACCACGC CAGCTAGGCC TGACCGAGCC ACGCATTTAC 1200 

ATCAGTGACC TGCTGGCCCA CACCCAGTAC ACCTTCGAGA TCCAGGCTGT GAACGGCGTT 1260 

ACTGACCAGA GCCCCTTCTC GCCTCAGTTC GCCTCTGTGA ACATCACCAC CAACCAGGCA 1320 

GCTCCATCGG CAGTGTCCAT CATGCATCAG GTGAGCCGCA CCGTGGACAG CATTACCCTG 1380 

TCGTGGTCCC AGCCAGACCA GCCCAATGGC GTGATCCTGG ACTATGAGCT GCAGTACTAT 1440 

GAGAAGGAGC TCAGTGAGTA CAACGCCACA GCCATAAAAA GCCCCACCAA CACGGTCACC 1500 

GTGCAGGGCC TCAAAGCCGG CGCCATCTAT GTCTTCCAGG TGCGGGCACG CACCGTGGCA 1560 

GGCTACGGGC GCTACAGCGG CAAGATGTAC TTCCAGACCA TGACAGAAGC CGAGTACCAG 1620 

ACAAGCATCC AGGAGAAGTT GCCACTCATC ATCGGCTCCT CGGCCGCTGG CCTGGTCTTC 1680 

CTCATTGCTG TGGTTGTCAT CGCCATCGTG TGTAACAGAA GACGGGGGTT TGAGCGTGCT 1740 

GACTCGGAGT ACACGGACAA GCTGCAACAC TACACCAGTG GCCACATGAC CCCAGGCATG 1800 

AAGATCTACA TCGATCCTTT CACCTACGAG GACCCCAACG AGGCAGTGCG GGAGTTTGCC 1860 

AAGGAAATTG ACATCTCCTG TGTCAAAATT GAGCAGGTGA TCGGAGCAGG GGAGTTTGGC 1920 

GAGGTCTGCA GTGGCCACCT GAAGCTGCCA GGCAAGAGAG AGATCTTTGT GGCGATCAAG 1980 

ACGCTCAAGT CGGGCTACAC GGAGAAGCAG CGCCGGGACT TCCTGAGCGA AGCCTCCATC 2040 

ATGGGCCAGT TCGACCATCC CAACGTCATC CACCTGGAGG GTGTCGTGAC CAAGAGCACA 2100 

CCTGTGATGA TCATCACCGA GTTCATGGAG AATGGCTCCC TGGACTCCTT TCTCGGGCAA 2160 

AAGGATGGGC AGTTCACAGT CATCCAGCTG GTGGGCATGC TTCGGGGCAT CGCAGCTGGC 2220 

ATGAAGTACC TGGCAGACAT GAACTATGTT CACCGTGACC TGGCTGCCCG CAACATCCTC 2280 

GTCAACAGCA ACCTGGTCTG CAAGGTGTCG GACTTTGGGC TCTCACGCTT TCTAGAGGAC 2340 

GATACCTCAG ACCCCACCTA CACCAGTGCC CTGGGCGGAA AGATCCCCAT CCGCTGGACA 2400 

GCCCCGGAAG CCATCCAGTA CCGGAAGTTC ACCTCGGCCA GTGATGTGTG GAGCTACGGC 2460 

ATTGTCATGT GGGAGGTGAT GTCCTATGGG GAGCGGCCCT ACTGGGACAT GACCAACCAG 2520 

GATGTAATCA ATGCCATTGA GCAGGACTAT CGGCTGCCAC CGCCCATGGA CTGCCCGAGC 2580 

GCCCTGCACC AACTCATGCT GGACTGTTGG CAGAAGGACC GCAACCACCG GCCCAAGTTC 2640 

GGCCAAATTG TCAACACGCT AGACAAGATG ATCCGCAATC CCAACAGCCT CAAAGCCATG 2700 

GCGCCCCTCT CCTCTGGCAT CAACCTGCCG CTGCTGGACC GCACGATCCC CGACTACACC 2760 

AGCTTTAACA CGGTGGACGA GTGGCTGGAG GCCATCAAGA TGGGGCAGTA CAAGGAGAGC 2820 

TTCGCCAATG CCGGCTTCAC CTCCTTTGAC GTCGTGTCTC AGATGATGAT GGAGGACATT 2880 

50 CTCCGGGTTG GGCTCACTTT GGCTGGCCAC CAGAAAAAAA TCCTGAACAG TATCCAGGTG 2940 

ATGCGGGCGC AGATGAACCA GATTCAGTCT GTGGAGGTTT GACATTCACC TGCCTCGGCT 3000 

CACCTCTTCC TCCAAGCCCC GCCCCCTCTG CCCCACGTGC CGGCCCTCCT GGTGCTCTAT 3060 

CCACTGCAGG GCCAGCCACT CGCCAGGAGG CCACGGGCCA CGGGAAGAAC CAAGCGGTGC 3120 

CAGCCACGAG ACGTCACCAA GAAAACATGC AACTCAAACG ACGGAAAAAA AAAGGGAATG 3180 

GGAAAAAAGA AAACAGATCC TGGGAGGGGG CGGGAAATAC AAGGAATATT TTTTAAAGAG 3240 

GATTCTCATA AGGAAAGCAA TGACTGTTCT TGCGGGGGAT AAAAAAGGGC TTGGGAGATT 3300 

CATGCGATGT GTCCAATCGG AGACAAAAGC AGTTTCTCTC CAACTCCCTC TGGGAAGGTG 3360 

ACCTGGCCAG AGCCAAGAAA CACTTTCAGA AAAACAAATG TGAAGGGGAG AGACAGGGGC 3420 

CGCCCTTGGC TCCTGTCCCT GCTGCTCCTC TAGGCCTCAC TCAACAACCA AGCGCCTGGA 3480 

GGACGGGACA GATGGACAGA CAGCCACCCT GAGAACCCCT CTGGGAAAAT CTATTCCTGC 3540 

CACCACTGGG CAAACAGAAG AATTTTTCTG TCTTTGGAGA GTATTTTAGA AACTCCAATG 3600 

AAAGACACTG TTTCTCCTGT TGGCTCACAG GGCTGAAAGG GGCTTTTGTC CTCCTGGGTC 3660 

AGGGAGAACG CGGGGACCCC AGAAAGGTCA GCCTTCCTGA GGATGGGCAA CCCCAGGTCT 3720 

GCAGCTCCAG GTACATATCA OGOGCACAGC CTGGCAGCCT GGCCCTCCTG GTGCCCACTC 3780 

COGCCAGCCC CTGCCTCGAG GACTGATACT GCAGTGACTG COGTCAGCTC CGACTGCCGC 3840 

TGAGAAGGGT TGATCCTGCA TCTGGGTTTG TTTACAGCAA TTCCTGGACT CGGGGGTATT 3900 

TTGGTCACAG GGTGGTTTTG GTTTAGGGGG TTTGTTTGTT GGGTTGTTTT TTGTTTTTTG 3960 

GTTTTTTTTA ATGACAATGA AGTGACACTT TGACATTTCC TACCTTTTGA GGACTTGATC 4020 

CTTCTOCAGG AAGAAGGTGC TTTCTGCTTA CTGACTTAGG CAATACACCA AGGGCGANAT 4080 

TTTATATGCA CATTTCTGGA TTTTTTTATA CGGTTTTCAT TGACACTCTT CCCTCCTCCC 4140 

ACCTGCCACC AGGCCTCACC AAAGCCCACT GCCATGGGGC CATCTGGGCC ATTCAGAGAC 4200 

TGGAGTGAGA TTTGGGTGTG GAGGGGGAGG CGCCAAGGTG GAGGAGCTTC CCACTCCAGG 4260 

ACTGTTGATG AAAGGGACAG ATTGAGGAGG AAGTGGGCTC TGAGGCTGCA GGGCTGGAAG 4320 

TCCTTGCCCA CTTCCCACTC TCCTGCCCCA ATCTATCTAG TACTTCCCAG GCAAATAGGC 4380 

CCCTTTGAGG CTCCTGAGTG CCCTCAGATG GTCAAAACCC AGTTTTCCCT CTGGGAGCCT 4440 

AAACCAGGCT GCATCGGAGG CCAGGACCCG GATCATTCAC TGTGATACCC TGCCCTCCAG 4500 

AGGGTGCGCT CAGAGACACG GGCAAGCATG CCTCTTCCCT TCCCTGGAGA GAAAGTGTGT 4560 

GATTTCTCTC CCACCTCCTT CCCCCCACCA GACCTTTGCT GGGCCTAAAG GTCTTGGCCA 4620 

TGGGGACGCC CTCAGTCTAG GGATCTGGCC ACAGACTCCC TCCTGTGAAC CAACACAGAC 4680 

OU ACCCAAGCAG AGCAATCAGT TAGTGAATTG GAATTCCCCA AGTCTTTGCT ATTGTGAATA 4740 

GTGCTGCAAT AAACATACGT GTGCATGTGT CTTTATAGTA GAATGATCTA TAATCCTCTG 4800 

GGTATGTACC CAGTAATGGG ATTGCTGGGT CAAATGGTTT TTCTGGTTCT AGATCCTTGA 4860 

GGAATTGCCA CACTGTCTTT CACAATGGTT GAACTAATTT ACACTCCTAC CAACAGTGTA 4920 

AAAGTCTTCC TGTTTCTCCA CATCCTCTCC AGCATCTGTT GTTTCCTGAC TTTTTAATGA 4980 
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TTGCCATTCT AACTGGTGTG AGATGGTATC TCATTGTGGG TTTGATTTGC ATTTCTCTAA 5040 

TGACCAGTGA AGATGAGCTT TTTTTCATAT GTTTGTTGGC CACATGTTTG TTGTTTCTTT 5100 

TGAGAAGTGT CTGTTCATAA CCTTCACCAA TTTTTGATGA GGTTGTTTGT TCTTTTCTTG 5160 

TAAACTTAAT GAAATAAAGC ATGAAGACAA GATTAGAAGA AAAAGAATGA AAAGGAACAA 5220 

ACAAAGCGTC CAAGAAATAT GGGACTATGT GACAAGAACA AACTTACGTT TGACTGGTGT 5280 

GCTGAAAATG ACAGGGAGAA TGAAACCAAG TTGGAAAACA CTCTTCAGGA TATTATCCAG 5340 

GAGAACTTCT CCAACCTAGC AAGACAGACC AACGTTCAAA TTCAGGAAAT ACAGAGAACA 5400 

CCCAAAGATA TTTCTCGAGA AGAGCAACCC GAAGACACAT AATTGTCAGA TTCACCAAGG 5460 

TTGAAATGAA GGAAAAAATG CTAAGGGCAG CCAGAGAGAA AGGTCAGGTT ACTCACAAAA 5520 

GGAAGCCCAT CAGACTAACA GCAGATCTCT CTGCAGAAAC CCTACAACCT AGAAGAGAGT 5580 

GGGGACCAAT ATTCAACATT CACAAAGAAA AGAATTTTCA ATCCAGAATT TCATATCCAG 5640 

CCAAACTAAG CTTCATAAGC AAAGGAGAAA TAAAATCCTT TACAGACAAG CAAATCCTGG 5700 

GAGATTTTGT CACCACCAGG CCTGCCTTAC AAGACATCCT GAAGGAAGCA CTAAATATGG 5760 

AAAAGAAAAA CTGGTGCCAG CCACTGCAAA AAATACCAAA TTGTAGAGAC CATTGACACT 5820 

ATGAAGAAAC CGTGTCAACT AATGGGCAAA ATAACCAGCT AGTATCATAA TGACAGGATC 5880 

AGATTCACAC ATAACAATAT TAACCTTATA TCTAAATGGG CTAAATCCCC CAATTAAAAG 5940 

ACGCAGACTG GCAAATTGGT TAAAGAGTCA AGACTCATTG GTGTGCCGTA TTCAGGAGAC 6000 

CCATCTCACG TGCAAAGACA CACATAGGCT CAGAGTAAAA GGGATACAGG GGAATTC 6057 

Seq ID NoY C31 DNA Sequence 

Nucleic Acid Accession 8: NM_031942.l 

Coding sequence: 14S..12G0 



^ r 1 11 21 31 41 51 

25 | | | | | | 

CCCGAGCCCC GCCCCTCCGG GCCCGGGTCG GCGCGCCCAG CCTGCCAGCC GCGCTGCTGC 60 

TGCTCCTCCT GCTGTGGGAC CGCTGACCGC GCGGCTGCTC CGCTCTCCCC GCTCCAAGCG 120 

CCGATCTGGG CACCCGCCAC CAGCATGGAC GCTCGCCGCG TGCCGCAGAA AGATCTCAGA 180 

GTAAAGAAGA ACTTAAAGAA ATTCAGATAT GTGAAGTTGA TTTCCATGGA AACCTCGTCA 240 

TCCTCTGATG ACAGTTGTGA CAGCTTTGCT TCTGATAATT TTGCAAACAC GAGGCTGCAG 300 

TCAGTTCGGG AAGGCTGTAG GACCCGCAGC CAGTGCAGGC ACTCTGGACC TCTCAGGGTG 360 

GCGATGAAGT TTGCAGCGCG GAGTACCAGG GGAGCAACCA ACAAAAAAGC AGAGTCCCGC 420 

CAGCCCTCAG AGAATTCTGT GACTGATTCC AACTCCGATT CAGAAGATGA AAGTGGAATG 480 

AATTTTTTGG AGAAAAGGGC TTTAAATATA AAGCAAAACA AAGCAATGCT TGCAAAACTC 540 

ATGTCTGAAT TAGAAAGCTT CCCTGGCTCG TTCCGTGGAA GACATCCCCT CCCAGGCTCC 600 

GACTCACAAT CAAGGAGACC GCGAAGGCGT ACATTCCCGG GTGTTGCTTC CAGGAGAAAC 660 

CCTGAACGGA GAGCTCGTCC TCTTACCAGG TCAAGGTCCC GGATCCTCGG GTCCCTTGAC 720 

GCTCTACCCA TGGAGGAGGA GGAGGAAGAG GATAAGTACA TGTTGGTGAG AAAGAGGAAG 780 

ACCGTGGATG GCTACATGAA TGAAGATGAC CTGCCCAGAA GCCGTCGCTC CAGATCATCC 840 

GTGACCCTTC CGCATATAAT TCGCCCAGTG GAAGAAATTA CAGAGGAGGA GTTGGAGAAC 900 

GTCTGCAGCA ATTCTCGAGA GAAGATATAT AACCGTTCAC TGGGCTCTAC TTGTCATCAA 960 

TGCCGTCAGA AGACTATTGA TACCAAAACA AACTGCAGAA ACCCAGACTG CTGGGGCGTT 1020 

CGAGGCCAGT TCTGTGGCCC CTGCCTTCGA AACCGTTATG GTGAAGAGGT CAGGGATGCT 1080 

CTGCTGGATC CGAACTGGCA TTGCCCGCCT TGTCGAGGAA TCTGCAACTG CAGTTTCTGC 1140 

CGGCAGOGAG ATGGACGGTG TGCGACTGGG GTCCTTGTGT ATTTAGCCAA ATATCATGGC 1200 

TTTGGGAATG TGCATGCCTA CTTGAAAAGC CTGAAACAGG AATTTGAAAT GCAAGCATAA 1260 

TATCTGGAAA ATTTGCTGCC TGCCTTCTAC TTCTCAAATC TTTCTTGTAA AAGTTTCCAA 1320 

TTTTTTCACT GAAACCTGAG TTAAAAATCT TGATGATCAG CCTGTTTCAT AAGAAACTCC 1380 

AATCAAGTTA ATCTTAGCAG ACATGTGTTT CTGGAGCATC ACAGAAGGTA TATTGCTAGT 1440 

TACACTTTGC CCTCCTGCAG TTTCTTCTCT GCTCCCAACC CCCATCTCAT AGCATCCCCC 1500 

TCTATTTCCA ATGCTCCTCT CCAACCGCTT AGTTTCTGAA TTTCTTTTAA ATTACAGTTT 1560 

TATGAAAGCA TATTTTATTT ACTTGGTGTT GAAATAGCCC TCATAAAACC TAAGCACTTG 1620 

GAAACACAAT AATAGTATTA ACTAACTAGA TCTATTGAAT TTCAGAGAAG AGCCTTCTAA 1680 

CTTGTTTACA CAAAAACGAG TATGATTTAG CACTCATACT AGTTGAAATT TTTAATAGAA 1740 

TCAAGGCACA AAAGTCTTAA AACCATGTGG AAAAATTAGG TAATTATTGC AGATTGATGT 1800 

CTCTCAATCC CATGTATTGC GCTTATGTTA CAAGTTGTTG TCACAGTTGA GACTTAATTT 1860 

CTCCTAATTT CTTCTGCCCG AAGGGTAAGT GGTGCGTCCA GCTTACACGA TCATAATTCA 1920 

AAGGTTGGTG GGCAATGTAA TACTTAATTA AAATAATGAT GGAAGAGCTA TCTGGAGATT 1980 

ATGAGTAAGC TGATTTGAAT TTTCAGTATA AAACTTTAGT ATAATTGTAG TTTGCAAAGT 2040 

TTATTTCAGT TCAGATGTAA GGTATTGCAA ATAAATTCTT GGACAATTTT GTATGGAAAC 2100 

TTGATATTAA AAACTAGTCT GTGGTTCTTT GCAGTTTCTT GTAAATTTAT AAACCAGGCA 2160 

CAAGGTTCAA GTTTAGATTT TAAGCACTTT TATAACAATG ATAAGTGCCT TTTTGGAGAT 2220 

GTAACTTTTA GCAGTTTGTT AACCTGACAT CTCTGCCAGT CTAGTTTCTG GGCAGGTTTC 2280 

CTGTGTCAGT ATTCCCCCTC CTCTTTGCAT TAATCAAGGT ATTTGGTAGA GGTGGAATCT 2340 

AAGTGTTTGT ATGTCCAATT TACTTGCATA TGTAAACCAT TGCTGTGCCA TTCAATGTTT 2400 

GATGCATAAT TGGACCTTGA ATCGATAAGT GTAAATACAG CTTTTGATCT GTAATGCTTT 2460 

TATACAAAAG TTTATTTTAA TAATAAAATG TTTGTTCTAA AAAAAAAAAA 2510 



„ Seq ID NOt C32 DNA Sequence 
/U Nucleic Acid Accession #: NM_012445.1 



Coding sequence: 276.. 1271 

1 11 21 31 41 51 

I I ! I I I 

GCACGAGGGA AGAGGGTGAT CCGACCCGGG GAAGGTCGCT GGGCAGGGCG AGTTGGGAAA 60 

GCGGCAGCCC CCGCCGCCCC CGCAGCCCCT TCTCCTCCTT TCTCCCACGT CCTATCTGCC 120 

TCTCGCTGGA GGCCAGGCCG TGCAGCATCG AAGACAGGAG GAACTGGAGC CTCATTGGCC 180 

GGCCCGGGGC GCCGGCCTCG GGCTTAAATA GGAGCTCCGG GCTCTGGCTG GGACCCGACC 240 

GCTGCCGGCC GCGCTCCCGC TGCTCCTGCC GGGTGATGGA AAACCCCAGC CCGGCCGCCG 300 

CCCTGGGCAA GGCCCTCTGC GCTCTCCTCC TGGCCACTCT CGGCGCCGCC GGCCAGCCTC 360 

TTGGGGGAGA GTCCATCTGT TCCGCCAGAG CCCCGGCCAA ATACAGCATC ACCTTCACGG 420 

GCAAGTGGAG CCAGACGGCC TTCCCCAAGC AGTACCCCCT GTTCCGCCCC CCTGCGCAGT 480 

GGTCTTCGCT GCTGGGGGCC GCGCATAGCT CCGACTACAG CATGTGGAGG AAGAACCAGT 540 

ACGTCAGTAA CGGGCTGCGC GACTTTGCGG AGCGCGGCGA GGCCTGGGCG CTGATGAAGG 600 
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AGATCGAGGC 
TCCCCAGCGG 
TCTCGTTTGT 
ACCTGTGCGA 
CCGGGACGGA 
CGGTGACCGA 
GGCTGAAGGC 
GGGCCTTCAT 
CCTCAGTTCC 
GCGGAGGCCA 
CCGCCAACAA 
ACTGCGTCTA 
GGCTCCTGTG 
GACCGCGGTG 
GGCATTGGGA 
TCTGCTCTCA 
AATTATGGTC 
CCTGGCTCCC 
CTCTCCCGAG 
GGAAGCGTCA 
TGCTCAC 



GGCGGGGGAG 
CACCGGGCAG 
GGTGCGCATC 
CGGGGACCGT 
CAGCGGCTTC 
GATAAOGTCC 
CCTGCCTCCC 
CCCTCCCGCC 
AGAAACGCCG 
CTGTGGGAGG 
CGGGAGCCCC 
AGACCAGAGC 
CAGGCTCATG 
AGGCCGCGCC 
AACAGCCTCC 
GCCTCCTCCT 
TCCTTATAAG 
ACGTGGTTGC 
GGCGCATCCA 
GTGTTTCCAT 



GCGCTGCAGA 
ACGTCGGCGG 
GTGCCCAGCC 
TGGCGGGAAC 
ACCTTCTCCT 
TCCTCTCCCA 
ATCGCCAGGG 
CCAGTCCTGC 
CTGGACTGCG 
CTCGGGACCA 
TGCCCCGAGC 
CCCGCAGCCC 
CTGCAGGCGG 
GACCATCTCT 
TCCTTTCCCA 
CCTGCAGGAT 
TTATTGCTGC 
AGATACCTCA 



GTTATGGATC 



GCGTGCACGC 
AGCTGGAGGT 
CCGACTGGTT 
AGGCGGCGCT 
CCCCCAACTT 
GCCACCCGGC 
TGACACTGGT 
CCAGCAGGGA 
AGGTCTCCCT 
AGAGCAGGAC 
TCGAAGAAGA 
CTGGGGCCCC 
CCGAGGCACA 
GCACTGAAGG 
ACCTTGCTTC 
AAAGTCATCC 
TCCAGGAGAT 
GACCTGGTGC 
ACTTGAGAAG 
TCTCTGCGTT 



Seq ID NO: C33 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..1314 



ATGTTACAGG 
AAACCCCGTA 
CTGAGCCTGG 
TACTTCCTCT 
CTGGACTGTC 
GCAGTGGCAG 
GGGAACTGGT 
AGGCAGATGG 
GATCTGGATG 
GGGCCCTGTC 
AAGACCCCCC 
AGCATCCAGT 
CTCACGGCAG 
GGCTCAGACA 
TTCAACCCCA 
ACTTTCTCAG 
GCCACCCCAC 
GACATACTGC 
GCGTACCAGG 
GACACCTGCC 
GTGGGCATCG 
AAGGTCTCAG 



11 
I 

ATCCTGACAG 
TCCCCATGGA 
CGAGTATCAT 
GCGGGCAGCC 
CCTTGGGGGA 
TCCGCCTCTC 
TCTCTGCCTG 
GCTACAGCAG 
TTGTTGAAAT 
TCTCAGGCTC 
GTGTGGTGGG 
ACGACAAACA 
CCCACTGCTT 
AACTGGGCAG 
TGTACCCCAA 
GCACAGTCAG 
TCTGGATCAT 
TGCAGGCGTC 
GGGAAGTCAC 
AGGGTGACAG 
TTAGCTGGGG 
CCTATCTCAA 



21 
I 

TGATCAACCT 
GACCTTCAGA 
CATTGTGGTT 
TCTCCACTTC 
GGACGAGGAG 
CAAGGACCGA 
TTTCGACAAC 
CAAACCCACT 
CACAGAAAAC 
CCTGGTCTCC 
TGGGGAGGAG 
GCAOGTCTGT 
CAGGAAACAT 
CTTCCCATCC 
AGACAATGAC 
GCCCATCTGT 
TGGATGGGGC 
AGTCCAGGTC 
GGAGAAGATG 
TGGTGGGCCC 
CTATGGCTGC 
CTGGATCTAC 



31 
I 

CTGAACAGCC 
AAGGTGGGGA 
GTCCTCATCA 
ATCCCGAGGA 
CACTGTGTCA 
TCCACACTGC 
TTCACAGAAG 
TTCAGAGCTG 
AGCCAGGAGC 
CTGCACTGTC 
GCCTCTGTGG 
GGAGGGAGCA 
ACCGATGTGT 
CTGGCTGTGG 
ATCGCCCTCA 
CTGCCCTTCT 
TTTACGAAGC 
ATTGACAGCA 
ATGTGTGCAG 
CTGATGTACC 
GGGGGCCCGA 
AATGTCTGGA 



Seq ID NO: C34 DNA Sequence 

Nucleic Acid Accession #i NM_0 03 045.1 

Coding sequence: 148.. 2037 " 



1 
I 

CGATCCTGCC 
CTGAGACATC 
CGTCATATTC 
CAGATGCTGC 
CTGAACACTT 
GTCCTGGCTG 
ATCGCTGOGC 
CCCAAGACGG 
ATCACCGGCT 
TGGAGCGCCA 
ATGACTCTGA 
ATTCTCATCT 
ATATTCACTT 
GGATCGGTTA 
TGTTTGAACA 
TTCTCTGGTG 
ATCGCCACCA 
GCGTCCCTCT 
ATGCCCTACT 
TGGGAAGGTG 
CTAGGTTCCA 
TTTAAATTCT 

ATGTCCATTG 
TACCAGCCAG 
CCAGCAGACC 
GCAGAGATGT 



11 
I 

GGAGCCCCGC 
TTTGCTGCAA 
CAGCTCTGAA 
GGOGGAAGGT 
TTGATCTGGT 
GAGCTGTGGC 
TGGCCTCAGT 
GCTCAGCTTA 
GGAACTTAAT 
CCTTCGACGA 
ACGCCCCCGG 
TGACAGGACT 
GTATTAACGT 
AAAACTGGCA 
ATGACACAAA 
TCCTGTCGGG 
CAGGTGAAGA 
TGATCTGCTT 
TCTGCCTGGA 
CCAAGTACGC 
TGTTTCCCAT 
TAGCCAACQT 
TTGCTGCTGT 
GCACTCTCCT 
AGCAGCCTAA 
AAAATGAATT 
TCTCTTTGAA 



21 
I 

CGCOGCOGGC 
GATCGAGGCT 
CAGCAACATG 
GGTGGACTGT 
GGCCCTOGGG 
CCGTGAGAAT 
GCTGGCTGGC 
CCTCTACAGC 
CCTCTCCTAC 
GCTGATAGGC 
CGTGCTGGCT 
TTTAACTCTT 
CCTGGTCCTG 
GCTCACGGAG 
AGAAGGGAAG 
GGCAGCGACT 
GGTGAAGAAC 
CATCGCCTAC 
CAATAACAGC 
AGTGGCCGTG 
GCCTCGGGTT 
CAATGATAGG 
GATGGCCTTC 
GGCTTACTOG 
CCTGGTATAC 
GGCAAGCACC 
AACCATACTC 




TGCTTCTATG 
CCACAGAAGG 
TTTGGGGTGT 
CCCCTGCCCG 
GGCTCCCTCT 
ATCTATGCCA 
ACCAAAACAC 
CTCTTTGACC 
TTGGTGGCTG 
CAGATGGCCA 
AATGATTCCC 
TCACCCAAAA 



GGTGTTTTCG 
GCAGCGCAGG 
CGTGGGCGTG 
GGACCTGTAC 
CGCCACCATC 
CAACTCCTTC 
GCGGCTGCGA 
CAATGAGATT 
GTGGTOGTCC 
TCGCTACGTC 
GGCTGAGTGC 
CGGAGCCATG 
GGGGGTTTCG 

TTAGGGGCCC 
CCAAGGCTCC 
TGTCCTTCAT 
TCTAGGCTGT 
TGAATAAATG 
TGAATAAAGA 



41 
I 

TCGATGTCAA 
TCCCCATCAT 
AGGTGATTCT 
AGCAGCTGTG 
AGAGCTTCCC 
AGGTGCTGGA 
CTCTCGCTGA 
TGGAGATTGG 
TTCGCATGCG 
TTGCCTGTGG 
ATTCTTGGCC 
TCCTGGACCC 
TCAACTGGAA 
CCAAGATCAT 
TGAAGCTGCA 
TTGATGAGGA 
AGAATGGAGG 
CACGGTGCAA 
GCATCCCGGA 
AATCTGACCA 
GCACCCCAGG 
AGGCTGAGCT 



41 
I 

AAACCTTCCT 
GAGAAGGTGG 
TCCTGCTCAA 
AGACGCGGCT 
CACTGGGTGC 
CCATTGTCAT 
GCGAGTTTGG 
TTGGAGAGCT 
CTTCAAGCGT 
GGGAGTTCTC 
ACATATTCGC 
AGTCGGCCAT 
TGGTGTCAGG 
GGAACACATC 
GTGGATTCAT 
CCTTCGTGGG 
CCATCCCCGT 
CGGCTGCCCT 
ACGCCTTTAA 
GCGCTCTTTC 
TGGCTGAGGA 
CAATAATCGC 
TGAAGGACTT 
CCTGTGTGTT 
GTACTTCCGA 
AGCTGGGGTT 
ACATGGAGCC 



GCGCCCGCCG 
CACTCGCTGG 
GACAGCCTGG 
CCCTACGACG 
CCGCAGGACA 
TACTACCCGC 
CAGAGCCCCA 
GTAGACAGCG 
TGGGGACTGT 
CGGGTCCAGC 
GTCCCTGATA 
GGGTGTCGGG 
CGCTGCTCCT 
GGCCGGCACG 
CCGTGTCCCG 
AGCTACTCTA 
CGTCCAGGGG 
GCTGAGCCCA 
GGGCGGTTTC 
CTATCTCTGT 



51 
I 

ACCCCTGCGC 
CATAGCACTA 
GGATAAATAC 
TGACGGAGAG 
CGAAGGGCCT 
CTCGGCCACA 
GACAGCCTGT 
CCCAGACCAG 
GAACTCAAGT 
GAAGAGCCTG 
TTGGCAGGTC 
CCACTGGGTC 
GGTGCGGGCA 
CATCATTGAA 
GTTCCCACTC 
GCTCACTCCA 
GAAGATGTCT 
TGCAGACGAT 
AGGGGGTGTG 
GTGGCATGTG 
AGTATACACC 
GTAA 




CTTTGACTGC 
GGGGATCGTG 
CACGCTCATG 
GCACGTGGGC 
CGCCAGTCTT 
TGGACTGCTA 
CACATTAGCC 
GGTGGACCTC 
GGTCTTACGG 
CGAGTTAGAT 
TTTACCAGAG 
TTCCAAAATC 



660 

720 

780 

B40 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1S0O 

1560 

1620 

1660 

1740 

1800 

1807 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1060 

1140 

1200 

1260 

1314 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 
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TCTGGGCTAA TTGTGAACAT TTCAACCAGC CTTATAGCTG TTCTCATCAT CACCTTCTGC 1680 

ATTGTGACCG TGCTTGGAAG GGAGGCTCTC ACCAAAGGGQ CGCTGTGGGC AGTCTTTCTG 1740 

CTCGCAGGGT CTGCCCTCCT CTGTGCCGTG GTCACGGGCG TCATCTGGAG GCAGCCCGAG 1800 

AGCAAGACCA AGCTCTCATT TAAGGTTCCC TTCCTGCCAG TGCTCCCCAT CCTGAGCATC 1860 

TTCGTGAACG TCTATCTCAT GATGCAGCTG GACCAGGGCA CCTGGGTCCG GTTTGCTGTG 1920 

TGGATGCTGA TAGGCTTCAT CATCTACTTT GGCTATGGCC TGTGGCACAG CGAGGAGGCG 1980 

TCCCTGGATG CCGACCAAGC AAGGACTCCT GACGGCAACT TGGACCAGTG CAAGTGACGC 2040 

ACAGCCCCGC CCCCCGGAGG TGGCAGCAGC CCCGAGGGAC GCCCCCAGAG GACCGGGAGG 2100 

CACCCCACCC TCCCCACCAG TGCAACAGAA ACCACCTGCG TCCACACCCT CACTGCA 2157 

Seq ID NO: C35 DNA Sequence 

Nucleic Acid Accession 8: NM_002776.1 

Coding sequence : 82 . . 912 

1 11 21 31 41 51 

III III 

ACCAGCGGCA GACCACAGGC AGGGCAGAGG CACGTCTGGG TCCCCTCCCT CCTTCCTATC 60 

GGCGACTCCC AGATCCTGGC CATGAGAGCT CCGCACCTCC ACCTCTCCGC CGCCTCTGGC 120 

GCCCGGGCTC TGGCGAAGCT GCTGCCGCTG CTGATGGCGC AACTCTGGGC CGCAGAGGCG 180 

GCGCTGCTCC CCCAAAACGA CACGCGCTTG GACCCCGAAG CCTATGGCGC CCCGTGCGCG 240 

CGCGGCTCGC AGCCCTGGCA GGTCTCGCTC TTCAACGGCC TCTCGTTCCA CTGCGCGGGT 300 

GTCCTGGTGG ACCAGAGTTG GGTGCTGACG GCCGCGCACT GCGGAAACAA GCCACTGTGG 360 

GCTCGAGTAG GGGATGATCA CCTGCTGCTT CTTCAGGGCG AGCAGCTCOG CCGGACGACT 420 

CGCTCTGTTG TCCATCCCAA GTACCACCAG GGCTCAGGCC CCATCCTGCC AAGGCGAACG 480 

GATGAGCACG ATCTCATGTT GCTAAAGCTG GCCAGGCCCG TAGTGCCGGG GCCCCGCGTC 540 

CGGGCCCTGC AGCTTCCCTA CCGCTGTGCT CAGCCCGGAG ACCAGTGCCA GGTTGCTGGC 600 

TGGGGCACCA CGGCCGCCCG GAGAGTGAAG TACAACAAGG GCCTGACCTG CTCCAGCATC 660 

ACTATCCTGA GCCCTAAAGA GTGTGAGGTC TTCTACCCTG GCGTGGTCAC CAACAACATG 720 

ATATGTGCTG GACTGGACCG GGGCCAGGAC CCTTGCCAGA GTGACTCTGG AGGCCCCCTG 780 

GTCTGTGAGG AGACCCTCCA AGGCATCCTC TCGTGGGGTG TTTACCCCTG TGGCTCTGCC 840 

CAGCATCCAG CTGTCTACAC CCAGATCTGC AAATACATGT CCTGGATCAA TAAAGTCATA 900 

CGCTCCAACT GATCCAGATG CTACGCTCCA GCTGATCCAG ATGTTATGCT CCTGCTGATC 960 

CAGATGCCCA GAGGCTCCAT CGTCCATCCT CTTCCTCCCC AGTCGGCTGA ACTCtCCCCT 1020 

TGTCTGCACT GTTCAAACCT CTGCCGCCCT CCACACCTCT AAACATCTCC CCTCTCACCT 1080 

CATTCCCCCA CCTATCCCCA TTCTCTGCCT GTACTGAAGC TGAAATGCAG GAAGTGGTGG 1140 

CAAAGGTTTA TTCCAGAGAA GCCAGGAAGC CGGTCATCAC CCAGCCTCTG AGAGCAGTTA 1200 

CTGGGGTCAC CCAACCTGAC TTCCTCTGCC ACTCCCCGCT GTGTGACTTT GGGCAAGCCA 1260 

AGTGCCCTCT CTGAACCTCA GTTTCCTCAT CTGCAAAATG GGAACAATGA CGTGCCTACC 1320 

TCTTAGACAT GTTGTGAGGA GACTATGATA TAACATGTGT ATGTAAATCT TCATGTGATT 1380 

GTCATGTAAG GCTTAACACA GTGGGTGGTG AGTTCTGACT AAAGGTTACC TGTTGTCGTG 1440 

AAAAAAAAAA AAAA 1454 



Seq ID NO: C36 DNA Sequence 
Nucleic Acid Accession #i XM__095088 
45 Coding sequence: 1. .4074 



1 11 21 31 41 SI 

I I I I I I 

ATGACCCGGG CCGCGACAGC CGAGCCGGGC CGCGTGTCTC CCGCCTCGCC CGCGCGCAGC 60 

ACGGCGGGCC TGCCGCGCGC CTTCCTGCAG AGCCTGCGCA CCCTGTTAGA CATCCTGGAC 120 

GACTGGCAGC GCGGCTGCGT GCACCTGCGA GAGATCCAGT CCCTCTGGGT CGAAGCGCGG 180 

GAGCTGCCAA GCGGGGTGCT GGAGGGCCTG AGCCAGCGGC GCGGGCCGCA GCCGGGAGCA 240 

GCTGTACGCT CCCGCCGAGG CGGCGCAGTG CCCCGCGGGG CCCGAGCGGT TCCAGAGCGC 300 

TGCGCTGGAA CGGAGACCCG GCGCGGACGC AGGTGCAGCG GCCTGCAGCG CCTTGGAGGC 360 

GGATTCAGGG GATGCCCGGC GGACCCCTGT GCCCGAGGGG AACACCGGAG GCACACCATC 420 

ACCAGCGGCG TGGACTGCGG CCTGCTGAAG CAGATGAAGG AGCTGGAGCA GGAGAAGGAG 480 

GTGCTGCTAC AGGGTTTGGA GATGATGGCG CAGGGCCGCG ATTGGTACCA GCAGCAGCTG 540 

CAACAAGTGC AGGAGOGCCA GTGCCGCCTG GGCCAGAGCA GAGCCAGCGC CGACTTTGGG 600 

GCCGTGGGGA GCCCTCGCCC ACTGGGACGG CTACTGCCCA AGGTACAGGA GGTGGCCCGG 660 

TGGCTGGGGG AGCTGCTGGC TGAGGCCTGT GCCGGTCGGG CCCTGCCCAC ATCCTCCTCC 720 

GGGCCCCCTT GCTCTGCCCT GACGTCCACC TCQT CCCCGG GCTGGCAGCA GCAGATCATC 780 

CTCATGCTGA AGGAGCAGAA CCGACTCCTC ACCCAGGAGG TGACCGAGAA GAGTGAGCGC 840 

ATCACGCAGC TGGAGCAGAA GTCGGCGCTC ATTAAGCAGC TATTTGAGGC CCGCGCCCTG 900 

AGCCAGCAGG ATGGGGGCCT GTCCCCGGCT GGCCCCCACA TTGAGCCCTT GACTCGGTTT 960 

CGGCTTCCGG TGCTGACATG GGCTGGGGCT CTCTTGAGTC CGCATAGTCC GCAGCTACTA 1020 

CTGCCGCTGT CAGCGGACAG TGGGGGACCC CTCCACGAGT TACCAGATAC CTGGTTTCCA 1080 

GCGGTGCTGC TTTGGGTCCC ATCTCCAGGG AAAAGAACTG CTCACGCCAG GCTGCACTTC 1140 

CACCAAAGGC CAGCAGAGGG CGCGTGGCAG CTCGGATGCG GCGCTGAGGC TGCGCCCGAG 1200 

ACCTGCGGGA CGCTGCCCCA CTTTGAGTCC CACAAAACAA CCTGTGAGCC TGACTCCCTC 1260 

GGAGGGCCCT GTCCCCAGGA GGGGGATCGC AGCTGGAGCC ACCTGGGCGC AGCGTTTGAT 1320 

GTGGCACCTG CAGTGGCCAA AGTGACACCC AACCGTGAGG ACGCTGCAGG GAGTCGGCAC 1380 

GGAGACATCT GTCCCCTCTG TCCCAAGGGA CTGTTGACAT TCAGAGACAT AGCTATAGAA 1440 

TTCTCTCTGG CGGAGTGGCA ATGCCTGGAT CATGCTCAGC AGAATTTATA TAGAGATGTG 1500 

ATGTTAGAGA ACTACAGAAA CCTGTTCTCC CTGGGTATGA CTGTCTCTAA GCCAGACTTG 1S60 

ATCGCCTGTC TGGAGCAAAA TAAAGAGCCC CAGAATATAA AGAGAAATGA GATGGCAGCC 1620 

AAACACCCAG TTACATGTTC TCATTTCAAC CAAGACCTTC AGCCAGAGCA GAGCATAAAA 1680 

GATTCACTCC AAAAAGTAAT ACCAAGAACA TATGGAAAAT GTGGACATGA GAATTTACAA 1740 

TTAAAAAAAT GTTGTAAAAG AGTAGATGAG TGTGAGGTGC ACAAAGGAGG TTATAATGAC 1800 

CTTAACCAAT GTTTGTCAAA TACCCAAAAC AAAATATTTC AGACTCATAA ATGTGTCAAA 1860 

GTCTTCAGTA AATTTTCAAA TTCCAATAGA CACAATGCAA GATATACTGG AAAGAAACAT 1920 

TTGAAATGTA AAAAATATGG CAAATCATTT TGCATGTTTT CACACCTAAA TCAACATCAG 1980 

ATAATTCATA CTAAGGAGAA GTCCTACAAA TGTGAAGAAT GTGGCAAATC CTTTAACCAC 2040 

TCCTCAAGOG GTACTACACA TAAAAGAATT CTTACTGGAG AGAAACCCTA CAGATGTGAG 2100 

GAATGTGGCA AAGCCTTTAG GTGGCCCTCA AACCTTACTA GACATAAGAG AATTCACACT 2160 
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GGAGAGAAAC CCTACGCATG TGAAGAATGT GGCCAAGCCT TTAGGCGCTC CTCAACACTT 2220 

ACTAACCACA AGAGAATTCA TACTGGAGAG AGACCCTACA AATGTGAAGA ATGTGGCAAA 2200 

GCCTTTAGCG TATCCTCAGC CCTCATTTAC CACAAGAGAA TTCATACTGG AGAGAAACCC 2340 

C TACACATGTG AAGAATGTGG CAAAGCCTTT AACTGCTCCT CGACTCTTAA GACACATAAG 2400 

J ATAATTCATA CTGGAGAGAA ACCCTACACA TGTGAAGAAT GTGGCAGAAC CTTTAACTGC 2460 

TCCTCAACTG TAAAGGCACA TAAGAGAATT CATACTGGAG AGAAACCATA CAAATGTGAA 2S20 

GAATGTGACA AAGCTTTTAA GTGGCATTCA AGTCTTGCTA AACATAAGAT AATTCACACT 2580 

GGAGAGAAAC CCTACAAATG CAGTGACAGC AAAGCCTTAG CCAAATCATC AGAAGTGCAA 2640 

AAGGTCTACT CTGGAGATGG GGAAAATGGA ATCCGTGTAC ATAAGAAAAA GGAGACACAG 2700 

1U GGCTGGCTTG TGAGAAACAA GAACGAAAAT AGAACAGGGC TGTTCCAGAT CCGGGCTGCC 2760 

GTGAGACCTA ACAGGGACCC TTCATGGGGA CAGCAAGAAG GTTCACTGAC TGACCCAATT 2820 

CAGAGGAAGG AGGAACCTGA CCTTCAAAAT CACTATGACC ATCAGAATGC CTTAGAAGAT 2880 

CAAAGAAATA CTGGAGTGGG TGGACTGTTG ACATTCAGAG ATGTAGTCAT AGAATTCTCT 2940 

CTGGAGGAGT GGCAATGCCT GGATCACGCT CAGCAGAATT TATATAGAGA TGTGATGTTA 3000 

GAGAACTACA GAAACCTGGT CTCCCTGGGT ATTGCTGTCT CTAAGCCAGA CTTGATCACC 3060 

TGTCTGGAGC AAAATAAAGA GCCTTGGAAT ATAAAGAGAA ATGAGATGGT AACCAAACAC 3120 

CCAGACCTTC CGCCAGAGCT AGGCATAAAA GATTCACTCC AAAAAGTAAT ACCAAGAAGA 3180 

TATGGAAAAA GTGGACATGA CAATTTACAA GTAAAAACAT GTAAAAGCAT GGGTGAGTGT 3240 

GAGGTGCAAA AAGGAGGTTG TAATGAAGTT AACCAATGTT TGTCAACTAC CCAAAACAAA 3300 

Z,V ATATTTCAGA CTCATAAATG TGTCAAAGTC TTCGGCAAAT TTTCAAATTC CAATAGACAT 33 60 

AAGACAAGAC ATACTGGAAA GAAACATTTC AAATGTAAAA AATATGGCAA ATCATTTTGC 3420 

ATGGTTTCAC AACTACATCA ACATCAGATA ATTCATACTA GGGAGAATTC CTACCAATGT 3480 

GAAGAATGCG GCAAACCCTT CAACTGCTCT TCAACCCTTT CTAAACATAA AAGAATTCAT 3540 

- ACTGGAGAGA AACCCTACAG ATGTGAGGAA TGTGGCAAAG CTTTTACCTG GTCCTCAACC 3600 
AD CTTACTAAAC ATAGGAGAAT TCATACTGGA GAAAAACCCT ACACATGTGA AGAATGTGGC 3660 

CAAGCCTTTA GCCGCTCCTC AACACTTGCT AACCACAAGA GAATTCATAC TGGAGAGAAA 3720 

CCATACACAT GTGAAGAATG TGGCAAAGCC TTTAGCTTAT CCTCATCCCT CACTTACCAC 3780 

AAGAGAATTC ATACTGGAGA GAAACCCTAC ACATGTGAAG AATGTGGCAA AGCCTTTAAC 3840 

OH TGCTCCTCAA CCCTTAAGAA ACATAAGATA ATTCATACTG GAGAGAAACC CTACAAATGT 3900 

OV AAAGAATGTG GGAAAGCCTT TGCCTTCTCC TCAACTCTTA ATACTCATAA GAGGATTCAT 3960 

ACTGGAGAGG AACCCTACAA ATGTGAAGAA TGTGACAAAG CTTTTAAGTG GTCCTCAAGT 4020 

CTTGCTAATC ATAAGAGTAT GCATACTGGA GAGAAACCCT ACAAATGTGA ATAA 4074 

„ Seq ID NO: C37 DNA Sequence 
J J Nucleic Acid Accession ft* NM 032044 
Coding sequence : 182 . . 658 ~ 

J U 21 31 41 51 

40 AAGATATAAA AGCTCCAGAA ACGTTGACTG GGACCACTGG AGACACTGAA GAAGGCAGGG 60 

GCCCTTAGAG TCTTGGTTGC CAAACAGATT TGCAGATCAA GGAGAACCCA GGAGTTTCAA 120 

AGAAGCGCTA GTAAGGTCTC TGAGATCCTT GCACTAGCTA CATCCTCAGG GTAGGAGGAA 180 

GATGGCTTCC AGAAGCATGC GGCTGCTCCT - ATTGCTGAGC TGCCTGGCCA AAACAGGAGT 240 

AC CCTGGGTGAT ATCATCATGA GACCCAGCTG TGCTCCTGGA TGGTTTTACC ACAAGTCCAA 300 

HJ TTGCTATGGT TACTTCAGGA AGCTGAGGAA CTGGTCTGAT GCCGAGCTCG AGTGTCAGTC 360 

TTACGGAAAC GGAGCCCACC TGGCATCTAT CCTGAGTTTA AAGGAAGCCA GCACCATAGC 420 

AGAGTACATA AGTGGCTATC AGAGAAGCCA GCCGATATGG ATTGGCCTGC ACGACCCACA 480 

GAAGAGGCAG CAGTGGCAGT GGATTGATGG GGCCATGTAT CTGTACAGAT CCTGGTCTGG 540 

C.ft ^^5 CATG GGTGGGAACA AGCACTGTGC TGAGATGAGC TCCAATAACA ACTTTTTAAC 600 

TTGGAGCAGC AACGAATGCA ACAAGCGCCA ACACTTCCTG TGCAAGTACC GACCATAGAG 660 

CAAGAATCAA GATTCTGCTA ACTCCTGCAC AGCCCCGTCC TCTTCCTTTC TGCTAGCCTG 720 

GCTAAATCTG CTCATTATTT CAGAGGGGAA ACCTAGCAAA CTAAGAGTGA TAAGGGCCCT 780 

ACTACACTGG CTTTTTTAGG CTTAGAGACA GAAACTTTAG CATTGGCCCA GTAGTGGCTT 840 

- CTAGCTCTAA ATGTTTGCCC CGCCATCCCT TTCCACAGTA TCCTTCTTCC CTCCTCCCCT 900 
?!^ TCTGGCT GTCT CGAGCA GTCTAGAAGA GTGCATCTCC AGCCTATGAA ACAGCTGGGT 960 
CTTTGGCCAT AAGAAGTAAA GATTTGAAGA CAGAAGGAAG AAACTCAGGA GTAAGCTTCT 1020 
AGACCCCTTC AGCTTCTACA CCCTTCTGCC CTCTCTCCAT TGCCTCCACC CCACCCCAGC 1080 
CACTCAACTC CTGCTTGTTT TTCCTTTGGC CATAGGAAGG TTTACCAGTA GAATCCTTGC 1140 

gQ TAGGTTGATG TGGGCCATAC ATTCCTTTAA TAAACCATTG TGTACATAAG AAAAAAAAAA 1200 



65 



Seq ID NO t C38 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 52.. 3 042 



} « 21 31 41 51 

GCTCACCCAG GAAAAATATG CAATCGTCCC ATTGATATAC AGGCCACTAC AATGGATGGA 60 

GTTAACCTCA GCACCGAGGT TGTCTACAAA AAAGGCCAGG ATTATAGGTT TGCTTGCTAC 120 

70 GACCGGGGCA GAGCCTGCCG GAGCTACCGT GTACGGTTCC TCTGTGGGAA GCCTGTGAGG 180 

/U CCCAAACTCA CAGTCACCAT TGACACCAAT GTGAACAGCA CCATTCTGAA CTTGGAGGAT 240 

AATGTACAGT CATGGAAACC TGGAGATACC CTGGTCATTG CCAGTACTGA TTACTCCATG 300 

TACCAGGCAG AAGAGTTCCA GGTGCTTCCC TGCAGATCCT GCGCCCCCAA CCAGGTCAAA 360 

GTGGCAGGGA AACCAATGTA CCTGCACATC GGGGAGGAGA TAGACGGCGT GGACATGCGG 420 

7C 2255^ GGCTTCTGAG CCGGAACATC ATAGTGATGG GGGAGATGGA GGACAAATGC 480 

I^S ACA GAAACCACAT CTGCAATTTC TTTGACTTCG ATACCTTTGG GGGCCACATC 540 

AAGTTTGCTC TGGGATTTAA GGCAGCACAC TTGGAGGGCA CGGAGCTGAA GCATATGGGA 600 

^^ TGG TGGGTCAGTA CCCGATTCAC TTCCACCTGG CCGGTGATGT AGACGAAAGG 660 

GGAGGTTATG ACCCACCCAC ATACATCAGG GACCTCTCCA TCCATCATAC ATTCTCTCGC 720 

Of) TCCATGGCTC CAATGGCTTG TTGATCAAGG ACGTTGTGGG CTATAACTCT 780 

JZS^?^ 07 GCTTCTTCAC . GGAAGATGGG CCGGAGGAAC GCAACACTTT TGACCACTGT 840 

CTTGGCCTCC TTGTCAAGTC TGGAACCCTC CTCCCCTCGG ACCGTGACAG CAAGATGTGC 900 

AAGATGATCA CAGGAGACTC CTACCCAGGG TACATCCCCA AGCCCAGGCA AGACTGCAAT 960 

GCTGTGTCCA CCTTCTGGAT GGCCAATCCC AACAACAACC TCATCAACTG TGCCGCTGCA 1020 
GGATCTGAGG AAACTGGATT TTGGTTTATT TTTCACCACG TACCAACGGG CCCCTCCGTG 
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GGAATGTACT CCCCAGGTTA TTCAGAGCAC ATTCCACTGG GAAAATTCTA TAACAACOGA 1140 

GCACATTCCA ACTACCGGGC TGGCATGATC ATAGACAACG GAGTCAAAAC CACCGAGGCC 1200 

TCTGCCAAGG ACAAGCGGCC GTTCCTCTCA ATCATCTCTG CCAGATACAG CCCTCACCAG 1260 

GACGCCGACC CGCTGAAGCC CCGGGAGCCG GCCATCATCA GACACTTCAT TGCCTACAAG 1320 

AACCAGGACC ACGGGGCCTG GCTGCGCGGC GGGGATGTGT GGCTGGACAG CTGCCGGTTT 1380 

GCTGACAATG GCATTGGCCT GACCCTGGCC AGTGGTGGAA CCTTCCCGTA TGACGACGGC 1440 

TCCAAGCAAG AGATAAAGAA CAGCTTGTTT GTTGGCGAGA GTGGCAACGT GGGGAGGGAA 1500 

ATGATGGACA ATAGGATCTG GGGCCCTGGC GGCTTGGACC ATAGCGGAAG GACCCTCCCT 1560 

ATAGGCCAGA ATTTTCCAAT TAGAGGAATT CAGTTATATG ATGGCCCCAT CAACATCCAA 1620 

10 AACTGCACTT TCCGAAAGTT TGTGGCCCTG GAGGGCCGGC ACACCAGCGC CCTGGCCTTC 1690 

CGCCTGAATA ATGCCTGGCA GAGCTGCCCC CATAACAACG TGACOGGCAT TGCCTTTGAG 1740 

GACGTTCCGA TTACTTCCAG AGTGTTCTTC GGAGAGCCTG GGCCCTGGTT CAACCAGCTG 1800 

GACATGGATG GGGATAAGAC ATCTGTGTTC CATGACGTCG ACGGCTCCGT GTCCGAGTAC I860 

CCTGGCTCCT ACCTCACGAA GAATGACAAC TGGCTGGTCC GGCACCCAGA CTGCATCAAT 1920 

GTTCCCGACT GGAGAGGGGC CATTTGCAGT GGGTGCTATG CACAGATGTA CATTCAAGCC I960 

TACAAGACCA GTAACCTGCG AATGAAGATC ATCAAGAATG ACTTCCCCAG CCACCCTCTT 2040 

TACCTGGAGG GGGCGCTCAC CAGGAGCACC CATTACCAGC AATACCAACC GGTTGTCACC 2100 

CTGCAGAAGG GCTACACCAT CCACTGGGAC CAGACGGCCC CCGCCGAACT CGCCATCTGG 2160 

CTCATCAACT TCAACAAGGG CGACTGGATC CGAGTGGGGC TCTGCTACCC GCGAGGCACC 2220 

ACATTCTCCA TCCTCTCGGA TGTTCACAAT CGCCTGCTGA AGCAAACGTC CAAGACGGGC 2280 

GTCTTCGTGA GGACCTTGCA GATGGACAAA GTGGAGCAGA GCTACCCTGG CAGGAGCCAC 2340 

TACTACTGGG ACGAGGACTC AGGGCTGTTG TTCCTGAAGC TGAAAGCTCA GAACGAGAGA 2400 

GAGAAGTTTG CTTTCTGCTC CATGAAAGGC TGTGAGAGGA TAAAGATTAA AGCTCTGATT 2460 

CGAAAGAACG CAGGCGTCAG TGACTGCACA GCCACAGCTT ACCCCAAGTT CACCGAGAGG 2520 

GCTGTCGTAG ACGTGOCGAT GCCCAAGAAG CTCTTTGGTT CTCAGCTGAA AACAAAGGAC 2580 

CATTTCTTGG AGGTGAAGAT GGAGAGTTCC AAGCAGCACT TCTTCCACCT CTGGAACGAC 2640 

TTCGCTTACA TTGAAGTGGA TGGGAAGAAG TACCCCAGTT OGGAGGATGG CATCCAGGTG 2700 

GTGGTGATTG ACGGGAACCA AGGGCGCGTG GTGAGCCACA CGAGCTTCAG GAACTCCATT 2760 

CTGCAAGGCA TACCATGGCA GCTTTTCAAC TATGTGGCGA CCATCCCTGA CAATTCCATA 2820 

GTGCTTATGG CATCAAAGGG AAGATACGTC TCCAGAGGCC CATGGACCAG AGTGCTGGAA 2880 

AAGCTTGGGG CAGACAGGGG TCTCAAGTTG AAAGAGCAAA TGGCATTCGT TGGCTTCAAA 2940 

GGCAGCTTCC GGCCCATCTG GGTGACACTG GACACTGAGG ATCACAAAGC CAAAATCTTC 3000 

CAAGTTGTGC CCATCCCTGT GGTGAAGAAG AAGAAGTTGT GAGGACAGCT GCCGCCCGGT 3060 

GCCACCTCGT GGTAGACTAT GACGGTGACT CTTGGCAGCA GACCAGTGGG GGATGGCTGG 3120 

GTCCCCCAGC CCCTGCCAGC AGCTGCCTGG GAAGGCCGTG TTTCAGCCCT GATGGGCCAA 3180 

GGGAAGGCTA TCAGAGACCC TGGTGCTGCC ACCTGCCCCT ACTCAAGTGT CTACCTGGAG 3240 

CCCCTGGGGC GGTGCTGGCC AATGCTGGAA ACATTCACTT TCCTGCAGCC TCTTGGGTGC 3300 

TTCTCTCCTA TCTGTGCCTC TTCAGTGGGG GTTTGGGGAC CATATCAGGA GACCTGGGTT 3360 

GTGCTGACAG CAAAGATCCA CTTTGGCAGG AGCCCTGACC CAGCTAGGAG GTAGTCTGGA 3420 

GGGCTGGTCA TTCACAGATC CCCATGGTCT TCAGCAGACA AGTGAGGGTG GTAAATGTAG 3480 

GAGAAAGAGC CTTGGCCTTA AGGAAATCTT TACTCCTGTA AGCAAGAGCC AACCTCACAG 3540 

GATTAGGAGC TGGGGTAGAA CTGGCTATCC TTGGGGAAGA GGCAAGCCCT GCCTCTGGCC 3600 

GTGTCCACCT TTCAGGAGAC TTTGAGTGGC AGGTTTGGAC TTGGACTAGA TGACTCTCAA 3660 

AGGCCCTTTT AGTTCTGAGA TTCCAGAAAT CTGCTGCATT TCACATGGTA CCTGGAACCC 3720 

AACAGTTCAT GGATATCCAC TGATATCCAT GATGCTGGGT GCCCCAGCGC ACACGGGATG 3780 

GAGAGGTGAG AACTAATGCC TAGCTTGAGG GGTCTGCAGT CCAGTAGGGC AGGCAGTCAG 3840 

GTCCATGTGC ACTGCAATGC CAGGTGGAGA AATCACAGAG AGGTAAAATG GAGGCCAGTG 3900 

CCATTTCAGA GGGGAGGCTC AGGAAGGCTT CTTGCTTACA GGAATGAAGG CTGGGGGCAT 3960 

TTTGCTGGGG GGAGATGAGG CAGCCTCTGG AATGGCTCAG GGATTCAGCC CTCCCTGCCG 4020 

CTGCCTGCTG AAGCTGGTGA CTACGGGGTC GCCCTTTGCT CACGTCTCTC TGGCCCACTC 4080 

ATGATGGAGA AGTGTGGTCA GAGGGGAGCA ATGGGCTTTG CTGCTTATGA GCACAGAGGA 4140 

ATTCAGTCCC CAGGCAGCCC TGCCTCTGAC TCCAAGAGGG TGAAGTCCAC AGAAGTGAGC 4200 

TCCTGCCTTA GGGCCTCATT TGCTCTTCAT CCAGGGAACT GAGCACAGGG GGCCTCCAGG 4260 

AGACCCTAGA TGTGCTCGTA CTCCCTGGGC CTGGGATTTC AGAGCTGGAA ATATAGAAAA 4320 

5 J TATCTAGCCC AAAGCCTTCA TTTTAACAGA TGGGGAAAGT GAGCCCCCAA GATGGGAAAG 4380 

AACCACACAG CTAAGGGAGG GCCTGGGGAG CCCCACCCTA GCCCTTGCTG CCACACCACA 4440 

TTGCCTCAAC AACCGGCCCC AGAGTGCCCA GGCACTCCTG AGGTAGCTTC TGGAAATGGG 4500 

GACAAGTCCC CTCGAAGGAA AGGAAATGAC TAGAGTAGAA TGACAGCTAG CAGATCTCTT 4560 

CCCTCCTGCT CCCAGCGCAC ACAAACCCGC CCTCOCCTTG GTGTTGGCGG TCCCTGTGGC 4620 

CTTCACTTTG TTCACTACCT GTCAGCCCAG CCTGGGTGCA CAGTAGCTGC AACTCCCCAT 4680 

TGGTGCTACC TGGCTCTCCT GTCTCTGCAG CTCTACAGGT GAGGCCCAGC AGAGGGAGTA 4740 

GGGCTCGCCA TGTTTCTGGT GAGCCAATTT GGCTGATCTT GGGTGTCTGA ACAGCTATTG 4800 

GGTCCACCCC AGTCCCTTTC AGCTGCTGCT TAATGCCCTG CTCTCTCCCT GGCCCACCTT 4860 

ATAGAGAGCC CAAAGAGCTC CTGTAAGAGG GAGAACTCTA TCTGTGGTTT ATAATCTTGC 4920 

ACGAGGCACC AGAGTCTCCC TGGGTCTTGT GATGAACTAC ATTTATCCCC TTTCCTGCCC 4980 

CAACCACAAA CTCTTTCCTT CAAAGAGGGC CTGCCTGGCT CCCTCCACCC AACTGCACCC 5040 

ATGAGACTCG GTCCAAGAGT CCATTCCCCA GGTGGGAGCC AACTGTCAGG GAGGTCTTTC 5100 

CCACCAAACA TCTTTCAGCT GCTGGGAGGT GACCATAGGG CTCTGCTTTT AAAGATATGG 5160 

CTGCTTCAAA GGCCAGAGTC ACAGGAAGGA CTTCTTCCAG GGAGATTAGT GGTGATGGAG 5220 

AGGAGAGTTA AAATGACCTC ATGTCCTTCT TGTCCACGGT TTTGTTGAGT TTTCACTCTT 5280 

CTAATGCAAG GGTCTCACAC TGTGAACCAC TTAGGATGTG ATCACTTTCA GGTGGCCAGG 5340 

AATGTTGAAT GTCTTTGGCT CAGTTCATTT AAAAAAGATA TCTATTTGAA AGTTCTCAGA 5400 

GTTGTACATA TGTTTCACAG TACAGGATCT GTACATAAAA GTTTCTTTCC TAAACCATTC 5460 

ACCAAGAGCC AATATCTAGG CATTTTCTTG GTAGCACAAA TTTTCTTATT GCTTAGAAAA 5520 

TTGTCCTCCT TGTTATTTCT GTTTGTAAGA CTTAAGTGAG TTAGGTCTTT AAGGAAAGCA 5580 

ACGCTCCTCT GAAATGCTTG TCTTTTTTCT GTTGCCGAAA TAGCTGGTCC TTTTTCGGGA 5640 

GTTAGATGTA TAGAGTGTTT GTATGTAAAC ATTTCTTGTA GGCATCACCA TGAACAAAGA 5700 

TATATTTTCT ATTTATTTAT TATATGTGCA CTTCAAGAAG TCACTGTCAG AGAAATAAAG 5760 

AATTGTCTTA AATGTCAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAA 5808 
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I I I I I I 

GTGGCCTCGA GGTGGTGGCA GGGCCGCCCC CTGCAGTCCG GAGAOGAACG CACGGACCGG 60 

GCCTCCGGAG GCAGGTTCGG CTGGAAGGAA CCGCTCTCGC TTCGTCCTAC ACTTGCGCAA 120 

ATGTCTCCGA GCTTACTCAC AT AG CAT ATT GGTATATCAA AATGAAATGC AAGGAACCAA 180 

AAATAACATA ATTGAAGGCA GTAAAAGTGA AATTAAATAG GAAGATCATC AGTCAAGGAA 240 

GACCCACTGG AGAGGACAGA AAATGAAGCA GTGTTTTATC ATGTGTATTT CAGCAGGTCT 300 

TCTTGAAATT TAACTAAAAA TATGACTGCT CTCTCTTCAG AGAACTGCTC TTTTCAGTAC 3 GO 

CAGTTACGTC AAACAAACCA GCCCCTAGAC GTTAACTATC TGCTATTCTT GATCATACTT 420 

GGGAAAATAT TATTAAATAT CCTTACACTA GGAATGAGAA QAAAAAACAC CTGTCAAAAT 480 

TTTATGGAAT ATTTTTGCAT TTCACTAGCA TTCGTTGATC TTTTACTTTT GGTAAACATT 540 

TCCATTATAT TGTATTTCAG GGATTTTGTA CTTTTAAGCA TTAGGTTCAC TAAATACCAC 600 

ATCTGCCTAT TTACTCAAAT TATTTCCTTT ACTTATGGCT TTTTGCATTA TCCAGTTTTC 660 

CTGACAGCTT GTATAGATTA TTGCCTGAAT TTCTCTAAAA CAACCAAGCT TTCATTTAAG 720 

TGTCAAAAAT TATTTTATTT CTTTACAGTA ATTTTAATTT GGATTTCAGT CCTTGCTTAT 780 

GTTTTGGGAG ACCCAGCCAT CTACCAAAGC CTGAAGGCAC AGAATGCTTA TTCTCGTCAC 840 

TGTCCTTTCT ATGTCAGCAT TCAGAGTTAC TGGCTGTCAT TTTTCATGGT GATGATTTTA 900 

TTTGTAGCTT TCATAACCTG TTGGGAAGAA GTTACTACTT TGGTACAGGC TATCAGGATA 960 

ACTTCCTATA TGAATGAAAC TATCTTATAT TTTCCTTTTT CATCCCACTC CAGTTATACT 1020 

GTGAGATCTA AAAAAATATT CTTATCCAAG CTCATTGTCT GTTTTCTCAG TACCTGGTTA 10 GO 

CCATTTGTAC TACTTCAGGT AATCATTGTT TTACTTAAAG TTCAGATTCC AGCATATATT 1140 

GAGATGAATA TTCCCTGGTT ATACTTTGTC AATAGTTTTC TCATTGCTAC AGTGTATTGG 1200 

TTTAATTGTC ACAAGCTTAA TTTAAAAGAC ATTGGATTAC CTTTGGATCC ATTTGTCAAC 1260 

TGGAAGTGCT GCTTCATTCC ACTTACAATT CCTAATCTTG AGCAAATTGA AAAGCCTATA 1320 

TCAATAATGA TTTGTTAATA TTATTAATTA AAAGTTACAG CTGT CAT AAG ATCATAATTT 1380 

TATGAACAGA AAGAACTCAG GACATATTAA AAAATAAACT GAACTAAAAC AACTTTTGCC 1440 

CCCTGACTGA TAGCATTTCA GAATGTGTCT TTTGAAGGGC TATACCAGTT ATTAAATAGT 1500 

GTTTTATTTT AAAAACAAAA TAATTCCAAG AAGTTTTTAT AGTTATTCAG GGACACTATA 1560 

TTACAAATAT TACTTTGTTA TTAACACAAA AAGTGATAAG AGTTAACATT TGGCTATACT 1620 

GATGTTTGTG TTACTCAAAA AAACTACTGG ATGCAAACTG TTATGTAAAT CTGAGATTTC 1680 

ACTGACAACT TTAAGATATC AACCTAAACA TTTTTATTAA ATGTTCAAAT GTAAGCAAGA 1740 

AAAAAAAAA 1749 

Seq 10 NO: C40 DMA Sequence 
Kucleic Acid Accession #: BC012089 
Coding sequence: 1..2571 



1 11 21 31 41 51 

I I I I I I 

ATGGCCCTCG TACTCGGCTC CCTGTTGCTG CTGGGGCTGT GCGGGAACTC CTTTTCAGGA 60 

GGGCAGCCTT CATCCACAGA TGCTCCTAAG GCTTGGAATT ATGAATTGCC TGCAACAAAT 120 

TATGAGACCC AAGACTCCCA TAAAGCTGGA CCCATTGGCA TTCTCTTTGA ACTAGTGCAT 180 

ATCTTTCTCT ATGTGGTACA GCCGCGTGAT TTCCCAGAAG ATACTTTGAG AAAATTCTTA 240 

CAGAAGGCAT ATGAATCCAA AATTGATTAT GACAAGATTG TCTACTATGA AGCAGGGATT 300 

ATTCTATGCT GTGTCCTGGG GCTGCTGTTT ATTATTCTGA TGCCTCTGGT GGGGTATTTC 360 

TTTTGTATGT GTCGTTGCTG TAACAAATGT GGTGGAGAAA TGCACCAGCG ACAGAAGGAA 420 

AATGGGCCCT TCCTGAGGAA ATGCTTTGCA ATCTCCCTGT TGGTGATTTG TATAATAATA 480 

AGCATTGGCA TCTTCTATGG TTTTGTGGCA AATCACCAGG TAAGAACCCG GATCAAAAGG 540 

AGTCGGAAAC TGGCAGATAG CAATTTCAAG GACTTGCGAA CTCTCTTGAA TGAAACTCCA 600 

GAGCAAATCA AATATATATT GGCCCAGTAC AACACTACCA AGGACAAGGC GTTCACAGAT 660 

CTGAACAGTA TCAATTCAGT GCTAGGAGGC GGAATTCTTG ACCGACTGAG ACCCAACATC 720 

ATCCCTGTTC TTGATGAGAT TAAGTCCATG GCAACAGCGA TCAAGGAGAC CAAAGAGGCG 780 

TTGGAGAACA TGAACAGCAC CTTGAAGAGC TTGCACCAAC AAAGTACACA GCTTAGCAGC 840 

AGTCTGACCA GCGTGAAAAC TAGCCTGCGG TCATCTCTCA ATGACCCTCT GTGCTTGGTG 900 

CATCCATCAA GTGAAACCTG CAACAGCATC AGATTGTCTC TAAGCCAGCT GAATAGCAAC 960 

CCTGAACTGA GGCAGCTTCC ACCCGTGGAT GCAGAACTTG ACAACGTTAA TAACGTTCTT 1020 

AGGACAGATT TGGATGGCCT GGTCCAACAG GGCTATCAAT CCCTTAATGA TATACCTGAC 1080 

AGAGTACAAC GCCAAACCAC GACTGTCGTA GCAGGTATCA AAAGGGTCTT GAATTCCATT 1140 

GGTTCAGATA TCGACAATGT AACTCAGCGT CTTCCTATTC AGGATATACT CTCAGCATTC 1200 

TCTGTTTATG TTAATAACAC TGAAAGTTAC ATCCACAGAA ATTTACCTAC ATTGGAAGAG 1260 

TATGATTCAT ACTGGTGGCT GGGTGGCCTG GTCATCTGCT CTCTGCTGAC CCTCATCGTG 1320 

ATTTTTTACT ACCTGGGCTT ACTGTGTGGC GTGTGCGGCT ATGACAGGCA TGCCACCCCG 1380 

ACCACCCGAG GCTGTGTCTC CAACACCGGA GGCGTCTTCC TCATGGTTGG AGTTGGATTA 1440 

AGTTTCCTCT TTTGCTGGAT ATTGATGATC ATTGTGGTTC TTACCTTTGT CTTTGGTGCA 1500 

AATGTGGAAA AACTGATCTG TGAACCTTAC ACGAGCAAGG AATTATTCCG GGTTTTGGAT 1560 

ACACCCTACT TACTAAATGA AGACTGGGAA TACTATCTCT CTGGGAAGCT ATTTAATAAA 1620 

TCAAAAATGA AGCTCACTTT TGAACAAGTT TACAGTGACT GCAAAAAAAA TAGAGGCACT 1680 

TACGGCACTC TTCACCTGCA GAACAGCTTC AATATCAGTG AACATCTCAA CATTAATGAG 1740 

CATACTGGAA GCATAAGCAG TGAATTGGAA AGTCTGAAGG TAAATCTTAA TATCTTTCTG 1800 

TTGGGTGCAG CAGGAAGAAA AAACCTTCAG GATTTTGCTG CTTGTGGAAT AGACAGAATG 1860 
AATTATGACA GCTACTTGGC TCAGACTGGT AAATCCCCCG CAGGAGTGAA TCTTTTATCA % 1920 
TTTGCATATG ATCTAGAAGC AAAAGCAAAC AGTTTGCCCC CAGGAAATTT GAGGAACTCC % 1980 

CTGAAAAGAG ATGCACAAAC TATTAAAACA ATTCACCAGC AACGAGTCCT TCCTATAGAA 2040 

CAATCACTGA GCACTCTATA CCAAAGCGTC AAGATACTTC AACGCACAGG GAATGGATTG 2100 

IJ TTGGAGAGAG TAACTAGGAT TCTAGCTTCT CTGGATTTTG CTCAGAACTT CATCACAAAC 2160 

AATACTTCCT CTGTTATTAT TGAGGAAACT AAGAAGTATG GGAGAACAAT AATAGGATAT 2220 

TTTGAACATT ATCTGCAGTG GATCGAGTTC TCTATCAGTG AGAAAGTGGC ATCGTGCAAA 2280 

CCTGTGGCCA CCGCTCTAGA TACTGCTGTT GATGTCTTTC TGTGTAGCTA CATTATCGAC 2340 

CCCTTGAATT TGTTTTGGTT TGGCATAGGA AAAGCTACTG TATTTTTACT TCCGGCTCTA 2400 

OU ATTTTTGCGG TAAAACTGGC TAAGTACTAT CGTCGAATGG ATTCGGAGGA CGTGTACGAT 2460 

GATGTTGAAA CTATACCCAT GAAAAATATG GAAAATGGTA ATAATGGTTA TCATAAAGAT 2520 

CATGTATATG GTATTCACAA TCCTGTTATG ACAAGCCCAT CACAACATTG A 2S71 

Seq ID NO: C41 DMA Sequence 
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CCACGCGTCC 
CTCCTTTCTG 
GAAACTGCGA 
GAAACTGCTA 
CCCATAATTA 
ACACATAGTT 
GTAAATTCAT 
ACAATGGTTC 
CAATCATCAG 
GGTCCCAGCA 
CATAATACAA 
AAAGGAAAGG 
GAGAAACATT 
GTATTTGGCA 
CCAAGATCTG 
GCAGAAACCA 
AGTAGCTCAA 
AACCAGACTG 
AGGCCTAACC 
AACGCACAGC 
GTGCCCGGCT 
GGACTCGACT 
ATTGTCATTC 
AAGCATATTO 
ACAGGCTTCA 
TCCAGAGACA 
TATTAGAATC 
TTAGAGTGTT 

AATCCAGCTC 
CTTGTCAGAG 
TTTTCTTAAT 
CCCAGCATTG 
GGAGACATAC 
CAAGGGTCAC 
ATGGAGAGAT 
AGACTGGAAA 
TTTTTGATGC 
TTTTTGAAAT 
AGAGAGGCAA 
CTTCTTCCTT 
GAAGACTGGC 
TTTGAAATTG 
CCAGCCGTCC 
CTGGTGACAG 
ACAAATGGGG 
ATTTCTAATG 
TGTTAATTGT 
AAAAAAA 



11 
I 

GAGCAAGAAC 
TAAACACAGC 
CTAGTGGTCC 
GCACCACAGC 
GTACACATAG 
CCTCCACAAT 
TAGCTACCTC 
CTTCTGAAAC 
GGCCTCCCAC 
ATCCTTGCCA 
GTTTTTGCCT 
TATTCCCTGG 
CCATGGCCTA 
CATCTGTTTA 
AAATGCGTGC 
CAAGTGACAA 
GCAACTTTCT 
CGGATGACTG 
CACAGAGCCC 
ACAAGCAATG 
ACCAGGAAGA 
GTAAGGACAA 
TCAGCATGAT 
AAGAAGAGAA 
CCAATCTTGG 
GCCAGATGCA 
ATAAGAATGT 
TAGAAAGACT 
ATCTGACATC 
ACTTGCTAAA 
AGGTGGTTTT 
TCTTTTCCTG 
CTTGCTCCTG 
GCATCAACTG 
AAGCTCACTC 
TCTGGCAACC 
GAGACATATC 
TAATTAGAAG 
GCTAGGAGGC 
AGGAAACTGG 
AGGTCCCCTC 
TGGGAGAAGG 
GAGACAAACT 
AGCCTCGGGG 
AGGCTCAGCT 
GCAACCACAT 
CCTTGTCCTC 
GAATGGTAAT 



21 
I 

AGCTAAAATG 
CACCAACCAA 
TACAGTAGCT 
AAATACACCT 
TTCCTCCACA 
TCCTATACCT 
TGACATAATC 
ACAAAGTAAC 
TGGCACCGCT 
AGATGATCCC 
GTGTTTAGAA 
GAAGATTTCA 
TCAAGACTTQ 
TGGACAGACT 
TGATGACAAG 
TGAGAAGACT 
AAACTATGAT 
CCTCAATGGT 
TTTCTGOGTT 
CTTAATAAAG 
TGCTAATGGG 
ATTTCAGCTG 
AATTGCATTG 
CTTGATTGAC 
AGCAGAAGGG 
AAATCCCTAT 
GGAACCCGCC 
GATGGAGAAG 
TGCCAGCCTC 
TAAGAATCTA 
CTTCAATCAG 
GTAGGGCAAC 
GGCAAACATT 
TGGAGGTCCG 
TCTGACAAGT 
TTTGAACAGC 
TCTCAGCTTT 
GCCTGGACTA 
AGAAGGGGCA 
GTGGGGAGGA 
CTCCATCAGC 
TTTAAAAACA 
GTCTGGCAAA 
TGTAGGTTTC 
GTGGCCACCA 
CCAGTACAAG 
TTAAAAGTTA 
GCAATAAAGT 



3X 
I 

AAAGCCATCA 
GGCAACTCAG 
GCAGCTGATA 
TCTTTCCCAA 
ATTCCTACAC 
ACTGCTGCAG 
ACCGCTTCAT 
AATGAAATGT 
TTATTGGAGA 
TGTGCAGATA 
GGGTATTACT 
GTGACAGTAT 
CATAGTGAAA 
GTAATTCTTA 
TTTGTTAATG 
GTGACTGAGA 
TTGACCCTTC 
TTAGCATGCG 
GCTTCCAGTC 
AAGAGTGGTG 
AACTGCCAAA 
ATCCTCACTA 
ATTGTCACAG 
GAAGACTTTC 
AGCGTCTTTC 
TCAAGACACA 
ATGGCCCCCA 
TGAGCACCAG 
TCTGAATGGA 
TGACATTAAA 
TACAAAGTAC 
AAGAACCATT 
GCTCTTGAGT 
AGGGGATGAG 
CAGAATAGGG 
CCAGAGCTTG 
TTCAGGAGGC 
AAAATGTCCA 
GAGAGTAAAA 
TCAATTAGAG 
AAAGGAGCAC 
AAAAATGCAG 
GGGTGCGAGA 
TGAGGTGTGC 
ACACACAACC 
CTTTTACAAA 
TTTTATTTGT 
GCCTTTGTTA 



41 



I 

TTCATCTTAC 
CTGATGCTGT 
CCACTGAAAC 
CAGCTACTTC 
CTGCTCCCCC 
ACAGTGAGTC 
CTCCAAATGA 
CCCCCACCAC 
CCAGCACCCT 
ATTCGTTATG 
ACAACTCTTC 
CAGAAACATT 
TTACTAGCTT 
CTGTAAGCAC 
TAACAATAGT 
AAATTAATAA 
GGTGTGATTA 
ATTGCAAATC 
TCAAGTGTCC 
GGGCCCCTGA 
AGTGTGCATT 
TTGTGGGCAC 
CAAGATCAAA 
AAAATCTAAA 
CTAAGGTCAG 
GCAGCATGCC 
ACCAATGTAC 
TAAAGATCTG 
AGTTGTGAAT 
TGTAGTAGAT 
TGAGACAATG 
TCCAATCTAG 
TAAGTGACCT 
AAGGGATACC 
ACACTGCTTC 
CAACCTAGCC 
GTGCCTGGGA 
CTATGGGGTG 
AACATGACCT 
AGGAGGCACC 
TTCTCTAATC 
GAGTAAGAGC 
GGGAGCTTGT 
CATTGGGGCC 
ACACACACAC 
TGTTATTAGT 
TATTATTATT 
GATGGTGAAA 



Seq ID NO: C42 dna Sequence 

Nucleic Acid Accession #: NM_001432.1 

Coding sequence: 167.. 676 



1 
I 

TCACTTGCCT 
TGACAGCCGC 
TCCGCAGCCG 
GAGGATGGAG 
TCTTCTACAG 
TGATAACTGC 
AACAAAGTGT 
GGACATGAGT 
CTTCTTTTTA 
TATTATTTTG 
AAATCGAAAA 
GTTGCCGCAA 
GAATGGCGCC 
CATTTTATTA 
GTACTTGAAA 
TGTGCAGAAA 
GCTAAATGCT 
AGCAATTCCC 
ATATATAGTC 
GGTTTGACTG 
TAATGCAATT 
AAACAAGCTA 
TATATGAGGG 



11 
I 

GATATTTCCA 
TCTCCAGCCA 
CCCTCCGCCA 
ATGCTCTGTG 
GCAGTCCTCA 
ACAGCTTTAG 
AGCTCTGACA 
CAAAACTACT 
ACCGTCCACC 
TTTCTTATCA 
AGTAAAGAAC 
GTCTGAATAT 
ATCAAACTTA 
ATAATATTTA 
AATGTTTTTA 
ATATTTAATA 
TCATTGAAAG 
AGGTCATAGC 
AAATCGATTT 
TACCATGGTT 
AGAATTTGGG 
ACCCTGGGGA 
GCAGTGGACA 



21 
I 

GTGTCAGAGG 
CTGCCGCGAG 
AGCCCCAGCG 
CCGGCAGGGT 
GTACAACTGT 
TTCAGACAGA 
TGAATGGCTA 
GCAGGTGTGA 
AACCTTTAAG 
CAGTOGTCGG 
CAAAGAAGGA 
GAGAGAGTTA 
TGGGCAGGGA 
TGTTGGGTCA 
TTTTTGTTTT 
TCAAAAGAAA 
CTTCAAAGTT 
TCAAGAATTG 
AGTAAGTATG 
TGATATGTAG 
AGAAGCAAAT 
GTCTATGGTC 
GTTCCCTATG 



31 
I 

GACACAGCCA 
CCOGTCTGCT 
CCCGCTCCCA 
CCCTGOGCTG 
GATTCCATCA 
AGACAATCCA 
TTGTTTGCAT 
AGTGGGTTAT 
CAAAGAGTAT 
TTCCACATAT 
ATATGAGAGA 
CCTCAGGGGA 
TAACAGTGTG 
AGTGTTAGGT 
ATTTTTGACA 
ATTGATATTT 
TATATGCCTG 
TTAGCAAATG 
TTTTTTATGT 
TTGGCACCAT 
ATAGGTCCTG 
TCTTCACTCA 
CCAACTCACG 




TCGCCGATGA 
CTGCTCTGCC 
TGTATCCCAG 
OGTGTGGCTC 
GGACAGTGCA 
ACTGGTGTCC 
GTGGCTTTGA 
TATTTCTGCA 
GTTACCTCAG 
TCCAGAGTTG 
CCTGGTTAAT 
CAATAACACT 
GACTATTTGC 
TTATACAAGT 
GTGCACAGTG 
ACAGATTTCT 
TCCTCAAATC 
GGTATCATAT 
TGTTAAACAC 
GGTCTCAGCT 
ACTCCTACAG 



51 
I 

TCTTCTTGCT 
AACAACCACA 
TAATTTCCCT 
ACCTGCTCCC 
CATAATTAGT 
AACCACAAAT 
TGGATTAATC 
AGAAGACAAT 
AAACAGCACA 
TGTTAAGCTG 
TACATGTAAG 
TGACCCAGAA 
GTTTAAAGAT 
ATCTCTGTCA 
AACAATTTTQ 
AGCAATTAGA 
TTATGGCTGT 
TGACCTGCAA 
TGATGCCTGC 
GTGTGCGTGC 
TGGCTACAGT 
CATCGCTGGC 
TAACAAAACG 
ACTGCGGTCG 
GATAACGGCC 
CCGCCCTGAC 
AAGCTATTAT 
GCCTCCGGGG 
GTTTGCAACG 
GCTATTAGOG 
GTTAGGGTTG 
AGGAAAGCTC 
AATTCCCCTG 
CACCACCTTT 
TATCCCTCCA 
TCACCCAAGA 
ATCCAGGAAC 
CACTCTACAG 
GGTAGAAGGA 
TGGGATCCAC 
ATGCCCTCCC 
CTTAGGTCAG 
GCTCAGGAGT 
TCAGCCTTCT 
AACCACACAC 
GTCCTTTTTT 
TGTTCTTGAC 
AAAAAAAAAA 



51 
I 

CCTTCTAGGC 
CCGTGCACTC 
CCGCGGGGAG 
TGGGTTTCCA 
GAGAGTCCAG 
AAGTGTCAAT 
TCTATCTGGT 
GATGTGAACA 
CCGTGATTCT 
GATGGTACAG 
GGGATGCAGA 
CCGCAAGTCT 
ATTAATATTC 
GTATTTTAAT 
TAATGTATAA 
AATTTCCTGA 
CTTAGAAGTA 
GTAAGCCTAT 
AGTGATAATT 
ATTAAAACAA 
TACACATTTG 
ATAATTCTGT 
GTACTAGTCA 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1360 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

25B0 

2640 

2700 

2760 

2820 

2880 

2887 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 
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5 

10 
15 
20 



30 



35 



40 



CTCATCTACC AGATTCTGCC TATGTAAAAT GAATTGAAAA ACAATTTTCT GTAATCTTTT 1440 

ATTTAAGTAG TGGGCATTTC ATAGCTTCAC AATGTTCCTT TTTTGTATAT TACAACATTT 1500 

ATGTGAGGTA ATTATTGCTC AACAGACAAT TAGAAAAAAG TCCACACTTG AAGCCTAAAT 15G0 

TTGTGCTTTT TAAGAATATT TTTAGACTAT TTCTTTTTAT AGGGGCTTTG CTGAATTCTA 1620 

ACATTAAATC ACAGCCCAAA ATTTGATGGA CTAATTATTA TTTTAAAATA TATGAAGACA 16B0 

ATAATTCTAC ATGTTGTCTT AAGATGGAAA TACAGTTATT TCATCTTTTA TTCAAGGAAG 1740 

TTTTAACTTT AATACAG CTC AGTAAATGGC TTCTTCTAGA ATGTAAAGTT ATGTATTTAA 1800 

AGTTGTATCT TGACACAGGA AATGGGAAAA AACTTAAAAA TTAATATGGT GTATTTTTCC 1860 

AAATGAAAAA TCTCAATTGA AAGCTTTTAA AATGTAGAAA CTTAAACACA CCTTCCTGTG 1920 

GAGGCTGAGA TGAAAACTAG GGCTCATTTT CCTGACATTT GTTTATTTTT TGGAAGAGAC 1980 

AAAGATTTCT TCTGCACTCT GAGCCCATAG GTCTCAGAGA GTTAATAGGA GT ATTTTT GG 2040 

GCTATTGCAT AAGGAGCCAC TGCTGCCACC ACTTTTGGAT TTTATGGGAG GCTCCTTCAT 2100 

CGAATGCTAA ACCTTTGAGT AGAGTCTCCC TGGATCACAT ACCAGGTCAG GGAGGATCTG 2160 

TTCTTCCTCT ACGTTTATCC TGGCATGTGC TAGGGTAAAC GAAGGCATAA TAAGCCATGG 2220 

CTGACCTCTG GAGCACCAGG TGCCAGGACT TGTCTCCATG TGTATCCATG CATTATATAC 2280 

CCTGGTGCAA TCACACGACT GTCATCTAAA GTCCTGGCCC TGGCCCTTAC TATTAGGAAA 2340 

ATAAACAGAC AAAAACAAGT AAATATATAT GGTCCTATAC ATATTGTATA TATATTCATA 2400 

TACAAACATG TATGTATACA TGACCTTAAT GGATCATAGA ATTGCAGTCA TTTGGTGCTC 2460 

TGCTAACCAT TTATATAAAA CTTAAAAACA AGAGAAAAGA AAAATCAATT AGATCTAAAC 2520 

AGTTATTTCT GTTTCCTATT TAATATAGCT GAAGTCAAAA TATGTAAGAA CACATTTTAA 2580 

ATACTCTACT TACAGTTGGC CCTCTGTGGT TAGTTCCACA TCTGTGGATT CAACCAACCA 2640 

AGGACG GAAA ATGCTTAAAA AATAATACAA CAACAACAAA AAATACATTA TAACAACTAT 2700 

TTACTTTTTT TTTTTTCTTT TTGAGATGGA GTCTCGCTCT GTTGCCCAGG TTGGAGTGCA 2760 

GTGGCACGAT CTCGGCTCAC TGCAACCTCA CCTOCOGGGT TCAAGAGATC CTCCTGCCTC 2820 

Zj AGCCTCCTGA GCAGCTGGGA CTACAGGCGC ATGCCACCAT GCCCAGCTAA TTTTTGTATT 2880 

TTTAGTAGAG GCGGGGTTTC ACCATGTTGG CCAGGATGGT CTCAATCTCC TAACCTTGAG 2940 

ATCCACCCTC CACAGCCTCC CAAACTGCTG GGATTACAGG CGTGAGCCAC CGCACGTAGC 3000 

ATTTACATTA GGTATTACAA GTAATGTAAA GATGATTTAA GTATACAGGA GGATGTGAAT 3060 

AGGTTATATG CAAGCACTAT GCCCTTTTAT ATAAGTGACT TGAACATCTG TGCCCGATTT 3120 

TAGTATGTGC AGGGGGGCGA TCTGGGAATC AGTCCCCTGT GGATACCAAG GTACAACTGT 3180 

ATTTATTAAC GCTTACTAGA TGTGAGGAGA GTCTGAATAT TTTCAGTGAT CTTGGCTGTT 3240 

TCAAAAAAAT CTATTGACTT TTCAATAAAT CAGCTGCAAT CCATTTATTT CATTTACAAA 3300 

AGATTTATTG TAAGCCTCTC AATCTTGGTT TTTCAGTTGA TCTTAAGCAT GTCAATTCAT 3360 

AAAAACAAGT CATTTTTGTA TTTTTCATCT TTAAGAATGC TTAAAAAAGC TAATCCCTAA 3420 

AATAGTTAGA TCTTTGTAAA TGCATATTAA ATAATAAAGT ATGACCCACA TTACTTTTTA 3480 

TGGGTGAAAA TAAGACAAAA ATAATAGTTT TAGTGAGGAT GGTGCTGAGT AAACATAAAA 3540 

ACTGATTTGC TCTCAGCTGA TGTGTCCTGT ACACAGTGGG AAGATTTTAG TTCACACTTA 3600 

GTCTAACTCC CCCATTTTAC AGATTTCTCA CTATATATAT TTCTAGAAGG GGCTATGCAT 3660 

ATTCAATGTA TTGAGAACCA AAGCAACCAC AAATGCATAA ATGCATAATT TATGGTCTTC 3720 

AACCAAGGCC ACATAATAAC CCAGTTAACT TACTCTTTAA CCAGGAATAT TAAGTTCTAT 3780 

AACTAGTACT CAAGGTTTAA CCTTAAAATT AAGATTTCCT TAACCTTAAC CTTAAAATTG 3840 

ATATTATATT AAACATACAT AATACAATGT AACTCCACTG TTCTCCTGAA TATTTTTTGC 3900 

TCTAATCTCT CTGCCGAAAG TCAAAGTGAT GGGAGAATTG GTATACTGGT ATGACTACGT 3960 

CTTAAGTCAG ATTTTTATTT ATGAGTCTTT GAGACTAAAT TCAATCACCA CCAGGTATCA 4020 

4 J AATCAACTTT TATGCAGCAA ATATATGATT CTAGTGTCTG ACTTTTGTTA AATTCAGTAA 4080 

TGCAG TTTTT AAAAACCTGT ATCTGACCCA CTTTGTAATT TTTGCTCCAA TATCCATTCT 4140 

GTAGACTTTT GAAAAAAAAG TTTTTAATTT GATGCCCAAT ATATTCTGAC CGTTAAAAAA 4200 

TTCTTGTTCA TATGGGAGAA GGGGGAGTAA TGACTTGTAC AAACAGTATT TCTGGTGTAT 4260 

ATTTTAATGT TTTTAAAAAG AGTAATTTCA TTTAAATATC TGTTATTCAA ATTTGATGAT 4320 

GTTAAATGTA ATATAATGTA TTTTCTTTTT ATTTTGCACT CTGTAATTGC ACTTTTTAAG 4380 

TTTGAAGAGC CATTTTGGTA AACGGTTTTT ATTAAAGATG CTATGGAACA TAAAGTTGTA 4440 

TTGCAT GCAA TTTAAAGTAA CTTATTTGAC TATGAATATT ATCGGATTAC TGAATTGTAT 4500 

CAATTTGTTT GTGTTCAATA TCAGCTTTGA TAATTGTGTA CCTTAAGATA TTGAAGGAGA 4560 

AAATAGATAA TTTACAAGAT ATTATTAATT TTTATTTATT TTTCTTGGGA ATTGAAAAAA 4620 

J J ATTGAAATAA ATAAAAATGC ATTGAACATC TTGCATTCAA AATCTTCACT GAC 4673 

Seq ID NO: C43 UNA Sequence 
- n Nucleic Acid Accession #: AF011468.1 
OU Coding sequence : 2 57 . . 14 6 a 

1 11 21 31 41 51 

I 1 I I I I 

GGAAGACTTG GGTCCTTGGG TCGCAGGTGG GAGCCGACGG GTGGGTAGAC CGTGGGGGAT 60 

CO ATCTCAGTGG CGGACGAGGA CGGCGGGGAC AAGGGGCGGC TGGTCGGAGT GGCGGAGCGT 120 

CAAGTCCCCT GTCGGTTCCT CCGTCCCTGA GTGTCCTTGG CGCTGCCTTG TGCCCGCCCA 180 

GCGCCTTTGC ATCCGCTCCT GGGCACCGAG GCGCCCTGTA GGATACTGCT TGTTACTTAT 240 

TACAGCTAGA GGCATCATGG ACCGATCTAA AGAAAACTGC ATTTCAGGAC CTGTTAAGGC 300 

qp. TACAGCTCCA GTTGGAGGTC CAAAACGTGT TCTCGTGACT CAGCAAATTC CTTGTCAGAA 360 

l\J TCCATTACCT GTAAATAGTG GCCAGGCTCA GCGGGTCTTG TGTCCTTCAA ATTCTTCCCA 420 

GCGCGTTCCT TTGCAAGCAC AAAAGCTTGT CTCCAGTCAC AAGCCGGTTC AGAATCAGAA 480 

GCAGAAGCAA TTGCAGGCAA CCAGTGTACC TCATCCTGTC TCCAGGCCAC TGAATAACAC 540 

CCAAAAGAGC AAGCAGCCCC TGCCATCGGC ACCTGAAAAT AATCCTGAGG AGGAACTGGC S00 

ATCAAAACAG AAAAATGAAG AATCAAAAAA GAGGCAGTGG GCTTTGGAAG ACTTTGAAAT 660 

I J TGGTCGCCCT CTGGGTAAAG GAAAGTTTGG TAATGTTTAT TTGGCAAGAG AAAAGCAAAG 720 

CAAGTTTATT CTGGCTCTTA AAGTGTTATT TAAAGCTCAG CTGGAGAAAG CCGGAGTGGA 780 

GCATCAGCTC AGAAGAGAAG TAGAAATACA GTCCCACCTT CGGCATCCTA ATATTCTTAG 840 

ACTGTATGGT TATTTCCATG ATGCTACCAG AGTCTACCTA ATTCTGGAAT ATGCACCACT 900 

~~ TGGAACAGTT TATAGAGAAC TTCAGAAACT TTCAAAGTTT GATGAGCAGA GAACTGCTAC 960 

OU TTATATAACA GAATTGGCAA ATGCCCTGTC TTACTGTCAT TCGAAGAGAG TT ATT CAT AG 1020 

AGACATTAAG CCAGAGAACT TACTTCTTGG ATCAGCTGGA GAGCTTAAAA TTGCAGATTT 1080 

TGGGTGGTCA GTACATGCTC CATCTTCCAG GAGGACCACT CTCTGTGGCA CCCTGGACTA 1140 

CCTGCCCCCT GAAATGATTG AAGGTCGGAT GCATGATGAG AAGGTGGATC TCTGGAGCCT 1200 

TGGAGTTCTT TGCTATGAAT TTTTAGTTGG GAAGCCTCCT TTTGAGGCAA ACACATACCA 1260 
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AGAGACCTAC AAAAGAATAT CACGGGTTGA ATTCACATTC CCTGACTTTG TAACAGAGGG 1320 

AGCCAGGGAC CTCATTTCAA GACTGTTGAA GCATAATCCC AGCCAGAGGC CAATGCTCAG 1380 

AGAAGTACTT GAACACCCCT GGATCACAGC AAATTCATCA AAACCATCAA ATTGCCAAAA 1440 

CAAAGAATCA GCTAGCAAAC AGTCTTAGGA ATCGTGCAGG GGGAGAAATC CTTGAGCCAG 1500 

J GGCTGCCATA TAACCTGACA GGAACATGCT ACTGAAGTTT ATTTTACCAT TGACTGCTGC 1560 

CCTCAATCTA GAACGCTACA CAAGAAATAT TTGTTTTACT CAGCAGGTGT GCCTTAACCT 1620 

CCCTATTCAG AAAGCTCCAC ATCAATAAAC ATGACACTCT GAAGTGAAAG TAGCCACGAG 1680 

AATTGTGCTA CTTATACTGG TTCATAATCT GGAGGCAAGG TTCGACTGCA GCCGCCCCGT 1740 

CAGCCTGTGC TAGGCATGGT GTCTTCACAG GAGGCAAATC CAGAGCCTGG CTGTGGGGAA 1800 

1U AGTGACCACT CTGCCCTGAC CCCGATCAGT TAAGGAGCTG TGCAATAACC TTCCTAGTAC I860 

CTGAGTGAGT GTGTAACTTA TTGGGTTGGC GAAGCCTGGT AAAGCTGTTG GAATGAGTAT 1920 

GTGATTCTTT TTAAGTATGA AAATAAAGAT ATATGTACAG ACTTGTATTT TTTCTCTGGT 1980 

GGCATTCCTT TAGGAATGCT GTGTGTCTGT CCGGCACCCC GGTAGGCCTG ATTGGGTTTC 2040 

TAGTCCTCCT TAACCACTTA TCTCCCATAT GAGAGTGTGA AAAATAGGAA CACGTGCTCT 2100 

13 ACCTCCATTT AGGGATTTGC TTGGGATACA GAAGAGGCCA TGTGTCTCAG AGCTGTTAAG 2160 

GGCTTATTTT TTTAAAACAT TGGAGTCATA GCATGTGTGT AAACTTTAAA TATGCAAATA 2220 

AATAAGTATC TATGTCTAAA AAAAAAAAAA AAA 2253 

Seq ID NO: C44 DNA Sequence 
2U Nucleic Acid Accession #: NM_013372 
Coding sequences 63.. 617 

1 11 21 31 41 51 

or I I I I 1 I 

Z,J GCGGCCGCAC TCAGCGCCAC GCGTCGAAAG CGCAGGCCCC GAGGACCCGC CGCACTGACA 60 

GTATGAGCCG CACAGCCTAC ACGGTGGGAG CCCTGCTTCT CCTCTTGGGG ACCCTGCTGC 120 

CGGCTGCTGA AGGGAAAAAG AAAGGGTCCC AAGGTGCCAT CCCCCCGCCA GACAAGGCCC 180 

AGCACAATGA CTCAGAGCAG ACTCAGTCGC CCCAGCAGCC TGGCTCCAGG AACCGGGGGC 240 

n GGGGCCAAGG GCGGGGCACT GCCATGCCCG GGGAGGAGGT GCTGGAGTCC AGCCAAGAGG 300 

J\) CCCTGCATGT GACGGAGOGC AAATACCTGA AGCGAGACTG GTGCAAAACC CAGCCGCTTA 360 

AGCAGACCAT CCACGAGGAA GGCTGCAACA GTCGCACCAT CATCAACCGC TTCTGTTACG 420 

GCCAGTGCAA CTCTTTCTAC ATCCCCAGGC ACATCCGGAA GGAGGAAGGT TCCTTTCAGT 480 

CCTGCTCCTT CTGCAAGCCC AAGAAATTCA CTACCATGAT GGTCACACTC AACTGCCCTG 540 

AACTACAGCC ACCTACCAAG AAGAAGAGAG TCACACGTGT GAAGCAGTGT CGTTGCATAT 600 

J J CCATCGATTT GGATTAAGCC AAATCCAGGT GCACCCAGCA TGTCCTAGGA ATGCAGCCCC 660 

AGGAAGTCCC AGACCTAAAA CAACCAGATT CTTACTTGGC TTAAACCTAG AGGCCAGAAG 720 

AACCCCCAGC TGCCTCCTGG CAGGAGCCTG CTTGTGCGTA GTTCGTGTGC ATGAGTGTGG 780 

ATGGG TGCCT GTGGGTGTTT TTAGACACCA GAGAAAACAC AGTCTCTGCT AGAGAGCACT 840 

CCCTATTTTG TAAACATATC TGCTTTAATG GGGATGTACC AGAAACCCAC CTCACCCCGG 900 

4U CTCACATCTA AAGGGGCGGG GCCGTGGTCT GGTTCTGACT TTGTGTTTTT GTGCCCTCCT 960 

GGGGAC CAGA ATCTCCTTTC GGAATGAATG TTCATGGAAG AGGCTCCTCT GAGGGCAAGA 1020 

GACCTGTTTT AGTGCTGCAT TCGACATGGA AAAGTCCTTT TAACCTGTGC TTGCATCCTC 10 80 

CTTTCCTCCT CCTCCTCACA ATCCATCTCT TCTTAAGTTG ATAGTGACTA TGTCAGTCTA 1140 

ATCTCTTGTT TGCCAAGGTT CCTAAATTAA TTCACTTAAC CATGATGCAA ATGTTTTTCA 1200 

4 J TTTTGTGAAG ACCCTCCAGA CTCTGGGAGA GGCTGGTGTG GGCAAGGACA AGCAGGATAG 1260 

TGGAGTGAGA AAGGGAGGGT GGAGGGTGAG GCCAAATCAG GTCCAGCAAA AGTCAGTAGG 1320 

GACATTGCAG AAGCTTGAAA GGCCAATACC AGAACACAGG CTGATGCTTC TGAGAAAGTC 1380 

TTTTCCTAGT ATTTAACAGA ACCCAAGTGA ACAGAGGAGA AATGAGATTG CCAGAAAGTG 1440 

ATTAACTTTG GCCGTTGCAA TCTGCTCAAA CCTAACACCA AACTGAAAAC ATAAATACTG 1500 

ACCACTCCTA TGTTCGGACC CAAGCAAGTT AGCTAAACCA AACCAACTCC TCTGCTTTGT 1560 

CCCTCAGGTG GAAAAGAGAG GTAGTTTAGA ACTCTCTGCA TAGGGGTGGG AATTAATCAA 1620 

AAACCKCAGA GGCTGAAATT CCTAATACCT TTCCTTTATC GTGGTTATAG TCAGCTCATT 1680 

TCCATTCCAC TATTTCCCAT AATGCTTCTG AGAGCCACTA ACTTGATTGA TAAAGATCCT 1740 

GCCT CTGCTG AGTGTACCTG ACAGTAAGTC TAAAGATGAR AGAGTTTAGG GACTACTCTG 1800 

OD TTTTAGCAAG ARATATTKTG GGGGTCTTTT TGTTTTAACT ATTGTCAGGA GATTGGGCTA 1860 

RAGAGAAGAC GACGAGAGTA AGGAAATAAA GGGRATTGCC TCTGGCTAGA GAGTAAGTTA 1920 

GGTGTTAATA CCTGGTAGAA ATGTAAGGGA TATGACCTCC CTTTCTTTAT GTGCTCACTG 1980 

AGGATCTGAG GGGACCCTGT TAGGAGAGCA TAGCATCATG ATGTATTAGC TGTTCATCTG 2040 

^_ CTACTGGTTG GATGGACATA ACTATTGTAA CTATTCAGTA TTTACTGGTA GGCACTGTCC 2100 

OU TCTGATTAAA CTTGGCCTAC TGGCAATGGC TACTTAGGAT TGATCTAAGG GCCAAAGTGC 2160 

AGGGTGGGTG AACTTTATTG TACTTTGGAT TTGGTTAACC TCTTTTCTTC AAGCCTGAGG 2220 

TTTTATATAC AAACTCCCTG AATACTCTTT TTGCCTTGTA TCTTCTCAGC CTCCTAGCCA 2280 

AGTCCTATGT AATATGGAAA ACAAACACTG CAGACTTGAG ATTCAGTTGC OGATCAAGGC 2340 

TCTGGCATTC AGAGAACCCT TGCAACTCGA GAAGCTGTTT TTATTTCGTT TTTGTTTTGA 2400 

OJ TCCAGTGCTC TCCCATCTAA CAACTAAACA GGAGCCATTT CAAGGCGGGA GATATTTTAA 2460 

ACACCCAAAA TGTTGGGTCT GATTTTCAAA CTTTTAAACT CACTACTGAT GATTCTCACG 2S20 

CTAGGCGAAT TTGTCCAAAC ACATAGTGTG TGTGTTTTGT ATACACTGTA TGACCCCACC 2580 

CCAAATCTTT GTATTGTCCA CATTCTCCAA CAATAAAGCA CAGAGTGGAT TTAATTAAGC 2640 

_^ ACACAAATGC TAAGGCAGAA TTTTGAGGGT GGGAGAGAAG AAAAGGGAAA GAAGCTGAAA 2700 

i\) ATGTA AAACC ACACCAGGGA GGAAAAATGA CATTCAGAAC CAGCAAACAC TGAATTTCTC 2760 

TTGTTGTTTT AA CTCTGC CA CAAGAATGCA ATTTCGTTAA TGGAGATGAC TTAAGTTGGC 2820 

AGCAGTAATC TTCTTTTAGG AGCTTGTACC ACAGTCTTGC ACATAAGTGC AGATTTGGCT 2880 

CAAGTAAAGA GAATTTCCTC AACACTAACT TCACTGGGAT AATCAGCAGC GTAACTACCC 2940 

TAAAAGCATA TCACTAGCCA AAGAGGGAAA TATCTGTTCT TCTTACTGTG CCTATATTAA 3000 

/*> GACTAGTACA AATGTGGTGT GTCTTCCAAC TTTCATTGAA AATGCCATAT CTATACCATA 30 60 

TTTTATTCGA GTCACTGATG ATGTAATGAT ATATTTTTTC ATTATTATAG TAGAATATTT 3120 

TTATGGCAAG ATATTTGTGG TCTTGATCAT ACCTATTAAA ATAATGCCAA ACACCAAATA 3180 

TGAATTTTAT GATGTACACT TTGTGCTTGG CATTAAAAGA AAAAAACACA CATCCTGGAA 3240 

GTCTGTAAGT TGTTTTTTGT TACTGTAGGT CTTCAAAGTT AAGAGTGTAA GTGAAAAATC 3300 

OU TGGAGGAGAG GATAATTTCC ACTGTGTGGA ATGTGAATAG TTAAATGAAA AGTTATGGTT 3360 

ATTTAATGTA ATTATTACTT CAAATCCTTT GGTCACTGTG ATTTCAAGCA TGTTTTCTTT 3420 

TTCTCCTTTA TATGACTTTC TCTGAGTTGG GCAAAGAAGA AGCTGACACA CCGTATGTTG 3480 

TTAGAGTCTT TTATCTGGTC AGGGGAAACA AAATCTTGAC CCAGCTGAAC ATGTCTTCCT 3540 

GAGTCAGTGC CTGAATCTTT ATTTTTTAAA TTGAATGTTC CTTAAAGGTT AACATTTCTA 3600 
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AAGCAATATT AAGAAAGACT TTAAATGTTA TTTTGGAAGA CTTACGATGC ATGTATACAA 3660 

ACGAATAGCA GATAATGATG ACTAGTTCAC ACATAAAGTC CTTTTAAGGA GAAAATCTAA 3720 

AATGAAAAGT GGATAAACAG AACATTTATA AGTGATCAGT TAATGCCTAA GAGTGAAAGT 3780 

AGTTCTATTG ACATTCCTCA AGATATTTAA TATCAACTGC ATTATGTATT ATGTCTGCTT 3840 

5 AAATCATTTA AAAACGGCAA AGAATTATAT AGACTATGAG GTACCTTGCT GTGTAGGAGG 3900 

ATGAAAGGGG AGTTGATAGT CTCATAAAAC TAATTTGGCT TCAAGTTTCA TGAATCTGTA 3960 

ACTAGAATTT AATTTTCACC CCAATAATGT TCTATATAGC CTTTGCTAAA GAGCAACTAA 4020 

TAAATTAAAC CTATTCTTTC AAAAAAAAA 4049 

10 Seq ID NO: C4S DMA Sequence 

Nucleic Acid Accession 8: Eos sequence 
Coding sequence: 200.. 2932 

1 11 21 31 41 51 

15 I | | | | | 

ATTGCTGATG GATCAGTGAG CCTGTGTTCA TGCCAGTGAG CTGCTGTGGC TCAGATACTG 60 

ATACTTTCTT TCCAAACAGC ATAAGAAGTG ATTGAGCCAC AAGTATACTG AAGGAAGGGC 120 

TCCCTCGAGT TCTGGTGTGA AGAGATAAAT CACCAGTCAC AGACTATGCA CCCGACTGCT 180 

GCTGTTCAGT CCAGGGAAAA TGAAAGTTGG AGTGCTGTGG CTCATTTCTT TCTTCACCTT 240 

CACTGAGGGC CACGGTGGCT TCCTGGGGAA AAATGATGGC ATCAAAACAA AAAAAGAACT 300 

CATTGTGAAT AAGAAAAAAC ATCTAGGCCC AGTCGAAGAA TATCAGCTGC TGCTTCAGGT 360 

GACCTATAGA GATTCCAAGG AGAAAAGAGA TTTGAGAAAT TTTCTGAAGC TCTTGAAGCC 420 

TCCATTATTA TGGTCACATG GGCTAATTAG AATTATCAGA GCAAAGGCTA CCACAGACTG 480 

CAACAGCCTG AATGGAGTCC TGCAGTGTAC CTGTGAAGAC AGCTACACCT GGTTTCCTCC 540 

CTCATGCCTT GATCCCCAGA ACTGCTACCT TCACACGGCT GGAGCACTCC CAAGCTGTGA 600 

ATGTCATCTC AACAACCTCA GCCAGAGTGT CAATTTCTGT GAGAGAACAA AGATTTGGGG 660 

CACTTTCAAA ATTAATGAAA GGTTTACAAA TGACCTTTTG AATTCATCTT CTGCTATATA 720 

CTCCAAATAT GCAAATGGAA TTGAAATTCA ACTTAAAAAA GCATATGAAA GAATTCAAGG 780 

TTTTGAGTCG GTTCAGGTCA CCCAATTTCG AAATGGAAGC ATCGTTGCTG GGTATGAAGT 840 

TGTTGGCTCC AGCAGTGCAT CTGAACTGCT GTCAGCCATT GAACATGTTG CCGAGAAGGC 900 

TAAGACAGCC CTTCACAAGC TGTTTCCATT AGAAGACGGC TCTTTCAGAG TGTTCGGAAA 960 

AGCCCAGTGT AATGACATTG TCTTTGGATT TGGGTCCAAG GATGATGAAT ATACCCTGCC 1020 

CTGCAGCAGT GGCTACAGGG GAAACATCAC AGCCAAGTGT GAGTCCTCTG GGTGGCAGGT 1080 

CATCAGGGAG ACTTGTGTGC TCTCTCTGCT TGAAGAACTG AACAAGAATT TCAGTATGAT 1140 

TGTAGGCAAT GCCACTGAGG CAGCTGTGTC ATCCTTCGTG CAAAATCTTT CTGTCATCAT 1200 

TCGGCAAAAC CCATCAACCA CAGTGGGGAA TCTGGCTTCG GTGGTGTCGA TTCTGAGCAA 1260 

TATTTCATCT CTGTCACTGG CCAGCCATTT CAGGGTGTCC AATTCAACAA TGGAGGATGT 1320 

CATCAGTATA GCTGACAATA TCCTTAATTC AGCCTCAGTA ACCAACTGGA CAGTCTTACT 1380 

GCGGGAAGAA AAGTATGCCA GCTCACGGTT ACTAGAGACA TTAGAAAACA TCAGCACTCT 1440 

GGTGCCTCCG ACAGCTCTTC CTCTGAATTT TTCTCGGAAA TTCATTGACT GGAAAGGGAT 1500 

TCCAGTGAAC AAAAGCCAAC TCAAAAGGGG TTACAGCTAT CAGATTAAAA TGTGTCCCCA 1560 

AAATACATCT ATTCCCATCA GAGGCCGTGT GTTAATTGGG TCAGACCAAT TCCAGAGATC 1620 

CCTTCCAGAA ACTATTATCA GCATGGCCTC GTTGACTCTG GGGAACATTC TACCCGTTTC 1680 

CAAAAATGGA AATGCTCAGG TCAATGGACC TGTGATATCC ACGGTTATTC AAAACTATTC 1740 

CATAAATGAA GTTTTCCTAT TTTTTTCCAA GATAGAGTCA AACCTGAGCC AGCCTCATTG 1800 

TGTGTTTTGG GATTTCAGTC ATTTGCAGTG GAACGATGCA GGCTGCCACC TAGTGAATGA 1860 

AACTCAAGAC ATCGTGACGT GCCAATGTAC TCACTTGACC TCCTTCTCCA TATTGATGTC 1920 

ACCTTTTGTC CCCTCTACAA TCTTCCCCGT TGTAAAATGG ATCACCTATG TGGGACTGGG 1980 

TATCTCCATT GGAAGTCTCA TTTTATGCCT GATCATCGAG GCTTTGTTTT GGAAGCAGAT 2040 

TAAAAAAAGC CAAACCTCTC ACACACGTCG TATTTGCATG GTGAACATAG CCCTGTCCCT 2100 

CTTGATTGCT GATGTCTGGT TTATTGTTGG TGCCACAGTG GACACCACGG TGAACCCTTC 2160 

TGGAGTCTGC ACAGCTGCTG T GTT CT TT AC ACACTTCTTC TACCTCTCTT TGTTCTTCTG 2220 

GATGCTCATG CTTGGCATCC TGCTGGCTTA CCGGATCATC CTCGTGTTCC ATCACATGGC 2280 

CCAGCATTTG ATGATGGCTG TTGGATTTTG CCTGGGTTAT GGGTGCCCTC TCATTATATC 2340 

TGTCATTACC ATTGCTGTCA CGCAACCTAG CAATACCTAC AAAAGGAAAG ATGTGTGTTG 2400 

GCTTAACTGG TCCAATGGAA GCAAACCACT CCTGGCTTTT GTTGTCCCTG CACTGGCTAT 2460 

TGTGGCTGTG AACTTCGTTG TGGTGCTGCT AGTTCTCACA AAGCTCTGGA GGCCGACTGT 2520 

TGGGGAAAGA CTGAGTCGGG ATGACAAGGC CACCATCATC CGCGTGGGGA AGAGCCTCCT 2580 

CATTCTGACC CCTCTGCTAG GGCTCACCTG GGGCTTTGGA ATAGGAACAA TAGTGGACAG 2640 

CCAGAATCTG GCTTGGCATG TTATTTTTGC TTTACTCAAT GCATTCCAGG GATTTTTTAT 2700 

CTTATGCTTT GGAATACTCT TGGACAGTAA GCTGCGACAA CTTCTGTTCA ACAAGTTGTC 2760 

TGCCTTAAGT TCTTGGAAGC AAACAGAAAA GCAAAACTCA TCAGATTTAT CTGCCAAACC 2820 

CAAATTCTCA AAGCCTTTCA ACCCACTGCA AAACAAAGGC CATTATGCAT TTTCTCATAC 2680 

TGGAGATTCC TCCGACAACA TCATGCTAAC TCAGTTTGTC TCAAATGAAT AAGGCAAGGA 2940 

ATCATAAAAT CAAGAAAAAA TTTCCAGAAC AACTTGACAT TTAGAGACAA ATGTCAATGA 3000 

AGAAATTATG CTCAGTATTC GATCGGGTTT TCTGATTTAG GGGTCTGGGA ATAAAACAAG 3060 

AATGTCTCAG TGGCTTCA 3078 
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_ A Seq ID NO: C46 DNA Sequence 

70 Nucleic Acid Accession #: NM 0005B4.1 



75 
80 



Coding sequence : 75 . . 374 

1 11 21 31 41 51" 

I I I I I I 

AGCAGAGCAC ACAAGCTTCT AGGACAAGAG CCAGGAAGAA ACCACCGGAA GGAACCATCT 60 

CACTGTGTGT AAACATGACT TCCAAGCTGG CCGTGGCTCT CTTGGCAGCC TTCCTGATTT 120 

CTGCAGCTCT GTGTGAAGGT GCAGTTTTGC CAAGGAGTGC TAAAGAACTT AGATGTCAGT 180 

GCATAAAGAC ATACTCCAAA CCTTTCCACC CCAAATTTAT CAAAGAACTG AGAGTGATTG 240 

AGAGTGGACC ACACTGCGCC AACACAGAAA TTATTGTAAA GCTTTCTGAT GGAAGAGAGC 300 

TCTGTCTGGA CCCCAAGGAA AACTGGGTGC AGAGGGTTGT GGAGAAGTTT TTGAAGAGGG 360 

CTGAGAATTC ATAAAAAAAT TCATTCTCTG TGGTATCCAA GAATCAGTGA AGATGCCAGT 420 

GAAACTTCAA GCAAATCTAC TTCAACACTT CATGTATTGT GTGGGTCTGT TGTAGGGTTG 480 

CCAGATGCAA TACAAGATTC CTGGTTAAAT TTGAATTTCA GTAAACAATG AATAGTTTTT 540 

CATTGTACCA TGAAATATCC AGAACATACT TATATGTAAA GTATTATTTA TTTGAATCTA 600 
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CAAAAAACAA CAAATAATTT TTAAATATAA GGATTTTCCT AGATATTGCA CGGGAGAATA 660 

TACAAATAGC AAAATTGAGC CAAGGGCCAA GAGAATATCC GAACTTTAAT TTCAGGAATT 720 

GAATGGGTTT GCTAGAATGT GATATTTGAA GCATCACATA AAAATGATGG GACAATAAAT 780 

TTTGCCATAA AGTCAAATTT AGCTGGAAAT CCTGGATTTT TTTCTGTTAA ATCTGGCAAC 840 

CCTAGTCTGC TAGCCAGGAT CCACAAGTCC TTGTTCCACT GTGCCTTGGT TTCTCCTTTA 900 

TTTCTAAGTG GAAAAAGTAT TAGCCACCAT CTTACCTCAC AGTGATGTTG TGAGGACATG 960 

TGGAAGCACT TTAAGTTTTT TCATCATAAC ATAAATTATT TTCAAGTGTA ACTTATTAAC 1020 

CTATTTATTA TTTATGTATT TATTTAAGCA TCAAATATTT GTGCAAGAAT TTGGAAAAAT 1080 

AGAAGATGAA TCATTGATTG AATAGTTATA AAGATGTTAT AGTAAATTTA TTTTATTTTA 1140 

GATATTAAAT GATGTTTTAT TAGATAAATT TCAATCAGGG TTTTTAGATT AAACAAAGAA 1200 

ACAATTGGGT ACCCAGTTAA ATTTTCATTT CAGATAAACA ACAAATAATT TTTTAGTATA 1260 

AGTACATTAT TGTTTATCTG AAAGTTTTAA TTGAACTAAC AATCCTAGTT TGATACTCCC 1320 

AGTCTTGTCA TTGCCAGCTG TGTTGGTAGT GCTGTGTTGA ATTACGGAAT AATGAGTTAG 1380 

AACTATTAAA ACAGCCAAAA CTCCACAGTC AATATTAGTA ATTTCTTGCT GGTTGAAACT 1440 

TGTTTATTAT GTACAAATAG ATTCTTATAA TATTATTTAA ATGACTGCAT TTTTAAATAC 1500 

AAGGCTTTAT ATTTTTAACT TTAAGATGTT TTTATGTGCT CTCCAAATTT TTTTTACTGT 1560 

TTCTGATTGT ATGGAAATAT AAAAGTAAAT ATGAAACATT TAAAATATAA TTTGTTGTCA 1620 

AAGTAAAAAA AAAAAAAAA 1639 

Seq ID NO: C47 DNA Sequence 

Nucleic Acid Accession #: NM_005603.1 

Coding sequence: 1..3756 

25 i 11 21 31 41 SI 

I I I I I I 

ATGAGTACAG AAAGAGACTC AGAAACGACA TTTGACGAGG ATTCTCAGCC TAATGACGAA 60 

GTGGTTCCCT ACAGTGATGA TGAAACAGAA GATGAACTTG ATGACCAGGG GTCTGCTGTT 120 

GAACCAGAAC AAAACCGAGT CAACAGGGAA GCAGAGGAGA ACCGGGAGCC ATTCAGAAAA 180 

J\J GAATGTACAT GGCAAGTCAA AGCAAACGAT CGCAAGTACC ACGAACAACC TCACTTTATG 240 

AACACAAAAT TCTTGTGTAT TAAGGAGAGT AAATATGCGA ATAATGCAAT TAAAACATAC 300 

AAGTACAACG CATTTACCTT TATACCAATG AATCTGTTTG AGCAGTTTAA GAGAGCAGCC 360 

AATTTATATT TCCTGGCTCT TCTTATCTTA CAGGCAGTTC CTCAAATCTC TACCCTGGCT 420 

TGGTACACCA CACTAGTGCC CCTGCTTGTG GTGCTGGGCG TCACTGCAAT CAAAGACCTG 480 

D J GTGGACGATG TGGCTCGCCA TAAAATGGAT AAGGAAATCA ACAATAGGAC GTGTGAAGTC 540 

ATTAAGGATG GCAGGTTCAA AGTTGCTAAG TGGAAAGAAA TTCAAGTTGG AGACGTCATT 600 

CGTCTGAAAA AAAATGATTT TGTTCCAGCT GACATTCTCC TGCTGTCTAG CTCTGAGCCT 660 

AACAGCCTCT GCTATGTGGA AACAGCAGAA CTGGACGGAG AAACCAATTT AAAATTTAAG 720 

Af\ ATGTCACTTG AAATCACAGA CCAGTACCTC CAAAGAGAAG ATACATTGGC TACATTTGAT 780 

4U GGTTTTATTG AATGTGAAGA ACCCAATAAC CGACTAGATA AGTTTACAGG AACACTATTT 840 

TGGAGAAACA CAAGTTTTCC TTTGGATGCT GATAAAATTT TGTTAOGTGG CTGTGTAATT 900 

AGGAACACCG ATTTCTGCCA CGGCTTAGTC ATTTTTGCAG GTGCTGACAC TAAAATAATG 960 

AAGAATAGTG GGAAAACCAG ATTTAAAAGA ACTAAAATTG ATTACTTGAT GAACTACATG 1020 

GTTTACACGA TCTTTGTTGT TCTTATTCTG CTTTCTGCTG GTCTTGCCAT CGGCCATGCT 1080 

40 TATTGGGAAG CACAGGTGGG CAATTCCTCT TGGTACCTCT ATGATGGAGA AGACGATACA 1140 

CCCTCCTACC GTGGATTCCT CATTTTCTGG GGCTATATCA TTGTTCTCAA CACCATGGTA 1200 

CCCATCTCTC TCTATGTCAG CGTGGAAGTG ATTCGTCTTG GACAGAGTCA CTTCATCAAC 1260 

TGGGACCTGC AAATGTACTA TGCTGAGAAG GACACACCCG CAAAAGCTAG AACCACCACA 1320 

n CTCAATGAAC AGCTCGGGCA GATCCATTAT ATCTTCTCTG ATAAGACGGG GACACTCACA 1380 

DU CAAAATATCA TGACCTTTAA AAAGTGCTGT ATCAACGGGC AGATATATGG GGACCATCGG 1440 

GATGCCTCTC AACACAACCA CAACAAAATA GAGCAAGTTG ATTTTAGCTG GAATACATAT 1500 

GCTGATGGGA AGCTTGCATT TTATGACCAC TATCTTATTG AGCAAATCCA GTCAGGGAAA 1560 

GAGCCAGAAG TACGACAGTT CTTCTTCTTG CTCGCAGTTT GCCACACAGT CATGGTGGAT 1620 

. AGGACTGATG GTCAGCTCAA CTACCAGGCA GCCTCTCCCG ATGAAGGTGC CCTGGTAAAC 1680 

GCTGCCAGGA ACTTTGGCTT TGCCTTCCTC GCCAGGACCC AGAACACCAT CACCATCAGT 1740 

GAACTGGGCA CTGAAAGGAC TTACAATGTT CTTGCCATTT TGGACTTCAA CAGTGACCGG 1800 

AAGCGAATGT CTATCATTGT AAGAACCCCA GAAGGCAATA TCAAGCTTTA CTGTAAAGGT 1860 

GCTGACACTG TTATTTATGA ACGGTTACAT CGAATGAATC CTACTAAGCA AGAAACACAG 1920 

GATGCCCTGG ATATCTTTGC AAATGAAACT CTTAGAACCC TATGCCTTTG CTACAAGGAA 1980 

OU ATTGAAGAAA AAGAATTTAC AGAATGGAAT AAAAAGTTTA TGGCTGCCAG TGTGGCCTCC 2040 

ACCAACCGGG ACGAAGCTCT GGATAAAGTA TATGAGGAGA TTGAAAAAGA CTTAATTCTC 2100 

CTGGGAGCTA CAGCTATTGA AGACAAGCTA CAGGATGGAG TTCCAGAAAC CATTTCAAAA 2160 

CTTGCAAAAG CTGACATTAA GATCTGGGTG CTTACTGGAG ACAAAAAGGA AACTGCTGAA 2220 

£~ AATATAGGAT TTGCTTGTGA ACTTCTGACT GAAGACACCA CCATCTGCTA TGGGGAGGAT 2280 

CO ATTAATTCTC TTCTTCATGC AAGGATGGAA AACCAGAGGA ATAGAGGTGG CGTCTACGCA 2340 

AAGTTTGCAC CTCCTGTGCA GGAATCTTTT TTTCCACCCG GTGGAAACCG TGCCTTAATC 2400 

ATCACTGGTT CTTGGTTGAA TGAAATTCTT CTGGAGAAAA AGACCAAGAG AAATAAGATT 2460 

CTGAAGCTGA AGTTCCCAAG AACAGAAGAA GAAAGACGGA TGCGGACCCA AAGTAAAAGG 2520 

AGGCTAGAAG CTAAGAAAGA GCAGCGGCAG AAAAACTTTG TGGACCTGGC CTGCGAGTGC 2580 

/U AGCGCAGTCA TCTGCTGCCG CGTCACCCCC AAGCAGAAGG CCATGGTGGT GGACCTGGTG 2640 

AAGAGGTACA AGAAAGCCAT CACGCTGGCC ATCGGAGATG GGGCCAATGA CGTGAACATG 2700 

ATCAAAACTG CCCACATTGG CGTTGGAATA AGTGGACAAG AAGGAATGCA AGCTGTCATG 2760 

TCGAGTGACT ATTCCTTTGC TCAGTTCCGA TATCTGCAGA GGCTACTGCT GGTGCATGGC 2820 

CGATGGTCTT ACATAAGGAT GTGCAAGTTC CTACGATACT TCTTTTACAA AAACTTTGCC 2880 

O TTTACTTTGG TTCATTTCTG GTACTCCTTC TTCAATGGCT ACTCTGCGCA GACTGCATAC 2940 

GAGGATTGGT TCATCACCCT CTACAACGTG CTGTACACCA GCCTGCCCGT GCTCCTCATG 3000 

GGGCTGCTCG ACCAGGATGT GAGTQACAAA CTGAGCCTCC GATTCCCTGG GTTATACATA 3060 

GTGGGACAAA GAGACTTACT ATTCAACTAT AAGAGATTCT TTGTAAGCTT GTTGCATGGG 3120 

GTCCTAACAT CGATGATCCT CTTCTTCATA CCTCTTGGAG CTTATCTGCA AACCGTAGGG 3180 

(SU CAGGATGGAG AGGCACCTTC CGACTACCAG TCTTTTGCCG TCACCATTGC CTCTGCTCTT 3240 

GTAATAACAG TCAATTTCCA GATTGGCTTG GATACTTCTT ATTGGACTTT TGTGAATGCT 3300 

TTTTCAATTT TTGGAAGCAT TGCACTTTAT TTTGGCATCA TGTTTGACTT TCATAGTGCT 3360 

GGAATACATG TTCTCTTTCC ATCTGCATTT CAATTTACAG GCACAGCTTC AAACGCTCTG 3420 

AGACAGCCAT ACATTTGGTT AACTATCATC CTGACTGTTG CTGTGTGCTT ACTACCCGTC 3480 
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GTTGCCATTC GATTCCTGTC AATGACCATC TGGCCATCAG AAAQTGATAA GATCCAGAAG 3S40 

CATCCCAAGC GGTTGAAGGC GGAGGAGCAG TGGCAGCGAC GGCAGCAGGT GTTCCGCCGG 3600 

GOCGTGTCAA OGCGGCGCTC GGCCTACGCC TTCTCGCACC AGCGGGGCTA CGCGGACCTC 3660 

ATCTCCTCCG GGCGCAGCAT CCGCAAGAAG CGCTCGCCGC TTGATGCCAT CGTGGCGGAT 3720 

J GGCACCGCGG AGTACAGGCG CACCGGGGAC AGCTGA 37S6 
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Seq ID NO: C4B DNA Sequence 
Nucleic Acid Accession #: XM_044533 
Coding sequence: 238.. 2751 



1 11 21 31 41 51 

I I I I I I 

GCTCTGCCCA AGCCGAGGCT GCGGGGCCGG CGCCGGCGGG AGGACTGCGG TGCCCOGCGG 60 

AGGGGCTGAG TTTGCCAGGG CCCACTTGAC CCTGTTTCCC ACCTCCCGCC CCCCAGGTCC 120 

GGAGGCGGGG GCCCCCGGGG CGACTCGGGG GCGGACCGCG GGGCGGAGCT GCCGCCCGTG 180 

AGTCCGGCCG AGCCACCTGA GCCCGAGCCG CGGGACACCG TCGCTCCTGC TCTCCGAATG 240 

CTGCGCACCG CGATGGGCCT GAGGAGCTGG CTCGCCGCCC CATGGGGCGC GCTGCCGCCT 300 

CGGCCACCGC TGCTGCTGCT CCTGCTGCTG CTGCTCCTGC TGCAGCCGCC GCCTCCGACC 360 

TGGGCGCTCA GCCCCCGGAT CAGCCTGCCT CTGGGCTCTG AAGAGCGGCC ATTCCTCAGA 420 

TTCGAAGCTG AACACATCTC CAACTACACA GCCCTTCTGC TGAGCAGGGA TGGCAGGACC 480 

CTGTACGTGG GTGCTCGAGA GGCCCTCTTT GCACTCAGTA GCAACCTCAG CTTCCTGCCA 540 

GGCGGGGAGT ACCAGGAGCT GCTTTGGGGT GCAGACGCAG AGAAGAAACA GCAGTGCAGC 600 

TTCAAGGGCA AGGACCCACA GCGCGACTGT CAAAACTACA TCAAGATCCT CCTGCCGCTC 660 

AGCGGCAGTC ACCTGTTCAC CTGTGGCACA GCAGCCTTCA GCCCCATGTG TACCTACATC 720 

AACATGGAGA ACTTCACCCT GGCAAGGGAC GAGAAGGGGA ATGTCCTCCT GGAAGATGGC 780 

AAGGGCCGTT GTCCCTTCGA CCCGAATTTC AAGTCCACTG CCCTGGTGGT TGATGGCGAG 840 

CTCTACACTG GAACAGTCAG CAGCTTCCAA GGGAATGACC CGGCCATCTC GCGGAGCCAA 900 

AGCCTTCGCC CCACCAAGAC CGAGAGCTCC CTCAACTGGC TGCAAGACCC AGCTTTTGTG 960 

GCCTCAGCCT ACATTCCTGA GAGCCTGGGC AGCTTGCAAG GCGATGATGA CAAGATCTAC 1020 

JU TTTTTCTTCA GCGAGACTGG CCAGGAATTT GAGTTCTTTG AGAACACCAT TGTGTCCCGC 1080 

ATTGCCCGCA TCTGCAAGGG CGATGAGGGT GGAGAGCGGG TGCTACAGCA GCGCTGGACC 1140 

TCCTTCCTCA AGGCCCAGCT GCTGTGCTCA CGGCCCGACG ATGGCTTCCC CTTCAACGTG 1200 

CTGCAGGATG TCTTCACGCT GAGCCCCAGC CCCCAGGACT GGCGTGACAC CCTTTTCTAT 1260 

GGGGTCTTCA CTTCCCAGTG GCACAGGGGA ACTACAGAAG GCTCTGCCGT CTGTGTCTTC 1320 

ACAATGAAGG ATGTGCAGAG AGTCTTCAGC GGCCTCTACA AGGAGGTGAA CCGTGAGACA 1380 

CAGCAGTGGT ACACCGTGAC CCACCCGGTG CCCACACCCC GGCCTGGAGC GTGCATCACC 1440 

AACAGTGCCC GGGAAAGGAA GATCAACTCA TCCCTGCAGC TCCCAGACCG CGTGCTGAAC 1500 

TTCCTCAAGG ACCACTTCCT GATGGACGGG CAGGTCCGAA GCCGCATGCT GCTGCTGCAG 1S60 

CCCCAGGCTC GCTACCAGCG OGTGGCTGTA CACCGCGTCC CTGGCCTGCA CCACACCTAC 1620 

GATGTCCTCT TCCTGGGCAC TGGTGACGGC CGGCTCCACA AGGCAGTGAG CGTGGGCCCC 1680 

CGGGTGCACA TCATTGAGGA GCTGCAGATC TTCTCATCGG GACAGCCCGT GCAGAATCTG 1740 

CTCCTGGACA CCCACAGGGG GCTGCTGTAT GCGGCCTCAC ACTCGGGCGT AGTCCAGGTG 1800 

CCCATGGCCA ACTGCAGCCT GTACAGGAGC TGTGGGGACT GCCTCCTCGC CCGGGACCCC 1860 

TACTGTGCTT GGAGCGGCTC CAGCTGCAAG CAOGTCAGCC TCTACCAGCC TCAGCTGGCC 1920 

ACCAGGCCGT GGATCCAGGA CATCGAGGGA GCCAGCGCCA AGGACCTTTG CAGCGCGTCT 1980 

TCGGTTGTGT CCCCGTCTTT TGTACCAACA GGGGAGAAGC CATGTGAGCA AGTCCAGTTC 2040 

CAGCCCAACA CAGTGAACAC TTTGGCCTGC CCGCTCCTCT CCAACCTGGC GACCCGACTC 2100 

TGGCTACGCA ACGGGGCCCC CGTCAATGCC TCGGCCTCCT GCCACGTGCT ACCCACTGGG 2160 

GACCTGCTGC TGGTGGGCAC CCAACAGCTG GGGGAGTTCC AGTGCTGGTC ACTAGAGGAG 2220 

GGCTTCCAGC AGCTGGTAGC CAGCTACTGC CCAGAGGTGG TGGAGGACGG GGTGGCAGAC 2280 

CAAACAGATG AGGGTGGCAG TGTACCCGTC ATTATCAGCA CATCGCGTGT GAGTGCACCA 2340 

GCTGGTGGCA AGGCCAGCTG GGGTGCAGAC AGGTCCTACT GGAAGGAGTT CCTGGTGATG 2400 

TGCACGCTCT TTGTGCTGGC CGTGCTGCTC CCAGTTTTAT TCTTGCTCTA CCGGCACCGG 2460 

AACAGCATGA AAGTCTTCCT GAAGCAGGGG GAATGTGCCA GCGTGCACCC CAAGACCTGC 2520 

CCTGTGGTGC TGCCCCCTGA GACCCGCCCA CTCAAOGGCC TAGGGCCCCC TAGCACOCCG 2580 

CTCGATCACC GAGGGTACCA GTCCCTGTCA GACAGCCOCC CGGGGTCOCG AGTCTTCACT 2640 

GAGTCAGAGA AGAGGCCACT CAGCATCCAA GACAGCTTCG TGGAGGTATC CCCAGTGTGC 2700 

CCCCGGCCCC GGGTCCGCCT TGGCTCGGAG ATCCGTGACT CTGTGGTGTG AGAGCTGACT 2760 

TCCAGAGGAC GCTGCCCTGG CTTCAGGGGC TGTGAATGCT CGGAGAGGGT CAACTGGACC 2820 

TCCCCTCCGC TCTGCTCTTC GTGGAACACG ACCGTGGTGC CCGGCCCTTG GGAGCCTTGG 2880 

GGCCAGCTGG CCTGCTGCTC TCCAGTCAAG TAGCGAAGCT CCTACCACCC AGACACCCAA 2940 

ACAGCCGTGG CCCCAGAGGT CCTGGCCAAA TATGGGGGCC TGCCTAGGTT GGTGGAACAG 3000 

TGCTCCTTAT GTAAACTGAG CCCTTTGTTT AAAAAACAAT TCCAAATGTG AAACTAGAAT 3060 

GAGAGGGAAG AGATAGCATG GCATGCAGCA CACACGGCTG CTCCAGTTCA TGGCCTCCCA 3120 

GGGGTGCTGG GGATGCATCC AAAGTGGTTG TCTGAGACAG AGTTGGAAAC CCTCACCAAC 3180 

TGGCCTCTTC ACCTTCCACA TTATCCCGCT GCCACCGGCT GCCCTGTCTC ACTGCAGATT 3240 

CAGGACCAGC TTGGGCTGCG TGCGTTCTGC CTTGCCAGTC AGCCGAGGAT GTAGTTGTTG 3300 

CTGCCGTCGT CCCACCACCT CAGGGACCAG AGGGCTAGGT TGGCACTGCG GCCCTCACCA 3360 

GGTCCTGGGC TCGGACCCAA CTCCTGGACC TTTCCAGCCT GTATCAGGCT GTGGCCACAC 3420 

GAGAGGACAG CGCGAGCTCA GGAGAGATTT CGTGACAATG TACGCCTTTC CCTCAGAATT 3480 

CAGGGAAGAG ACTGTCGCCT GCCTTCCTCC GTTGTTGCGT GAGAACCCGT GTGCCCCTTC 3540 

CCACCATATC CACCCTCGCT CCATCTTTGA ACTCAAACAC GAGGAACTAA CTGCACCCTG 3600 

GTCCTCTCCC CAGTCCCCAG TTCACCCTCC ATCCCTCACC TTCCTCCACT CTAAGGGATA 3660 

TCAACACTGC CCAGCACAGG GGCCCTGAAT TTATGTGGTT TTTATACATT TTTTAATAAG 3720 

ATGCACTTTA TGTCATTTTT TAATAAAGTC TGAAGAATTA CTGTTT 3766 



Seq ID NO? C49 DNA Sequence 
Nucleic Acid Accession #: NM_007019.1 
oU Coding sequence: 41.. 580 ~ 

1 11 21 31 41 51 

I I I I I I 

GGCACGAGCG AGTTCCTGTC TCTCTGCCAA CGCCGCCCGG ATGGCTTCCC AAAACCGCGA 60 
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CCCAGCCGCC ACTAGCGTCG CCGCCGCCCG TAAAGGAGCT GAGCCGAGCG GGGGCGCCGC 120 

CCGGGGTCCG GTGGGCAAAA GGCTACAGCA GGAGCTGATG ACCCTCATGA TGTCTGGCGA 180 

TAAAGGGATT TCTGCCTTCC CTGAATCAGA CAACCTTTTC AAATGGGTAG GGACCATCCA 240 

TGGAGCAGCT GGAACAGTAT ATGAAGACCT GAGGTATAAG CTCTCGCTAG AGTTCCCCAG 300 

TGGCTACCCT TACAATGCGC CCACAGTGAA GTTCCTCACG CCCTGCTATC ACCCCAACGT 360 

GGACACCCAG GGTAACATAT GCCTGGACAT CCTGAAGGAA AAGTGGTCTG CCCTGTATGA 420 

TGTCAGGACC ATTCTGCTCT CCATCCAGAG CCTTCTAGGA GAACCCAACA TTGATAGTCC 480 

CTTGAACACA CATGCTGCCG AGCTCTGGAA AAACCCCACA GCTTTTAAGA AGTACCTGCA 540 

AGAAACCTAC TCAAAGCAGG TCACCAGCCA GGAGCCCTGA CCCAGGCTGC CCAGCCTGTC 600 

CTTGTGTCGT CTTTTTAATT TTTCCTTAGA TGGTCTGTCC TTTTTGTGAT TTCTGTATAG 660 

GACTCTTTAT CTTGAGCTGT GGTATTTTTG TTTTGTTTTT GTCTTTTAAA TTAAGCCTCG 720 

GTTGAGCCCT TGTATATTAA ATAAATGCAT TTTTGTCCTT TTTTAAAAAA AAAAAAAAAA 780 

AAA 7B3 

15 Seq ID NO: C50 DNA Sequence 

Nucleic Acid Accession #: NM_014584.1 
Coding sequence: 227.. 1633 

0 ~ 1 11 21 31 41 51 

20 | I | | | | 

GCACGAGCCC CGGGCTGCCG GCGCGGGCGC CGCGGCACGT CCACAGGCTG GGTCGCGAGG 60 

TGGCGATCGC TGAGAGGCAG GAGGGCCGAG GCGGGCCTGG GAGGCGGCCC GGAGGTGGGG 120 

CGCCGCTGGG GCCGGCCCGC ACGGGCTTCA TCTGAGGGCG CACGGCCCGC GACCGAGCGT 180 

GCGGACTGGC CTCCCAAGCG TGGGGCGACA AGCTGCCGGA GCTGCAATGG GCCGCGGCTG 240 

GGGATTCTTG TTTGGCCTCC TGGGCGCCGT GTGGCTGCTC AGCTCGGGCC ACGGAGAGGA 300 

GCAGCCCCCG GAQACAGCGG CACAGAGGTG CTTCTGCCAG GTTAGTGGTT ACTTGGATGA 360 

TTGTACCTGT GATGTTGAAA CCATTGATAG ATTTAATAAC TACAGGCTTT TCCCAAGACT 420 

ACAAAAACTT CTTGAAAGTG ACTACTTTAG GTATTACAAG GTAAACCTGA AGAGGCCGTG 480 

„ TCCTTTCTGG AATGACATCA GCCAGTGTGG AAGAAGGGAC TGTGCTGTCA AACCATGTCA 540 

3U ATCTGATGAA GTTCCTGATG GAATTAAATC TGCGAGCTAC AAGTATTCTG AAGAAGCCAA 600 

TAATCTCATT GAAGAATGTG AACAAGCTGA ACGACTTGGA GCAGTGGATG AATCTCTGAG 660 

TGAGGAAACA CAGAAGGCTG TTCTTCAGTG GACCAAGCAT GATGATTCTT CAGATAACTT 720 

CTGTGAAGCT GATGACATTC AGTCCCCTGA AGCTGAATAT GTAGATTTGC TTCTTAATCC 780 

TGAGCGCTAC ACTGGTTACA AGGGACCAGA TGCTTGGAAA ATATGGAATG TCATCTAOGA 840 

AGAAAACTGT TTTAAGCCAC AGACAATTAA AAGACCTTTA AATCCTTTGG CTTCTGGTCA 900 

AGGGACAAGT GAAGAGAACA CTTTTTACAG TTGGCTAGAA GGTCTCTGTG TAGAAAAAAG 960 

AGCATTCTAC AGACTTATAT CTGGCCTACA TGCAAGCATT AATGTGCATT TGAGTGCAAG 1020 

ATATCTTTTA CAAGAGACCT GGTTAGAAAA GAAATGGGGA CACAACATTA CAGAATTTCA 1080 

ACAGCGATTT GATGGAATTT TGACTGAAGG AGAAGGTCCA AGAAGGCTTA AGAACTTGTA 1140 

TTTTCTCTAC TTAATAGAAC TAAGGGCTTT ATCCAAAGTG TTACCATTCT TCGAGCGCCC 1200 

AGATTTTCAA CTCTTTACTG GAAATAAAAT TCAGGATGAG GAAAACAAAA TGTTACTTCT 1260 

GGAAATACTT CATGAAATCA AGTCATTTCC TTTGCATTTT GATGAGAATT CATTTTTTGC 1320 

TGGGGATAAA AAAGAAGCAC ACAAACTAAA GGAGGACTTT CGACTGCATT TTAGAAATAT 1380 

TTCA AGAATT ATGGATTGTG TTGGTTGTTT TAAATGTCGT CTGTGGGGAA AGCTTCAGAC 1440 

TCAGGGTTTG GGCACTGCTC TGAAGATCTT ATTTTCTGAG AAATTGATAG CAAATATGCC 1500 

AGAAAGTGGA CCTAGTTATG AATTCCATCT AACCAGACAA GAAATAGTAT CATTATTCAA 1560 

CGCATTTGGA AGAATTTCTA CAAGTGTGAA AGAATTAGAA AACTTCAGGA ACTTGTTACA 1620 

GAATATTCAT TAAAGAAAAC AAGCTGATAT GTGCCTGTTT CTGGACAATG GAGGCGAAAG 1680 

AGTGGA ATTT CATTCAAAGG CATAATAGCA ATGACAGTCT TAAGCCAAAC ATTTTATATA 1740 

AAGTTGCTTT TGTAAAGGAG AATTATATTG TTTTAAGTAA ACACATTTTT AAAAATTGTG 1800 

TTAAGTCTAT GTATAATACT ACTGTGAGTA AAAGTAATAC TTTAATAATG TGGTACAAAT 1860 

TTTAAAGTTT AATATTGAAT AAAAGGAGGA TTATCAAATT CATATATGAT AAAAGTGAAT 1920 

GTTCTAAGTC TCTCAAACTA GCGTTTTATG TAATAATATG TAATATAAAT AAAACTATGG 1980 

TAAATGTGAC AAGCATTTAA TAGGAAAATG CTAAGGAGGC CTCATAAATG ACCCATAATT 2040 

ACCAACGTAG AATTTTTCAG TACATTTAGG GTTGCTGGAT TTAGCAAATA AAAATAAAGA 2100 

TTGCCCAGTT AGATTTGAAT TTCAGATAAA CAATTAGTTT TTTAATATTT TACATGGAAT 2160 

ATTT GGAAA A TACTTATACT AAAAAATTAT TTGTTTGAAA TTCACATTTA ACTGGGAGTC 2220 

TTGTATTTTA TCTGGCAATC CTAAAATACA TTGGTATGAA ACAAATCACT TTTAGAAGTA 2280 

TATTGCTATT TTGATTGGGT TGTTTTTGTG TGTAGAAACG TACAATAACA ACTCAAAGGC 2340 

ACAGGAGATT TCTAAACATT GTGAAAAGTT GAATAGATTA TATATTTATT CTCATAATAC 2400 

TTTCACTAAT ACTAAATAAA ATTTGGGGAA CACTTTTTAT TTTTATATAA TTTCCAATTT 2460 

ACAGAAAAGT TTCAAAAATA GTACAAAGAG CTCTCTTACC CAGATTCACT AATTGTTCAT 2520 

ACGTGCTTTA TCTTTCATGC TTTCTCTGTA CACACACACA CACACACAAA TTTTTCCTCA 2580 

ATCATTTGAA AGTCAGTTAT AGGCATCATG CCCCTTAAAC CCTAAATACT TCAGTGTGTA 2640 

ATACTGAATA ATTACTAAAA ATGATTTTCT CAGAAAAAAA AACTCCCACA ATTCTGGAAC 2700 

TA TAATA CTG TAAGCCTTAG AATAAATAAT ACTTTCAAGT TCCAATCTAA AGTTCTTTTT 2760 

GAGTTTTGTT GCCCGTTTTA TGCTTGATGT GTATAGTAAT AGGGTAGGCT ATTTATTTTA 2820 

TTAAAATTTT TTTTAGAGAC AAGGTTTTGC TGTGTTGCCC AAGCTGGAAC TTGAACGACT 2880 

GGGCTGAAGT GATCTTCCCA CCTCAGCCTC CCAAGTAGCT GGGAATACAG GTGTCTGCCA 2940 

CCATACCCAG TTTCATTTTT GTTTTTTATA CCCGAAGTTC ATTTCCTTTG TCTCCCTAAA 3000 

ACTGAACTGT AATTTTGGGA GGTTTTCATT AGTGGAAGCT CTTCATTTAT AAAGCTATTT 3060 

GAAGGGGTTT AGGAATTTAT ATCACATGGT AATTGTAGAG AAAAAGAAGC TATATACCTC 3120 

AAAATCGTGC CCTCTTTACA TATGTCTTAT CAGGTATAAC ATGTTGAAAT GTCACATTAG 3180 

TAGTAAAGTG GGGTTTATTT ATATAGTGGT TAAGAAATGT CAGTTTACAC TGCTGTATAC 3240 

TTCTTCTTCT GTGTCCCTAA GGCCTGGTAC AGTGCCAAGC ACATACTTGG TATCCAATAA 3300 

ATATTTGTTG GATGAAAAAA AAAAAAAAAA AAAA 3334 
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Seq ID NO: CS1 DMA Sequence 
oU Nucleic Acid Accession »: NM_002888.1 
Coding sequence: 37.. 723 

1 11 21 31 41 SI 

I I I I I I 
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CCACGTCCGG GGTGCCGAGC CAACTTTCCT GCGTCCATQC AGCCCCGCCG GCAACGGCTG 60 

CCCGCTCCCT GGTCCGGGCC CAGGGGCCCG CGCCCCACCG CCCCGCTGCT CGCGCTGCTG 120 

CTGTTGCTCG CCCCGGTGGC GGCGCCCGCG GGGTCCGGGG GCCCCGACGA CCCTGGGCAG 180 

CCTCAGGATG CTGGGGTCCC GCGCAGGCTC CTGCAGCAGA AGGCGCGCGC GGCGCTTCAC 240 

TTCTTCAACT TCCGGTCCGG CTCGCCCAGC GCGCTGCGAG TGCTGGCCGA GGTGCAGGAG 300 

GGCCGCGCGT GGATTAATCC AAAAGAGGGA TGTAAAGTTC ACGTGGTCTT CAGCACAGAG 360 

CGCTACAACC CAGAGTCTTT ACTTCAGGAA GGTGAGGGAC GTTTGGGGAA ATGTTCTGCT 420 

CGAGTGTTTT TCAAGAATCA GAAACCCAGA CCAACCATCA ATGTAACTTG TACACGGCTC 480 

ATCGAGAAAA AGAAAAGACA ACAAGAGGAT TACCTGCTTT ACAAGCAAAT GAAGCAACTG 540 

AAAAACCCCT TGGAAATAGT CAGCATACCT GATAATCATG GACATATTGA TCCCTCTCTG 600 

AGACTCATCT GGGATTTGGC TTTCCTTGGA AGCTCTTACG TGATGTGGGA AATGACAACA 660 

CAGGTGTCAC ACTACTACTT GGCACAGCTC ACTAGTGTGA GGCAGTGGGT AAGAAAAACC 720 

TGAAAATTAA CTTGTGCCAC AAGAGTTACA ATCAAAGTGG TCTCCTTAGA CTGAATTCAT 780 

GTGAACTTCT AATTTCATAT CAAGAGTTGT AATCACATTT ATTTCAATAA ATATGTGAGT 840 

TCCTGC 846 

Seq ID NO: C52 DNA Sequence 

Nucleic Acid Accession #: NM_005409.3 

Coding sequence: 94.. 378 

1 11 21 31 41 51 

I I I 1 I I 

TTCCTTTCAT GTTCAGCATT TCTACTCCTT CCAAGAAGAG CAGCAAAGCT GAAGTAGCAG 60 

CAACAGCACC AGCAGCAACA GCAAAAAACA AACATGAGTG TGAAGGGCAT GGCTATAGCC 120 

TTGGCTGTGA TATTGTGTGC TACAGTTGTT CAAGGCTTCC CCATGTTCAA AAGAGGACGC 180 

TGTCTTTGCA TAGGCCCTGG GGTAAAAGCA GTGAAAGTGG CAGATATTGA GAAAGCCTCC 240 

ATAATGTACC CAAGTAACAA CTGTGACAAA ATAGAAGTGA TTATTACCCT GAAAGAAAAT 300 

AAAGGACAAC GATGCCTAAA TCCCAAATCG AAGCAAGCAA GGCTTATAAT CAAAAAAGTT 360 

GAAAGAAAGA ATTTTTAAAA ATATCAAAAC ATATGAAGTC CTGGAAAAGG GCATCTGAAA 420 

AACCTAGAAC AAGTTTAACT GTGACTACTG AAATGACAAG AATTCTACAG TAGGAAACTG 480 

AGACTTTTCT ATGGTTTTGT GACTTTCAAC TTTTGTACAG TTATGTGAAG GATGAAAGGT 540 

GGGTGAAAGG ACCAAAAACA GAAATACAGT CTTCCTGAAT GAATGACAAT CAGAATTCCA 600 

CTGCCCAAAG GAGTCCAGCA ATTAAATGGA TTTCTAGGAA AAGCTACCTT AAGAAAGGCT 660 

GGTTACCATC GGAGTTTACA AAGTGCTTTC ACGTTCTTAC TTGTTGTATT ATACATTCAT 720 

GCATTTCTAG GCTAGAGAAC CTTCTAGATT TGATGCTTAC AACTATTCTG TTGTGACTAT 780 

GAGAACATTT CTGTCTCTAG AAGTTATCTG TCTGTATTGA TCTTTATGCT ATATTACTAT 840 

CTGTGGTTAC AGTGGAGACA TTGACATTAT TACTGGAGTC AAGCCCTTAT AAGTCAAAAG 900 

CATCTATGTG TCGTAAAGCA TTCCTCAAAC ATTTTTTCAT GCAAATACAC ACTTCTTTCC 960 

CCAAATATCA TGTAGCACAT CAATATGTAG GGAAACATTC TTATGCATCA TTTGGTTTGT 1020 

TTTATAACCA ATTCATTAAA TGTAATTCAT AAAATGTACT ATGAAAAAAA TTATACGCTA 1080 

TGGGATACTG GCAACAGTGC ACATATTTCA TAACCAAATT AGCAGCACCG GTCTTAATTT 1140 

GATGTTTTTC AACTTTTATT CATTGAGATG TTTTGAAGCA ATTAGGATAT GTGTGTTTAC 1200 

TGTACTTTTT GTTTTGATCC GTTTGTATAA ATGATAGCAA TATCTTGGAC ACATTTGAAA 1260 

TACAAAATGT TTTTGTCTAC CAAAGAAAAA TGTTGAAAAA TAAGCAAATG TATACCTAGC 1320 

AATCACTTTT ACTTTTTGTA ATTCTGTCTC TTAGAAAAAT ACATAATCTA ATCAATTTCT 1380 

TTGTTCATGC CTATATACTG TAAAATTTAG GTATACTCAA GACTAGTTTA AAGAATCAAA 1440 

GTCATTTTTT TCTCTAATAA ACTACCACAA CCTTTCTTTT TTAAAAAAAA AAA 1493 



A Seq ID NO: C53 DNA Sequence 
J\J Nucleic Acid Accession #: FGENESH predicted 
Coding sequence : 1 . . 609 



1 11 21 31 41 51 

I I I I I I 

ATGCTGCGGC AGGTGCTTCG CAGAGGGCTC CAGTCGTTCT GCCACAGGCT GGGTTTGTGC 60 

GTGAGCCGGC ACCCGGTCTT TTTCCTCACC GTGCCCGCAG TCCTGACAAT CACCTTCGGC 120 

CTCAGCGCGC TCAACCGCTT CCAGCCCGAG GGCGACCTGG AGCGCCTGGT CGCTCCCAGC 180 

CACAGCCTGG CCAAGATCGA GCGCAGCCTG GCCAGCAGCC TTTTCCCCCT GGACCAGTCC 240 

AAAAGCCAGC TCTATTCGGA CTTACACACC CCTGGGAGGT ATGGCAGGGT GATCCTCCTC 30O 

TCCCCAACOG GGGACAATAT TTTGCTCCAG GCTGAGGGGA TCCTGCAGAC CCACCGAGCC 360 

GTGCTGGAAA TGAAGGTGAA CCACAAGGGC TATAATTATA CTTTTTCCCA TCTGTGTGTG 420 

TTGAGAAATC AGGATAAGAA ATGCGTGCTG GATGATATTA TTTCAGTGCT AGAGGATCTC 480 

AGGCAGGCTG CCGTCTCCAA TAAGACAACA GCCAGGGTGC AAGTGAGGTA TCCCAACACT 540 

AAATTAAAGG TATGCTCCTT CTGCATGCTT CTGCCAATTA AAGAGGCAGC ACTTCATTTC 600 

TTGCCCTAA 609 

Seq ID NO: CS4 DNA Sequence 

Nucleic Acid Accession #: NMJJ02438.1 

Coding sequence: 104.. 4474 " 

1 11 21 31 41 51 

I I I I J I 

GGGAACTTGG ATTAGGTGGA GAGGCAGTTG GGGGGCCTCG TTGTTTTGCG TCTTAGTTCC 60 

GCCCTCCTGT CCATCAGGAG AAGGAAAGGA TAAACCCTGG GCCATGAGGC TACCCCTGCT 120 

CCTGGTTTTT GCCTCTGTCA TTCCGGGTGC TGTTCTCCTA CTGGACACCA GGCAATTTTT 180 

AATCTATAAT GAAGATCACA AGCGCTGCGT GGATGCAGTG AGTCCCAGTG CCGTCCAAAC 240 

CGCAGCTTGC AACCAGGATG CCGAATCACA GAAATTCCGA TGGGTGTCCG AATCTCAGAT 300 

TATGAGTGTT GCATTTAAAT TATGCCTGGG AGTGCCATCA AAAACAGACT GGGTTGCTAT 360 

CACTCTCTAT GCCTGTGACT CAAAAAGTGA ATTTCAGAAA TGGGAGTGCA AAAATGACAC 420 

ACTTTTGGGG ATCAAAGGAG AAGATTTATT TTTTAACTAC GGCAACAGAC AAGAAAAGAA 480 

TATTATGCTC TACAAGGGAT CGGGTTTATG GAGCAGGTGG AAGATCTATG GAACCACAGA 540 

CAATCTGTGC TCCAGAGGTT ATGAAGCCAT GTATACGCTA CTAGGCAATG CCAATGGAGC 600 

AACCTGTGCA TTCCCGTTCA AGTTTGAAAA CAAGTGGTAC GCAGATTGCA CGAGTGCTGG 660 

GCGGTCGGAT GGATGGCTCT GGTGCGGAAC CACTACTGAC TATGACACAG ACAAGCTATT 720 
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TGGATATTGT CCATTGAAAT TTGAGGGCAG TGAAAGCTTA TGGAATAAAG ACCCGCTGAC 780 

CAGCGTTTCC TACCAGATAA ACTCCAAATC CGCTTTAACG TGGCACCAAG CGAGGAAAAG 840 

CTGCCAACAA CAGAACGCTG AGCTCCTGAG CATCACAGAG ATACATGAGC AAACATACCT 900 

GACAGGATTA ACCAGTTCCT TGACCTCAGG ACTCTGGATT GGACTTAACA GTCTGAGCTT 960 

CAACAGCGGT TGGCAGTGGA GTGACCGCAG TCCTTTCCGA TATTTGAACT GGTTACCAGG 1020 

AAGTCCATCA GCTGAACCTG GAAAAAGCTG TGTGTCACTA AATCCTGGAA AAAATGCTAA 1080 

ATGGGAAAAT CTGGAATGTG TTCAGAAACT GGGCTATATT TGCAAAAAGG GCAACACCAC 1140 

TTTAAATTCT TTTGTTATTC CCTCAGAAAG TGATGTGCCT ACTCACTGTC CTAGTCAGTG 1200 

GTGGCCGTAT GCCGGTCACT GTTACAAGAT TCACAGAGAT GAGAAAAAAA TCCAGAGGGA 1260 

TGCTCTGACC ACCTGCAGGA AGGAAGGCGG TGACCTCACA AGTATCCACA CCATCGAGGA 1320 

ATTGGACTTT ATTATCTCCC AGCTAGGATA TGAGCCAAAT GACGAATTGT GGATCGGCTT 1380 

AAATGACATT AAGATTCAAA TGTACTTTGA GTGGAGTGAT GGGACCCCTG TAACGTTTAC 1440 

CAAATGGCTT CGTGGAGAAC CAAGCCATGA AAACAACAGA CAGGAGGATT GTGTGGTGAT 1500 

GAAAGGCAAG GATGGGTACT GGGCAGATCG GGGCTGTGAG TGGCCTCTTG GCTACATCTG 1560 

CAAGATGAAA TCACGAAGCC AAGGTCCAGA AATAGTGGAA GTCGAAAAAG GCTGCAGGAA 1620 

AGGCTGGAAA AAACATCACT TTTACTGCTA TATGATTGGA CATACGCTTT CAACATTTGC 1680 

AGAAGCAAAC CAAACCTGTA ATAATGAGAA TGCTTATTTA ACAACTATTG AAGACAGATA 1740 

TGAACAAGCC TTCCTGACTA GTTTCGTTGG CTTAAGGCCT GAAAAATATT TCTGGACAGG 1800 

ACTTTCAGAT ATACAAACCA AAGGGACTTT TCAGTGGACC ATCGAGGAAG AGGTTCGGTT 1860 

1\J CACCCACTGG AATTCAGATA TGCCAGGGCG AAAGCCAGGG TGTGTTGCCA TGAGAACCGG 1920 

GATTGCAGGG GGCTTATGGG ATGTTTTGAA ATGTGATGAA AAGGCAAAAT TTGTGTGCAA 1980 

GCACTGGGCA GAAGGAGTAA CCCACCCACC GAAGCCCACG ACGACTCCCG AACCCAAATG 2040 

TCCGGAGGAT TGGGGCGCCA GCAGTAGAAC AAGCTTGTGT TTCAAGCTGT ATGCAAAAGG 2100 

AAAACATGAG AAGAAAACGT GGTTTGAATC TCGAGATTTT TGTCGAGCTC TGGGTGGAGA 2160 

CTTAGCTAGC ATCAATAACA AAGAGGAACA GCAAACAATA TGGOGATTAA TAACAGCTAG 2220 

TGGAAGCTAC CACAAACTGT TTTGGTTGGG ATTGACATAT GGAAGCCCTT CAGAAGGTTT 2280 

TACTTGGAGT GATGGTTCTC CTGTTTCATA TGAAAACTGG GCTTATGGAG AACCTAATAA 2340 

TTATCAAAAT GTTGAATACT GTGGTGAGCT GAAAGGTGAC CCTACTATGT CTTGGAATGA 2400 

TATTAATTGT GAACACCTTA ACAACTGGAT TTGCCAGATA CAAAAAGGAC AAACACCAAA 2460 

ACCTGAGCCA ACACCAGCTC CTCAAGACAA TCCACCAGTT ACTGAAGATG GGTGGGTTAT 2520 

TTACAAAGAC TACCAGTATT ATTTCAGCAA AGAGAAGGAA ACCATGGACA ATGCGCGAGC 2S80 

GTTTTGCAAG AGGAATTTTG GTGATCTTGT TTCTATTCAA AGTGAAAGTG AAAAGAAGTT 2640 

TCTATGGAAA TATGTAAACA GAAATGATGC ACAGTCTGCA TATTTTATTG GTTTATTGAT 2700 

CAGCTTGGAT AAAAAGTTTG CTTGGATGGA TGGAAGCAAA GTGGATTACG TGTCTTGGGC 2760 

CACAGGTGAA CCCAATTTTG CAAATGAAGA TGAAAACTGT GTGACCATGT ATTCAAATTC 2820 

AGGGTTTTGG AATGACATTA ACTGTGGCTA TCCAAACGCC TTCATTTGCC AGCGACATAA 2880 

CAGTAGTATC AATGCTACCA CAGTTATGCC TACCATGCCC TCGGTCCCAT CAGGGTGCAA 2940 

GGAAGGTTGG AATTTCTACA GCAACAAGTG TTTCAAAATC TTTGGATTTA TGGAAGAAGA 3000 

AAGAAAAAAT TGGCAAGAGG CACGAAAAGC TTGTATAGGC TTTGGAGGGA ATCTGGTCTC 3060 

CATACAAAAT GAAAAAGAGC AAGCATTTCT TACCTATCAC ATGAAGGACT CCACTTTCAG 3120 

TGCCTGGACT GGGCTGAATG ATGTCAATTC AGAACACACG TTCCTTTGGA CGGATGGACG 3180 

AGGAGTCCAT TACACAAACT GGGGGAAAGG TTACCCTGGT GGAAGAAGAA GCAGTCTTTC 3240 

TTATGAAGAT GCTGACTGTG TTGTTATTAT TGGAGGTGCA TCAAATGAAG CAGGAAAATG 3300 

GATGGATGAT ACCTGCGACA GTAAACGAGG CTACATATGC CAGACACGAT CCGACCCTTC 3360 

4 J CTTGACTAAT CCTCCAGCAA CGATTCAAAC AGATGGCTTT GTTAAATATG GCAAAAGCAG 3420 

CTATTCACTC ATGAGACAAA AATTTCAATG GCATGAAGOG GAGACATACT GCAAGCTTCA 3480 

CAATTCCCTT ATAGCCAGCA TTCTGGATCC CTACAGTAAT GCATTTGCGT GGCTGCAGAT 3540 

GGAAACATCT AATGAACGTG TGTGGATCGC CCTGAACAGT AACTTGACTG ATAATCAATA 3600 

CACTTGGACT GATAAGTGGA GGGTGAGGTA CACTAACTGG GCTGCTGATG AGCCCAAATT 3660 

J\) GAAATCAGCA TGTGTTTATC TGGATCTTGA TGGCTACTGG AAGACAGCAC ATTGCAATGA 3720 

AAGTTTTTAC TTTCTCTGTA AAAGATCAGA TGAAATCCCT GCTACTGAAC CCCCACAACT 3780 

GCCTGGCAGA TGCCCGGAGT CAGATCACAC AGCATGGATT CCTTTCCATG GTCACTGTTA 3840 

CTATATTGAG TCCTCATATA CAAGAAACTG GGGCCAAGCT TCTCTGGAAT GTCTTCGAAT 3900 

GGGTTCCTCT CTGGTTTCCA TTGAAAGTGC TGCAGAATCC AGTTTTCTGT CATATCGGGT 3960 

J J TGAGCCACTT AAAAGTAAAA CCAATTTTTG GATAGGATTG TTCAGAAATG TTGAAGGGAC 4020 

GTGGCTGTGG ATAAATAACA GTCCGGTCTC CTTTGTCAAC TGGAACACAG GAGATCCCTC 4080 

TGGTGAACGG AATGATTGTG TAGCTTTACA TGCGTCTTCT GGGTTTTGGA GTAATATTCA 4140 

CTGTTCTTOC TACAAAGGAT ATATTTGTAA AAGACCAAAA ATTATTGATG CTAAACCTAC 4200 

TCATGAATTA CTTACAACAA AAGCTGACAC AAGGAAGATG GACCCTTCTA AACCGTCTTC 4260 

CAACGTGGCC GGAGTAGTCA TCATTGTGAT CCTCCTGATT TTAACGGGTG CTGGCCTTGC 4320 

CGCCTATTTC TTTTATAAGA AAAGACGTGT GCACCTACCT CAAGAGGGCG CCTTTGAAAA 4380 

CACTCTGTAT TTTAACAGTC AGTCAAGCCC AGGAACTAGT GATATGAAAG ATCTCGTGGG 4440 

CAATATTGAA CAGAATGAAC ACTCGGTCAT CTAGTACCTC AATGCGATTC TGAGATATTT 4500 

GAATTTCATA AAATTGTAAC TGAAATTTAA AATTTTTAGT TCAATGTGAT TGTTTTCTTT 4560 

AAAATGAGTA CTGAATTGTA CTGGTCTGTC CTTTTTTCCT TTGCCTAATT GAAGAAATAA 4620 

TTGCTTGTTT TCTAGCCTGG CAAGATATTT TCATAAAAGA GGGATAACAA TGCTGATTAC 4680 

TACCTTTTAA AATATTTTAG ATAAATGCAC AGCACCACAG CACCACATCT AAGCATTAGT 4740 

GATGGGTAGC TGATGTCAGC TTCATGTGGA TTTTAAGCAC TCTAGAAACA ATGAAGCTTC 4800 

TTGGCATATT TTAAGGAGCT CCCAAAATGT GTTACCTATT AAATTGTAAC TCAGCAAGTA 4860 

GAAGACCATT TGAAAAGTCA GGTACAAATT TCCTCAAGTG GCATAAAAAT GTAGTCAGTT 4920 

TTCTCTTTTA CCAGTTTTTA TTTCCACTCC AATTATTTAG AACTTTATTT GTACATGTGC 4980 

AGAAGAATAA GGCAGCTGAG AATCTTGTTT CCCCCAAGAG AGTTTTACAG GCTGAGTGTT 5040 

GCAAATGTGT TCTTTGTCCT GTTATATGTA TATCAGGAAT ACAAGGATGT GAAATAAAAC 5100 

TGTAAATTTG CATAACTGGA TGTACTTAGA TAATGTGAAA TAAACATTAA AGACAAGGTC 5160 

TATTTTTAAT AAAAAAAAAA AAAAA 5185 
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Seq ID NO: C55 DNA Sequence 

Nucleic Acid Accession #: NM_024574.2 

Coding sequence: 424.. 2 130 

1 11 21 31 41 51 

I 1 I I I I 

AGTGCGACTA GCGGCGGCCG GCGAAGCCCG CGCCAGCCCC GCGGGTCGCT GCGGCATCCT 60 
CCCGCCGCCC AGCCCCCGCA CGCTCCCTGC AGTTTAAAAG GACCTCCCGC CGCTTCTCGG 120 
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CGCTGCCCCG GGATTCCCCA GCCCCGCGCG GCTCCCTACT CCACTTCGCA GCAACTTCGG 180 

CGACCGCGCG CCGCCCGCCC TCGCCCGCCT TTGAAGTTTG CTGTGCCGAC CGCAAAGTTG 240 

GGACACTTCA GCGGATTGAA TTTTTCTCTT TTATCTGCCT CCGTCCCCGC CCTCCAGGCT 300 

TCTCGTTCCT GGATATTGGT GCTTAGCATC TTGGCAGGGT CCGGGGACGT GGACTATTTC 360 

GCACACCACA CCACGGGGAG GGATTTTTTT CTATTTTCCC TACGAAAAAC AGATCTTTTT 420 

AAGGATGGTG CTGCTCCACT GGTGCCTGCT GTGGCTCCTG TTTCCACTCA GCTCAAGGAC 480 

CCAGA AGTTA CCCACCCGGG ATGAGGAACT TTTTCAGATG CAGATCCGGG ACAAGGCATT 540 

TTTTCATGAT TCGTCAGTAA TTCCAGATGG AGCTGAAATT AGCAGTTATC TCTTTAGAGA 600 

TACACCTAAA AGGTATTTCT TTGTGGTTGA AGAAGACAAT ACTCCATTAT CAGTCACAGT 660 

GACGCCCTGT GATGCGCCTT TGGAGTGGAA GCTGAGCCTC CAGGAGCTGC CAGAGGACAG 720 

GAGCGGGGAA GGCTCAGGTG ATCTGGAACC TCTTGAGCAG CAGAAGCAGC AGATCATTAA 780 

TGAGGAAGGC ACTGAGTTAT TCTCCTACAA AGGCAATGAT GTTGAGTATT TTATATCGTC 840 

TAGTTCCCCA TCCGGTTTAT ATCAGTTGGA TCTTCTTTCA ACAGAGAAAG ACACACATTT 900 

CAAAGTATAT GCCACCACAA CTCCAGAATC TGATCAGCCA TACCCTGAGT TACCCTATGA 960 

CCCAAGAGTA GATGTGACCT CACTGGGGCG CACCACGGTC ACTTTGGCCT GGAAACCAAG 1020 

CCCCACTGCC TCTTTGCTGA AACAACCCAT TCAGTACTGT GTGGTCATCA ACAAAGAGCA 1080 

CAATTTCAAA AGTCTCTGTG CAGTGGAAGC AAAACTGAGT GCAGATGATG CTTTTATGAT 1140 

GGCACCGAAA CCTGGTCTGG ACTTCAGCCC CTTTGACTTT GCCCACTTTG GATTTCCTTC 1200 

TGATAATTCA GGTAAAGAAC GCAGTTTCCA GGCAAAGCCT TCTCCAAAAC TGGGGCGTCA 1260 

TGTCTACTCC AGGCCCAAGG TTGATATTCA GAAAATCTGC ATAGGAAACA AGAACATCTT 1320 

CACCGTCTCT GATCTGAAAC CCGACACGCA GTACTACTTT GACGTATTTG TGGTCAACAT 1380 

CAACAGCAAC ATGAGCACCG CTTATGTAGG TACCTTTGCC AGGACCAAGG AAGAAGCCAA 1440 

ACAGAAGACA GTCGAGCTCA AAGATGGGAA GATAACAGAT GTATTTGTTA AAAGGAAGGG 1500 

AGCAAAGTTT CTACGGTTTG CTCCAGTCTC TTCTCACCAA AAAGTCACCT TCTTTATTCA 1560 

CTCTTGTCTG GATGCTGTCC AAATCCAAQT GAGAAGAGAT GGGAAACTTC TTCTGTCTCA 1620 

GAATGTGGAA GGCATTCAGC AGTTTCAGCT TAGAGGAAAA CCTAAAGCTA AATACCTCGT 1680 

TCGACTGAAA GGAAACAAGA AAGGAGCATC TATGTTGAAA ATTCTAGCTA CCACAAGGCC 1740 

TACTAAGCAG TCATTTCCCT CTCTTCCTGA AGACACAAGA ATCAAAGCCT TTGACAAGCT 1800 

CCGTACCTGT TCCTCGGCCA CCGTGGCTTG GCTAGGCACT CAGGAAAGGA ACAAGTTTTG 1860 

CATCTACAAA AAAGAAGTGG ATGATAACTA CAATGAAGAC CAGAAGAAAA GAGAGCAAAA 1920 

CCAATGTCTA GGACCAGATA TAAGGAAGAA GTCAGAAAAG GTCCTCTGTA AATATTTCCA 1980 

CAGTCAAAAC CTGCAGAAAG CAGTGACCAC AGAAACAATT AAAGGTCTTC AGCCTGGCAA 2040 

ATCTTACCTG CTGGATGTTT ATGTCATAGG ACATGGGGGG CACTCTGTAA AGTATCAGAG 2100 

TAAGGTTGTG AAAACTAGAA AGTTCTGTTA GTTACCTTCT TATAGAGATA TATTATGTAG 2160 

AACTCCAGGA GGGACATTAA ATCACTTTAA GTATAAACTG ACTACTCCCA CAGTTGAGAG 2220 

AAGTTGTGAC CTGTACTTGT ACTATGGAAG GAAGGATATC AACGTGTGTA TATTGATGTT 2280 

TATATAAGTA ACTCTTGAAG GAGACTTGTT CTAGCGTGCC CCATGGTACC TAGTGTGTGT 2340 

CTGATGCOGG TTGGTGTCAA AGATAGAGGG CTTCTTGAAG GAACTTGCCA TTCCTTGCTT 2400 

TGACCACTGC ATGAACTGCT TCTAAATTAT TTTATTACCT AAAAATTTAA AATATGCCAT 2460 

TCATTGCACA CACCCACAAA TGCAAATCAT TCCTCTCTAT AGATGCTAGG ATATATATAA 2520 

ATTATTTTAT AAATTCTTGT TTTAAATGTC AGTGTTTCTA TGATTGTAAA CTATTAAATT 2580 

CTTTTCCTAT TAAAGTACAG ATCTAATCTA AGTATTATTA AGTTGATAGC CCTCTAGTCA 2640 

GTTATATTGC TATTGTAAAT TCTTGTTTGT TGAGTAAAAT GTTTAAATAC TATATGTATC 2700 

TCATGTACAA AGTTGACATA CATTATATTC ATGTACATAA AATTAAAGAG ATTAGATTAT 2760 

ATACTGTTAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAA 2808 



Seq ID MO: C56 DNA Sequence 
_ n Nucleic Acid Accession #: BC03 4229.1 
Jv Coding sequence: 373.. 1422 

1 11 21 31 41 51 

I I I I I I 

ATCCGGCGGT GGTGACGCGA GAGGCTGGGG TCTCCAGGAC CAACTCCTCT TCATCTTCGT 60 

J J CTTCCTC AGC CTGCTCAATG TGAAGCCCCT GATCATGATT CACTTCCACT TAATAAATAA 120 

AGTGTTTACA AATCAGAATA ACTTTTAGAC AATATTAAGG TGGTAATCAT GAACAGAAAA 180 

GATTTTGTAG TTCTTCCATG GGGAAAACCT GGAAATTCTG TAAAGCTAAA ATATAGCAAT 240 

GTAATAATTA AAACAAAAGT CTAAGATTTG AAGAGATAAT TTGCTTCAGG ATTTTGATGG 300 

- AAGGCAAATG CTAACTTTAA AAACCAGATT TCGGAGAAGT ACAAAAGAAA TAGAAATGCT 360 

OU CAAGAACTGC GAATGGAGAA AGTACAGTTA GAGTTTGAGA ACCAAGAGAT GGAGAAGAAA 420 

CTGCAAGAAT TCCGATCCAC AAGAAACAAA GAAAAGGAAG ATAGAGAGTC AAGCGAGTAT 480 

TACTGGAAAT CTGGAAAAGT GGGCAAATTG GTCAATCAAT CATATATGAT GTCACAAAAT 540 

AAAGGAAATG TTGTTAAGTT TTCTGCTGGA AAAGTGAAAT TAAAATTGCT GAAGGAACAG 600 

ATTCAAGAGC CAGTGAAACC AACAGTTAAT TATAAAATGG CAAATTCTTC AGAATGTGAA 660 

OJ AAACCCAAGA TAAATGGGAA AGTTTGTGGA CAGTGTGAGA ACAAAGCTGC TCTACTGGTA 720 

- TGCCTTGAAT GTGGAGAAGA TTATTGTTCA GGATGCTTTG CTAATGTTCA CCAGAAAGGG 780 

GCACTA AAG C TCCACAGAAC AACTCTTTTG CAGGCAAGAT CTCAAATATT ATTCAATGTA 840 

TTGGATGTTG CCCATCAGTT TATAAAGGAT GTTAATCCAG ATGAACCCAA AGAGGAGAAT 900 

AATTCTACAA AGGAAACCAG TAAAATTCAA CATAAACCCA AATCTGTACT TCTCCAGAGG 960 

/U AGCAGCTCTG AGGTAGAAAT TACAACGATG AAAAGAGCAC AACGTACAAA ACCAAGAAAG 1020 

AGTCTGTTGT GTGAAGGGTC ATTCGATGAA GAAGCTTCTG CACAGTCCTT TCAGGAAGTG 1080 

TTAAGTCAAT GGAGAACCGG AAATCATGAT GACAACAAGA AACAGAATTT ACATGCAGCA 1140 

GTAAAAGACT CATTGGAAGA ATGCGAAGTA CAGACTAATC TGAAAATTTG GAGAGAACCA 1200 

rjr CTTAATATTG AACTTAAAGA AGACATTCTA TCCTATATGG AAAAATTATG GCTTAAAAAA 1260 

(J CACAGGAGAA CTCCACAAGA GCAACTTTTT AAATGCTACC AGATACGTTC CCACATCCAC 1320 

ATGAAACCAC TGGTGATGCA CAGTGTTCTC AAAATGAAAA CGATGAAGAT AGTGATGGTG 1380 

AGGAGACCAA AGTACAACAC ACAGCTCTTT TATTGCCAGT AGAAACATTA AACATAGAGA 1440 

GACCTGAACC ATCTCTGAAG ATAGTCGAAC TGGATGATAC TTATGAAGAG GAATTTGAAG 1500 

R ~ AAGCAGAAAA TATTGTGCCT TACAAAGTTA AATTAGCTGA TGCAGACAGT CAACGAAGTT 1560 

OU GTGCTTTTCA TGATTGTCAG AAGAATAGCT TTCCATATGA AAATGGCATC CATCAACATC 1620 

ATGTTTTCGA TAAGGGAAAG AGAGACTTCT TAAATCTTTG TCTGAGAAAC AGCTCTACTT 1680 

ATTATAAAGA TAATTCAAAA GGAGAAACTT CAAACACAGA TTTTGACAAC ATCGTGGATC 1740 

CTGATGTGTA TTCTTCTGAC ATTGAAAAAA TTGAGGAAAG CACCTCCTTT GAAAGAAATT 1800 

TAAAGGAGAA AAATATAGGT TTAGAAAGTA ATCAAAAGTC TGATGATTCC TGTGTATCAC 1860 

1232 



WO 03/042661 



PCT/US02/36810 



TTGAAAGCAA GGACACTTTG CTAGGTAGAG ATTTAGAAAA AGCTCCCATT GAGGAGAAAT 1920 

TATCTCAAGA CATCAAAGAA TCCTTGGAAT TGAGCAATCT GTATAAGAGG CCAAGCTTTG 1980 

AAGAATCAAA AACTACAAAG TCATCACTGT TGTTACAAGA AATAGCCTGC AGAAGTAAGC 2040 

r CTATAACAAA ACAATATCAA GGACTTGAGA GATTCTTTAT TTTTGATACA AATGAAAGAC 2100 

-> TCAACTTACT TCCTTCTCAT CGTTTAGAAT GCAACAATTC CAGTACTAGG ATTACACTTG 2160 

CAGGTCAGAA ATCACAGAGA CCTTCAACAG CAAATTTTCC ACTTTCCAAC TCTGTTAAAG 2220 

AAAGCTCCAG TTGCCTTTCA TCCTCTCATC CTCGATCAAG AAGTGCAGCT GCTCAATCAT 2280 

CATCTAGAGC TGCTTCTGAA ATTTCAGAAA TTGAATATAT TGATATTACT GACCAGAATG 2340 

AGCTTTCCTT AGATGACACT ACTGATCAAC ATACTTTAGA CAATTTGGAA AAAGAATTAC 2400 

IV AAGTGCTGAG ATCTCTTGCA GATACTTCAG AAAAGCTTTA CAGCTTAACC TCAGAAGAGT 2460 

TCCCAGATTT CAGCAGCCAA TCACTGAATA TAAGTCAGAT TTCCACAGAT TTCCTTAAGA 2520 

CCTCACATGT GAGGGGTCCC TGTGGAGTTG AGGAATTGAG CTGTTCTGGA AGAGATACCA 25B0 

AAATTCAGTC TTTGCTGTCA CTTTCTGAGA GCAGTACAGA TGAGGAGGAG GAAGATTTTC 2640 

TCAACAAGCA ACA TGTCA TC ACACTACCGT GGTCAAAGAG TACTTAAAGA TTATTTGTTC 2700 

?P^m?T A CCCAGAGTAA AGCAAACAAC TGAGAAAAGT AACCAAGTGA 2760 

20 



50 



80 



v>.wi«n«inn nuwwiUWL 1 urns A/U\MVa 1 iUV*. \_HAli TliA 2750 

TTACCTATCC AAGTGCTGGA GATTTTGATT ACTAATGTCT TTGATGTTTC AAGGCTACAA 2820 
ACTAATAAAA GTAAAATTAT AAGTTCAAAA AAATTTTTAA AAAAAAAAAT AAAAAA 2876 



Seq ID NO: C57 DNA Sequence 
Nucleic Acid Accession #: NM_024 687.1 
Coding sequencer 138.. 1706 

OC J " 21 31 41 51 

AAAAACATGA TGACAACAAG AAACAGAATT TACATGCAGC AGTAAAAQAC TCATTGGAAG 60 

AATGCGAAGT ACAGACTAAT CTGAAAATTT GGAGAGAACC ACTTAATATT GAACTTAAAG 120 

AAGACATTCT ATCCTATATG GAAAAATTAT GGCTTAAAAA ACACAGGAGA ACTCCACAAG 180 

AGCAACTTTT TAAAATGCTA TCAGATACGT TCCCACATCC ACATGAAACC ACTGGTGATG 240 

JU CACAGTGTTC TCAAAATGAA AACGATGAAG ATAGTGATGG TGAGGAGACC AAAGTACAAC 300 

ACACAGCTCT TTTATTGCCA GTAGAAACAT TAAACATAGA GAGACCTGAA CCATCTCTGA 360 

AGATAGTCGA ACTGGATGAT ACTTATGAAG AGGAATTTGA AGAAGCAGAA AATATTGTGC 420 

CTTACAAAGT TAAATTAGCT GATGCAGACA GTCAACGAAG TTGTGCTTTT CATGATTGTC 480 

AGAAGAATAG CTTTCCATAT GAAAATGGCA TCCATCAACA TCATGTTTTC GATAAGGGAA 540 

JJ AGAGAGACTT CTTAAATCTT TGTCTGAGAA ACAGCTCTAC TTATTATAAA GATAATTCAA 600 

AAGGAGAAAC TTCAAACACA GATTTTGACA ACATCGTGGA TCCTGATGTG TATTCTTCTG 660 

ACATTGAAAA AATTGAGGAA AGCACCTCCT TTGAAAGAAA TTTAAAGGAG AAAAATATAG 720 

GTTTAGAAAG TAATCAAAAG TCTGATGATT CCTGTGTATC ACTTGAAAGC AAGGACACTT 780 

A(\ TGCTAGGTAG AGATTTAGAA AAAGCTCCCA TTGAGGAGAA ATTATCTCAA GACATCAAAG 840 

HK) AATCCTTGGA ATTGAGCAAT CTGTATAAGA GGCCAAGCTT TGAAGAATCA AAAACTACAA 900 

AGTCATCACT GTTGTTACAA GAAATAGCCT GCAGAAGTAA GCCTATAACA AAACAATATC 960 

AAGGACTTGA GAGATTCTTT ATTTTTGATA CAAATGAAAG ACTCAACTTA CTTCCTTCTC 1020 

ATCGTTTAGA ATGCAACAAT TCCAGTACTA GGATTACACT TGCAGAAGAC AGAGAATGGA 1080 

Ar TTCCAGACCA TAGCTTAAGT GAATATGCTG ATAATGCAAT TGTCTTGGGT GTTCTGCAGG 1140 

GTGCTCAGAG TCC ATCA TCA AGTAGAAAAC AGCAAAAGAT GGGTCAGAAA TCACAGAGAC 1200 

CTTCAACAGC AAATTTTCCA CTTTCCAACT CTGTTAAAGA AAGCTCCAGT TGCCTTTCAT 1260 

CCTCTCATCC TCGATCAAGA AGTGCAGCTG CTCAATCATC ATCTAGAGCT GCTTCTGAAA 1320 

TTTCAGAAAT TGAATATATT GATATTACTG ACCAGAATGA GCTTTCCTTA GATGACACTA 1380 

CTGATCAACA TACTTTAGAC AATTTGGAAA AAGAATTACA AGTGCTGAGA TCTCTTGCAG 1440 

ATACTTCAGA AAAGCTTTAC AGCTTAACCT CAGAAGAGTT CCCAGATTTC AGCAGCCAAT 1500 

CACTGAATAT AAGTCAGATT TCCACAGATT TCCTTAAGAC CTCACATGTG AGGGGTCCCT 1560 

GTGGAGTTGA GGAATTGAGC TGTTCTGGAA GAGATACCAA AATTCAGTCT TTGCTGTCAC 1620 

TTTCTGAGAG CAGTACAGAT GAGGAGGAGG AAGATTTTCT CAACAAGCAA CATCTCATCA 1680 

- - CACTACCGTG GTCAAAGAGT ACTTAAAGAT TATTTGTTCA TTACTGTTTC CATTTTGTAC 1740 

CCAGAGTAAA GCAAACAACT GAGAAAAGTA ACCAAGTGAT TACCTATCCA AGTGCTGGAG 1800 

ATTTTGATTA CTAATGTCTT TGATGTTTCA AGGCTACAAA CTAATAAAAG TAAAATTATA 1860 
AGTTCAAAAA AAAAAAAAAA AAAA 



Seq ID NO: C59 DNA Sequence 

Nucleic Acid Accession #: AK097746.1 

Coding sequence: 18S..2224 



1884 



Seq ID NO: C5B DNA Sequence 
OU Nucleic Acid Accession #: NM_O05408.l 
Coding sequence: 7 6.. 372 

I J 1 I 1 f 41 51 

65 AAAAGGCCGG CGGAACAGCC AGAGGAGCAG AGAGGGAAAG AAACATTGTG AAATCTCCAA 60 

. CTCTTAACCT TCAACATGAA AGTCTCTGCA GTGCTTCTGT GCCTGCTGCT CATGACAGCA 120 

GCTTTCAACC CCCAGGGACT TGCTCAGCCA GATGCACTCA ACGTCCCATC TACTTGCTGC 180 

TTCACATTTA GCAGTAAGAA GATCTCCTTG CAGAGGCTGA AGAGCTATGT GATCACCACC 240 

70 ^5^3, CCCAGAAGGC TGTCATCTTC AGAACCAAAC TGGGCAAGGA GATCTGTGCT 300 

/U GACCCAAAGG AGAAGTGGGT CCAGAATTAT ATGAAACACC TGGGCCGGAA AGCTCACACC 360 

™ A ^ A 5TI GAACTCTGCT ACCCCTACTG AAATCAAGCT GGAGTACGTG AAATGACTTT 420 

TCCATTCTCC TCTGGCCTCC TCTTCTATGC TTTGGAATAC TTCTACCATA ATTTTCAAAT 480 

AGGATGCATT CGGTTTTGTG ATTCAAAATG TACTATGTGT TAAGTAATAT TGGCTATTAT 540 

7C IIS^IT^I GCTGGTTT GG AGTTTATTTG AGTATTGCTG ATCTTTTCTA AAGCAAGGCC 600 

T^^f* AGGTTGCTGT CTCTAAGCCC CCTTCCCTTC CACTATGAGC TGCTGGCAGT 660 

GGGTTTGTAT TCGGTTCCCA GGGGTTGAGA GCATGCCTGT GGGAGTCATG GACATGAAGG 720 

SSIS^?^ TGTAGGAAG G AGAGCTCTTT GTGAATGTGA GGTGTTGCTA AATATGTTAT 780 

3 GTGGAAA i A TGAATGCAAT AGTAGGACTG CTGACATTTT GCAGAAAATA CATTTTATTT 840 

AAAATCTCCA AAAAAAAAAA a60 
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10 
15 
20 
25 
30 
35 
40 



50 
55 
60 
65 
70 
75 
80 



CTTTCATGAC 
CTTCACATAC 
ATTTTCCTTT 
ACTGATGTTT 
AACTTGGCTA 
CAGAGGACTC 
TAAAGAGCCA 
GAAAATAATA 
TGTAACTGAC 
TTACTTGGAT 
TATGGCCAGC 
TATTTCACTG 
AGAAGGAACT 
GGAAAAAATA 
GACAAGCTAT 
TAATGAACCT 
TTCTGATCCT 
TGATACAATT 
AGTGACAGAC 
GTACATAGTT 
TAAAGGCACT 
TGAGATATTT 
CCTCTTTGAG 
TACTGATCAG 
CGACATTGAA 
GTTAGTACAA 
GGACCTTGAA 
CAGTAGCTTA 
TAAGCCCCTG 
GTATAATTCA 
TTTAACTGGA 
ATATGAATTT 
TATCCACGGA 
ACAATATCCC 
ATCTCGGATT 
AGGAACTCTT 
AGACCAACCT 
CTAAATTGGA 



ATTCTGTTAG 
ATGGCACTAC 
GAAATGGAAA 
AAAGCTATTA 
CTTGTTGGAA 
GGAAGTTACA 
GGAAAACCTT 
GCTATGCAGA 
GAGGTTATCC 
CTGTATCTCG 
AAACTTCTGT 
ATAAAGTCGG 
TCCACTACGG 
ACTCGGCACT 
CAAATTTATA 



21 
I 

CCAAGATTTT 
GTAATATATG 
CCAATCTTCT 
GAGGAGTAAG 
GCTGGGAGGA 
TTTGTGAACA 
AATTTCCAGC 
GTTTAAGACC 
AAAAGTTTGT 
CCATTCCCTT 
TTGCAAATGA 
AAGGACCGAT 
TACAGAATTG 
TTACCTCTGA 
AATTCCCAGT 
TTCGGCTGAA 
AGGGATGCCG 
CTATATCTTA 
GACGTCTTCT 
ATTATAAGTT 
ACATTGAATT 
AAAACGTTGA 
TCACCCAGGG 
AAATTACCAA 
GGAAGTATCC 
GATTTAACAA 
AGGGTGTGGT 
AGGTTCCAGA 
TCACAGATTT 
GTGTGTTTTG 
ATTATGCCAG 
CATCTGATAC 
ATGGCGCACG 
TTGACCTGAT 
ATGCCTATGT 
GACATTCTAC 
GGATCAAGCG 
AAACATCCAA 



31 
I 

GGAAAAGCGC 
CCGATCACTA 
TCTGGCAAGG 
TCTTAAAAGT 
AATCTGTCGG 
TATATATGAA 
ACCAATGGAT 
TGATAAGATA 
AGAGCCTCCA 
AATTTTTGTT 
TAAATCTATG 
TGCAGCAAAA 
CCATCTTGCA 
AACCTGTAAC 
AACAATTCTA 
TCTCCTTCAA 
TGGAAAGGAA 
CCTGACTGGG 
ATTAACCATG 
TTCTCCCAGT 
CATTAAGAAA 
CATCTCCAAG 
AGGCTCCAAA 
AGATATCCTC 
TGTGAGATAT 
TTTAATTATA 
TGTGATGGAT 
AATATGGGCC 
CCTAGCCCGG 
GCTGTCAGGT 
AAAATATACC 
ATCTGACACA 
CTGGGACCGA 
GCCCATCATA 
CTGTCCCCTC 
TAACTTTGTC 
CGGGGTTGCT 
AAGTTT 



41 
I 

CTACGATATT 
TTTGAGAAGG 
AAAGAGATTG 
GCTGAGAAAA 
GCAAGTGAAT 
TGGCGAGAAA 
AAGAACCTAA 
ACCCCAGCTA 
CCATTTGATT 
CTATCTCCAG 
TCTGGAAATA 
ATGATTAAAG 
GTGTCCTGGA 
TCATCCTTTA 
CAGAATGGAG 
TCATATCTCA 
CTGTTATTTA 
GAGTGTAATT 
CTGGCTGACT 
GGAAACTATT 
CTTCCATTTA 
GATCTTCAAC 
CAGACAGGAG 
AACAAGCTCC 
GAAGAAAGCA 
ACTATACGTA 
TCTGCATTGG 
AAACGTTCAT 
TTGAACTTTT 
TTCTTTTTCA 
ACCCCTATTG 
TCACCAGAAG 
GAAAGTGGAT 
TGGATAAAAC 
TACAAGACAA 
ATTGCAATGT 
TTGCTTTGTC 



51 
I 

TAAATGACCA 
ACAAGCTGTT 
AATACCAGGA 
ATCCTGATCC 
TTCCTGCCTT 
TCTATGACAG 
ATGAACTACA 
TAACAAACTA 
TGACAAAGAG 
GAGCAGATCC 
AGTTTCAAGC 
CAGCAATTGA 
TGCCCATGTT 
GGCTTTGGCT 
TAAAAATGAC 
CTGATCCAGT 
TCAATGAATA 
ATGGAGGAAG 
TTTATAATCT 
TTGCACCTCC 
CTCAACACCC 
AAACAAAAAC 
CCTCAGGAAG 
CTAGTGATTT 
TGAATACTGT 
ACACTCTACG 
AGGCACTCTC 
ACCCAAGCCT 
TACAGGACTG 
CTCAGGCCTT 
ATTTGCTAGG 
ATGGTGTTTA 
TGCTTGCTGA 
CAACTCAAAA 
GTGAACGTAA 
TGTTAAAAAC 
AGTTGGATGA 



Seq ID NO: C60 DNA Sequence 
Nucleic Acid Accession #: J02761. 
45 Coding sequence: 14.. 1159 



1 
I 

GAATTCCGGT 
GCTCTGTGGC 
TGAGTTCTGG 
ACAGGAAGTC 
CCACATCCTT 
GGAGCAGGAG 
TGACGACTAC 
CTGTATGCAC 
GTCAGACCCC 



CTCCGAGCAG 
GCGGATCCAA 
CGTGGTACCT 
CCTGCTCGAC 
GTGCTCCATG 
CTCTGAGTGC 
CATACCACAG 
GCAATTTGTG 
CCACACCACC 
CCACAGCCCC 
GACGGGCTCT 
CCAGGCTGAA 
CACTTCCACT 
ACCACAGTCT 
GCATCTGGAG 
AAAAACCAGC 
CCCCATGCAC 
AAAATGGGAA 
TCAAAAAATC 
GGGATGGCGA 
TTGTTAAGAT 
GATTGAAAGT 
GGGTTGTGCA 
AGTCCTATTC 



11 
I 

GCCATGGCTG 
CCAGGCACTG 
TGCCAAAGCC 
TGGGGACATG 
AACAAGATGG 
TGCAACGTCC 
TTCCCCCTGG 
CTGGGCCTGT 
CTGCCCAAAC 
GTGCTGCCCG 
CAATTCCCCA 
GCCATGATTC 
CTGGTGGCGG 
ACGCTGCTGG 
GATGACAGCG 
CACCTCTGCA 
GCAATGCTCC 
GAGCAGCACA 
TGCCAGGCCC 
GACCTTTGAT 
GGGACCATGG 
AAGAAGCCTC 
GGTGGACCAC 
GACACCAGAG 
GAGCTCTGCA 
CACTGCTTTA 
TCAAAGATTG 
CATACAGAAC 
AGCAATTCCC 
CCGGGCAAGC 
ATAAAAAGGG 
CCTAGGGTGA 
AGCTAGGGAC 
TCTTTTAAAA 



21 
I 

AGTCACACCT 
CTGCCTGGAC 
TGGAGCAAGC 
TGGGAGCCGA 
CCAAGGAGGC 
TCCCCTTGAA 
TCATCGACTA 
GCAAATCCCG 
CTCTGCGGGA 
GGGCCCTCCA 
TTCCTCTCCC 
CCAAGGGTGC 
GOGGCATCTG 
GCCGCATGCT 
CTGGCCCAAG 
TGTCCGTGAC 
AGGCCTGTGT 
CGCCCCAGCT 
TCGGGGTGTG 
GAGAACTCAG 
TGACCAGGCT 
AGCTCCCACA 
GGGCCCCCAG 
CCCACTTCCA 
GCCTCCACAC 
CAGGACAGGG 
GATTTTACAG 
TCTAAAAGAT 
CAGCGTAGTC 
TTTCTTCCTC 
GTTTCTTTTT 
TTCTATTTCT 
CCATTCCTGT 
AAAAAAAAAA 



31 
I 

GCTGCAGTGG 
CACCTCATCC 
ATTGCAGTGC 
TGACCTATGC 
CATTTTCCAG 
GCTGCTCATG 
CTTCCAGAAC 
GCAGCCAGAG 
CCCTCTGCCA 
GGCGAGGCCT 
CTATTGCTGG 
GCTAGCTGTG 
CCAGTGCCTG 
GCCCCAGCTG 
GTCGCCGACA 
CACCCAGGCC 



GCTGACCCTG 
TGGGACCATG 
CTGTCCAGCT 
CTTCCCCTGC 
CCGCCCTCCT 
CCCTGTGTCG 

CTACCACGAC 
GGTTGAAGCT 
CTACTTGCAA 
AGACATCAGA 
AAGGGTGGAC 
GAGATGCTCT 
GTCTTTCTGT 
GCTGTGATTT 
GTAATACAAT 
AAAAAAAACG 



41 
I 

CTGCTGCTGC 
TTGGCCTGTG 
AGAGCCCTAG 
CAAGAGTGTG 
GACACGATGA 
CCCCAGTGCA 
CAGACTGACT 
CCAGAGCAGQ 
GACCCTCTGC 
GGGCCTCACA 
CTCTGCAGGG 
GCAGTGGCCC 
GCTGAGCGCT 
GTCTGCCGCC 
GGAGAATGGC 
GGQAACAGCA 
CTGGACAGGG 
GTGCCCAGGG 
TCCAGCCCTC 
GCAAAGGAAA 
TCCCTGGCCC 
CACCTCCCTT 
GCCTTGTCTG 
TGTGAGGCAC 
CTCCCAGGGC 
GAGCCCCGCC 
TTCAAAATTC 
AATTGTTAAG 
ACTGCACGCT 
GCTGCTTGAG 
AAGGTGGACT 
ATCTGCTGAA 
GTCTGCACCA 
GAATTC 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2256 



51 
! 

TGCTGCCCAC 
CCCAGGGCCC 
GGCATTGCCT 
AGGACATCGT 
GGAAGTTCCT 
ACCAAGTGCT 
CAAACGGCAT 
AGCGAGGGAT 
TGGACAAGCT 
CACAGGATCT 
CTCTGATCAA 
AGGTGTGCCG 
ACTCCGTCAT 
TCGTCCTCCG 
TGCCGCGAGA 
GCGAGCAGGC 
AAAAGTGCAA 
GCTGGGATGC 
TCCAGTGTAT 
AGCCAAGTGA 
TCGCCAGCTG 
CCTCGGCAGT 
TCTCAGCTCA 
AGCGAGGGCA 
TGGGCTCAGG 
TCACACCCAC 
AGAAGAATAA 
TTAAGCTTTT 
CTGGCATGAT 
AGCTATTGCT 
TCCAGATTTT 
AGCTCAGCTG 
ATGCTAATAA 



60 

120 

160 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2026 



Seq ID NO: C61 DNA Sequence 
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Nucleic Acid Accession #: NM_139172.1 
Coding sequence: 19..S52 



5 

10 

15 

20 

25 

30 

35 

40 

45 
50 
55 
60 
65 
70 
75 
80 



i 
I 

GGGGTCTGGG 
CTGCAGGGCT 
GGTGACATAT 
TGCCTGCGTC 
AACGTGCGGA 
AGCTGCAGCA 
TTCCTGGCGG 
ACCAAGAAGA 
GTGGAGGGAG 
GAGGAGGATT 



11 
I 

GAGGTGACAT 
CGGCAGACGG 
GGGACCGGGA 
TCCGGTGCTG 
GGAAGCACAT 
TCTGCTTGTT 
GTCCGTGTGA 
CGCCGTCCAC 
GCACOGAGGG 
AGGGGAGTCC 



21 
I 

GTTGGGCTGT 
AAATGGAATC 
GAGCTGTGGG 
CTACCGCAAT 
GTGGGCGCTG 
CTGGTGGGCC 
CATGTCCAAG 
GGGCAGCGTG 
GGAAGGGACG 
CCGGGGGACT 



31 
I 

GGGATCCCAG 
CAGGGATTCT 
GGCCAGGCGG 
GGGGTCTGCT 
GTCTGGACGT 
AAGCGCCGGG 
TCCGTCTCGC 
CCAGTCGCCC 
GAGGAGGGTG 
GCTCAATACA 



41 
I 

CGCTGGGCCT 
TCTACCCATG 
CCATCGATAG 
ACCACCAGCG 
GCAGCGGCCT 
ACGTGCTGCA 
TGCTCTCCAA 
TGTCCAAAGA 
AGGAGACAGA 
GATACGGTGG 



51 
I 

GCTCCTGCTG 
GAGCTGTGAG 
CCCCAACCTC 
TCCAGACGAA 
CCTCCTCCTG 
TATGCCCGGT 
GCACCGAGGG 
GTCCAGGGAT 
GGGCGAGGAA 
ACG 



60 

120 

180 

240 

300 

360 

420 

480 

540 

593 



Seq ID NO: C62 DNA Sequence 

Nucleic Acid Accession ft: NM_054023. 

Coding sequence: 98.. 3 79 



1 
1 

GGGGACACTT 
ATTTTTAAAC 
GCTGGTGACC 
CCTTCCTGTT 
ATTAAAGCTT 
GAAGTGTGTA 
GCTATCACAC 
TGCTCCTATC 
GTAGTGACCC 
AAAAAAAAAA 



11 
I 

TGTATGGCAA 
TCCTGAAAAA 
ATCAGCCTTT 
GACAAGTTGG 
CTTCTGAAAA 
AATGAGCTGG 
TTGGTGTGAC 
CTCCCTGCCT 
GTGAAAAGGA 



21 
I 

GTGGAACCAC 
TATCCCAGAT 
GTAGTTACTC 
CACCTTTACC 
CTCTGGGCAT 
GACCAGAGGC 
ATCAAGATAA 
GAAACCTGTT 
CAAATAAAGC 



31 
I 

TGGCTTGGTG 
AACTGTCATG 
TGCTACTGCC 
TCTGGACAAC 
TTCTGTTGAG 
TTCTGAAGCT 
AGAGCGGAGG 
CTACCAATTA 
AATGAATACT 



41 

1 

GATTTTGCTA 
AAGCTGGTAA 
TTCCTCATCA 
ATTCTTCCCT 
CACCTTGTGG 
GTGAAGAAAC 
TGGATGGGGA 
TAGATCAAAT 
AAAAAAAAAA 



51 
1 

GATTTTTCTG 
CTATCTTCCT 
ACAAAGTGCC 
TTATGGATCC 
AGGGGCTAAG 
TGCTGGAGGC 
TGGAAGATGA 
GCCCTAAAAT 
AAAAAAAAAA 



Seq ID NO: C63 DNA Sequence 

Nucleic Acid Accession #: FGENESH predicted 

Coding sequence : 1 . . 2874 



60 

120 

180 

240 

300 

360 

420 

480 

540 

550 



1 11 21 31 41 51 

t t I I I I 

ATGCCCCTGT CCTATGCCTA TAAAAACGCT GAGACCCTAG CAGGCAGACA CACAAGCAGC 60 

TGGATGTCGA GAGGAGCATA TCAGCGGAGG AACACACGGG CAGCTGGACG TCCAGAGGAA 120 

TGCACTGACA GAAACTGGCA TGCTGGCAGA ACACGTGGAA TTTGGCTGGG GCAGTTGGAG 180 

GAGAGATGTT CAGATGTGTT CGGAGTTTCT TTCTTCTGGT GGGTTCGTGG TCTCGCTGGC 240 

TCAGGAGCGA AGCTGCAGAC CTTCACGCCA GCCCAGGAAG GGGCTCCCAC AGTGCAGCGG 300 

CAGGCTGAAG CGCTCCTCAA GTGCCGCCAG AGTGGGCGTC CAGGCAGAGG AGGCGCCGAG 360 

AGCGAGCGAG CGAGGGATGC CAGCATGCTG TCACCTCTCA GTGCTGCCAT GCGAAACTAC 420 

CCAACGTCCT CTACCATCCC TCCAAGAAGA TCCTACTCTC CAACCGAAAT TGCTCACAAG 480 

AGTTACTCCT GCAGCCTTCC AGACATGAAA ATCTCCATGG CAGAATCTGG CCCCTCCTTG S40 

GATAGCCTTG ACATTCTGGA GGATGGCGAG TCTGGGTCAC CATTTCTTGT GACTCATTTG 600 

TACTTTCTGG GGGTTGTCAC CACTGGGATG GAACAACTAG ATTTTGAAAC AGGACCAAAC 660 

ATATTTGATT TGCAGATTTA TGTGAAGGAT GAGGTTGGTG TCACAGACCT GCAAGTCCTG 720 

ACTGTCCAGG TAACAGATGT GAACGAGCCA CCTCAGTTTC AAGGCAACTT GGCAGAAGAT 780 

CATCTCCGTG CAGACCAGCC ACATTTCAAT GCTCATAGTC ACACGTACGT GAGGGTAGTG 840 

GCTACTGCAT TGGCCAGGCA CAGGCTTAGA TCTAGCATTG GTTCCCCCTT CCTGGGCACC 900 

TTCTGTGTTG TGGTGGGCAT GCAGTATTTC CTGATTTCTC CCCCAAAGAG CTTCAGAATG 960 

TCTGCTAATG GCACCCTCTT CTCCACAACA GAATTGGACT TTGAAGCAGG ACACAGAAGT 1020 

TTCCATCTCA TCGTGGAGGT GAGGGACAGT GGAGGCCTCA AAGCCTCCAC AGAGCTCCAG 1080 

GTGAACATCG TGAACCTCAA CGACGAAGTC CCTCGCTTTA CCAGCCCGAC ACGAGTGTAC 1140 

ACAGTCCTGG AGGAACTGAG TCCAGGAACC ATCGTGGCCA ATATCACAGC GGAGGATCCT 1200 

GATGATGAAG GTTTTCCCAG CCACCTCCTC TACAGCATTA CCACTGTTAG CAAATATTTC 1260 

ATGATAAATC AGTTGACTGG TACAATCCAA GTGGCCCAAA GGATAGACCG AGATGCAGGT 1320 

GAATTGAGAC AAAATCCCAC CATTTCCCTG GAAGTTCTAG TGAAGGACAG ACCATATGGG 1380 

GGTCAGGAGA ATCGCATCCA GATAACCTTC ATTGTGGAAG ACGTCAACGA CAATCCTGCC 1440 

ACATGCCAAA AGTTCACCTT CAGATCCAGT CTCCACCCTG CTCTGTGCTC CAAGACGCTG 1500 

ACCTGGATGG ATACCGTATT AGACTGTTTT CATGCTGCTG ATAAAGATAT ACCTGTGACT 1560 

GGGCGATTTA CAAAAGAAAG AGGTTTAATT GGACTTACAG TTCCACATGG CTGGGGAAGC 1620 

CTCACAATCA TGGCAGAAGG CAAGGAGGAG CAAGTCACAT CTTACATGGA TGGCAGCAGG 1680 

CAAAGAGATA GAGCTTGTGT AGGGAAACTC CTCCTTATAA AGCCATCAGA TCTCATGAGA 1740 
CTTAGTCACT ATCACGAGAA CAACTCAGGA AAGACTTGCC CCCATGATTC CATTTCCTCC 1800 
TACCAGGTCC CTCCCACAAC ATGTAGGAAT TCAAGAATCC AGGCCACCAA CAACGAAGAC I860 
ACAAGCTCTG TCACTGTTAC TGTGAACATC CTTGAAGAAA ATGATGAAAA GCCAATTTGT 1920 
ACTCCAAACT CTTATTTCCT GGCCCTCCCA GTGGATCTGA AAGTTGGCAC AAATATTCAG 1980 
AATTTCAAGC TGACATGTAC CGACCTTGAT TCCAGCCCCA GATCTTTCCG TTATTCCATT 2040 
- GGCCCAGGTA ACGTCAACAA TCATTTCACC TTCTCTCCCA ATGCTGGTTC CAATGTCACA 2100 
CGCCTGCTGC TTACATCTCG CTTTGACTAT GCTGGTGGGT TTGATAAGAT CTGGGACTAC 2160 
AAGCTACTTG TCTACGTAAC TGATGACAAC TTGATGTCTG ACAGGAAGAA AGCGGAGGCT 2220 
CTTGTTGAGA CAGGAACAGT GACACTGAGT ATTAAAGTCA TTCCCCACCC AACCACTATC 2280 
ATCACCACGA CCCCCAGGCC CAGGGTCACC TATCAGGTCC TGAGGAAAAA CGTTTACTCT 2340 
CCATCTGCAT GGTACGTGCC GTTTGTCATC ACTTTGGGCT CCATATTGCT TCTGGGTCTC 2400 
CTCGTGTACC TGGTCGTCCT ATTGGCCAAA GCCATCCACA GACACTGCCC CTGCAAGACT 2460 
GGGAAGAACA AGGAACCTCT GACAAAGAAA GGAGAAACGA AGACTGCAGA GAGAGACGTC 2520 
GTGGTGGAAA CTATCCAGAT GAACACTATC TTTGATGGAG AAGCCATAGA TCCAGAGCCT 2580 
GAGCAAGCTT CACTCGAGCT CTATGCCCTG CTGCCCAGCT GCTGCGACCC TAGTCCAGTA 2640 
ACCCTAAGAA AGGTCCAGGT GTGTGGGGAG AGTGAAGAGA CCGGTCAGTG TTCCGGCCAC 2700 
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ATCACACTTC CCGGCAAGAT TCCAGTCGAT GACCCAAGGA AACAGGAAAC AGGCCTGCAG 2760 
GGTGATTTCG AGGTCTGGAC TCTATGCCCC GCTGTGAAGG TGGTTGTAGG CAGCCCTCAA 2820 
GCTGAACGGT GCATTCGATT GGCTCTCAGT CTGAAAAAGT ACAGTTCTGA TTAA 2874 

Seq ID NO: C64 DNA Sequence 

Nucleic Acid Accession #: XM_168S71.l 

Coding sequence: 155.. 988 " % 



I 

TACACAGTCC 
CCTGATGATG 
TTCATGATAA 
GGTGAATTGA 
GGGGGTCAGG 
GCCACATGCC 
TTGCTTCTTG 
TTCAACTTCA 
GGCTCTGGGA 
GCCGGCAATA 
AATAACGTCT 
AGGCCATCCT 
CCTGAGGAAA 
CTCCATATTG 
CAGACACTGC 
GAAGACTGCA 
AGAAGCCATA 
AAAGTGGAAA 
TGCCCCACGC 
AGAAGATGAG 
GGGTGGCAAG 
CAAACCACAC 
TGCCTCACCC 
TGAGGGGATT 
GAAAGGGGTT 
ATTTCAGAAC 
TGTGTTTCTT 
TAGGAATTCC 
CTGCCCTCGT 
AAGTCACCTC 
AAGAGTTGGT 
GGTGTCOGGA 
TCAGTTCTCT 
CTACACTTGG 
ACTACAGGCA 
ATCACCTAAG 
TAAAAAATAC 
AGGCTGAGGT 
GTGCCACTGC 
AAGTGAATTA 



11 
I 

TGGAGGAACT 
AAGGTTTTCC 
ATCAGTTGAC 
GACAAAATCC 
AGAATCGCAT 
AAAAGTTCAC 
ACCTAAACAA 
CCATGCCATC 
AGATTGTGCT 
AATATACGGT 
ACGTTTATAT 
ATGTATTTGA 
AACGTTTACT 
CTTCTGGGTC 
CCCTGCAAGA 
GAGAGAGACG 
GATCCAGTGA 
GATCCACTAA 
AGAGTCACTG 
CTGAGTGGCA 
CTGGGCAACC 
CCAGGAAAGT 
TAAATTCTAT 
CAGACATCCA 
TGATCACATA 
TACCTGTGCT 
CTTTGCATTG 
TGACATAAAT 
TTTAACATCA 
AAATCTTCTT 
CATCTGGAAA 
TCAGCATCCT 
TCATGGTTAT 
GTGGCAGCTG 
GGGCATGATG 
GTCAGGAGTT 
AAAAATTAGC 
AGGGAGAACT 
ACTCTAGCCT 
CAACACT 



21 
I 

GAGTCCAGGA 
CAGCCACCTC 
TGGTACAATC 
CACCATTTCC 
CCAGATAACC 
CTTCAGCATT 
GTTCTGCTTT 
TGGAGTGGGG 
GATTGGTGAT 
GATAATCCAG 
CCTAACAAGC 
TGTGT CAGAA 
CTCCATCTGC 
TCCTCGTGTA 
CTGGGAAGAA 
TCGTGGTGGA 
CCGGGGAAAC 
CCCAAATGCC 
CTGGGGAAGG 
AAGCGTGGGC 
CAAAGAACAG 
AAACGGGGTC 
GGGGATGGTG 
GGGTCAAACA 
GTTGCGTGTT 
TCTGATAAGC 
ACTGCTAGGA 
AGTGAAGACT 
CCCAGATTTC 
CAAAAGAAGC 
GAAGAAAACT 
GCATGTGAGA 
GTCTGGTTTC 
CTTCAGATTT 
GCTCATGCCT 
CAAGATCAGC 
TTGGTGTGGT 
GCTTAAACCT 
GGGTGACAAA 



31 
I 

ACCATCGTGG 
CTCTACAGCA 
CAAGTGGCCC 
CTGGAAGTTC 
TTCATTGTGG 
ATGGTGCCGG 
GATGATGACA 
AGCGGCAGCA 
CTAGACTACG 
GTGCAGGATG 
CCAGAAAATG 
AGAAGGCCCG 
ATGGTACGTG 
CCTGGTCGTC 
CAAGGAACCT 
AACTATCCAG 
ATATGAATTC 
AAAATGGAAA 
GATGGGGTCA 
TGAGGATGCT 
AAATCCAGCC 
TAAGGAGGGG 
TGGGCATGGT 
TGGGATGTTT 
CTGAAATGAT 
AAGACTGTTA 
AGCTCTATTC 
ATCCTTACAT 
TTCAGTTATA 
AGAACAGTGA 
CAGTAGGCAC 
TTCATCCACG 
ATTCTATGAT 
TTTACTTTTA 
GTAATGCCAG 
CTGGCCTAGA 
GGTGGGCACA 
GAGAGGTGGA 
GCAAGACTCC 



41 

I 

CCAATATCAC 
TTACCACTGT 
AAAGGATAGA 
TAGTGAAGGA 
AAGACGTCAA 
AAAGAACAGC 
GTGAGGCACC 
GATTTTTACA 
AAAATCCAAG 
TGGCCCCCCC 
AGTTTCCTCT 
CCCAGGGTCA 
CCGTTTGTCA 
CTATTGGCCA 
CTGACAAAGA 
ATGAACACTA 
AACTCAAAAA 
GAGTCCAGCC 
CTGAGAAGTG 
GGTCTGGGTT 
TTCATGAACA 
CCTGTCAATC 
GTAGGGGGGA 
GACAAATTTT 
ACAGGAACAT 
ACTTTGGGGT 
TGTTCACCAT 
CTGGTTTCCA 
AATATGCCAT 
AAAAAACAGA 
CTTCTTTTGT 
TTGTCCTGTC 
TATATCACAA 
AAAAATATAC 
CACTTTGCCA 
TGGCAAAACC 
TGTAATCCCA 
GGTTACAGTG 
ATCTCAGAAA 



51 
I 

AGCGGAGGAT 
TAGCAAATAT 
CCGAGATGCA 
CAGACCATAT 
CGACAATCCT 
CAAGGGGACG 
AAACAACAGA 
GGATCCAGCT 
TAACCTAGCA 
TTACTATAAA 
CATTTTTGAT 
CCTATCAGGT 
TCACTTTGGG 
AAGCCATCCA 
AAGGAGAAAC 
TCTTTGATGG 
CTGGAGCCAG 
ACCAGGGAGC 
CCAACTGGGA 
CCAGAAATGA 
GGGCTTACCC 
ACTGAGATGC 
AAATGTGGGC 
TAAACAAATA 
TTTCTATCAG 
GTGGAATTGT 
AGAAAGTTTG 
CCTTATTTTC 
ACACCTTTGT 
TGAGTAAGTT 
TTTTTCTTGT 
TAGCAGTAGT 
TTTATCTATT 
TTAAAAGTGA 
AGGTGGGCAG 
CTGTCTCTAC 
GCTACTTGGG 
AGTTGAGATT 
AAAAAAATAA 



Seq ID NO: C65 DNA Sequence 

Nucleic Acid Accession #: NM_005266.3 

Coding sequence: 122.. 1198 ~ 



GGCACGAGGC 
ACACAAGGCA 
GATGGGCGAT 
GGTAGGCAAG 
TGCTGAGTCT 
CTGCCAGAAT 
GCAGATCATC 
GCGCATGCAG 
TGGCTCTTAC 
TGGAAGGATT 
CACCACCATG 
CACCCTGCAT 
GCCCACAGAG 
CCTTAGCCTG 
ACCGCGGCAG 
CTGCACACCA 
CAATCCCTTC 
AGTACGAGGT 
GCCTGAGGTG 
CAAGCGACGT 
CTCCTTTATG 
GTCCCTTCTG 
TCTCAATGAC 
AGGTGACAAC 
TTCAGATTAC 
GGAAGGGATA 
GCGTTCTCTA 



11 
I 

CATTTTCAAA 
GCAAGCACTG 
TGGAGCTTCC 
GTCTGGCTCA 
TCCTGGGGGG 
GTCTGCTACG 
TTCGTCTCCA 
GAGAAGCGCA 
GAGTACCCGG 
GCCCTCCAGG 
GAGGTGGGCT 
GTCTGCCGCA 
AAGAATGTCT 
GCTGAACTCT 
CACATGGCTA 
CCCCCCGACT 
AGCAATAATA 
CAGGAGCAGA 
CCCAATGGAG 
CTTAGTAAGG 
GGAGGATCAG 
AACTGATGCT 
GTTGCTCATT 
CCACCCAGAC 
TCATGAAACA 
GCCAGAGGGA 
AGTTCCTACC 



21 
I 

CAGTCCCTCC 
GGAGACGAAA 
TGGGAAATTT 
CTGTCCTCTT 
ATGAGCAGGC 
ACCAGGCTTT 
CGCCCTCTCT 
AGCTACGGGA 
TGGCAGAGAA 
GCACTCTGCT 
TCATTGTGGG 
GGAGTCCCTG 
TCATTGTCTT 
ACCACCTGGG 
AGTGCCAGCT 
TTAATCAGTG 
TGGCCTCCCA 
CTCCTGGGGA 
TCTCACCAGG 
CCAGCAGCAA 
GACCAGGTGG 
TTCTCACTGT 
AATTCTAGAA 
TGCAGTTCCC 
GGGTAGAGGG 
TAGAATGACT 
TCCTTGACCT 



31 
I 

TGGGAGAACA 
GTTTTGGCAT 
CCTGQAGGAA 
CATATTCCGT 
TGATTTCCGG 
CCCCATCTCC 
GGTGTACATG 
GGCCGAGAGG 
GGCAGAACTG 
CAACACCTAT 
CCAGTACTTC 
TCCCCACCCG 
TATGCTGGCT 
CTGGAAGAAG 
TTCTGGCCCC 
CCTGGAGAAT 
ACAAAACACA 
AGGTTTCATC 
TCACCGCCTT 
GGCAAGGTCA 
GAACAAAGGA 
CATCACTGCT 
ACTATAACCA 
TCCCCACCCT 
AAAGAAGGGA 
CTCTCTCTAC 
GATCACCCTC 



41 
I 

CAGACAGGCA 
CTGTTCCCTG 
GTACACAAGC 
ATGCTCGTGC 
TGTGATACGA 
CACATTCGCT 
GGCCACGCCA 
GCCAAAGAGG 
TCCTGCTGGG 
GTGTGCAGCA 
ATCTACGGAA 
GTCAACTGTT 
GTGGCTGCAC 
ATCAGACAGC 
TCTGTGGGCA 
GGCCCTGGGG 
GACAACCTGG 
CAGGTTCGTT 
CCCCATGGCT 
GATGACCTAT 
GGCTCAGAGA 
TGGCTCCTTT 
GGGCTCTGGG 
CTACCCAGTA 
AGCATGGCAA 
ATACCAGCAG 
CCTCCTCCAA 



51 
I 

GAGGATTACA 
GCTGTGCCAA 
ACTCGACCGT 
TGGGCACAGC 
TTCAGCCTGG 
ACTGGGTGCT 
TGCACACTGT 
TCCGGGGCTC 
AGGAAGGGAA 
TCCTGATCCG 
TCTTCCTGAC 
ACGTATCCCG 
TGTCCCTCCT 
GATTTGTCAA 
TAGTCCAGAG 
GAAAATTCTT 
TCACCGAGCA 
ATGGCCAGAA 
ATCATAGTGA 
CAGTGTGACC 
GGAAAGACGT 
GAGCCCCGGG 
ATAGTAAGAG 
TACGAAGCCT 
AAGCTGGCCT 
CATACCAAAT 
GGAAGAGCTC 
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AAAGTTCCCA 
CTGTTGTCTC 
CTTCTCCTCC 
CTTTACAAAT 
GCCCCACCTT 
GGTGTCAGCC 
TCTGTCACCC 
CTCCCTTGGC 
GCTGTTGTCC 
CCAGGCCTTG 



GCCAATAGAC 
CATGGACCAT 
CTCTACTCAG 
GCCTTGGTAT 
GGTGATGTTG 
CTGCTAGGGA 
TGCAGCTCCT 
TGTTCACCCA 
TCTCTCGAAC 
TTAAAAAAAA 



AGCATGAATC 
TCCTCGGAGT 
CCTTAAAAAG 
GTACCTCTGG 
CAACCTTTCC 
GTCACTGTAC 
TTACAGTTCA 
GCCATTCCCT 
CCTGACCAGA 
AAAAAAAAAA 



AAGGAACTTG 
AGTGGTGAGA 
GGCTTCTTGG 
CAAATGCCCT 
TTCTGCTAGG 
ACACAAACTC 
ATCCAATGAT 
GAAGGCCTTA 
TCATCAGCCA 



CATTATATGT 
TGGCCTTGGG 
AACTTTACCA 
GGTATGTACC 
GTGTACACCT 
TACTGGAATT 
AGAAACCATC 
CCAACAGGAA 
CTGAGGCCAG 



GCTCTTGAAT 
TTGCCCTTGG 
GCAGCCTCAG 
TCTGGCAAAT 
AGCCTGTGCA 
CCTGCCAACA 
CCTTCCCTTT 
TATCCAAGAA 
TGGAATTTCC 



Seg ID NO: C66 DNA Sequence 

Nucleic Acid Accession tf: NM_0144S9.2 

Coding sequence: 73B..3407 
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1 11 21 31 41 51 

I I 1 I I I 

GTAGATGCAG TCCGCCGCCG CCGCTGCCTC AGCCAGCAAT GCAAGATTAG ATCTCTAAAT 
GCAGCAAAAC ACTGCCTGAA AACAGACCGG CCCGCGCAGC AAGCAGACAT TTCACGGTGC 
GCTGGGGAAG CTTCAAAATA TATCTGTGAC TCTGTCTTCG TTGCTCTTCA TCCCCATCAA 
TTTCATCACG GGAGGCGAGC AGCAAGTAAG AATTTCACTT TCGGATCTGC CTAGAGACAC 
ACCTCCCTGC TCCCTCCCCC ACTCGATGTG AAGAGTATTC CGGAGTCTCC GGGCGGGAGT 
AGATTTGCAG CACCCTAGCG GGAGCGAGGA AAACCTACTG ATTCTTTAGC TCATTATCAT 
CTCTCCCAGA CGAGATTTCC TTCTTATCGC CTGCCTCATC GCTCAAGTTT GAGCCTCCCG 
AAGTCCGGGC GGGAGAGACG AAACCCCTGG CTCACCCCCA GCCGCAGGAA GCCACCGCCT 
TGCTCCAAGC CCCTGCAGCT CTGCTGCACC GCAGCTTCTC ACCCAGTGCG GATGCTGTAG 
ATCAACAGGT TCAGGGAACT TGAGCAGAAT AAGGAGAGAC CACCGGGTGC CGCAGCTCGG 
GTGCAGAGGG AAAAAAGGAC CCATAGACTT GTGGCTCGCG TCGCGCGCGC ACGCTGCGCC 
AGGGCCCCAG GCTGGCGCGC ACTCCCTCTC TGGCTCCTCC AGTCCGATTG CTCCTGCCCC 
CACCTTACAG GTCTGGGATG TACCTTTCCA TCTGTTGCTG CTTTCTTCTA TGGGCCCCTG 
CCCTCACTCT CAAGAACCTC AACTACTCCG TGCCGGAGGA GCAAGGGGCC GGCACGGTGA 
TCGGGAACAT CGGCAGGGAT GCTCGACTGC AGCCTGGGCT TCCGCCTGCA GAGCGCGGCG 
GCGGAGGGCG CAGCAAGTCG GGTAGCTACC GGGTGCTGGA GAACTCCGCA CCGCACCTGC 
TGGACGTGGA CGCAGACAGC GGGCTCCTCT ACACCAAGCA GCGCATCGAC CGCGAGTCCC 
TGTGCCGCCA CAATGCCAAG TGCCAGCTGT CCCTCGAGGT GTTCGCCAAC GACAAGGAGA 
TCTGCATGAT CAAGGTAGAG ATCCAGGACA TCAACGACAA CGCGCCCTCC TTCTCCTCGG 
ACCAGATCGA AATGGACATC TCGGAGAACG CTGCTCCGGG CACCCGCTTC CCCCTCACCA 
GCGCACATGA CCCCGACGCC GGCGAGAATG GGCTCCGCAC CTACCTGCTC ACGCGCGACG 
ATCACGGCCT CTTTGGACTG GACGTTAAGT CCCGCGGCGA CGGCACCAAG TTCCCAGAAC 
TGGTCATCCA GAAGGCTCTG GACCGCGAGC AACAGAATCA CCATACGCTC GTGCTGACTG 
CCCTGGACGG TGGCGAGCCT CCACGTTCCG CCACCGTACA GATCAACGTG AAGGTGATTG 
ACTCCAACGA CAACAGCCCG GTCTTCGAGG CGCCATCCTA CTTGGTGGAA CTGCCCGAGA 
ACGCTCCGCT GGGTACAGTG GTCATCGATC TGAACGCCAC CGACGCCGAT GAAGGTCCCA 
ATGGTGAAGT GCTCTACTCT TTCAGCAGCT ACGTGCCTGA CCGCGTGCGG GAGCTCTTCT 
CCATCGACCC CAAGACCGGC CTAATCCGTG TGAAGGGCAA TCTGGACTAT GAGGAAAACG 
GGATGCTGGA GATTGACGTG CAGGCCCGAG ACCTGGGGCC TAACCCTATC CCAGCCCACT 
GCAAAGTCAC GGTCAAGCTC ATCGACCGCA ACGACAATGC GCCGTCCATC GGTTTCGTCT 
CCGTGCGCCA GGGGGCGCTG AGCGAGGCCG CCCCTCCCGG CACCGTCATC GCCCTGGTGC 
GGGTCACTGA CCGGGACTCT GGCAAGAACG GACAGCTGCA GTGTCGGGTC CTAGGCGGAG 
GAGGGACGGG CGGCGGCGGG GGCCTGGGCG GGCCCGGGGG TTCCGTCCCC TTCAAGCTTG 
AGGAGAACTA CGACAACTTC TACACGGTGG TGACTGACCG CCCGCTGGAC CGCGAGACAC 
AAGACGAGTA CAACGTGACC ATCGTGGCGC GGGACGGGGG CTCTCCTCCC CTCAACTCCA 
CCAAGTCGTT CGCGATCAAG ATTCTAGACG AGAACGACAA CCCGCCTCGG TTCACCAAAG 
GGCTCTACGT GCTTCAGGTG CACGAGAACA ACATCCCGGG AGAGTACCTG GGCTCTGTGC 
TCGCCCAGGA TCCCGACCTG GGCCAGAACG GCACCGTATC CTACTCTATC CTGCCCTCGC 
ACATCGGCGA CGTGTCTATC TACACCTATG TGTCTGTGAA TCCCACGAAC GGGGCCATCT 
ACGCCCTGCG CTCCTTTAAC TTCGAGCAGA CCAAGGCTTT TGAGTTCAAG GTGCTTGCTA 
AGGACTCGGG GGCGCCCGCG CACTTGGAGA GCAACGCCAC GGTGAGGGTG ACAGTGCTAG 
ACGTGAATGA CAACGCGCCA GTGATCGTGC TCCCCACGCT GCAGAACGAC ACCGCGGAGC 
TGCAGGTGCC GCGCAACGCT GGCCTGGGCT ATCTGGTGAG CACTGTGCGC GCCCTAGACA 
GCGACTTCGG CGAGAGCGGG CGTCTCACCT ACGAGATCGT GGACGGCAAC GACGACCACC 
TGTTTGAGAT CGACCCGTCC AGCGGCGAGA TCCGCACGCT GCACCCTTTC TGGGAGGACG 
TGACGCCCGT GGTGGAGCTG GTGGTGAAGG TGACCGACCA CGGCAAGCCT ACCCTGTCCG 
CAGTGGCCAA GCTCATCATC CGCTCGGTGA GCGGATCCCT TCCCGAGGGG GTACCACGGG 
TGAATGGCGA GCAGCACCAC TGGGACATGT CGCTGCCGCT CATCGTGACT CTGAGCACTA 
TCTCCATCAT CCTCCTAGCG GCCATGATCA CCATCGCCGT CAAGTGCAAG CGCGAGAACA 
AGGAGATCCG CACTTACAAC TGCCGCATCG CCGAGTACAG CCACCCGCAG CTGGGTGGGG 
GCAAGGGCAA GAAGAAGAAG ATCAACAAAA ATGATATCAT GCTGGTGCAG AGCGAAGTGG 
AGGAGAGGAA CGCCATGAAC GTCATGAACG TGGTGAGCAG CCCCTCCCTG GCCACCTCCC 
CCATGTACTT CGACTACCAG ACCCGCCTGC CCCTCAGCTC GCCCCGGTCG GAGGTGATGT 
ATCTCAAACC GGCCTCCAAC AACCTGACTG TCCCTCAGGG GCACGCGGGC TGCCACACCA 
GCTTCACCGG ACAAGGGACT AATGCAAGCG AGACCCCTGC CACTCGGATG TCCATAATTC 
AGACAGACAA TTTTCCCGCA GAGCCCAATT ACATGGGCAG CAGGCAGCAG TTTGTTCAAA 
GTATTTCAGT AGCTCCACGT TTAAGGACCC AGAAAGAGCC AGCCTGAGAG ACAGTGGGCA 
CGGGGACAGT GATCAGGCTG ACAGTGACCA AGACACTAAC AAAGGCTCCT GCTGTGACAT 
GTCTGTTAGG GAGGCACTCA AGATGAAAAC TACTTCAACT AAAAGCCAAC CACTTGAACA 
AGAACCAGAA GAGTGTGTTA ATTGCACAGA TGAATGCCGA GTGCTTGGTC ATTCTGACAG 
GTGCTGGATG CCACAGTTCC CTGCAGCCAA TCAGGCTGAA AATGCAGATT ACCGCACAAA 
TCTCTTTGTA CCTACAGTTG AAGCTAATGT TGAGACTGAG ACTTACGAAA CTGTGAATCC 
CACTGGGAAA AAGACTTTTT GTACATTTGG AAAAGACAAG CGAGAGCACA CTATTCTCAT 
TGCCAACGTT AAACCTTATT TAAAAGCCAA ACGTGCCCTG AGCCCTCTCC TCCAAGAGGT 
CCCCTCAGCA TCAAGCAGCC CAACCAAGGC GTGCATCGAG CCTTGCACCT CAACAAAAGG 
CTCCCTGGAT GGCTGTGAAG CAAAACCAGG AGCCCTGGCT GAAGCAAGCA GTCAGTACTT 
GCCCACTGAC AGTCAATATC TGTCACCTAG TAAGCAACCA AGAGACCCTC CCTTCATGGC 
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5 



TTCCGATCAG ATGGCAAGGG TCTTTGCAGA TGTGCATTCC AGAGCCAGCC GGGATTCCAG 4080 

TGAGATGGGT GCTGTTCTTG AGCAGCTTGA CCACCCCAAC AGGGATCTGG GCAGAGAGTC 4140 

TGTGGATGCA GAGGAAGTTG TGAGAGAAAT TGATAAGCTT TTGCAAGACT GCCGGGGAAA 4200 

CGACCCTGTG GCTGTGAGAA AGTGAAAAAA GAAAAAAAAA AAGGCATTGG CATTTTCTTG 4260 

TCTCTTCTGT TGATTTAAAA ATGATCCCTC CTGGTGATAA CCCATTTTAC AGGGATGAAG 4320 

AAAGACCAAT GCTGCTTTAA GGCTTTTAGT GAACATCTGA AGTGCCCACA AGTATGTTCT 4380 

TTCCACTGCT GATTTCTTTT TCAGAGATAA CAATGGTTTC GTTTTGACCA AACTTGTATT 4440 

AGGACAGAAT TAATGATGCT TAAAGAGAAA AGAAAAAAAG AGAGAAGAAA AAGGAGAGAT 4500 

GAAAAAGGAG GATGAGGAGA AGAATTACCT TTTGACAATC TGTTAGGAAG GTATGCAGTG 4560 

1U TGAGAACTGA AGTATTTCTG ATCACTCTCA GACTGTCCTC CGTGATTTAT GCTGACTTAA 4620 

CTGTTTACCT ATAAACCCCA TACAAAGCAG GGTCATAATT TGTGATCTGT GGTGGATTTC 46 BO 

TAGCAGTCAT CACAGGCTTC TACTGAAAGT CCTGAAAAGA CCTTGCAGTA GTCCAAGCTA 4740 

CACCAAACAT TAACACATAT TTGTGGTAAA CATTTCTGTA TAAAGTTACC TGACACACAT 4800 

ATAAACACAA GGAACATTCC ATATCATTAG TCGAAAACAA AAACAAAAAA AAAACCTTTG 4860 

GTCATTTGTA AGACATCTCA TGTCATATAA AAGTTAAATG TAAAAAGATA CAGTCCATTT 4920 

TGTCCTGCAC ACACGTAGAC TAATTCACGT CAAAAAAAAA AAAAAA 4966 



15 



Seq ID NO: C67 DNA Sequence 
Nucleic Acid Accession #: NM_005601.2 
ZU Coding sequence: 101.. 598 
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1 11 21 31 41 SI 

I I I I I I 

CCCAGGAGTC TGGGTGCACA GCCTCCTTCT CTCTGAGATT CAAGAGTCTG ATCAGCAGCC 60 

TCTTCCTCCT CCAGGACCCA GAAGCCCTGA GCTTATCCCC ATGGAGCTCT GCCGGTCCCT 120 

GGCCCTGCTG GGGGGCTCCC TGGGCCTGAT GTTCTGCCTG ATTGCTTTGA GCACCGATTT 180 

CTGGTTTGAG GCTGTGGGTC CCACCCACTC AGCTCACTCG GGCCTCTGGC CAACAGGGCA 240 

TGGGGACATC ATATCAGG CT ACATCCACGT GACGCAGACC TTCAGCATTA TGGCTGTTCT 300 

GTGGGCCCTG GTGTCCGTGA GCTTCCTGGT CCTGTCCTGC TTCCCCTCAC TGTTCCCCCC 360 

AGGCCACGGC CCGCTTGTCT CAACCACCGC AGCCTTTGCT GCAGCCATCT CCATGGTGGT 420 

GGCCATGGCG GTGTACACCA GCGAGCGGTG GGACCAGCCT CCACACCCCC AGATCCAGAC 480 

CTTCTTCTCC TGGTCCTTCT ACCTGGGCTG GGTCTCAGCT ATCCTCTTGC TCTGTACAGG 540 

TGCCCTGAGC CTGGGTGCTC ACTGTGGCGG TCCCCGTCCT GGCTATGAAA CCTTGTGAGC 600 

AGAAGGCAAG AGCGGCAAGA TGAGTTTTGA GCGTTGTATT CCAAAGGCCT CATCTGGAGC 660 

CTCGGGAAAG TCTGGTCCTA CATTTGCCCG CCCTTCCAGC CCTTCCCCAG CCCCTCCTCT 720 

TGTTTCTTCA TTCATTCAAC AAAATTTGGC TGGAAAAAAA AAAAAAAAAA AAAAAAAAAA 780 

AAA 783 



40 Seq ID NO: C68 dna Sequence 

Nucleic Acid Accession #: NM_006433.2 
Coding sequence: 129.. 566 

AC 1 11 21 31 41 51 

45 I I I I I I 

GTATCTGTGG TAAACCCAGT GACACGGGGG AGATGACATA CAAAAAGGGC AGGACCTGAG 60 

AAAGATTAAG CTGCAGGCTC CCTGCCCATA AAACAGGGTG TGAAAGGCAT CTCAGCGGCT 120 

GCCCCACCAT GGCTACCTGG GCCCTCCTGC TCCTTGCAGC CATGCTCCTG GGCAACCCAG 180 

GTCTGGTCTT CTCTCGTCTG AGCCCTGAGT ACTACGACCT GGCAAGAGCC CACCTGCGTG 240 

ATGAGGAGAA ATCCTGCCCG TGCCTGGCCC AGGAGGGCCC CCAGGGTGAC CTGTTGACCA 300 

AAACACAGGA GCTGGGCCGT GACTACAGGA CCTGTCTGAC GATAGTCCAA AAACTGAAGA 360 

AGATGGTGGA TAAGCCCACC CAGAGAAGTG TTTCCAATGC TGCGACCCGG GTGTGTAGGA 420 

CGGGGAGGTC ACGATGGCGC GACGTCTGCA GAAATTTCAT GAGGAGGTAT CAGTCTAGAG 480 

TTACCCAGGG CCTCGTGGCC GGAGAAACTG CCCAGCAGAT CTGTGAGGAC CTCAGGTTGT 540 

GTATACCTTC TACAGGTCCC CTCTGAGCCC TCTCACCTTG TCCTGTGGAA GAAGCACAGG 600 

CTCCTGTCCT CAGATCCCGG GAACCTCAGC AACCTCTGCC GGCTCCTCGC TTCCTCGATC 660 

CAGAATCCAC TCTCCAGTCT CCCTCCCCTG ACTCCCTCTG CTGTCCTCCC CTCTCACGAG 720 

AATAAAGTGT CAAGCAAG 738 



Seq ID NO: C69 DNA Sequence 

Nucleic Acid Accession #: NM_00298S.2 

Coding oequence: 69.. 344 



1 11 21 31 41 51 

I I 1 I I I 

GCTGCAGAGG ATTCCTGCAG AGGATCAAGA CAGCACGTGG ACCTCGCACA GCCTCTCCCA 60 

CAGGTACCAT GAAGGTCTCC GCGGCAGCCC TCGCTGTCAT CCTCATTGCT ACTGCCCTCT 120 

GCGCTCCTGC ATCTGCCTCC CCATATTCCT CGGACACCAC ACCCTGCTGC TTTGCCTACA 180 

TTGCCCGCCC ACTGCCCCGT GCCCACATCA AGGAGTATTT CTACACCAGT GGCAAGTGCT 240 

CCAACCCAGC AGTCGTCTTT GTCACCCGAA AGAACCGCCA AGTGTGTGCC AACCCAGAGA 300 

AGAAATGGGT TCGGGAGTAC ATCAACTCTT TGGAGATGAG CTAGGATGGA GAGTCCTTGA 360 

ACCTGAACTT ACACAAATTT GCCTGTTTCT GCTTGCTCTT GTCCTAGCTT GGGAGGCTTC 420 

CCCTCACTAT CCTACCCCAC CCGCTCCTTG AAGGGCCCAG ATTCTACCAC ACAGCAGCAG 480 

Id ttacaaaaac CTTCCCCAGG CTGGACGTGG TGGCTCACGC CTGTAATCCC AGCACTTTGG 540 

GAGGCCAAGG TGGGTGGATC ACTTGAGGTC AGGAGTTCGA GACCAGCCTG GCCAACATGA 600 

TGAAACCCCA TCTCTACTAA AAATACAAAA AATTAGCCGG GCGTGGTAGC GGGOGCCTGT 660 

AGTCCCAGCT ACTCGGGAGG CTGAGGCAGG AGAATGGCGT GAACCCGGGA GGCGGAGCTT 720 

GCAGTGAGCC GAGATCGCGC CACTGCACTC CAGCCTGGGC GACAGAGCGA GACTCCGTCT 780 

CAAAAAAAAA AAAAAAAAAA AAAATACAAA AATTAGCCGG GCGTGGTGGC CCACGCCTGT 840 

AATCCCAGCT ACTCGGGAGG CTAAGGCAGG AAAATTGTTT GAACCCAGGA GGTGGAGGCT 900 

GCAGTGAGCT GAGATTGTGC CACTTCACTC CAGCCTGGGT GACAAAGTGA GACTCCGTCA 960 

CAACAACAAC AACAAAAAGC TTCCCCAACT AAAGCCTAGA AGAGCTTCTG AGGCGCTGCT 1020 

TTGTCAAAAG GAAGTCTCTA GGTTCTGAGC TCTGGCTTTG CCTTGGCTTT GCCAGGGCTC 1080 
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TGTGACCAGG AAGGAAGTCA GCATGCCTCT AGAGGCAAGG AGGGGAGGAA CACTGCACTC 1140 
TTAAGCTTCC GCCGTCTCAA CCCCTCACAG GAGCTTACTG GCAAACATGA AAAATCGGCT 1200 
TACCATTAAA GTTCTCAATG CAACCATAAA AAAAAAA 1237 



Seq ID NO: C70 DNA Sequence 

Nucleic Acid Accession #: NMJJ22154.2 

Coding sequence: 13B1..1722 

1 11 21 31 41 51 

I I I 1 I 1 

AGTGTGGTTT TAGTTTTTCC TAAGAAGTGG CGTGGTTTGG GGCTTTATAT CCGGGAGGAG 60 

CATATGTACG CAAATCCTGG GGCGTTTGCA AACCCGGATC CGGGGCGTCT GGCCCCATGC 120 

CCGGCCGGGC GTTTGAGGGC TACTGCCACG CAGCGTTTCT GGAGCCTGCC GGCTGGTGCC 180 

CTGGTGGCCT TTATCTCTGT CCCCCTTTGT CCTCTTTATC TCAGGCTCTC CAGGAGGCCG 240 

GGGGGCCCAC TCCGCCTATC GCTCCCCTCG GCTACGCTGC CACTCCAATG CCCCGCAGGT 300 

CGCGAGCTGC TGTTCTTTCG AAGGCGCCGG AGAACCAGGG GCGTCCCGCG CCACCTCTGA 360 

CTCGGAGCAG CGCCGAGCAC TGACGCTCCC GCCCTTGGGC AAGGACGCCA GTGCGCCCGC 420 

GCGCGTCCCT CTGCGCGGCA GCCCGTCGCG GGCCCTCAAG GGGAAQCCCA GGCCAGGATG 480 

GCCCCGGGTC GCGCGGTGGC CGGGCTCCTG TTGCTGGCGG CCGCCGGCCT CGGAGGAGTG 540 

GCGGAGGGGC CAGGGCTAGC CTTCAGCGAG GATGTGCTGA GCGTGTTCGG CGCGAATCTG 600 

AGCCTGTCGG CGGCGCAGCT CCAGCACTTG CTGGAGCAGA TGGGAGCCGC CTCCCGCGTG 660 

GGCGTCCCGG AGCCTGGCCA GCTGCACTTC AACCAGTGTT TAACTGCTGA AGAGATCTTT 720 

TCCCTTCATG GCTTTTCAAA TGCTACCCAA ATAACCAGCT CCAAATTCTC TGTCATCTGT 780 

CCAGCAGTCT TACAGCAATT GAACTTTCAC CCATGTGAGG ATCGGCCCAA GCACAAAACA 840 

AGACCAAGTC ATTCAGAAGT TTGGGGATAT GGATTCCTGT CAGTGACGAT TATTAATCTG 900 

GCATCTCTCC TCGGATTGAT TTTGACTCCA CTGATAAAGA AATCTTATTT CCCAAAGATT 960 

TTGACCTTTT TTGTGGGGCT GGCTATTGGG ACTC TTTTTT CAAATGCAAT TTTCCAACTT 1020 

ATTCCAGAGG CATTTGGATT TGATCCCAAA GTCGACAGTT ATGTTGAGAA GGCAGTTGCT 1080 

GTGTTTGGTG GATTTTACCT ACTTTTCTTT TTTGAAAGAA TGCTAAAGAT GTTATTAAAG 1140 

ACATATGGTC AGAATGGTCA TACCCACTTT GGAAATGATA . ACTTTGGTCC TCAAGAAAAA 1200 

ACTCATCAAC CTAAAGCATT ACCTGCCATC AATGGTGTGA CATGCTATGC AAATCCTGCT 1260 

GTCACAGAAG CTAATGGACA TATCCATTTT GATAATGTCA GTGTGGTATC TCTACAGGAT 1320 

GGAAAAAAAG AGCCAAGTTC ATGTACCTGT TTGAAGGGGC CCAAACTGTC AGAAATAGGG 1380 

ACGATTGCCT GGATGATAAC GCTCTGCGAT GCCCTCCACA ATTTCATCGA TGGCCTGGCG 1440 

ATTGGGGCTT CCTGCACCTT GTCTCTCCTT CAGGGACTCA GTACTTCCAT AGCAATCCTA 1500 

TGTGAGGAGT TTCCCCACGA GTTAGGAGAC TTTGTGATCC TACTCAATGC AGGGATGAGC 1560 

ACTCGACAAG CCTTGCTATT CAACTTCCTT TCTGCATGTT CCTGCTATGT TGGGCTAGCT 1620 

TTTGGCATTT TGGTGGGCAA CAATTTCGCT CCAAATATTA TATTTGCACT TGCTGGAGGC 1680 

ATGTTCCTCT ATATTTCTCT GGCAGATATG TTTCCAGAGA TGAATGATAT GCTGAGAGAA 1740 

AAGGTAACTG GAAGAAAAAC CGATTTCACC TTCTTCATGA TTCAGAATGC TGGAATGTTA 1800 

ACTGGATTCA CAGCCATTCT ACTCATTACC TTGTATGCAG GAGAAATCGA ATTGGAGTAA 1860 

TAGAAAATGG AAGATGGTGT TGTTAATAAA GGCATTTAAT AGATAAAAAC ATCTCCAAAA 1920 

AGGATTTTGA AGCTGATCCT ATTTAGTTAA AAAGATAATT TTGCTTTCAA CTGTAGGTCC 1980 

AGAAAACTAA TTATTGGCAT CAGTCTGTGA AATAGTCCAT TATTTGTTGT TAAAAATGCT 2040 

TCAAAAGGTT TTCAGTGTCA GTCTGAGATG CCTGGTATAT AGGAGCCTTT GGGAAATACT 2100 

TATTTTTCAG TATTCCATGC ATATTAGATA TCACCATGAA GCAAGAGACA TGCATTCTAT 2160 

AATCATGTAG ACACTCAGAC TCAGGGGAAA ATACAAGTTA TATCCTGAAA GCCTTTAAAA 2220 

CTCTATGGTA GGATCAAAGA TTCAAATGGT TTCAGAGAGG TTTTATTTCA ATTAATTTGT 2280 

TCTAGTGCTT TCAAGAGCAA GTACATCAAA ATGTAGAAGG TAAAATGTAT GCAACACTAA 2340 

TATAAATTAT TCCAAGTCTT TAAGGAGCCA AAGAAAAAAA AGATTTCTCA CAGCTTTTTG 2400 

TTCTGTTTTG TATTTCAATT AGGAACTTGC AGTATTATTT TGAAAACCAT TCTAAAATAA 2460 

TAGGAGTTAG GAAATAAATA AAGTTTTGCT AGCCCTGCTA AGTTCAGGCT TAGAGGCTTA 2520 

TCGCTAAGTN TAAACTTCAC CAGATTCCAC GAAAAGCTGG ATAGCTTTTT TTCTGACTTA 2580 

TGTTGTGGTT GCACCCCTCA CAAATGGCAG AACAGTATGT AAAGCTGGTA ACACCTCGGT 2640 

TTCAGTGCAC CATGTGTTTG CTTTGTGAAG GTGAAGAATA TGTTGGTTTA GAGAAAGAAA 2700 

TTGGATGTAA TTTTATGCAA TTTACTTTTA AAGACAAACA TAACTATTTA GCAGAGAATA 2760 

TTTTAATAAA TGCAAAACAA CAGCTGGACT GCTGTACATC AAGGACAGAT TAACTGGAAA 2820 

ACATATGTTC CTTATGTGTG ATTGAGAGGC ATTCAGAAAA GACTTCCTTT GTGTTCAGCC 2BB0 

TATACTTTTC CATATGGTAT ACCTTGAAAA AAATTAGCAC ACCATGGTTA TTTTTCTACC 2940 

TTTTATAAAA GACAGAGCCT GTTTACTCAT TTAGAAGATA GAGAAAATTG GTCTAAAATT 3000 

GAACATCCTA GATTCACACT CCCAAGTCAC TTAAGGTGAT TTGATGGTGA GGAAAATGAT 3060 

TGACAAAGCC CAACAATGAT CTCAGGAATT ACATTTTCCA ACAGACCAAA AAATGTTTTC 3120 

ATGTAGCAGC AATGCAGATT TGGTGAATAT TTAATATATA TTTTAGTATG TATTTCACTT 3180 

TATGACTGAC AATTAAAAAA TATTGTTTGG CCAAATAGTA AACACCCTTT TGAAACCATG 3240 

AAAAAA 3246 



A Seq ID NO: C71 DNA Sequence 
70 Nucleic Acid Accession #: NM_004184.2 
Coding sequence: 188.. 1603 

1 11 21 31 41 51 

I 1 I I I I 

CGAAAAAAGA GGGGAAGAGT ATTAAAGACC ATTTCTGGCT GGGCAGGGCA CTCTCAGCAG 60 

CTCAACTGCC CAGCGTGACC AGTGGCCACC TCTGCAGTGT CTTCCACAAC CTGGTCTTGA 120 

CTCGTCTGCT GAACAAATCC TCTGACCTCA GGCCGGCTGT GAACGTAGTT CCTGAGAGAT 180 

AGCAAACATG CCCAACAGTG AGCCCGCATC TCTGCTGGAG CTGTTCAACA GCATCGCCAC 240 

ACAAGGGGAG CTCGTAAGGT CCCTCAAAGC GGGAAATGCG TCAAAGGATG AAATTGATTC 300 

00 TGCAGTAAAG ATGTTGGTGT CATTAAAAAT GAGCTACAAA GCTGCCGCGG GGGAGGATTA 360 

CAAGGCTGAC TGTCCTCCAG GGAACCCAGC ACCTACCAGT AATCATGGCC CAGATGCCAC 420 

AGAAGCTGAA GAGGATTTTG TGGACCCATG GACAGTACAG ACAAGCAGTG CAAAAGGCAT 480 

AGACTACGAT AAGCTCATTG TTCGGTTTGG AAGTAGTAAA ATTGACAAAG AGCTAATAAA 540 

CCGAATAGAG AGAGCCACCG GCCAAAGACC ACACCACTTC CTGCGCAGAG GCATCTTCTT 600 
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CTCACACAGA GATATGAATC AGGTTCTTGA TGCCTATGAA AATAAGAAGC CATTTTATCT 660 

GTACACGGGC CGGGGCCCCT CTTCTGAAGC AATGCATGTA GGTCACCTCA TTCCATTTAT 720 

TTTCACAAAG TGGCTCCAGG ATGTATTTAA CGTGCCCTTG GTCATCCAGA TGACGGATGA 780 

CGAGAAGTAT CTGTGGAAGG ACCTGACCCT GGACCAGGCC TATGGCGATG CTGTTGAGAA 840 

TGCCAAGGAC ATCATCGCCT GTGGCTTTGA CATCAACAAG ACTTTCATAT TCTCTGACCT 900 

GGACTACATG GGGATGAGCT CAGGTTTCTA CAAAAATGTG GTGAAGATTC AAAAGCATGT 960 

TA CCTTCA AC CAAGTGAAAG GCATTTTCGG CTTCACTGAC AGCGACTGCA TTGGGAAGAT 1020 

CAGTTTTCCT GCCATCCAGG CTGCTCCCTC CTTCAGCAAC TCATTCCCAC AGATCTTCCG 1080 

AGACAGGACG GATATCCAGT GCCTTATCCC ATGTGCCATT GACCAGGATC CTTACTTTAG 1140 

AATGACAAGG GACGTCGCCC CCAGGATCGG CTATCCTAAA CCAGCCCTGT TGCACTCCAC 1200 

CTTCTTCCCA GCCCTGCAGG GCGCCCAGAC CAAAATGAGT GCCAGCGACC CAAACTCCTC 1260 

CATCTTCCTC ACCGACACGG CCAAGCAGAT CAAAACCAAG GTCAATAAGC ATGCGTTTTC 1320 

TG GAGGG AGA GACACCATCG AGGAGCACAG GCAGTTTGGG GGCAACTGTG ATGTGGACGT 1380 

GTCTTTCATG TACCTGACCT TCTTCCTCGA GGACGACGAC AAGCTCGAGC AGATCAGGAA 1440 

GGATTACACC AG CG GAG CCA TGCTCACCGG TGAGCTCAAG AAGGCACTCA TAGAGGTTCT 1500 

GCAGCCCTTG ATCGCAGAGC ACCAGGCCCG GCGCAAGGAG GTCACGGATG AGATAGTGAA 1560 

AGAGTTCATG ACTCCCCGGA AGCTGTCCTT CGACTTTCAG TAGCACTCGT TTTACATATG 1620 

CTTATAAAAG AAGTGATGTA TCAGTAATGT ATCAATAATC CCAGCCCAGT CAAAGCACCG 1680 

CCACCTGTAG GCTTCTGTCT CATGGTAATT ACTGGGCCTG GCCTCTGTAA GCCTGTGTAT 1740 

GTTATCAATA CTGTTTCTTC CTGTGAGTTC CATTATTTCT ATCTCTTATG GGCAAAGCAT 1800 

TGTGGGTAAT TGGTGCTGGC TAACATTGCA TGGTCGGATA GAGAAGTCCA GCTGTGAGTC 1860 

TCTCCCCAAA GCAGCCCCAC AGTGGAGCCT TCGGCTGGAA GTCCATGGGC CACCCTGTTC 1920 

TTGTCCATGG AGGACTTCCG AGGGTTCCAA GTATACTCTT AAGACCCACT CTGTTTAAAA 1980 

ATATATATTC TATGTATGCG TATATGGAAT TGAAATGTCA TTATTGTAAC CTAGAAAGTG 2040 

CTTTGAAATA TTGATGTGGG GAGGTTTATT GAGCACAAGA TGTATTTCAG CCCATGCCCC 2100 

CTCCCAAAAA GAAATTGATA AGTAAAAGCT TCGTTATACA TTTGACTAAG AAATCACCCA 2160 

GCTTTAAAGC TGCTTTTAAC AATGAAGATT GAACAGAGTT CAGCAATTTT GATTAAATTA 2220 

AGACTTGGGG GTGAAACTTT CCAGTTTACT GAACTCCAGA CCATGCATGT AGTCCACTCC 2280 

AGAAATCATG CTCGCTTCCC TTGGCACACC AGTGTTCTCC TGCCAAATGA CCCTAGACCC 2340 

TCTGTCCTGC AGAGTCAGGG TGGCTTTTCC CCTGACTGTG TCCGATGCCA AGGAGTCCTG 2400 

GCCTCCGCAG ATGCTTCATT TTGACCCTTG GCTGCAGTGG AAGTCAGCAC AGAGCAGTGC 2460 

CCTGGCTGTG TCCTGGACGG GTGGACTTAG CTAGGGAGAA AGTCGAGGCA GCAGCCCTCG 2520 

AGGCCCTCAC AGATGTCTAG GCAGGCCTCA TTTCATCACG CAGCATGTGC AGGCCTGGAA 25B0 

GAGCAAAGCC AAATCTCAGG GAAGTCCTTG GTTGATGTAT CTGGGTCTCC TCTGGAGCAC 2640 

TCTGCCCTCC TGTCACCCAG TAGAGTAAAT AAACTTCCTT GGCTCCTAAA AAA 2693 

Seq ID NO: C72 DNA Sequence 

Nucleic Acid Accession Hi NMJ)04938.1 

Coding sequence: 337.. 4632 



1 11 21 31 41 SI 

I I I 1 I I 

CGGAGGACAG CCGGACCGAG CCAACGCCGG GGACTTTGTT CCCTCCACGG AGGGGACTCG 60 

GCAACTCGCA GCGGCAGGGT CTGGGGCCGG CGCCTGGGAG GGATCTGCGC CCCCCACTCA 120 

4 J CTCCCTAGCT GTGTTCCCGC CGCCGCCCCG GCTAGTCTCC GGCGCTGGCG CCTATGGTCG 180 

GCCTCCGACA GCGCTCCGGA GGGACCGGGG GAGCTCCCAG GCGCCCGGGA CTGGAGACTG 240 

ATGCATGAGG GGCCTACGGA GGCGCAGGAG CGGTGGTGAT GGTCTGGGAA GCGGAGCTGA 300 

AGTCCCCTGG GCTTTGGTGA GGCGTGACAG TTTATCATGA CCGTGTTCAG GCAGGAAAAC 360 

GTGGATGATT ACTACGACAC CGGCGAGGAA CTTGGCAGTG GACAGTTTGC GGTTGTGAAG 420 

DU AAATGCCGTG AGAAAAGTAC CGGCCTCCAG TATGCCGCCA AATTCATCAA GAAAAGGAGG 480 

ACTAAGTCCA GCCGGCGGGG TGTGAGCCGC GAGGACATCG AGCGGGAGGT CAGCATCCTG 540 

AAGGAGATCC AGCACCCCAA TGTCATCACC CTGCACGAGG TCTATGAGAA CAAGACGGAC 600 

GTCATCCTGA TCTTGGAACT CGTTGCAGGT GGCGAGCTGT TTGACTTCTT AGCTGAAAAG 660 

GAATCTTTAA CTGAAGAGGA AGCAACTGAA TTTCTCAAAC AAATTCTTAA TGGTGTTTAC 720 

TACCTGCACT CCCTTCAAAT CGCCCACTTT GATCTTAAGC CTGAGAACAT AATGCTTTTG 780 

GATAGAAATG TCCCCAAACC TCGGATCAAG ATCATTGACT TTGGGTTGGC CCATAAAATT 840 

GACTTTGGAA ATGAATTTAA AAACATATTT GGGACTCCAG AGTTTGTCGC TCCTGAGATA 900 

GTCAACTATG AACCTCTTGG TCTTGAGGCA GATATGTGGA GTATCGGGGT AATAACCTAT 960 

ATCCTCCTAA GTGGGGCCTC CCCATTTCTT GGAGACACTA AGCAAGAAAC GTTAGCAAAT 1020 

GTATCCGCTG TCAACTACGA ATTTGAGGAT GAATACTTCA GTAATACCAG TGCCCTAGCC 1080 

AAAGATTTCA TAAGAAGACT TCTGGTCAAG GATCCAAAGA AGAGAATGAC AATTCAAGAT 1140 

AGTTTGCAGC ATCCCTGGAT CAAGCCTAAA GATACACAAC AGGCACTTAG TAGAAAAGCA 1200 

TCAGCAGTAA ACATGGAGAA ATTCAAGAAG TTTGCAGCCC GGAAAAAATG GAAACAATCC 1260 

GTTCGCTTGA TATCACTGTG CCAAAGATTA TCCAGGTCAT TCCTGTCCAG AAGTAACATG 1320 

AGTGTTGCCA GAAGCGATGA TACTCTGGAT GAGGAAGACT CCTTTGTGAT GAAAGCCATC 1380 

ATCCATGCCA TCAACGATGA CAATGTCCCA GGCCTGCAGC ACCTTCTGGG CTCATTATCC 1440 

AACTATGATG TTAACCAACC CAACAAGCAC GGGACACCTC CATTACTCAT TGCTGCTGGC 1500 

TGTGGGAATA TTCAAATACT ACAGTTGCTC ATTAAAAGAG GCTCGAGAAT CGATGTCCAG 1560 

GATAAGGGCG GGTCCAATGC CGTCTACTGG GCTGCTCGGC ATGGCCACGT CGATACCTTG 1620 

AAATTTCTCA GTGAGAACAA ATGCCCTTTG GATGTGAAAG ACAAGTCTGG AGAGATGGCC 1680 

CTCCACGTGG CAGCTCGCTA TGGCCATGCT GACGTGGCTC AAGTTACTTG TGCAGCTTCG 1740 

GCTCAAATCC CAATATCCAG GACAAAGGAA GAAGAAACCC CCCTGCACTG TGCTGCTTGG 1800 

CACGGCTATT ACTCTGTGGC CAAAGCCCTT TGTGAAGCCQ GCTGTAACGT GAACATCAAG I860 

AACCGAGAAG GAGAGACGCC CCTCCTGACA GCCTCTGCCA GGGGCTACCA CGACATCGTG 1920 

GAGTGTCTGG CCGAACATGG AGCCGACCTT AATGCTTGCG ACAAGGACGG ACACATTGCC 1980 

CTTCATCTGG CTGTAAGACG GTGTCAGATG GAGGTAATCA AGACTCTCCT CAGCCAAGGG 2040 

TGTTTCGTCG ATTATCAAGA CAGGCACGGC AATACTCCCC TCCATGTGGC ATGTAAAGAT 2100 

GGCAACATGC CTATCGTGGT GGCCCTCTGT GAAGCAAACT GCAATTTGGA CATCTCCAAC 2160 

QA AAGTATGGGC GAACGCCTCT GCACCTTGCG GCCAACAACG GAATCCTAGA CGTGGTCCGG 2220 

OU TATCTCTGTC TGATGGGAGC CAGCGTTGAG GCGCTGACCA CGGACGGAAA GACGGCAGAA 2280 

GATCTTGCTA GATCGGAACA GCACGAGCAC GTAGCAGGTC TCCTTGCAAG ACTTCGAAAG 2340 

GATACGCACC GAGGACTCTT CATCCAGCAG CTCCGACCCA CACAGAACCT GCAGCCAAGA 2400 

ATTAAGCTCA AGCTGTTTGG CCACTCGGGA TCCGGGAAAA CCACCCTTGT AGAATCTCTC 2460 

AAGTGTGGGC TGCTGAGGAG CTTTTTCAGA AGGCGTCGGC CCAGACTGTC TTCCACCAAC 2520 
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TCCAGCAGGT TCCCACCTTC ACCCCTGGCT TCTAAGCCCA CAGTCTCAGT GAGCATCAAC 2580 

AACCTGTACC CAGGCTGCGA GAACGTGAGT GTGAGGAGCC GCAGCATGAT GTTCGAGCCG 2640 

GGTCTTACCA AAGGGATGCT GGAGGTGTTT GTGGCCCCGA CCCACCACCC GCACTGCTOG 2700 

GCCGATGACC AGTCCACCAA GGCCATCGAC ATCCAGAACG CTTATTTGAA TGGAGTTGGC 2760 

GATTTCAGCG TGTGGGAGTT CTCTGGAAAT CCTGTGTATT TCTGCTGTTA TGACTATTTT 2820 

GCTGCAAATG ATCCCACGTC AATCCATGTT GTTGTCTTTA GTCTAGAAGA GCCCTATGAG 2880 

ATCCAGCTGA ACCCAGTGAT TTTCTGGCTC AGTTTCCTGA AGTCCCTTGT CCCAGTTGAA 2940 

GAACCCATAG CCTTCGGTGG CAAGCTGAAG AACCCACTCC AAGTTGTCCT GGTGGCCACC 3000 

CACGCTGACA TCATGAATGT TCCTCGACCG GCTGGAGGCG AGTTTGGATA TGACAAAGAC 3060 

ACATCGTTGC TGAAAGAGAT TAGGAACAGG TTTGGAAATG ATCTTCACAT TTCAAATAAG 3120 

CTGTTTGTTC TGGATGCTGG GGCTTCTGGG TCAAAGGACA TGAAGGTACT TCGAAATCAT 3180 

CTGCAAGAAA TACGAAGCCA GATTGTTTCG GTCTGTCCTC CCATGACTCA CCTGTGTGAG 3240 

AAAATCATCT CCACGCTGCC TTCCTGGAGG AAGCTCAATG GACCCAACCA GCTGATGTCG 3300 

CTGCAGCAGT TTGTGTACGA CGTGCAGGAC CAGCTGAACC CCCTGGCCAG CGAGGAGGAC 3360 

1 J CTCAGGCGCA TTGCTCAGCA GCTCCACAGC ACAGGCGAGA TCAACATCAT GCAAAGTGAA 3420 

ACAGTTCAGG ACGTGCTGCT CCTGGACCCC CGCTGGCTCT GCACAAACGT CCTGGGGAAG 3480 

TTGCTGTCCG TGGAGACCCC ACGGGCGCTG CACCACTACC GGGGCCGCTA CACCGTGGAG 3540 

GACATCCAGC GCCTGGTGCC CGACAGCGAC GTGGAGGAGC TGCTGCAGAT CCTCGATGCC 3600 

ATGGACATCT GCGCCCGGGA CCTGAGCAGC GGGACCATGG TGGACGTCCC AGCCCTGATC 3660 

AAGACAGACA ACCTGCACCG CTCCTGGGCT GATGAGGAGG ACGAGGTGAT GGTGTATGGT 3720 

GGCGTGCGCA TCGTGCCCGT GGAACACCTC ACCCCCTTCC CATGTGGCAT CTTTCACAAG 3780 

GTCCAGGTGA ACCTGTGCCG GTGGATCCAC CAGCAAAGCA CAGAGGGCGA CGCGGACATC 3840 

CGCCTGTGGG TGAATGGCTG CAAGCTGGCC AACCGTGGGG CCGAGCTGCT GGTGCTGCTG 3900 

GTCAACCAGG GCCAGGGCAT TGAGGTCCAG GTCCGTGGCC TGGAGACGGA GAAGATCAAG 3960 

TGCTGCCTGC TGCTGGACTC GGTGTGCAGC ACCATTGAGA ACGTCATGGC CACCACGCTG 4020 

CCAGGGCTCC TGACCGTGAA GCATTACCTG AGCCCCCAGC AGCTGCGGGA GCACCATGAG 4080 

CCCGTCATGA TCTACCAGCC ACGGGACTTC TTCCGGGCAC AGACTCTGAA GGAAACCTCA 4140 

CTGACCAACA CCATGGGGGG GTACAAGGAA AGCTTCAGCA GCATCATGTG CTTCGGGTGT 4200 

CACGACGTCT ACTCACAGGC CAGCCTCGGC ATGGACATCC ATGCATCAGA CCTGAACCTC 4260 

CTCACTCGGA GGAAACTGAG TCGCCTGCTG GACCCGCCCG ACCCCCTGGG GAAGGACTGG 4320 

TGCCTTCTCG CCATGAACTT AGGCCTCCCT GACCTCGTGG CAAAGTACAA CACCAATAAC 4380 

GGGGCTCCCA AGGATTTCCT CCCCAGCCCC CTCCACGCCC TGCTGCGGGA ATGGACCACC 4440 

TACCCTGAGA GCACAGTGGG CACCCTCATG TCCAAACTGA GGGAGCTGGG TCGCCGGGAT 4500 

GCCGCAGACC TTTTGCTGAA GGCATCCTCT GTGTTCAAAA TCAACCTGGA TGGCAATGGC 4560 

CAGGAGGCCT ATGCCTCGAG CTGCAACAGC GGCACCTCTT ACAATTCCAT TAGCTCTGTT 4620 

GTATCCCGGT GAGGGCAGCC TCTGGCTTGG ACAGGGTCTG TTTGGACTGC AGAACCAAGG 4660 

GGGTGATGTA GCCCATCCTT CCCTTTGGAG ATGCTGAGGG TGTTTCTTCC TGCACCCACA 4740 

GCCAGGGGGA TGCCACTCCT CCCTCCGGCT TGACCTGTTT CTCTGCCGCT ACCTCCCTCC 4800 

CCGTCTCATT CCGTTGTCTG TGGATGGTCA TTGCAGTTTA AGAGCAGAAC AGATCTTTTA 4860 

CTTTGGCCGC TTGAAAAGCT AGTGTACCTC CTCTCAGTGT TTTGGACTCC ATCTCTCATC 4920 

CTCCAGTACC TTGCTTCTTA CTGATAATTT TGCTGGAATT CCTAACTTTT CAATGACATT 4980 

TTTTTTAACT ATCATATTGA TTGTCCTTTA AAAAAGAAAA GTGCATATTT ATCCAAAATG 5040 

TGTATTTCTT ATACGCTTTT CTGTGTTATA CCATTTCCTC AGCTTATCTC TTTTATATTT 5100 

GTAGGAGAAA CTCCCATGTA TGGAATCCCA CTGTATGATT TATAAACAGA CAATATGTGA 5160 

GTGCCTTTTG CAGAAGAGGG TGTGTTTGAA ATCATCGGAG TCAGCCAGGA GCTGTCACCA 5220 

AGG AAACG CT ACCTCTCTGT CCCTTGCTGT ATGCTGATCA TCGCCAGAGG TGCTTCACCC 52 80 

TGAGTTTTGT TTTGTATTGT TTTCTGACAG TTTTTCTGTT TTGTTTGGCA AGGAAAGGGG 5340 

AGAAGGGAAT CCTCCTCCAG GGTGATTTTA TGATCAGTGT TGTTGCTCTA GGAAGACATT 5400 

TTTCCGTTTG CTTTTGTTCC AATGTCAATG TGAACGTCCA CATGAAACCT ACACACTGTC 5460 

ATGCTTCATC ATTCCCTCTC ATCTCAGGTA GAAGGTTGAC ACAGTTGTAG GGTTACAGAG 5520 

ACCTATGTAA GAATTCAGAA GACCCCTGAC TCATCATTTG TGGCAGTCCC TTATAATTGG 5580 

TGCATAGCAG ATGGTTTCCA CATTTAGATC CTGGTTTCAT AACTTCCTGT ACTTGAAGTC 5640 

TAAAAGCAGA AAATAAAGGA AGCAAGTTTT CTTCCATGAT TTTAAATTGT GATCGAGTTT 5700 

TAAATTGATA GGAGGGAACA TGTCCTAATT CTTCTGTCCT GAGAAGCATG TAATGTTAAT 5760 

GTTAT ATCAT ATGTATATAT ATATATGCAC TATGTATATA CATATATATT AATACTGGTA 5B20 

TTTTTACTTA ATCTATAAAA TGTCGTTAAA AAGTTGTTTG TTTTTTTCTT TTTTTATAAA 5880 

TAAACTGTTG CTCGTTAAAA AAAAAAAAAA 5910 
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Seq id NO: C73 DNA Sequence 

Nucleic Acid Accession #: NM_002081.1 

Coding sequence: 222., 1898 



1 11 21 31 41 51 

« I I I i I I 

OO GGCTGCCCGA GCGAGCGTTC GGACCTCGCA CCCCGCGCGC CCCGCGCCGC CGCCGCCGCC 60 

GGCTTTTGTT GTCTCCGCCT CCTCGGCCGC CGCCGCCTCT GGACCGCGAG CCGCGCGCGC 120 

CGGGACCTTG GCTCTGCCCT TCGCGGGCGG GAACTGCGCA GGACCCGGCC AGGATCCGAG 180 

AGAGGCGCGG GCGGGTGGCC GGGGGCGCCG CCGGCCCCGC CATGGAGCTC CGGGCCCGAG 240 

7fj GCTGGTGGCT GCTATGTGCG GCCGCAGCGC TGGTCGCCTG CGCCCGCGGG GACCCGGCCA 300 

/U GCAAGAGCCG GAGCTGCGGC GAGGTCCGCC AGATCTACGG AGCCAAGGGC TTCAGCCTGA 360 

GCGACGTGCC CCAGGCGGAG ATCTCGGGTG AGCACCTGCG GATCTGTCCC CAGGGCTACA 420 

CCTGCTGCAC CAGCGAGATG GAGGAGAACC TGGCCAACCG CAGCCATGCC GAGCTGGAGA 480 

CCGCGCTCCG GGACAGCAGC CGCGTCCTGC AGGCCATGCT TGCCACCCAG CTGOGCAGCT 540 

TCGATGACCA CTTCCAGCAC CTGCTGAACG ACTCGGAGCG GACGCTGCAG GCCACCTTCC 600 

/-) CCGGCGCCTT CGGAGAGCTG TACACGCAGA ACGCGAGGGC CTTCCGGGAC CTGTACTCAG 660 

AGCTGCGCCT GTACTACCGC GGTGCCAACC TGCACCTGGA GGAGACGCTG GCCGAGTTCT 720 

GGGCCCGCCT GCTCGAGCGC CTCTTCAAGC AGCTGCACCC CCAGCTGCTG CTGCCTGATG 780 

ACTACCTGGA CTGCCTGGGC AAGCAGGCCG AGGCGCTGCG GCCCTTCGGG GAGGCCCCGA 840 

Qn GAGAGCTGCG CCTGCGGGCC ACCCGTGCCT TCGTGGCTGC TCGCTCCTTT GTGCAGGGCC 900 

oU TGGGCGTGGC CAGCGACGTG GTCCGGAAAG TGGCTCAGGT CCCCCTGGGC CCGGAGTGCT 960 

CGAGAGCTGT CATGAAGCTG GTCTACTGTG CTCACTGCCT GGGAGTCCCC GGCGCCAGGC 1020 

CCTGCCCTGA CTATTGCCGA AATGTGCTCA AGGGCTGCCT TGCCAACCAG GCCGACCTGG 1080 

ACGCCGAGTG GAGGAACCTC CTGGACTCCA TGGTGCTCAT CACCGACAAG TTCTGGGGTA 1140 

CATCGGGTGT GGAGAGTGTC ATCGGCAGCG TGCACACGTG GCTGGCGGAG GCCATCAACG 12 00 



1241 
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5 

10 
15 
20 
25 
30 
35 
40 
45 



55 
60 
65 
70 
75 
80 



CCCTCCAGGA CAACAGGGAC ACGCTCACGG CCAAGGTCAT CCAGGGCTGC GGGAACCCCA 1260 

AGGTCAACCC CCAGGGCCCT GGGCCTGAGG AGAAGCGGCG CCGGGGCAAG CTGGCCCCGC 1320 

GGGAGAGGCC ACCTTCAGGC ACGCTGGAGA AGCTGGTCTC TGAAGCCAAG GCCCAGCTCC 13 80 

GCGACGTCCA GGACTTCTGG ATCAGCCTCC CAGGGACACT GTGCAGTGAG AAGATGGCCC 1440 

TGAGCACTGC CAGTGATGAC CGCTGCTGGA ACGGGATGGC CAGAGGCCGG TACCTCCCCG 1500 

AGGTCATGGG TGACGGCCTG GCCAACCAGA TCAACAACCC CGAGGTGGAG GTGGACATCA 1560 

CCAAGCCGGA CATGACCATC CGGCAGCAGA TCATGCAGCT GAAGATCATG ACCAACCGGC 1620 

TGCGCAGCGC CTACAACGGC AACGACGTGG ACTTCCAGGA CX3CCAGTGAC GACGGCAGCG 1680 

GCTCGGGCAG CGGTGATGGC TGTCTGGATG ACCTCTGCGG CCGGAAGGTC AGCAGGAAGA 1740 

GCTCCAGCTC CCGGACGCCC TTGACCCATG CCCTCCCAGG CCTGTCAGAG CAGGAAGGAC 1800 

AGAAGACCTC GGCTGCCAGC TGCCCCCAGC CCCCGACCTT CCTCCTGCCC CTCCTCCTCT 1860 

TCCTGGCCCT TACAGTAGCC AGGCCCCGGT GGCGGTAACT GCCCCAAGGC CCCAGGGACA 1920 

GAGGCCAAGG ACTGACTTTG CCAAAAATAC AACACAGACG ATATTTAATT CACCTCAGCC 1980 

TGGAGAGGCC TGGGGTGGGA CAGGGAGGGC CGGCGGCTCT GAGCAGGGGC AGGCGCAGAG 2040 

GTCCCAGCCC CAGGCCTGGC CTCGCCTGCC TTTCTGCCTT TTAATTTTGT ATGAGGTCCT 2100 

CAGGTCAGCT GGGAGCCAGT GTGCCCAAAA GCCATGTATT TCAGGGACCT CAGGGGCACC 2160 

TCCGGCTGCC TAGCCCTCCC CCCAGCTCCC TGCACCGCCG CAGAAGCAGC CCCTCGAGGC 2220 

CTACAGAGGA GGCCTCAAAG CAACCCGCTG GAGCCCACAG CGAGCCTGTG CCTTCCTCCC 2280 

CGCCTCCTCC CACTGGGACT CCCAGCAGAG CCCACCAGCC AGCCCTGGCC CACCCCCCAG 2340 

CCTCCAGAGA AGCCCCGCAC GGGCTGTCTG GGTGTCCGCC ATCCAGGGTC TGGCAGAGCC 2400 

TCTGAGATGA TGCATGATGC CCTCCCCTCA GCGCAGGCTG CAGAGCCOGG CCCCACCTCC 2460 

CTGCGCCCTT GAGGGGCCCC AGCGTCTGCA GGGTGACGCC TGAGACAGCA CCACTGCTGA 2520 

GGAGTCTGAG GACTGTCCTC CCACAGACCC TGCAGTGAGG GGCCCTCCAT GCGCAGATGA 2580 

GGGGCCACTG ACCCACCTGC GCTTCTGCTG GAGGAGGGGA AGCTGGGCCC AAAGGCCCAG 2640 

GGAGGCAGCG TGGGCTCTGC CAATGTGGGC TGCCCCTCGC ACACAGGGCT CACAGGGCAG 2700 

GCCTTGCTGG GGTCCAGGGC TGTTGGAGGA CCCCGAGGGC TGAGGAGCAG CCAGGACCCG 2760 

CCTGCTCCCA TCCTCACCCA GATCAGGAAC CAGGGCCTCC CTGTTCACGG TGACACAGGT 2820 

CAGGGCTCAG AGTGACCCTC GGCTGTCACC TGCTCACAGG GATGCTGGTG GCTGGTGAGA 2880 

CCCCGCACTG CACACGGGAA TGCCTAGGTC CCTTCCCGAC CCAGCCAGCT GCACTGCAGG 2940 

GCACGGGGAC CTGGATAGTT AAGGGCTTTT CCAAACATGC ATCCATTTAC TGACACTTCC 3000 

TGTCCTTGTT CATGGAGAGC TGTTCGCTCC TCCCAGATGG CTTCGGAGGC CCGCAGGGCC 30 SO 

CACCTTGGAC CCTGGTGACC TCCTGTCACT CACTGAGGCC ATCAGGGCCC TGCCCCAGGC 3120 

CTGGACGGGC CCTCCTTCCC TCCTGTGCCC CAGCTGCCAG GTGGCCCTGG GGAGGGGTGG 3180 

TGTGGTGTTG GGAAGGGGTC CTGCAGGGGG AGGAGGACTT GGAGGGTCTG GGGGCAGCTG 3240 

TCCTGAACCG ACTGACCCTG AGGAGGCCGC TTAGTGCTGC TTTGCTTTTC ATCACCGTCC 3300 

CGCACAGTGG ACGGAGGTCC CCGGTTGCTG GTCAGGTCCC CATGGCTTGT TCTCTGGAAC 3360 

CTGACTTTAG ATGTTTTGGG ATCAGGAGCC CCCAACACAG GCAAGTCCAC CCCATAATAA 3420 

CCCTGCCAGT GCCAGGGTGG GCTGGGGACT CTGGCACAGT GATGCCGGGC GCCAGGACAG 34 B0 

CAGCACTCCC GCTGCACACA GACGGCCTAG GGGTGGCGCT CAGACCCCAC CCTACGCTCA 3540 

TCTCTGGAAG GGGCAGCCCT GAGTGGTCAC TGGTCAGGGC AGTGGCCAAG CCTGCTGTGT 3600 

CCTTCCTCCA CAAGGTCCCC CCACCGCTCA GTGTCAGCGG GTGACGTGTG TTCTTTTGAG 3660 

TCCTTGTATG AATAAAAGGC TGGAAACCTA AA 3692 



Seq ID NO: C74 DNA Sequence 

Nucleic Acid Accession #: BC030205.1 

Coding sequence: 45.. 878 



1 11 21 31 41 51 

50 | | | | | | 

GTGAGCAGCC CCTAACAGGC TGTTACTTCA CTACAACTGA OGATATGATC ATCTTAATTT 60 

ACTTATTTCT CTTGCTATGG GAAGACACTC AAGGATGGGG ATTCAAGGAT GGAATTTTTC 120 

ATAACTCCAT ATGGCTTGAA CGAGCAGCCG GTGTGTACCA CAGAGAAGCA CGGTCTGGCA 180 

AATACAAGCT CACCTACGCA GAAGCTAAGG CGGTGTGTGA ATTTGAAGGC GGCCATCTCG 240 

CAACTTACAA GCAGCTAGAG GCAGCCAGAA AAATTGGATT TCATGTCTGT GCTGCTGGAT 300 

GGATGGCTAA GGGCAGAGTT GGATACCCCA TTGTGAAGCC AGGGCCCAAC TGTGGATTTG 360 

GAAAAACTGG CATTATTGAT TATGGAATCC GTCTCAATAG GAGTGAAAGA TGGGATGCCT 420 

ATTGCTACAA CCCACACGCA AAGGAGTGTG GTGGCGTCTT TACAGATCCA AAGCAAATTT 480 

TTAAATCTCC AGGCTTCCCA AATGAGTACG AAGATAACCA AATCTGCTAC TGGCACATTA 540 

GACTCAAGTA TGGTCAGCGT ATTCACCTGA GTTTTTTAGA TTTTGACCTT GAAGATGACC 600 

CAGGTTGCTT GGCTGATTAT GTTGAAATAT ATGACAGTTA CGATGATGTC CATGGCTTTG 660 

TGGGAAGATA CTGTGGAGAT GAGCTTCCAG ATGACATCAT CAGTACAGGA AATGTCATGA 720 

CCTTGAAGTT TCTAAGTGAT GCTTCAGTGA CAGCTGGAGG TTTCCAAATC AAATATGTTG 780 

CAATGGATCC TGTATCCAAA TCCAGTCAAG GAAAAAATAC AAGTACTACT TCTACTGGAA 840 

ATAAAAACTT TTTAGCTGGA AGATTTAGCC ACTTATAAAA AAAAAAAAAG GATGATCAAA 900 

ACACACAGTG TTTATGTTGG AATCTTTTGG AACTCCTTTG ATCTCACTGT TATTATTAAC 960 

ATTTATTTAT TATTTTTCTA AATGTGAAAG CAATACATAA TTTAGGGAAA ATTGGAAAAT 1020 

ATAGGAAACT TTAAACGAGA AAATGAAACC TCTCATAATC CCACTGCATA GAAATAACAA 1080 

GCGTTAACAT TTTCATATTT TTTTCTTTCA GTCATTTTTC TATTTGTGGT ATATGTATAT 1140 

ATGTACCTAT ATGTATTTGC ATTTGAAATT TTGGAATCCT GCTCTATGTA CAGTTTTGTA 12 00 

TTATACTTTT TAAATCTTGA ACTTTATAAA CATTTTCTGA AATCATTGAT TATTCTACAA 1260 

AAACATGATT TTAAACAGCT GTAAAATATT CTATGATATG AATGTTTTAT GCATTATTTA 1320 

AGCCTGTCTC TATTGTTGGA ATTTCAGGTC ATTTTCATAA ATATTGTTGC AATAAATATC 1380 

CTTGAACACA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1430 



Seq ID NO: C7S DNA Sequence 

Nucleic Acid Accession #: NM_001982.1 

Coding sequence: 199.. 4227 

1 11 21 31 41 51 

11)111 

CTCTCACACA CACACACCCC TCCCCTGCCA TCCCTCCCCG GACTCOGGCT CCGGCTCCGA 60 
TTGCAATTTG CAACCTCCGC TGCCGTCGCC GCAGCAGCCA CCAATTCGCC AGCGGTTCAG 120 

1242 



WO 03/042661 



GTGGCTCTTG CCTCGATGTC CTAGCCTAGG GGCCCCCGGG CCGGACTTGG CTGGGCTCCC 180 

TTCACCCTCT GCGGAGTCAT GAGGGCGAAC GAOGCTCTGC AGGTGCTGGG CTTGCTTTTC 240 

AGCCTGGCCC GGGGCTCCGA GGTGGGCAAC TCTCAGGCAG TGTGTCCTGG GACTCTGAAT 300 

GGCCTGAGTG TGACCGGCGA TGCTGAGAAC CAATACCAGA CACTGTACAA GCTCTACGAG 360 

AGGTGTGAGG TGGTGATGGG GAACCTTGAG ATTGTGCTCA CGGGACACAA TGCCGACCTC 420 

TCCTTCCTGC AGTGGATTCG AGAAGTGACA GGCTATGTCC TCGTGGCCAT GAATGAATTC 480 

TCTACTCTAC CATTGCCCAA CCTCCGCGTG GTGCGAGGGA CCCAGGTCTA CGATGGGAAG 540 

TTTGCCATCT TCGTCATGTT GAACTATAAC ACCAACTCCA GCCACGCTCT GCGCCAGCTC 600 

CGCTTGACTC AGCTCACCGA GATTCTGTCA GGGGGTGTTT ATATTGAGAA GAACGATAAG 660 

CTTTGTCACA TGGACACAAT TGACTGGAGG GACATCGTGA GGGACCGAGA TGCTGAGATA 720 

GTGGTGAAGG ACAATGGCAG AAGCTGTCCC CCCTGTCATG AGGTTTGCAA GGGGCGATGC 780 

TGGGGTCCTG GATCAGAAGA CTGCCAGACA TTGACCAAGA CCATCTGTGC TCCTCAGTGT 840 

AATGGTCACT GCTTTGGGCC CAACCCCAAC CAGTGCTGCC ATGATGAGTG TGCCGGGGGC 900 

TGCTCAGGCC CTCAGGACAC AGACTGCTTT GCCTGCCGGC ACTTCAATGA CAGTGGAGCC 960 

TGTGTACCTC GCTGTCCACA GCCTCTTGTC TACAACAAGC TAACTTTCCA GCTGGAACCC 1020 

AATCCCCACA CCAAGTATCA GTATGGAGGA GTTTGTGTAG CCAGCTGTCC CCATAACTTT 1080 

GTGGTGGATC AAACATCCTG TGTCAGGGCC TGTCCTCCTG ACAAGATGGA AGTAGATAAA 1140 

AATGGGCTCA AGATGTGTGA GCCTTGTGGG GGACTATGTC CCAAAGCCTG TGAGGGAACA 1200 

GGCTCTGGGA GCCGCTTCCA GACTGTGGAC TCGAGCAACA TTGATGGATT TGTGAACTGC 1260 

ACCAAGATCC TGGGCAACCT GGACTTTCTG ATCACCGGCC TCAATGGAGA CCCCTGGCAC 1320 

AAGATCCCTG CCCTGGACCC AGAGAAGCTC AATGTCTTCC GGACAGTACG GGAGATCACA 1380 

GGTTACCTGA ACATCCAGTC CTGGCCGCCC CACATGCACA ACTTCAGTGT TTTTTCCAAT 1440 

TTGACAACCA TTGGAGGCAG AAGCCTCTAC AACCGGGGCT TCTCATTGTT GATCATGAAG 1500 

AACTTGAATG TCACATCTCT GGGCTTCCGA TCCCTGAAGG AAATTAGTGC TGGGCGTATC 1560 

TATATAAGTG CCAATAGGCA GCTCTGCTAC CACCACTCTT TGAACTGGAC CAAGGTGCTT 1620 

CGGGGGCCTA CGGAAGAGCG ACTAGACATC AAGCATAATC GGCCGCGCAG AGACTGCGTG 1680 

GCAGAGGGCA AAGTGTGTGA CCCACTGTGC TCCTCTGGGG GATGCTGGGG CCCAGGCCCT 1740 

GGTCAGTGCT TGTCCTGTCG AAATTATAGC CGAGGAGGTG TCTGTGTGAC CCACTGCAAC 1800 

TTTCTGAATG GGGAGCCTCG AGAATTTGCC CATGAGGCCG AATGCTTCTC CTGCCACCCG 1860 

GAATGCCAAC CCATGGGGGG CACTGCCACA TGCAATGGCT CGGGCTCTGA TACTTGTGCT 1920 

CAATGTGCCC ATTTTCGAGA TGGGCCCCAC TGTGTGAGCA GCTGCCCCCA TGGAGTCCTA 1980 

GGTGCCAAGG GCCCAATCTA CAAGTACCCA GATGTTCAGA ATGAATGTCG GCCCTGCCAT 2040 

GAGAACTGCA CCCAGGGGTG TAAAGGACCA GAGCTTCAAG ACTGTTTAGG ACAAACACTG 2100 

GTGCTGATCG GCAAAACCCA TCTGACAATG GCTTTGACAG TGATAGCAGG ATTGGTAGTG 2160 

ATTTTCATGA TGCTGGGCGG CACTTTTCTC TACTGGCGTG GGCX3CCGGAT TCAGAATAAA 2220 

AGGGCTATGA GGCGATACTT GGAACGGGGT GAGAGCATAG AGCCTCTGGA CCCCAGTGAG 2280 

AAGGCTAACA AAGTCTTGGC CAGAATCTTC AAAGAGACAG AGCTAAGGAA GCTTAAAGTG 2340 

CTTGGCTCGG GTGTCTTTGG AACTGTGCAC AAAGGAGTGT GGATCCCTGA GGGTGAATCA 2400 

ATCAAGATTC CAGTCTGCAT TAAAGTCATT GAGGACAAGA GTGGACGGCA GAGTTTTCAA 2460 

GCTGTGACAG ATCATATGCT GGCCATTGGC AGCCTGGACC ATGCCCACAT TGTAAGGCTG 2520 

CTGGGACTAT GCCCAGGGTC ATCTCTGCAG CTTGTCACTC AATATTTGCC TCTGGGTTCT 2580 

CTGCTGGATC ATGTGAGACA ACACCGGGGG GCACTGGGGC CACAGCTGCT GCTCAACTGG 2640 

GGAGTACAAA TTGCCAAGGG AATGTACTAC CTTGAGGAAC ATGGTATGGT GCATAGAAAC 2700 

CTGGCTGCCC GAAACGTGCT ACTCAAGTCA CCCAGTCAGG TTCAGGTGGC AGATTTTGGT 2760 

GTGGCTGACC TGCTGCCTCC TGATGATAAG CAGCTGCTAT ACAGTGAGGC CAAGACTCCA 2820 

ATTAAGTGGA TGGCCCTTGA GAGTATCCAC TTTGGGAAAT ACACACACCA GAGTGATGTC 2880 

TGGAGCTATG GTGTGACAGT TTGGGAGTTG ATGACCTTCG GGGCAGAGCC CTATGCAGGG 2940 

CTACGATTGG CTGAAGTACC AGACCTGCTA GAGAAGGGGG AGCGGTTGGC ACAGCCCCAG 3000 

ATCTGCACAA TTGATGTCTA CATGGTGATG GTCAAGTGTT GGATGATTGA TGAGAACATT 3060 

CGCCCAACCT TTAAAGAACT AGCCAATGAG TTCACCAGGA TGGCCCGAGA CCCACCACGG 3120 

TATCTGGTCA TAAAGAGAGA GAGTGGGCCT GGAATAGCCC CTGGGCCAGA GCCCCATGGT 3180 

CTGACAAACA AGAAGCTAGA GGAAGTAGAG CTGGAGCCAG AACTAGACCT AGACCTAGAC 3240 

TTGGAAGCAG AGGAGGACAA CCTGGCAACC ACCACACTGG GCTCCGCCCT CAGCCTACCA 3300 

GTTGGAACAC TTAATCGGCC ACGTGGGAGC CAGAGCCTTT TAAGTCCATC ATCTGGATAC 3360 

ATGCCCATGA ACCAGGGTAA TCTTGGGGGG TCTTGCCAGG AGTCTGCAGT TTCTGGGAGC 3420 

AGTGAACGGT GCCCCOGTCC AGTCTCTCTA CACCCAATGC CACGGGGATG CCTGGCATCA 3480 

GAGTCATCAG AGGGGCATGT AACAGGCTCT GAGGCTGAGC TCCAGGAGAA AGTGTCAATG 3540 

TGTAGAAGCC GGAGCAGGAG CCGGAGCCCA CGGCCACGCG GAGATAGCGC CXACCATTCC 3600 

CAGCGCCACA GTCTGCTGAC TCCTGTTACC CCACTCTCCC CACCOGGGTT AGAGGAAGAG 3660 

GATGTCAACG GTTATGTCAT GCCAGATACA CACCTCAAAG GTACTCCCTC CTCCCGGGAA 3720 

GGCACCCTTT CTTCAGTGGG TCTCAGTTCT GTCCTGGGTA CTGAAGAAGA AGATGAAGAT 3780 

GAGGAGTATG AATACATGAA CCGGAGGAGA AGGCACAGTC CACCTCATCC CCCTAGGCCA 3840 

AGTTCCCTTG AGGAGCTGGG TTATGAGTAC ATGGATGTGG GGTCAGACCT CAGTGCCTCT 3900 

CTGGGCAGCA CACAGAGTTG CCCACTCCAC CCTGTACCCA TCATGCCCAC TGCAGGCACA 3960 

ACTCCAGATG AAGACTATGA ATATATGAAT CGGCAACGAG ATGGAGGTGG TCCTGGGGGT 4020 

GATTATGCAG CCATGGGGGC CTGCCCAGCA TCTGAGCAAG GGTATGAAGA GATGAGAGCT 4080 

TTTCAGGGGC CTGGACATCA GGCCCCCCAT GTCCATTATG CCOGCCTAAA AACTCTACGT 4140 

AGCTTAGAGG CTACAGACTC TGCCTTTGAT AACCCTGATT ACTGGCATAG CAGGCTTTTC 4200 

CCCAAGGCTA ATGCCCAGAG AACGTAACTC CTGCTCCCTG TGGCACTCAG GGAGCATTTA 4260 

ATGGCAGCTA GTGCCTTTAG AGGGTACCGT CTTCTCCCTA TTCCCTCTCT CTCCCAGGTC 4320 

CCAGCCCCTT TTCCCCAGTC CCAGACAATT CCATTCAATC TTTGGAGGCT TTTAAACATT 4380 

TTGACACAAA ATTCTTATGG TATGTAGCCA GCTGTGCACT TTCTTCTCTT TCCCAACCCC 4440 

AGGAAAGGTT TTCCTTATTT TGTGTGCTTT CCCAGTCCCA TTCCTCAGCT TCTTCACAGG 4500 

CACTCCTGGA GATATGAAGG ATTACTCTCC ATATCCCTTC CTCTCAGGCT CTTGACTACT 4560 

TGGAACTAGG CTCTTATGTG TGCCTTTGTT TCCCATCAGA CTGTCAAGAA GAGGAAAGGG 4620 

AGGAAACCTA GCAGAGGAAA GTGTAATTTT GGTTTATGAC TCTTAACCCC CTAGAAAGAC 4680 

AGAAGCTTAA AATCTGTGAA GAAAGAGGTT AGGAGTAGAT ATTGATTACT ATCATAATTC 4740 

AGCACTTAAC TATGAGCCAG GCATCATACT AAACTTCACC TACATTATCT CACTTAGTCC 4800 

TTTATCATCC TTAAAACAAT TCTGTGACAT ACATATTATC TCATTTTACA CAAAGGGAAG 4860 

TCGGGCATGG TGGCTCATGC CTGTAATCTC AGCACTTTGG GAGGCTGAGG CAGAAGGATT 4920 

ACCTGAGGCA AGGAGTTTGA GACCAGCTTA GCCAACATAG TAAGACCCCC ATCTC 4 975 



Seq ID NO: C76 DNA Sequence 
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Nucleic Acid Accession tf : NM_001216. 
Coding sequence: 43.. 1422 
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GCCCGTACAC 
AGCCCCTGGC 
CTGTCACTGC 
TCCCCCTTGG 
AGTGAAGAGG 
GAGGATCTAC 
TCCCTGAAGT 
AATAATGCCC 
CCGCCCTGGC 
CGCCCCCAGC 
CTCCCGCCGC 
CCTCCTGGGC 
CTGCACTGGG 
CCTGCCGAGA 
GGGCGCCCGG 
AGTGCCTATG 
CAGGTCCCAG 
TATGAGGGGT 
CAGACAGTGA 
GGTGACTCTC 
GAGGCCTCCT 
AATTCCTGCC 
ACCAGCGTCG 
GTGAGCTACC 
TGTGAGAAGC 
ATGCCACTTC 




TCGCCGCCTT 
TCCCAGAACT 
TAGAGATGGC 
GGGCTGCAGG 
TCCACGTGGT 
GAGGCCTGGC 
AGCAGTTGCT 
GACTGGACAT 
CTCTGACTAC 
TGCTGAGTGC 
GGCTACAGCT 
TCCCTGCTGG 
TGGCTGCTGG 
CGTTCCTTGT 
GCCCAGCAGA 
CAGCCAGAGG 
CTTTTAACTG 



21 
I 

GGGACACCCC 
GATCCCGGCC 
GCCTGTCCAT 
TTCTGGGGAA 
AGAGGAGGAT 
GGATCTACCT 
AOCTACTGTT 
AGAAGGGGAT 
CCCAGCCTGC 
CPGCCCGGCC 
GCGCCTGCGC 
TCTGGGTCCC 
TCGTCCGGGC 
TCACCTCAGC 
OGTGTTGGCC 
GTCTCGCTTG 
ATCTGCACTC 
ACCGCCCTGT 
TAAGCAGCTC 
GAACTTCCGA 
AGTGGACAGC 
TGACATCCTA 
GCAGATGAGA 
GGTAGCCGAG 
CATCTGAGGG 
CCAAGAAATT 



31 
I 

ACAGTCAGCC 
CCTGCTCCAG 
CCCCAGAGGT 
GATGACCCAC 
CCACCCGGAG 
GAAGTTAAGC 
GAGGCTCCTG 
GACCAGAGTC 
GCGGGCCGCT 
CTGCGCCCCC 
AACAATGGCC 
GGGCGGGAGT 
TCGGAGCACA 
ACCGCCTTTG 
GCCTTTCTGG 
GAAGAAATCG 
CTGCCCTCTG 
GCCCAGGGTG 
CACACCCTCT 
GCGACGCAGC 
AGTCCTCGGG 
GCCCTGGTTT 
AGGCAGCACA 
ACTGGAGCCT 
GGAGCCGGTA 
TTTTAAAATA 



41 
I 

GCATGGCTCC 
GCCTCACTGT 
TGCCCCGGAT 
TGGGCGAGGA 
AGGAGGATCT 
CTAAATCAGA 
GAGATCCTCA 
ATTGGCGCTA 
TCCAGTCCCC 
TGGAACTCCT 
ACAGTGTGCA 
ACCGGGCTCT 
CTGTGGAAGG 
CCAGAGTTGA 
AGGAGGGCCC 
CTGAGGAAGG 
ACTTCAGCCG 
TCATCTGGAC 
CTGACACCCT 
CTTTGAATGG 
CTGCTGAGCC 
TTGGCCTCCT 
GAAGGGGAAC 
AGAGGCTGGA 
ACTGTCCTGT 
AATATTTATA 



51 
I 

CCTGTGCCCC 
GCAACTGCTG 
GCAGGAGGAT 
GGATCTGCCC 
ACCTGGAGAG 
AGAAGAGGGC 
AGAACCCCAG 
TGGAGGCGAC 
GGTGGATATC 
GGGCTTCCAG 
ACTGACCCTG 
GCAGCTGCAT 
CCACCGTTTC 
CGAGGCCTTG 
GGAAGAAAAC 
CTCAGAGACT 
CTACTTCCAA 
TGTGTTTAAC 
GTGGGGACCT 
GCGAGTGATT 
AGTCCAGCTG 
TTTTGCTGTC 
CAAAGGGGGT 
TCTTGGAGAA 
CCTGCTCATT 
AT 



Seq ID NO: C77 DNA Sequence 

Nucleic Acid Accession #: NMJJ04207.1 

Coding sequence: 63.. 1460 



1 
I 

GGCGAGAGGC 
CCATGGGAGG 
GGGGCTGGGC 
AGGCCGTCAG 
CAGCCTGGAT 
TGTGCGTGAA 
TGGGCATGGT 
TCATCACGGG 
ACTTCAGCAA 
TGTGTGCCCT 
TCCTCATCCT 
TGGTGGTCAC 
TGAGCGTCTT 
TGGGGCTCTT 
ACACCAAGGC 
CCGCGGGCTT 
TCTCCATGTT 
GCCTCGTGGT 
TCGAGGTGCT 
TGCTGATGGA 
CGACCCACGT 
TGATTTTGCT 
AGGTGGOGGC 
TGCGGGAGGT 
CCCCGGAAAC 
AAGCCGGCAA 
GGGCTCCAGC 
CGGGGTGGGA 
AAGGCATCCT 
CTCAAGGACC 
CCGCAGACAG 
GGCTCACATG 
CTGGAAGATG 
TT 



11 
I 

GGGCTGAGGC 
GGCCGTGGTG 
CGTGCTCTTC 
TGTCTTCTTC 
CTCCTCCATC 
CCGCTTTGGC 
GGCTGCGTCC 
GTTGGGTTTG 
GCGGCGCCCC 
GAGCCCGCTG 
GGGCGGCCTG 
GGCCCAGCCG 
CCGGGACCGC 
CGTCCCGCCC 
CGCCTTCCTG 
CGTGGCGGGG 
CTTCAACGGC 
CTTCTGCATC 
CATGGCCATC 
GGCGGTGGCC 
CTACATGTAC 
GCTGGGCAAC 
CGCGGAGGAG 
GGAGCATTTC 
AAGTGTCTGA 
CGCTTGCTAT 
TGCCGGCCCA 
ACCGTGTCAT 
CACCAGGGGC 
TGGAAACCCA 
GCTGGCAGGG 
GGGCCTGTGC 
GAAATAAACC 




CTCACCATCC 
CTTGGGAAGG 
CTCGCGGACC 
TTCTTTGGCA 
GTGGGCACCC 
GTGCTCGTCG 
GTGTTCATCC 
TTCTTCTGCA 
GAGAAGCTOC 
CTGAAGGCTG 

TTATTTTACA 
GCGGATCGTC 
TCCAGAGTGG 
CCCGCCTGCT 
TGCTTCGAGA 
CAGGTGCTGC 
CCACCCCTCT 
TGCGTGTGGG 



31 
I 

CGGCAGGTGA 
CCACAGGCGT 
TCATCACTGG 
TACAGGAGTT 
TGCTCTACGG 
TCATGCTTGT 
GCATCATCCA 
TCCAGCCCTC 
GGCTGGCGGC 
TGCAGGACCG 
GCTGCGTGTG 
CGCCGCGACC 
TTTAOGCCGT 
TGAGCTACGC 
TGGGCTTCAT 
TGCGGCCCTA 
TGGCGGGCTC 
TCTCCTACGG 
ACAAGTTCTC 
GGCCCCCTTC 
TGGCGGGGGC 
TTAGGAAGAA 
ACAAGCCTCC 
AGCCTGAGAA 
GGGGCCGGCA 
AACTGGACTG 
GCCCGATCAG 
ATCTGCGGTG 
GCTCCCAGGT 
CAACGTGACT 
GTGGGGCCCT 
TGAGTGTCTT 
TGGAGTGTTC 



41 
I 

GGCGGAACCA 
CAAGGCCCCT 
CTTCTCCTAC 
TGGGATCGGC 
GACAGGTCCG 
GGGGGGTCTC 
GGTCTACCTC 
GCTCATCATG 
AGCAGGTAGC 
CTACGGCTGG 
TGCCGCACTC 
CTCCCGGCGC 

CAAGGACCTG 
TGACATCTTC 
CTCCGTCTAC 
TACGGCGGGC 
CATGGTGGGG 
CAGTGCCATT 
GGGAGGCAAA 
CGAGGTGCTC 
GCCCAAAGAG 
TGCAGACTCG 
AAACGGGGAG 
GGCACAGGGA 
GCTCAGGCAG 
TGTTTTGAGG 
AAGCCAAGCC 
GGCCTGCGGC 
TTAATGGGAG 
CTCCAGCCCG 
GGGGACAGCT 
TCGTGCCGAA 



60 

120 ' 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1552 




TCCTACCCTG 
CTTTCCACCC 
TTCAAAAAGC 



75 Seq ID NO: C78 DNA Sequence 

Nucleic Acid Accession #: NM_000358.1 
Coding sequence: 48.. 2099 

0 _ 1 11 21 31 41 51 

80 | | | | | | 

GCTTGCCCGT CGGTOGCTAG CTCGCTCGGT GCGCGTCGTC CCGCTCCATG GCGCTCTTCG 60 

TGCGGCTGCT GGCTCTCGCC CTGGCTCTGG CCCTGGGCCC CGCCGCGACC CTGGCGGGTC 120 

CCGCCAAGTC GCCCTACCAG CTGGTGCTGC AGCACAGCAG GCTCOGGGGC CGCCAGCACG 180 

GCCCCAACGT GTGTGCTGTG CAGAAGGTTA TTGGCACTAA TAGGAAGTAC TTCACCAACT 240 
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65 
70 



GCAAGCAGTG GTACCAAAGG AAAATCTGTG GCAAATCAAC AGTCATCAGC TACGAGTGCT 300 

GTCCTGGATA TGAAAAGGTC CCTGGGGAGA AGGGCTGTCC AGCAGCCCTA CCACTCTCAA 360 

ACCTTTACGA GACCCTGGGA GTCGTTGGAT CCACCACCAC TCAGCTGTAC ACGGACCGCA 420 

- CGGAGAAGCT GAGGCCTGAG ATGGAGGGGC CCGGCAGCTT CACCATCTTC GCCCCTAGCA 400 

J ACGAGGCCTG GGCCTCCTTG CCAGCTGAAG TGCTGGACTC CCTGGTCAGC AATGTCAACA 540 

TTGAGCTGCT CAATGCCCTC CGCTACCATA TGGTGGGCAG GCGAGTCCTG ACTGATGAGC 600 

TGAAACACGG CATGACCCTC ACCTCTATGT ACCAGAATTC CAACATCCAG ATCCACCACT 660 

ATCCTAATGG GATTGTAACT GTGAACTGTG CCCGGCTCCT GAAAGCCGAC CACCATGCAA 720 

1 - CCAACGGGGT GGTGCACCTC ATCGATAAGG TCATCTCCAC CATCACCAAC AACATCCAGC 780 

IV AGATCATTGA GATCGAGGAC ACCTTTGAGA CCCTTCGGGC TGCTGTGGCT GCATCAGGGC B40 

TCAACACGAT GCTTGAAGGT AACGGCCAGT ACACGCTTTT GGCCCCGACC AATGAGGCCT 900 

TCGAGAAGAT CCCTAGTGAG ACTTTGAACC GTATCCTGGG CGACCCAGAA GCCCTGAGAG 960 

ACCTGCTGAA CAACCACATC TTGAAGTCAG CTATGTGTGC TGAAGCCATC GTTGCGGGGC 1020 

TGTCTGTAGA GACCCTGGAG GGCACGACAC TGGAGGTGGG CTGCAGCGGG GACATGCTCA 1080 

13 CTATCAACGG GAAGGCGATC ATCTCCAATA AAGACATCCT AGCCACCAAC GGGGTGATCC 1140 

ACTACATTGA TGAGCTACTC ATCCCAGACT CAGCCAAGAC ACTATTTGAA TTGGCTGCAG 1200 

AGTCTGATGT GTCCACAGCC ATTGACCTTT TCAGACAAGC CGGCCTCGGC AATCATCTCT 1260 

CTGGAAGTGA GCGGTTGACC CTCCTGGCTC CCCTGAATTC TGTATTCAAA GATGGAACCC 1320 

CTCCAATTGA TGCCCATACA AGGAATTTGC TTCGGAACCA CATAATTAAA GACCAGCTGG 1380 

CCTCTAAGTA TCTGTACCAT GGACAGACCC TGGAAACTCT GGGCGGCAAA AAACTGAGAG 1440 

TTTTTGTTTA TCGTAATAGC CTCTGCATTG AGAACAGCTG CATCGCGGCC CACGACAAGA 1500 

GGGGGAGGTA CGGGACCCTG TTCACGATGG ACCGGGTGCT GACCCCCCCA ATGGGGACTG 1560 

TCATGGATGT CCTGAAGGGA GACAATCGCT TTAGCATGCT GGTAGCTGCC ATCCAGTCTG 1620 

rye CAGGACTGAC GGAGACCCTC AACCGGGAAG GAGTCTACAC AGTCTTTGCT CCCACAAATG 1680 

ZJ AAGCCTTCCG AGCCCTGCCA CCAAGAGAAC GGAGCAGACT CTTGGGAGAT GCCAAGGAAC 1740 

TTGCCAACAT CCTGAAATAC CACATTGGTG ATGAAATCCT GGTTAGCGGA GGCATCGGGG 18 00 

CCCTGGTGCG GCTAAAGTCT CTCCAAGGTG ACAAGCTGGA AGTCAGCTTG AAAAACAATG 1860 

TGGTGAGTGT CAACAAGGAG CCTGTTGCCG AGCCTGACAT CATGGCCACA AATGGCGTGG 1920 

„ TCCATGTCAT CACCAATGTT CTGCAGCCTC CAGCCAACAG ACCTCAGGAA AGAGGGGATG 1980 

0\J AACTTGCAGA CTCTGCGCTT GAGATCTTCA AACAAGCATC AGCGTTTTCC AGGGCTTCCC 2040 

AGAGGTCTGT GCGACTAGCC CCTGTCTATC AAAAGTTATT AGAGAGGATG AAGCATTAGC 2100 

TTGAAGCACT ACAGGAGGAA TGCACCACGG CAGCTCTCCG CCAATTTCTC TCAGATTTCC 2160 

ACAGAGACTG TTTGAATGTT TTCAAAACCA AGTATCACAC TTTAATGTAC ATGGGCCGCA 2220 

nc CCATAATGAG ATGTGAGCCT TGTGCATGTG GGGGAGGAGG GAGAGAGATG TACTTTTTAA 2280 

J J ATCATGTTCC CCCTAAACAT GGCTGTTAAC CCACTGCATG CAGAAACTTG GATGTCACTG 2340 

CCTGACATTC ACTTCCAGAG AGGACCTATC CCAAATGTGG AATTGACTGC CTATGCCAAG 2400 

TCCCTGGAAA AGGAGCTTCA GTATTGTGGG GCTCATAAAA CATGAATCAA GCAATCCAGC 2460 

CTCATGGGAA GTCCTGGCAC AGTTTTTGTA AAGCCCTTGC ACAGCTGGAG AAATGGCATC 2520 

ATTATAAGCT ATGAGTTGAA ATGTTCTGTC AAATGTGTCT CACATCTACA CGTGGCTTGG 2580 

HU AGGCTTTTAT GGGGCCCTGT CCAGGTAGAA AAGAAATGGT ATGTAGAGCT TAGATTTCCC 2640 

TATTGTGACA GAGCCATGGT GTGTTTGTAA TAATAAAACC AAAGAAACAT A 2691 

. Seq ID NO: C79 DNA Sequence 

4j Nucleic Acid Accession #: NM_O06536.2 
Coding sequence : 109 . .2940 

1 11 21 31 41 51 

SO 1 1 I II I 

JU ACCTAAAACC TTGCAAGTTC AGGAAGAAAC CATCTGCATC CATATTGAAA ACCTGACACA 60 

ATGTATGCAG CAGGCTCAGT GTGAGTGAAC TGGAGGCTTC TCTACAACAT GACCCAAAGG 120 

AGCATTGCAG GTCCTATTTG CAACCTGAAG TTTGTGACTC TCCTGGTTGC CTTAAGTTCA 180 

GAACTCCCAT TCCTGGGAGC TGGAGTACAG CTTCAAGACA ATGGGTATAA TGGATTGCTC 240 

ff ATTGCAATTA ATCCTCAGGT ACCTGAGAAT CAGAACCTCA TCTCAAACAT TAAGGAAATG 300 

ATAACTGAAG CTTCATTTTA CCTATTTAAT GCTACCAAGA GAAGAGTATT TTTCAGAAAT 360 

ATAAAGATTT TAATACCTGC CACATGGAAA GCTAATAATA ACAGCAAAAT AAAACAAGAA 420 

TCATATGAAA AGGCAAATGT CATAGTGACT GACTGGTATG GGGCACATGG AGATGATCCA 480 

TACACCCTAC AATACAGAGG GTGTGGAAAA GAGGGAAAAT ACATTCATTT CACACCTAAT 540 

TTCCTACTGA ATGATAACTT AACAGCTGGC TACGGATCAC GAGGCCGAGT GTTTGTCCAT 600 

GAATGGGCCC ACCTCOGTTG GGGTGTGTTC GATGAGTATA ACAATGACAA ACCTTTCTAC 660 

ATAAATGGGC AAAATCAAAT TAAAGTGACA AGGTGTTCAT CTGACATCAC AGGCATTTTT 720 

GTGTGTGAAA AAGGTCCTTG CCCCCAAGAA AACTGTATTA TTAGTAAGCT TTTTAAAGAA 780 

GGATGCACCT TTATCTACAA TAGCACCCAA AATGCAACTG CATCAATAAT GTTCATGCAA 840 

AGTTTATCTT CTGTGGTTGA ATTTTGTAAT GCAAGTACCC ACAACCAAGA AGCACCAAAC 900 

CTACAGAACC AGATGTGCAG CCTCAGAAGT GCATGGGATG TAATCACAGA CTCTGCTGAC 960 

TTTCACCACA GCTTTCCCAT GAATGGGACT GAGCTTCCAC CTCCTCCCAC ATTCTCGCTT 1020 

GTACAGGCTG GTGACAAAGT GGTCTGTTTA GTGCTGGATG TGTCCAGCAA GATGGCAGAG 1080 

GCTGACAGAC TCCTTCAACT ACAAGAAGCC GCAGAATTTT ATTTGATGCA GATTGTTGAA 1140 

ATTCATACCT TCGTGGGCAT TGCCAGTTTC GACAGCAAAG GAGAGATCAG AGCCCAGCTA 1200 

CACCAAATTA ACAGCAATGA TGATCGAAAG TTGCTGGTTT CATATCTGCC CACCACTGTA 1260 

TCAGCTAAAA CAGACATCAG CATTTGTTCA GGGCTTAAGA AAGGATTTGA GGTGGTTGAA 1320 

AAACTGAATG GAAAAGCTTA TGGCTCTGTG ATGATATTAG TGACCAGCGG AGATGATAAG 1380 

CTTCTTGGCA ATTGCTTACC CACTGTGCTC AGCAGTGGTT CAACAATTCA CTCCATTGCC 1440 

nr C ^ X3G ^ TTCAT CTGCAGCCCC AAATCTGGAG GAATTATCAC GTCTTACAGG AGGTTTAAAG 1500 

TTCTTTGTTC CAGATATATC AAACTCCAAT AGCATGATTG ATGCTTTCAG TAGAATTTCC 1560 

TCTGGAACTG GAGACATTTT CCAGCAACAT ATTCAGCTTG AAAGTACAGG TGAAAATGTC 1620 

AAACCTCACC ATCAATTGAA AAACACAGTG ACTGTGGATA ATACTGTGGG CAACGACACT 1680 

ATGTTTCTAG TTACGTGGCA GGCCAGTGGT CCTCCTGAGA TTATATTATT TGATCCTGAT 1740 

GGACGAAAAT ACTACACAAA TAATTTTATC ACCAATCTAA CTTTTCGGAC AGCTAGTCTT 1800 

OU TGGATTCCAG GAACAGCTAA GCCTGGGCAC TGGACTTACA CCCTGAACAA TACCCATCAT 1860 

TCTCTGCAAG CCCT GAAA GT GACAGTGACC TCTCGCGCCT CCAACTCAGC TGTGCCCCCA 1920 

GCCACTGTGG AAGCCTTTGT GGAAAGAGAC AGCCTCCATT TTCCTCATCC TGTGATGATT 1980 

TATGCCAATG TGAAACAGGG ATTTTATCCC ATTCTTAATG CCACTGTCAC TGCCACAGTT 2040 

GAGCCAGAGA CTGGAGATCC TGTTACGCTG AGACTCCTTG ATGATGGAGC AGGTGCTGAT 2100 
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GTTATAAAAA 
TATAGCTTGA 
CCAGGGAGTC 
GCTCCAAGGA 
AGCTCAGGAG 
CCACCATGCA 
TGGACAGCAC 
AGTAAAAGTC 
AAGCGAAATC 
ACGAATGGAC 
GCAATACGAG 
CCTCTGTTTA 
GGAGTTTTAA 
CATACTTTAA 
ATAAATATCC 
CATACTAACA 
ATACAGATAA 
CCTTACACTT 
GCAAAGGGAA 
AATAGCCCCA 
TCATTTAGTT 
TTTACATGAA 
CTTGCTATTT 
TTTCACTGTA 
TTTATGACAA 
TTTCTAAGTT 
TACCTAGGAA 



ATGATGGAAT 
AAGTGCATGT 
ATGCTATGTA 
AATCAGTAGG 
GCTCCTTTTC 
AAATTATTGA 
CTGGAGAAGA 
TACAGAATAT 
CTCAGCAAGC 
CTGAACATCA 
CAATGGATAG 
TTCCCCCCAA 
CAGCAATGGG 
GCAGGAAAAA 
AAAGTGTCTT 
AAGTCAAATT 
GATTTTTACA 
TGGCTATGAA 
GGGTAAAGTC 
AGCAGAGAAA 
ACTTTGATTA 
GATCATGCTA 
TGTTATATAT 
AGAGGTAACC 
AGGTCTATTG 
TATTGCCTTG 
A 



TTACTCGAGG 
CAATCACTCT 
TGTACCAGGT 
CAGAAATGAG 
AGTGCTGGGA 
CCTGGAAGCT 
CTTTGATCAG 
CCAAGATGAC 
TGGCATCAGG 
GCCAAATGGA 
GAACTCCTTA 
TTCTGATCCT 
TTTGATAGGA 
GAGAGCAGAC 
CCTTCTTAGA 
AACATCAAAA 
TGGTAGATCA 
CAAATAATAA 
GGACCAGTGT 
AGGAGGGTAG 
ATTTTTCTTT 
TATTTTATAT 
ATTTCAGATG 
TTTAACAATA 
AATTTATTTG 
GGTTATTATG 



TATTTTTTCT 
CCCAGCATAA 
TACACAGCAA 
GAGGAGCGAA 
GTTCCAGCTG 
GTAAAAGTAG 
GGCCAGGCTA 
TTTAACAATG 
GAGATATTTA 
GAAACACATG 
CAGTCTGCTG 
GTACCTGCCA 
ATCATTTGCC 
AAGAAAGAGA 
TATAAGACCC 
CTGTATTAAA 
ACAATTCTTT 
AAATTATTCT 
CAAGGAAAGT 
GTCTGCATTA 
TCTCCTTATC 
ATGTAGCCCC 
ACATCTCCCT 
TGGGTATTAC 
TNTGTAAGTT 
GAATGATAGT 



CCTTTGCTGC 
GCACCCCAGC 
ACGGTAATAT 
AGTGGGGCTT 
GCCCCCACCC 
AAGAGGAATT 
CAAGCTATGA 
CTATTTTAGT 
CGTTCTCACC 
AAAGCCACAG 
TATCTAACAT 
GAGATTATCT 
TTATTATAGT 
ATGGAACAAA 
ATGGCCTTCG 
ATGCATTGAG 
TTGGGGGTAG 
TTAAAGTAAT 
TTGTTTTATT 
TAACTGTCTG 
TGTGCAGTAC 
TAATGCAAAG 
GCTAATGCTC 
CTTTGTCTCT 
TCTACTCCCA 
TATAGCCCCN 



AAATGGTAGA 
CCACTCTATT 
TCAGATGAAT 
TAGCCGAGTC 
TGATGTGTTT 
GACCCTATCT 
AATAAGAATG 
AAATACATCA 
CCAGATTTCC 
AATTTATGTT 
TGCCCAGGCG 
TATATTGAAA 
TGTGACACAT 
ATTATTATAA 
ACTACAAAAA 
TTTTTGTACA 
ATTAGAAAAC 
GTCTTTAAAG 
GAGGTGGAAA 
TGTGAAGCAA 
AGGTTGCTTG 
CTCTTTACCT 
AGAGATCTTT 
TCATACCGGT 
TCAAAGCAGC 
TATAATGCCT 



Seq ID NO: C80 DNA Sequence 

Nucleic Acid Accession ft: Bos sequence 

Coding sequence: 1..1413 



1 
I 

ATGAAGTTTC 
AGCTCTACAA 
TATGGCCTTG 
AAGGAAAAAA 
ACATCTAQCC 
AGGGAAATGC 
TACACACCTG 
TGGAGTAATG 
GTGGTTTTTG 
CTAGCCCATG 
GAATTCTGGA 
GGCCATTCCT 
AAATATGTTG 
CTGTATGGAG 
CTCTGTGACC 
TTCAAAGACA 
ATTTCTTCCT 
AGAAATCAAG 
GAGCCAAATT 
GATGCAGCTG 
TGGAGGTATG 
AACTTCCAAG 
TATTTCTTCC 
ACACTGAAAA 



11 
I 

TTCTAATACT 
GCCTGGAAAA 
AGATAAACAA 
TCCAAGAAAT 
TGGAGATGAT 
CAGGGGGGCC 
ACATGAACCG 
TTACCCCCTT 
CCCGTGGAGC 
CTTTTGGACC 
CTACACATTC 
TAGGTCTTGG 
ACATCAACAC 
ACCCAAAAGA 
CCAATTTGAG 
GGTTCTTCTG 
TATGGCCAAC 
TTTTTCTTTT 
ATCCCAAGAG 
TTTTTAACCC 
ATGAAAGGAG 
GAATCGGGCC 
AAGGATCTAA 
GCAATAGCTG 



21 
I 

GCTCCTGCAG 
AAATAATGTG 
ACTTCCAGTG 
GCAGCACTTC 
GCACGCACCT 
CGTATGGAGG 
TGAGGATGTT 
GAAATTCAGC 
TCATGGAGAC 
TGGATCTGGC 
AGGAGGCACA 
CCATTCTAGT 
ATTTCGCCTC 
GAACCAACGC 
TTTTGATGCT 
GCTGAAGGTT 
CTTGCCATCT 
TAAAGATGAC 
CATACATTCT 
ACGTTTTTAT 
ACAGATGATG 
TAAAATTGAT 
CCAATTTGAA 
GTTTGGTTGT 



31 
I 

GCCACTGCTT 
CTATTTGGTG 
ACAAAAATGA 
TTGGGTCTGA 
CGATGTGGAG 
AAACATTATA 
GACTACGCAA 
AAGATTAACA 
TTCCATGCTT 
ATTGGAGGGG 
AACTTGTTCC 
GATCCAAAGG 
TCTGCTGATG 
TTGCCAAATC 
GTCACTACCG 
TCTGAGAGAC 
GGCATTGAAG 
AAATACTGGT 
TTTGGTTTTC 
AGGAOCTACT 
GACCCTGGTT 
GCAGTCTTCT 
TATGACTTCC 
TGA 



41 

I 

CTGGAGCTCT 
AAAGATACTT 
AATATAGTGG 
AAGTGACCGG 
TCCCCGATGT 
TCACCTACAG 
TCCGGAAAGC 
CAGGCATGGC 
TTGATGGCAA 
ATGCACATTT 
TCACTGCTGT 
CCGTAATGTT 
ACATACGTGG 
CTGACAATTC 
TGGGAAATAA 
CAAAGACCAG 
CTGCTTATGA 
TAATTAGCAA 
CTAACTTTGT 
TCTTTGTAGA 
ATCCCAAACT 
ACTCTAAAAA 
TACTCCAACG 



51 
I 

TCCCCTGAAC 
AGAAAAATTT 
AAACTTAATG 
GCAACTGGAC 
CCATCATTTC 
AATCAATAAT 
TTTCCAAGTA 
TGACATTTTG 
AGGTGGAATC 
CGATGAGGAC 
TCACGAGATT 
CCCCACCTAC 
CATTCAGTCC 
AGAACCAGCT 
GATCTTTTTC 
TGTTAATTTA 
AATTGAAGCC 
TTTAAGACCA 
GAAAAAAATT 
TAACCAGTAT 
GATTACCAAG 
CAAATACTAC 
TATCACCAAA 



Seq ID NO: C8l DNA Sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 1. .1413 



1 
I 

ATGAAGTTTC 
AGCTCTACAA 
TATGGCCTTG 
AAGGAAAAAA 
ACATCTACCC 
AGGGAAATGC 
TACACACCTG 
TGGAGTAATG 
GTGGTTTTTG 
CTAGCCCATG 
GAATTCTGGA 
GGCCATTCCT 
AAATATGTTG 
CTGTATGGAG 
CTCTGTGACC 
TTCAAAGACA 
ATTTCTTCCT 
AGAAATCAAG 
GAGCCAAATT 



11 

I 

TTCTAATACT 
GCCTGGAAAA 
AGATAAACAA 
TCCAAGAAAT 
TGGAGATGAT 
CAGGGGGGCC 
ACATGAACCG 
TTACCCCCTT 
CCCGTGGAGC 
CTTTTGGACC 
CTACACATTC 
TAGGTCTTGG 
ACATCAACAC 
ACCCAAAAGA 
CCAATTTGAG 
GGTTCTTCTG 
TATGGCCAAC 
TTTTTCTTTT 
ATCCCAAGAG 



21 

I 

GCTCCTGCAG 
AAATAATGTG 
ACTTCCAGTG 
GCAGCACTTC 
GCACGCACCT 
CGTATGGAGG 
TGAGGATGTT 
GAAATTCAGC 
TCATGGAGAC 
TGGATCTGGC 
AGGAGGCACA 
CCATTCTAGT 
ATTTCGCCTC 
GAACCAACGC 
TTTTGATGCT 
GCTGAAGGTT 
CTTGCCATCT 
TAAAGATGAC 
CATACATTCT 



31 
I 

GCCACTGCTT 
CTATTTGGTG 
ACAAAAATGA 
TTGGGTCTGA 
CGATGTGGAG 
AAACATTATA 
GACTACGCAA 
AAGATTAACA 
TTCCATGCTT 
ATTGGAGGGG 
AACTTGTTCC 
GATCCAAAGG 
TCTGCTGATG 
TTGCCAAATC 
GTCACTACCG 
TCTGAGAGAC 
GGCATTGAAG 
AAATACTGGT 
TTTGGTTTTC 



41 
I 

CTGGAGCTCT 
AAAGATACTT 
AATATAGTGG 
AAGTGACCGG 
TCCCCGATGT 
TCACCTACAG 
TCCGGAAAGC 
CAGGCATGGC 
TTGATGGCAA 
ATGCACATTT 
TCACTGCTGT 
CCGTAATGTT 
ACATACGTGG 
CTGACAATTC 
TGGGAAATAA 
CAAAGACCAG 
CTGCTTATGA 
TAATTAGCAA 
CTAACTTTGT 



51 
I 

TCCCCTGAAC 
AGAAAAATTT 
AAACTTAATG 
GCAACTGGAC 
CCATCATTTC 
AATCAATAAT 
TTTCCAAGTA 
TGACATTTTG 
AGGTGGAATC 
CGATGAGGAC 
TCACGCCATT 
CCCCACCTAC 
CATTCAGTCC 
AGAACCAGCT 
GATCTTTTTC 
TGTTAATTTA 
AATTGAAGCC 
TTTAAGACCA 
GAAAAAAATT 



2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3671 
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780 

840 
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GATGCAGCTG TTT 'lfAACCC ACGTTTTTAT AGGACCTACT TCTTTGTAGA TAACCAGTAT 12 00 

TGGAGGTATG ATGAAAGGAG ACAGATGATG GACCCTGGTT ATCCCAAACT GATTACCAAG 1260 

AACTTCCAAG GAATCGGGCC TAAAATTGAT GCAGTCTTCT ACTCTAAAAA CAAATACTAC 1320 

TATTTCTTCC AAGGATCTAA CCAATTTGAA TATGACTTCC TACTCCAACG TATCACCAAA 1380 

ACACTGAAAA GCAATAGCTG GTTTGGTTGT TGA 1413 

Seq ID NO: C82 DNA Sequence 

Nucleic Acid Accession #: nm_006952.i 

Coding sequence: 11.. 793 

1 11 21 31 41 51 

I I I I I I 

AATCCCGACA ATGGCGAAAG ACAACTCAAC TGTTCGTTGC TTCCAGGGCC TGCTGATTTT 60 

TGGAAATGTG ATTATTGGTT GTTGCGGCAT TGCCCTGACT GCGGAGTGCA TCTTCTTTGT 120 

ATCTGACCAA CACAGCCTCT ACCCACTGCT TGAAGCCACC GACAACGATG ACATCTATGG 180 

GGCTGCCTGG ATCGGCATAT TTGTGGGCAT CTGCCTCTTC TGCCTGTCTG TTCTAGGCAT 240 

TGTAGGCATC ATGAAGTCCA GCAGGAAAAT TCTTCTGGCG TATTTCATTC TGATGTTTAT 300 

AGTATATGCC TTTGAAGTGG CATCTTGTAT CACAGCAGCA ACACAACGAG ACTTTTTCAC 360 

ACCCAACCTC TTCCTGAAGC AGATGCTAGA GAGGTACCAA AACAACAGCC CTCCAAACAA 420 

TGATGACCAG TGGAAAAACA ATGGAGTCAC CAAAACCTGG GACAGGCTCA TGCTCCAGGA 480 

CAATTGCTGT GGCGTAAATG GTCCATCAGA CTGGCAAAAA TACACATCTG CCTTCCGGAC 540 

TGAGAATAAT GATGCTGACT ATCCCTGGCC TCGTCAATGC TGTGTTATGA ACAATCTTAA 60 0 

AGAACCTCTC AACCTGGAGG CTTGTAAACT AGGCGTGCCT GGTTTTTATC ACAATCAGGG 660 

CTGCTATGAA CTGATCTCTG GTCCAATGAA CCGACACGCC TGGGGGGTTG CCTGGTTTGG 720 

ATTTGCCATT CTCTGCTGGA CTTTTTGGGT TCTCCTGGGT ACCATGTTCT ACTGGAGCAG 780 

AATTGAATAT TAAGAA 796 



Seq ID NO: C83 DNA Sequence 
Nucleic Acid Accession #: NM_001793.2 
3U Coding sequence: 71.. 2560 



1 11 21 31 41 51 

I 1 I I 1 I 

AAAGGGGCAA GAGCTGAGCG GAACACCGGC CCGCCGTCGC GGCAGCTGCT TCACCCCTCT 60 

CTCTGCAGCC ATGGGGCTCC CTCGTGGACC TCTCGCGTCT CTCCTCCTTC TCCAGGTTTQ 120 

CTGGCTGCAG TGCGCGGCCT CCGAGCCGTG CCGGGCGGTC TTCAGGGAGG CTGAAGTGAC 180 

CTTGGAGGCG GGAGGCGCGG AGCAGGAGCC CGGCCAGGCG CTGGGGAAAG TATTCATGGG 240 

CTGCCCTGGG CAAGAGCCAG CTCTGTTTAG CACTGATAAT GATGACTTCA CTGTGCGGAA 300 

TGGCGAGACA GTCCAGGAAA GAAGGTCACT GAAGGAAAGG AATCCATTGA AGATCTTCCC 360 

ATCCAAACGT ATCTTACGAA GACACAAGAG AGATTGGGTG GTTGCTCCAA TATCTGTCCC 420 

TGAAAATGGC AAGGGTCCCT TCCCCCAGAG ACTGAATCAG CTCAAGTCTA ATAAAGATAG 480 

AGACACCAAG ATTTTCTACA GCATCACGGG GCCGGGGGCA GACAGCCCCC CTGAGGGTGT 540 

CTTCGCTGTA GAGAAGGAGA CAGGCTGGTT GTTGTTGAAT AAGCCACTGG ACCGGGAGGA 600 

GATTGCCAAG TATGAGCTCT TTGGCCACGC TGTGTCAGAG AATGGTGCCT CAGTGGAGGA 660 

CCCCATGAAC ATCTCCATCA TCGTGACCGA CCAGAATGAC CACAAGCCCA AGTTTACCCA 720 

GGACACCTTC OGAGGGAGTG TCTTAGAGGG AGTCCTACCA GGTACTTCTG TGATGCAGGT 780 

GACAGCCACG GATGAGGATG ATGCCATCTA CACCTACAAT GGGGTGGTTG CTTACTCCAT 840 

CCATAGCCAA GAACCAAAGG ACCCACACGA CCTCATGTTC ACCATTCACC GGAGCACAGG 900 

CACCATCAGC GTCATCTCCA GTGGCCTGGA CCGGGAAAAA GTCCCTGAGT ACACACTGAC 960 

CATCCAGGCC ACAGACATGG ATGGGGACGG CTCCACCACC ACGGCAGTGG CAGTAGTGGA 1020 

GATCCTTGAT GCCAATGACA ATGCTCCCAT GTTTGACCCC CAGAAGTACG AGGCCCATGT 1080 

GCCTGAGAAT GCAGTGGGCC ATGAGGTGCA GAGGCTGACG GTCACTGATC TGGACGCCCC 1140 

CAACTCACCA GCGTGGCGTG CCACCTACCT TATCATGGGC GGTGACGACG GGGACCATTT 1200 

TACCATCACC ACCCACCCTG AGAGCAACCA GGGCATCCTG ACAACCAGGA AGGGTTTGGA 1260 

TTTTGAGGCC AAAAACCAGC ACACCCTGTA CGTTGAAGTG ACCAACGAGG CCCCTTTTGT 1320 

GCTGAAGCTC CCAACCTCCA CAGCCACCAT AGTGGTCCAC GTGGAGGATG TGAATGAGGC 1380 

ACCTGTGTTT GTCCCACCCT CCAAAGTCGT TGAGGTCCAG GAGGGCATCC CCACTGGGGA 1440 

GCCTGTGTGT GTCTACACTG CAGAAGACCC TGACAAGGAG AATCAAAAGA TCAGCTACCG 1500 

CATCCTGAGA GACCCAGCAG GGTGGCTAGC CATGGACCCA GACAGTGGGC AGGTCACAGC 1560 

TGTGGGCACC CTCGACCGTG AGGATGAGCA GTTTGTGAGG AACAACATCT ATGAAGTCAT 1620 

GGTCTTGGCC ATGGACAATG GAAGCCCTCC CACCACTGGC ACGGGAACCC TTCTGCTAAC 1680 

ACTGATTGAT GTCAATGACC ATGGCCCAGT CCCTGAGCCC OGTCAGATCA CCATCTGCAA 1740 

CCAAAGCCCT GTGCGCCAGG TGCTGAACAT CACGGACAAG GACCTGTCTC CCCACACCTC 1800 

CCCTTTCCAG GCCCAGCTCA CAGATGACTC AGACATCTAC TGGACGGCAG AGGTCAACGA 1860 

GGAAGGTGAC ACAGTGGTCT TGTCCCTGAA GAAGTTCCTG AAGCAGGATA CATATGACGT 1920 

GCACCTTTCT CTGTCTGACC ATGGCAACAA AGAGCAGCTG ACGGTGATCA GGGCCACTGT 1980 

GTGCGACTGC CATGGCCATG TCGAAACCTG CCCTGGACCC TGGAAGGGAG GTTTCATCCT 2040 

CCCTGTGCTG GGGGCTGTCC TGGCTCTGCT GTTCCTCCTG CTGGTGCTGC TTTTGTTGGT 2100 

GAGAAAGAAG CGGAAGATCA AGGAGCCCCT CCTACTCCCA GAAGATGACA CCCGTGACAA 2160 

CGTCTTCTAC TATGGCGAAG AGGGGGGTGG CGAAGAGGAC CAGGACTATG ACATCACCCA 2220 

GCTCCACCGA GGTCTGGAGG CCAGGCCGGA GGTGGTTCTC CGCAATGACG TGGCACCAAC 2280 

CATCATCCCG ACACCCATGT ACCGTCCTCG GCCAGCCAAC CCAGATGAAA TCGGCAACTT 2340 

TATAATTGAG AACCTGAAGG CGGCTAACAC AGACCCCACA GCCCCGCCCT ACGACACCCT 2400 

CTTGGTGTTC GACTATGAGG GCAGCGGCTC CGACGCCGCG TCCCTGAGCT CCCTCACCTC 2460 

CTCCGCCTCC GACCAAGACC AAGATTACGA TTATCTGAAC GAGTGGGGCA GCCGCTTCAA 2520 

GAAGCTGGCA GACATGTACG GTGGCGGGGA GGACGACTAG GCGGCCTGCC TGCAGGGCTG 2580 

GGGACCAAAC GTCAGGCCAC AGAGCATCTC CAAGGGGTCT CAGTTCCCCC TTCAGCTGAG 2640 

GACTTCGGAG CTTGTCAGGA AGTGGCCGTA GCAACTTGGC GGAGACAGGC TATGAGTCTG 2700 

ACGTTAGAGT GGTTGCTTCC TTAGCCTTTC AGGATGGAGG AATGTGGGCA GTTTGACTTC 2760 

AGCACTGAAA ACCTCTCCAC CTGGGCCAGG GTTGCCTCAG AGGCCAAGTT TCCAGAAGCC 2820 

TCTTACCTGC CGTAAAATGC TCAACCCTGT GTCCTGGGCC TGGGCCTGCT GTGACTGACC 2880 

TACAGTGGAC TTTCTCTCTG GAATGGAACC TTCTTAGGCC TCCTGGTGCA ACTTAATTTT 2940 

TTTTTTTAAT GCTATCTTCA AAACGTTAGA GAAAGTTCTT CAAAAGTGCA GCCCAGAGCT 3000 

GCTGGGCCCA CTGGCCGTCC TGCATTTCTG GTTTCCAGAC CCCAATGCCT CCCATTCGGA 3060 
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TGGATCTCTG CGTTTTTATA CTGAGTGTGC CTAGGTTGCC CCTTATTTTT TATTTTCCCT 3120 

GTTGCGTTGC TATAGATGAA GGGTGAGGAC AATCGTGTAT ATGTACTAGA ACTTTTTTAT 3180 

TAAAGAAACT TTTCCCAGAA AAAAA 3205 

5 Seq ID NO: C84 DNA Sequence 

Nucleic Acid Accession th NM_005629.1 
Coding sequence: 639.-2546 

ir , 1 11 21 31 41 51 

10 | | | | | | 

TAGTCGGAGC GAGGTGGCGA GTCGCTGAGC CCGCCGCGGC CCCGAGAGCG GCTGCAGCCG 60 

CCGCCGCCGG GAAGGAGAGG GCGAGGCGCG CCCGAGCCGC CGCCGCCGCC GCCACCGCCG 120 

CCGCCGCCAC CACCGCCACC GGAGTCGCGG GCCAGCCGGG CAGCCTCCGC GGGCCCCGGC 180 

CGGGGCGGGG GGCGCGGGCC ACAGGCCCCT GCTCCGGCCG TCGTTTGCAG ACCGCGGGCG 240 

CCGATGTCGC CGGCGCCCCG TTAGGATGAG TCTCGGGTCG GGCGAGGAGC CGCCGCAGCC 300 

GCCGCCGCCC GAGCCGCGGG CAGGAGCCTC GGGAGCCGCC GCCGCCGCCG CCGCCGCCCG 360 

GCCGGGCCCC GACGCCGCCC GCGCGCCCCC GGGCCCCCGA CACACATGAG ATTCTTCAGG 420 

CTCACTTTCA AGTGCTTCGT GGACTGCTTC TGACTGCX3CC GCCCGCGCCC CGCACCCCGC 480 

CGTCCGCCCG CCGCCCCGTC CCCCGGCCCG GCCGCCCCCC GGCCCCCGGC CGGCCCGCGC 540 

CCTCGGGGCC CTCCCCGGTG CCGCCGGTGC CCCCCGCCTG ACCGCCGCCC CCCGTGAGGC 600 

GCCGCGACCC CGGCCCGGCC GTGCGGCCCG CCGGGGCCAT GGCGAAGAAG AGCGCCGAGA 660 

ACGGCATCTA TAGCGTGTCC GGCGACGAGA AGAAGGGCCC CCTCATCGCG CCCGGGCCCG 720 

ACGGGGCCCC GGCCAAGGGC GACGGCCCCG TGGGCCTGGG GACACCCGGC GGCCGCCTGG 780 

CCGTGCCGCC GCGCGAGACC TGGACGCGCC AGATGGACTT CATCATGTCG TGCGTGGGCT 840 

TCGCCGTGGG CTTGGGCAAC GTGTGGCGCT TCCCCTACCT GTGCTACAAG AACGGCGGAG 900 

GTGTGTTCCT TATTCCCTAC GTCCTGATCG CCCTGGTTGG AGQAATCCCC ATTTTCTTCT 960 

TAGAGATCTC GCTGGGCCAG TTCATGAAGG CCGGCAGCAT CAATGTCTGG AACATCTGTC 1020 

CCCTGTTCAA AGGCCTGGGC TACGCCTCCA TGGTGATCGT CTTCTACTGC AACACCTACT 1080 

ACATCATGGT GCTGGCCTGG GGCTTCTATT ACCTGGTCAA GTCCTTTACC ACCACGCTGC 1140 

CCTGGGCCAC ATGTGGCCAC ACCTGGAACA CTCCCGACTG CGTGGAGATC TTCCGCCATG 1200 

AAGACTGTGC CAATGCCAGC CTGGCCAACC TCACCTGTGA CCAGCTTGCT GACCGCCGGT 1260 

CCCCTGTCAT CGAGTTCTGG GAGAACAAAG TCTTGAGGCT GTCTGGGGGA CTGGAGGTGC 1320 

CAGGGGCCCT CAACTGGGAG GTGACCCTTT GTCTGCTGGC CTGCTGGGTG CTGGTCTACT 1380 

TCTGTGTCTG GAAGGGGGTC AAATCCACGG GAAAGATCGT GTACTTCACT GCTACATTCC 1440 

CCTACGTGGT CCTGGTCGTG CTGCTGGTGC GTGGAGTGCT GCTGCCTGGC GCCCTGGATG 1500 

GCATCATTTA CTATCTCAAG CCTGACTGGT CAAAGCTGGG GTCCCCTCAG GTGTGGATAG IS 60 

ATGCGGGGAC CCAGATTTTC TTTTCTTACG CCATTGGCCT GGGGGCCCTC ACAGCCCTGG 1620 

GCAGCTACAA CCGCTTCAAC AACAACTGCT ACAAGGACGC CATCATCCTG GCTCTCATCA 1680 

ACAGTGGGAC CAGCTTCTTT GCTGGCTTCG TGGTCTTCTC CATCCTGGGC TTCATGGCTG 1740 

4U CAGAGCAGGG CGTGCACATC TCCAAGGTGG CAGAGTCAGG GCCGGGCCTG GCCTTCATCG 1800 

CCTACCCGCG GGCTGTCAOG CTGATGCCAG TGGCCCCACT CTGGGCTGCC CTGTTCTTCT 1860 

TCATGCTGTT GCTGCTTGGT CTCGACAGCC AGTTTGTAGG TGTGGAGGGC TTCATCACCG 1920 

GCCTCCTCGA CCTCCTCCOG GCCTCCTACT ACTTCCGTTT CCAAAGGGAG ATCTCTGTGG 1980 

CCCTCTGTTG TGCCCTCTGC TTTGTCATCG ATCTCTCCAT GGTGACTGAT GGCGGGATGT 2040 

ACGTCTTCCA GCTGTTTGAC TACTACTOGG CCAGCGGCAC CACCCTGCTC TGGCAGGCCT 2100 

TTTGGGAGTG CGTGGTGGTG GCCTGGGTGT ACGGAGCTGA CCGCTTCATG GACGACATTG 2160 

CCTGTATGAT CGGGTACCGA CCTTGCCCCT GGATGAAATG GTGCTGGTCC TTCTTCACCC 2220 

CGCTGGTCTG CATGGGCATC TTCATCTTCA ACGTTGTGTA CTACGAGCCG CTGGTCTACA 2280 

ACAACACCTA CGTGTACCOG TGGTGGGGTG AGGCCATGGG CTGGGCCTTC GCCCTGTCCT 2340 

CCATGCTGTG CGTGCCGCTG CACCTCCTGG GCTGCCTCCT CAGGGCCAAG GGCACCATGG 2400 

CTGAGCGCTG GCAGCACCTG ACCCAGCCCA TCTGGGGCCT CCACCACTTG GAGTACCGAG 2460 

CTCAGGACGC AGATGTCAGG GGCCTGACCA CCCTGACCCC AGTGTCCGAG AGCAGCAAGG 2520 

TCGTCGTGGT GGAGAGTGTC ATGTGACAAC TCAGCTCACA TCACCAGCTC ACCTCTGGTA 2580 

GC CATAGC AG CCCCTGCTTC AGCCCCACCG CACCCCTCCA GGGGGCCTGC CTTTCCCTGA 2640 

C ACTTTT GGG GTCTGCCTGG GGGAGGAGGG GAGAAAGCAC CATGAGTGCT CACTAAAACA 2700 

ACTTTTTCCA TTTTTAATAA AACGCCAAAA ATATCACAAC CCACCAAAAA TAGATGCCTC 2760 

TCCCCCTCCA GCCCTAGCCG AGCTGGTCCT AGGCCCCGCC TAGTGCCCCA CCCCCACCCA 2820 

CAGTGCTGCA CTCCTCCTGC CCCTGCCACG CCCACCCCCT GCCCACCTCT CCAGGCTCTG 2880 

CTCTGCAGCA CACCCGTGGG TGACCCCTCA CCCCAGAAGC AGCAGTGGCA GCTTGGGAAA 2940 

TGTGAGGAAG GGAAGGAGGG AGAGACGGGA GGGAGGAGAG AGAGGAGAAG GGAGGCAGGG 3000 

GAGGGGCAGC AGAACCAAGG CAAATATTTC AGCTGGGCTA TACCCCTCTC CCCATCCCTG 3060 

TTATAGAAGC TTAGAGAGCC AGCCAGCAAT GGAACCTTCT GGTTCCTGCG CCAATCGCCA 3120 

CCAGTATCAA TTGTGTGAGC TTGGGTGCGA GTGCACGCGT GCGTGAGTAC GGAGAGTATA 3180 

TATAGATCTC TATCTCTTAG CAAAGGTGAA TGCCAGATGT AAATGGCGCC TCTGGGCAAA 3240 

GGAGGCTTGT ATTTTGCACA TTTTATAAAA ACTTGAGAGA ATGAGATTTC TGCTTGTATA 3300 

TTTCTAAAAA GAGGAAGGAG CCCAAACCAT CCTCTCCTTA CCACTCCCAT CCCTGTGAGC 3360 

CCTACCTTAC CCCTCTGCCC CTAGCCAAGG AGTGTGAATT TATAGATCTA ACTTTCATAG 3420 

GCAAAACAAA AGCTTCGAGC TGTTGCGTGT GTGAGTCTGT TGTGTGGATG TGCGTGTGTG 3480 

GTCCCCAGCC CCAGACTGGA TTGGAAAAGT GCATGGTGGG GGCCTCGGGG CTGTCCCCAC 3540 

GCTGTCCCTT TGCCACAAGT CTGTGGGGCA AGAGGCTGCA ATATTCCGTC CTGGGTGTCT 3600 

GGGCTGCTAA CCTGGCCTGC TCAGGCTTCC CACCCTGTGC GGGGCACACC CCCAGGAAGG 3660 

GACCCTGGAC ACGGCTCCCA CGTCCAGGCT TAAGGTGGAT GCACTTCCCG CACCTCCAGT 3720 

CTTCTGTGTA GCAGCTTTAA CCCACGTTTG TCTGTCACGT CCAGTCCCGA GACGGCTGAG 3780 

TGACCCCAAG AAAGGCTTCC CCGACACCCA GACAGAGGCT GCAGGGCTGG GGCTGGGTGA 3840 

O GGGTGGCGGG CCTGCGGGGA CATTCTACTG TGCTAAAAAG CCACTGCAGA CATAGCAATA 3900 

AAAACATGTC ATTTTCC 3917 
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Seq ID NO: C85 DNA Sequence 

Nucleic Acid Accession #: NM_006516.1 

Coding sequence: 180.. 1658 

<~ 

1 11 21 31 41 51 

1 I I I I I 

TAGTCGCGGG TCCCCGAGTG AGCACGCCAG GGAGCAGGAG ACCAAACGAC GGGGGTCGGA 60 
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GTCAGAGTCG 
CGCACGCCCG 
TGGAGCCCAG 
TTGGCTCCCT 
AGGAGTTCTA 
TCACCACGCT 
TCTCTGTGGG 
TGCTGGCCTT 
TGCTGATCCT 
CCATGTATGT 
AGCTGGGCAT 
GCAACAAGGA 
GCATCGTGCT 
AGAACCGGGC 
TGCAGGAGAT 
AGCTGTTCCG 
CCCAGCAGCT 
CGGGGGTGCA 
CTGTCGTGTC 
TCGCTGGCAT 
TACCCTGGAT 
TGGGTCCTGG 
CAGCTGCCAT 
GCTTCCAGTA 
TGGTTCTGTT 
ATGAGATCGC 
AGCTGTTCCA 
GCCTGCTCCC 
AACCTGACAG 
CCAGAAGAAT 
AAATCTATTC 
ATATCAGCCT 
GAGGGTGGAG 
CTGGACCTAT 
GAGGTGGCTA 
CATTAGGATT 
CCTGAGACCA 
GCCGGGTTCT 
GGGAGCCTGC 
TGCAAGATAT 
ATATCTGGAC 
TATAAATGGC 
TTTGGATGGG 
GACTCAGGAT 
TTTGATCCCT 
ATCACATATT 
AGGCTTGAAA 



CAGTGGGAGT 
TCGCCACCCG 
CAGCAAGAAG 
GCAGTTTGGC 
CAACCAGACA 
CTGGTCCCTC 
CCTTTTCGTT 
CGTGTCCGCC 

GGGTGAAGTG 
CGTCGTCGGC 
CCTGTGGCCC 
GCCCTTCTGC 
CAAGAGTGTG 
GAAGGAAGAG 
CTCCCCCGCC 
GTCTGGCATC 
GCAGCCTGTG 



GTCCTATCTG 
CCCCATCCCA 
TGCCGTTGCA 
TGTGGAGCAA 
CTTCATCTTC 
TTCCGGCTTC 
TCCCCTGGGG 
AGCAGCCCTA 
ATGTCAGCCG 
ATTCAGGACT 
AGACAAGCAA 
GAGTCTCCTG 
ACTAAGCCCT 
GTCCTAAGGA 
TGGCCACCCG 
TGCCCCTTCC 
GTTGGGAGCA 
AGTCTCCTTT 
AAACTCACTG 
TTATATATAT 
AAGCCAACTT 
TGGTTTTTAG 
AGTGAGACAG 
CCAGTCCCTT 
GTTACCCAGA 
TGATAGTTGG 
TOGCATTATT 



CCCCGGACCG 
CGTACCCGGC 
CTGACGGGTC 
TACAACACTG 
TGGGTCCACC 
TCAGTGGCCA 
AACCGCTTTG 
GTGCTCATGG 
ATCATCGGTG 
TCACCCACAG 
ATCCTCATCG 
CTGCTGCTGA 
CCCGAGAGTC 
CTAAAGAAGC 
AGTCGGCAGA 
TACCGCCAGC 
AACGCTGTCT 
TATGCCACCA 
GTGGAGCGAG 
GCCATACTCA 
AGCATCGTGG 
TGGTT CATCG 
GGCTTCTCCA 

ACCTACTTCA 
CGGCAGGGGG 
GCTGATTCCC 
AGGATCTCTC 
AGCCGGGCCT 
TAACGGCTCC 
CAGGTTTTAT 
TGCCCACATC 
GTCGAGACAC 
CACACTAATC 
TTCTGCTGGC 
CATCTCTTCC 
CTGGAGTGCA 
GCACTGAGGG 
CTCAAGAAGA 
TTTTGGTTGT 
GTAAATACAC 
AAACATGGTT 
AAGTAAGTGG 
ACACGTACCT 
GAATATATAC 
TGTTCAAAAA 
TTGAATGTGA 



GAGCACGAGC 
GCAGCCAGAG 
GCCTCATGCT 
GAGTCATCAA 
GCTATGGGGA 
TCTTTTCTGT 
GCCGGCGGAA 
GCTTCTCGAA 
TGTACTGCGG 
CCTTTCGTGG 
CCCAGGTGTT 
GCATCATCTT 
CCCGCTTCCT 
TGCGCGGGAC 
TGATGCGGGA 
CCATCCTCAT 
TCTATTACTC 
TTGGCTCCGG 
CAGGCCGGCG 
TGACCATCGC 
CCATCTTTGG 
TGGCTGAACT 
ACTGGACCTC 
CCTACGTCTT 
AAGTTCCTGA 
GAGCCAGCCA 
AAGTGTGAGT 
AGGAGCACAG 

AGGATTTTAA 
AATTTTTTTA 
CCAGGCTTCA 
TTGCCTTCTT 
GAACTATGAA 
CTGGATCTCC 
TACCCAACCA 
GGGAGGAGAG 
CCACACTATT 
CATGGAGACT 
CAATATTAAA 
CACCTCACTC 
TTGAAATGCT 
GGTTGCAACC 
CTCATCAGTG 
ATTCTTTATC 
AACACTAGTT 
AGGGAA 



CTGAGCGGGA 
CCACCAGCGC 
GGCTGTGGGA 
TGCCCCCCAG 
GAGCATCCTG 
TGGGGGCATG 
TTCAATGCTG 
ACTGGGCAAG 
CCTGACCACA 
GGCCCTGGGC 
CGGCCTGGAC 
CATCCCGGCC 
GCTCATCAAC 
AGCTGACGTG 
GAAGAAGGTC 
CGCTGTGGTG 
CACGAGCATC 
TATCGTCAAC 
GACCCTGCAC 
GCTAGCACTG 
CTTTGTGGCC 
CTTCAGCCAG 
AAATTTCATT 
CATCATCTTC 
GACTAAAGGC 
AAGTGATAAG 
CGCCCCAGAT 
GCAGCTGGAT 
TCTCCAGCCA 
CAAAAGCAAG 
TTACTGATTT 
CCCTGAATGG 
CACCCAGCTA 
CTACAAAGCT 
CCACTCTAGG 
CTCAAATTAA 
GGGAAGGGCC 
ACCATGAGAA 
CCTGCCCTGT 
TACAGACACT 
CTGTTACTTA 
TGTGGATTGA 
ACTGCAACGG 
TCCTCTTGCT 
TTGACATTCA 
TTGTGCCAGC 



GAGCGCCGCT 
AGCGCTGCCA 
GGAGCAGTGC 
AAGGTGATCG 
CCCACCACGC 
ATTGGCTCCT 
ATGATGAACC 
TCCTTTGAGA 

ACCCTGCACC 
TCCATCATGG 
CTGCTGCAGT 
CGCAACGAGG 
ACCCATGACC 
ACCATCCTGG 
CTGCAGCTGT 
TTCGAGAAGG 
ACGGCCTTCA 
CTCATAGGCC 
CTGGAGCAGC 
TTCTTTGAAG 
GGTCCACGTC 
GTGGGCATGT 
ACTGTGCTCC 
CGGACCTTCG 
ACACCCGAGG 
CACCAGCCCG 
GAGACTTCCA 
GCAATGATGT 
ACTGTTGCTC 
TGTTATTTTT 
TTCCATGCCT 
ATCTGTAGGG 
TCTATCCCAG 
GGTCAGGCTC 
TCTTTCTTTA 
AGTCTGGGCT 
GAGGGCCTGT 
TGTGTATAGA 
AAGTTATAGT 
CCTAAACAGA 
GGGTAGGAGG 
CTTAGACTTC 
CAAAAATCTG 
AGGCATTTCT 
CGTGATGCTC 



Seq ID NO: C86 DNA Sequence 

Nucleic Acid Accession #: XM_035292.2 

Coding sequence » S3.. 1576 



1 
I 

GCTGGCTGGQ 
TGCGGGCCCG 
GGAGAAGATG 
CGTGACCCTG 
TATCGGCTCG 
GCTGGCGCTG 
CGCGGAGCTC 
CTACGGCTCG 
ATCGCAGTAC 
CTGCCCGGTG 
GGCCGTGAAC 
CAAGCTCCTG 
TGTGTCCAAT 
TGTGCTGGCA 
CACAGAGGAA 
CATCGTGACG 
GCAGATGCTG 
GTCCTGGATC 
GTTCACATCC 
CTCCATGATC 
GACGCTGCTC 
CAACTGGCTC 
TGAGCTTGAG 
CCTCTTCCTG 
CATCATCCTC 
GTGGCTCCTC 
CCCCCAGGAG 




GGCACCACCA 
CTGCCCGCCT 
ATCGTGGCCC 
CCCGAGGAGG 
TGCTACAGCG 
GCCCTGGCCC 
CTAGATCCCA 
TTATACAGCG 
ATGATCAACC 
CTGGTGTACG 
TCGTCCGAGG 
ATCCCCGTCT 
TCCAGGCTCT 
CACCCACAGC 
TACGCCTTCT 
TGCGTGGCCC 
CGGCCCATCA 
ATCGCCGTCT 
AGCGGGCTGC 
CAGGGCATCT 
ACATAGCCAG 



21 
I 

CGCTAGCGGC 
AGAGCGCGGA 
TCACGCTGCT 
TGACGCGCAC 
CCGOGTGCGG 
TCTCCAAATC 
TCCTCAAGCT 
TGGTCTTCGC 
CAGCCAAGCT 
TGAAGGCCGC 
TGATCATCCT 
ACTTCTCATT 
GCCTCTTTGC 
CCTACAGAAA 
TGCTGACCAA 
CCGTGGCCGT 
TCGTGGGCCT 
TCTTCGTGGG 
TCCTCACCCC 
CCAAGGACAT 
TGGCCATCAT 
AGGTGAACCT 
CCTTCTGGAA 
CCGTCTACTT 
TCTCCACGAC 
GAGGCCGAGT 



31 
i 

AGGCCCGGCC 
GCCGGCGGCC 
CGGCTCGGCG 
CAACGGCGTG 
GGGCGTGCTC 
CGTCTTCTCC 
GGGCGGCGAC 
CTGGATCGAG 
CACCTACCTG 
CGTGGCCTGC 
CACCCGGGTC 
GCTGGGCTTC 
TGAAGGCACC 
CTATGGAGGA 
CCTGCCCCTG 
CCTGGCCTAC 
GGACTTCGGG 
GTCCTGCTTC 
GTCCCGGGAA 

CTTCTCCGTC 
CGGCATGATC 
GGCCCTGCCT 
GACACCCGTG 
CTTCGGGGTC 
CGTCCTGTGT 
GGCTGCCGGA 



41 
I 

GGTGCGCAGA 
GAGGAGAAGG 
CCGGCAGGCG 
GCCATCATCG 
AAGGAGGCAG 
ATCGTGGGCG 
TAOGCCTACA 
CTGCTCATCA 
CTCAAGCCGC 
CTCTGCGTGC 
CAGGATGCCT 
GTCCAGATCG 
AAACTGGATG 
TGGAATTACT 
GCCATCATCA 
TTCACCACCC 
AACTATCACC 
GGCTCCGTCA 
GGCCACCTGC 
CTCGTGTTCA 
ATCAACTTCT 
TGGCTGOGCC 
GTGTTCTTCA 
GAGTGTGGCA 
TGGTGGAAAA 
CAGAAGCTCA 
GGAGCATGC 



51 
I 

GCATGGCGGG 
AAGAGGCGCG 
AGGGCGAGGG 
TGGGGACCAT 

CGCTCTGCTA 
TGCTGGAGGT 
TCCGGCCTTC 
TCTTCCCCAC 
TGCTGCTCAC 
TTGCCGCCGC 
GAAAGGGTGA 
TGGGGAACAT 
TGAATTTCGT 

TGTCCACCGA 
TGGGCGTCAT 
ATGGGTCCCT 
CCTCCATCCT 
CGTGTGTGAT 
TCAGCTTCTT 
ACAGAAAGCC 
TCCTGGCCTG 
TCGGCTTCAC 
ACAAGCCCAA 
TGCAGGTGGT 



120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1O20 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2656 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1609 



Seq ID NO: CB7 DNA Sequence 

Nucleic Acid Accession #: NM 005268.1 
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Coding sequence: 168.. 989 

1 11 21 31 41 51 

I I I I I I 

TAAAAAGCAA AAGAATTCGC GGCCGCGTCG ACACGGGCTT CCCCGAAAAC CTTCCCCGCT 60 

TCTGGATATC AAATTCAAGC TGCTTGCTGA GTCCTATTGC CGGCTGCTGG GAGCCAGGAG 120 

AGCCCTGAGG AGTAGTCACT CAGTAGCAGC TGACGCGTGG GTCCACCATG AACTGGAGTA 180 

TCTTTGAGGG ACTCCTGAGT GGGGTCAACA AGTACTCCAC AGCCTTTGGG CGCATCTGGC 240 

TGTCTCTGGT CTTCATCTTC CGCGTGCTGG TGTACCTGGT GACGGCCGAG CGTGTGTGGA 300 

1U GTGATGACCA CAAGGACTTC GACTGCAATA CTCGCCAGCC CGGCTGCTCC AACGTCTGCT 360 

TTGATGAGTT CTTCCCTGTG TCCCATGTGC GCCTCTGGGC CCTGCAGCTT ATCCTGGTGA 420 

CATGCCCCTC ACTGCTCGTG GTCATGCACG TGGCCTACCG GGAGGTTCAG GAGAAGAGGC 480 

ACCGAGAAGC CCATGGGGAG AACAGTGGGC GCCTCTACCT GAACCCCGGC AAGAAGCGGG 540 

GTGGGCTCTG GTGGACATAT GTCTGCAGCC TAGTGTTCAA GGCGAGCGTG GACATCGCCT 600 

TTCTCTATGT GTTCCACTCA TTCTACCCCA AATATATCCT CCCTCCTGTG GTCAAGTGCC 660 

ACGCAGATCC ATGTCCCAAT ATAGTGGACT GCTTCATCTC CAAGCCCTCA GAGAAGAACA 720 

TTTTCACCCT CTTCATGGTG GCCACAGCTG CCATCTGCAT CCTGCTCAAC CTCGTGGAGC 780 

TCATCTACCT GGTGAGCAAG AGATGCCACG AGTGCCTGGC AGCAAGGAAA GCTCAAGCCA 840 

TGTGCACAGG TCATCACCCC CACGGTACCA CCTCTTCCTG CAAACAAGAC GACCTCCTTT 900 

ZU CGGGTGACCT CATCTTTCTG GGCTCAGACA GTCATCCTCC TCTCTTACCA GACCGCCCCC 960 

GAGACCATGT GAAGAAAACC ATCTTGTGAG GGGCTGCCTG GACTGGTCTG GCAGGTTGGG 1020 

CCTGGATGGG GAGGCTCTAG CATCTCTCAT AGGTGCAACC TGAGAGTGGG GGAGCTAAGC 1080 

CATGAGGTAG GGGCAGGCAA GAGAGAGGAT TCAGACGCTC TGGGAGCCAG TTCCTAGTCC 1140 

TCAACTCCAG CCACCTGCCC CAGCTCGACG GCACTGGGCC AGTTCCCCCT CTGCTCTGCA 1200 

GCTCGGTTTC CTTTTCTAGA ATGGAAATAG TGAGGGCCAA TGC 1243 
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Seq ID NO: 088 DNA Sequence 
Nucleic Acid Accession #: NMJJ05130 
Coding sequence: 98.. 802 

1 11 21 31 41 51 

I 1 I I I I 

CTCTACCTGA CACAGCTGCA GCCTGCAATT CACTCCCACT GCCTGGGATT GCACTGGATC 60 

CGTGTGCTCA GAACAAGGTG AACGCCCAGC TGCAGCCATG AAGATCTGTA GCCTCACCCT 120 

GCTCTCCTTC CTCCTACTGG CTGCTCAGGT GCTCCTGGTG GAGGGGAAAA AAAAAGTGAA 180 

GAATGGACTT CACAGCAAAG TGGTCTCAGA ACAAAAGGAC ACTCTGGGCA ACACCCAGAT 240 

TAAGCAGAAA AGCAGGCCCG GGAACAAAGG CAAGTTTGTC ACCAAAGACC AAGCCAACTG 300 

CAGATGGGCT GCTACTGAGC AGGAGGAGGG CATCTCTCTC AAGGTTGAGT GCACTCAATT 360 

GGACCATGAA TTTTCCTGTG TCTTTGCTGG CAATCCAACC TCATGCCTAA AGCTCAAGGA 420 

TGAGAGAGTC TATTGGAAAC AAGTTGCCCG GAATCTGCGC TCACAGAAAG ACATCTGTAG 480 

ATATTCCAAG ACAGCTGTGA AAACCAGAGT GTGCAGAAAG GATTTTCCAG AATCCAGTCT 540 

TAAGCTAGTC AGCTCCACTC TATTTGGGAA CACAAAGCCC AGGAAGGAGA AAACAGAGAT 600 

GTCCCCCAGG GAGCACATCA AGGGCAAAGA GACCACCCCC TCTAGCCTAG CAGTGACCCA 660 

GACCATGGCC ACCAAAGCTC CCGAGTGTGT GGAGGACCCA GATATGGCAA ACCAGAGGAA 720 

GACTGCCCTG GAGTTCTGTG GAGAGACTTG GAGCTCTCTC TGCACATTCT TCCTCAGCAT 780 

AGTGCAGGAC ACGTCATGCT AATGAGGTCA AAAGAGAACG GGTTCCTTTA AGAGATGTCA 840 

TGTCGTAAGT CCCTCTGTAT ACTTTAAAGC TCTCTACAGT CCCCCCAAAA TATGAACTTT 900 

TGTGCTTAGT GAGTGCAACG AAATATTTAA ACAAGTTTTG TATTTTTTGC TTTTGTGTTT 960 

TGGAATTTGC CTTATTTTTC TTGGATGCGA TGTTCAGAGG CTGTTTCCTG CAGCATGTAT 1020 

TTCCATGGCC CACACAGCTA TGTGTTTGAG CAGCGAAGAG TCTTTGAGCT GAATGAGCCA 1080 

GAGTGATAAT TTCAGTGCAA CGAACTTTCT GCTGAATTAA TGGTAATAAA ACTCTGGGTG 1140 

TTTTTCAAAA AAAAAAAAAA AAA 1163 

Seq ID NO: C89 DNA Sequence 
Nucleic Acid Accession #i BC022542 
Coding sequence: 274.. 927 

1 11 21 31 41 51 

I I I 1 I I 

ACTTGGTCCC AGCCGATAAA TCTGGGGCAG CGCGCGGTAG GAGCTGCGGG CGGCCAGGCC 60 

CCTTCCTGCG TCCGCACCTG GCCCCGCGCG CCCCTCTCGG GCGTCCGGCT TCCGGCGTCC 120 

TGGCGGCTCG GGTGGOGGCG GTTCGGGCGG CCGCCTGGCT GCTCCTCGGG GCGGCGACGG 180 

GGCTCACGCG CGGGCCCGCC ACGGCCTTCA CCGCCGCGCG CTCTGACGCC GGCATAAGGG 240 

CCATGTGTTC TGAAATTATT TTGAGGCAAG AAGTTTTGAA AGATGGTTTC CACAGAGACC 300 

TTTTAATCAA AGTGAAGTTT GGGGAAAGCA TTGAGGACTT GCACAOGTGC CGTCTCTTAA 360 

TTAAACAGGA CATTCCTGCA GGACTTTATG TGGATCCGTA TGAGTTGGCT TCATTACGAG 420 

AGAGAAACAT AACAGAGGCA GTGATGGTTT CAGAAAATTT TGATATAGAG GCCCCTAACT 480 

ATTTG TCCAA GGAGTCTGAA GTTCTCATTT ATGCCAGACG AGATTCACAG TGCATTGACT 540 

GTTTTCAAGC CTTTTTGCCT GTGCACTGCC GCTATCATCG GCCGCACAGT GAAGATGGAG 600 

AAGCCTOGAT TGTGGTCAAT AACCCAGATT TGTTGATGTT TTGTGACCAA GAGTTCCCGA 660 

TTTTGAAATG CTGGGCTCAC TCAGAAGTGG CAGCCCCTTG TGCTTTGGAT AATGAGGATA 720 

TATGCCAATG GAACAAGATG AAGTATAAAT CAGTATATAA GAATGTGATT CTACAAGTTC 780 

CAGTGGGACT GACTGTACAT ACCTCTCTAG TATGTTCTGT GACTCTGCTC ATTACAATCC 840 

TGTGCTCTAC ATTGATCCTT GTAGCAGTTT TCAAATATGG CCATTTTTCC CTATAAGTTT 900 

TATGTAGTTA AATGCTTCCT AGAAACCTAA ATAAGATCTA TTAATTTCTG AOGAGAGGTG 960 

TTCTTCTAGA ATTAATTACT TTTATCTTTT GTCTTCATTT GTGGCCAAAA TTATGTTTAC 1020 

TAGAGGAAAT TTGGGATCAT TCTCAGCTAA TTCCAAAATG TAGTGCTCTA TTGCATGGAT 1080 

CCTTGGTAAT CCTCAAGCAT CAGATGCCAT AAGGGGAAAC TTAATTCTGC TAAATTAATG 1140 

TTTATTTTGT GAGAAGTGAC TTTATCTTCA TTTGGGGTAG AAAAATTATT TCTTTATGTA 1200 

GTAGAGACAA ATTATTCTCA TTTTGCAAGT ACTTTCAATT TAAGCTACAA ATTGAGAAAA 1260 

CCGTTATAAA TAAGAATAAA ATAGGCCAGG CACAGTGGCT CACACCTGTA ATCCCAGCAC 1320 

TTTGGGAGGC CGAGGTGGGC GGATCACCAG AGGTCAAGAG TTTGAGACCA GCTTGGTGAA 1380 

ACCCTGTCTC TACTAAAAAT ACAAAAGTTA GCTGGGGCTG GTGGTGGGCA TCTGTAGTCC 1440 

CAGCTAATTG GAAGGGTGAG GCGGGAGGAT CGCTTGAACC TGGGAGGCGG AGGTTCCAGA IS 00 
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GAGCCAAGAT 
GGAAAAACAA 
ATGTCATGAG 
CTAAGAAATT 
ACGCACTCCA 
GGACTTGATG 
TTGCCATTTT 



CGCACCACTG 
AAAAGAAGAA 
ACTATTAAAG 
AATATTAATA 
TTCTCCTTTT 
AAACTGAGTA 
AAATAAAGTT 



CACTACAGCC 
TAAAATAATT 
ATGTGCCAGA 
TAAAAATTAT 
ACATTTTATC 
CTAAGATTTG 
GTACATGAAC 



TGGGCGACAG 
TGGATGAAAA 
GTTTCAATGA 
TGATAATCTT 
ATGTTTCTTT 
GTACAGAGTA 
AAAAAAAAAA 



AACGAGACCC 
TCATGTTTAT 
AAATCATTAA 
AAATTATTGA 
TGAATATATG 
TGTCAGGAAG 
AAAAAA 



TGTCTCCAAA 
TTAAATAGTA 
AGTAGGACAG 
TTATTCCTTA 
AATTGGCAAA 
ACAACTCAGA 



1560 
1620 
1660 
1740 
1600 
1860 
1906 



Seq ID NO: C90 DNA Sequence 
Nucleic Acid Accession #: NM_004994 
Coding sequence: 20.. 2 143 



1 
I 

AGACACCTCT 
GGGCTGCTGC 
CCTGAGAACC 
CACTCGGGTG 
CCAGAAGCAA 
GCGAACCCCA 
CAAGTGGCAC 
GGCGGTGATT 
CACCTTCACT 
GCACGGAGAC 
TGGCCCCGGC 
GGGCGTCGTG 
CATCTTCGAG 
CTGGTGCAGT 
GAGACTCTAC 
CCAAGGCCAA 
CGCCACCACC 
CTCGACGGTG 
GGGTAAGGAG 
TACCACCTCG 
TTTGTTCCTC 
GCCGGAGGCG 
CGACGTGAAT 
AACCACCACC 
TGTCCACCCC 
AGGTCCCCCC 
TGCCTGCAAC 
CAAGGATGGG 
CCTTATCGCC 
GCTCTCCAAG 
GGTGCTGGGC 
CGGGGCCCTC 
GTTCGACGTG 
CCCCGGGGTG 
CCAGGACCGC 
GGGCTACGTG 
GCAGTGCCAT 
CAAACTGGTA 
TCACCTTTGT 



11 
I 

GCCCTCACCA 
TTTGCTGCCC 
AATCTCACCG 
GCAGAGATGC 
CTGTCCCTGC 
CGGTGCGGGG 
CACCACAACA 
GACGACGCCT 
CGCGTGTACA 
GGGTATCCCT 
ATTCAGGGAG 
GTTCCAACTC 
GGCCGCTCCT 
ACCACGGCCA 
ACCCGGGACG 
TCCTACTCCG 
GCCAACTACG 
ATGGGGGGCA 
TACTCGACCT 
AACTTTGACA 
GTGGCGGCGC 
CTCATGTACC 
GGCATCCGGC 
ACACCGCAGC 
TCAGAGCGCC 
ACTGCTGGCC 
GTGAACATCT 
AAGTACTGGC 
GACAAGTGGC 
AAGCTTTTCT 
COGAGGCGTC 
CGGAGTGGCA 
AAGGCGCAGA 
CCTTTGGACA 
TTCTACTGGC 
ACCTATGACA 
GTAAATCCCC 
TTCTGTTCTG 
TTTTTGTTGG 



21 
I 

TGAGCCTCTG 
CCAGACAGCG 
ACAGGCAGCT 
GTGGAGAGTC 
CCGAGACCGG 
TCCCAGACCT 
TCACCTATTG 
TTGCCCGCGC 
GCCGGGACGC 
TCGACGGGAA 
ACGCCCATTT 
GGTTTGGAAA 
ACTCTGCCTG 
ACTACGACAC 
GCAATGCTGA 
CCTGCACCAC 
ACCGGGACAA 
ACTCGGCGGG 
GTACCAGCGA 
GCGACAAGAA 
ATGAGTTCGG 
CTATGTACCG 
ACCTCTATGG 
CCACGGCTCC 
CCACAGCTGG 
CTTCTACGGC 
TCGACGCCAT 
GATTCTCTGA 
CCGCGCTGCC 
TCTTCTCTGG 
TGGACAAGCT 
GGGGGAAGAT 
TGGTGGATCC 
CGCACGACGT 
GOGTGAGTTC 
TCCTGCAGTG 
ACTGGGACCA 
GAGGAAAGGG 
AGTGTTTCTA 



31 
I 

GCAGCCCCTG 
CCAGTCCACC 
GGCAGAGGAA 
GAAATCTCTG 
TGAGCTGGAT 
GGGCAGATTC 
GATCCAAAAC 
CTTCGCACTG 
AGACATCGTC 
GGACGGGCTC 
CGACGATGAC 
CGCAGATGGC 
CACCACCGAC 
CGACGACCGG 
TGGGAAACCC 
GGACGGTOGC 
GCTCTTCGGC 
GGAGCTGTGC 
GGGCCGCGGA 

CCACGCGCTG 
CTTCACTGAG 
TCCTCGCCCT 
CCCGACGGTC 
CCCCACAGGT 
CACTACTGTG 
CGCGGAGATT 
GGGCAGGGGG 
CCGCAAGCTG 
GCGCCAGGTG 



GCTGCTGTTC 
CCGGAGCGCC 
CTTCCAGTAC 
CCGGAGTGAG 
CCCTGAGGAC 
ACCCTGGGGA 
AGGAGTGGAG 
ATAAACTTGG 



41 
I 

GTCCTGGTGC 
CTTGTGCTCT 
TACCTGTACC 
GGGCCTGCGC 
AGCGCCACGC 
CAAACCTTTG 
TACTCGGAAG 
TGGAGCGCGG 
ATCCAGTTTG 
CTGGCACACG 
GAGTTGTGGT 
GCGGCCTGCC 
GGTCGCTCCG 
TTTGGCTTCT 
TGCCAGTTTC 
TCCGACGGCT 
TTCTGCCCGA 
GTCTTCCCCT 
GATGGGCGCC 
TGCCCGGACC 
GGCTTAGATC 
GGGCCCCCCT 
GAACCTGAGC 
TGCCCCACCG 
CCCCCCTCAG 
CCTTT6AGTC 
GGGAACCAGC 
AGCCGGCCGC 
GACTCGGTCT 
TGGGTGTACA 
GCGGACGTGG 
AGCGGGCGGC 
AGCGAGGTGG 
CGAGAGAAAG 
TTGAACCAGG 



AGGAGCCAGT 
GTGGGCTGGG 
ATTCTCTAAC 



51 
I 

TCCTGGTGCT 
TCCCTGGAGA 
GCTATGGTTA 

TGAAGGCCAT 
AGGGCGACCT 
ACTTGCCGOG 
TGACGCCGCT 
GTGTCGCGGA 
CCTTTCCTCC 
CCCTGGGCAA 
ACTTCCCCTT 
ACGGCTTGCC 
GCCCCAGCGA 
CATTCATCTT 
ACCGCTGGTG 
CCCGAGCTGA 
TCACTTTCCT 
TCTGGTGCGC 
AAGGATACAG 
ATTCCTCAGT 
TGCATAAGGA 
CACGGCCTCC 
GACCCCCCAC 
CTGGCCCCAC 
CGGTGGACGA 
TGTATTTGTT 
AGGGCCCCTT 
TTGAGGAGCC 
CAGGCGCGTC 
CCCAGGTGAC 
GOCTCTGGAG 
ACOGGATGTT 
CCTATTTCTG 
TGGACCAAGT 
GTCCTGCTTT 
TTGCCGGATA 
CCCTCTCTTC 
CTTT 



Seq ID NO: C91 Sequence 

Nucleic Acid Accession #: NM_000213 

Coding sequence: 188.5656 



1 
I 

GCQCTGCCCG 
TCGCCCGCGC 
GCCCCGAGGT 
GAAGAGGATG 
CAGCGTCAGC 
CACGGAGTGT 
CCGGCGCTGC 
GGTCATGGAG 
CAGCCAGATG 
TGAGCTGGAG 
CTCCAACTCC 
GGTCCTGAGC 
CAGCGTCCCG 
CCCCCCCTTC 
TAAACTGCAG 
CATCCTGCAG 
GCTGGTCTTC 
TGGCATCATG 
GTACAGGACA 
CATCATCCCC 
CTATTTCCCT 
GCTGGAGGAG 
CCGAGGCCTT 
TCACATCCGG 



11 
I 

CCTCGTCCCC 
GCTGCAGCCC 
AGGTCCAGGA 
GCAGGGCCAC 
CTCTCTGGGA 
GTCCGTGTGG 
AACACCCAGG 
AGCAGCTTCC 
TCCCCCCAAG 
GTGTTTGAGC 
ATGTCCGATG 
CAGCTCACCA 
CAGACGGACA 
TCCTTCAAGA 
GGAGAGCGGA 
ACAGCTGTGT 
TCCACCGAGT 
AGCCGCAACG 
CAGGACTACC 
ATCTTTGCTG 
GTCTCCTCAC 
GCCTTCAATC 
CGGACAGAGG 
CGGGGGGAAG 



21 
I 

ACCCCCCAAC 
CATCTCCTAG 
CGGGCGCACA 
GCCCCAGCCC 
CCTTGGCAAA 
ATAAGGACTG 
CGGAGCTGCT 
AAATCACAGA 
GCCTGCGGGT 
CACTGQAGAG 
ATCTGGACAA 
GCGACTACAC 
TGAGGCCTGA 
ACGTCATCAG 
TCTCAGGCAA 
GCACGAGGGA 
CAGCCTTCCA 
ATGAACGGTG 
CGTCGGTGCC 
TCACCAACTA 
TGGGGGTGCT 
GGATCCGCTC 
TCACCTCCAA 
TGGGTATATA 



31 
I 

CCCCCGCGCC 
CGGCAGCCCA 
GCAGCAGCCG 
ATGGGCCAGG 
CCGCTGCAAG 
CGCCTACTGC 
GGCCGCGGGC 
GGAGACCCAG 
CCGTCTGOGG 
CCCCGTGGAC 
CCTCAAGAAG 
TATTGGATTT 
GAAGCTGAAG 
CCTGACAGAA 
CCTGGATGCT 
CATTGGCTGG 
CTATGAGGCT 
CCACCTGGAC 
CACCCTGGTG 
CTCCTATAGC 
GCAGGAGGAC 
CAACCTGGAC 
GATGTTCCAG 
CCAGGTGCAG 



41 
I 

CGCCCTCGGA 
GGCGCGGAGG 
AGGCTGGCCG 
CTGCTCCTGG 
AAGGCCCCAG 
ACAGACGAGA 
TGCCAGCGGG 
ATTGACACCA 
CCCGGTGAGG 
CTGTACATCC 
ATGGGGCAGA 
GGCAAGTTTG 
GAGCCCTGGC 
GATGTGGATG 
CCTGAGGGCG 
CGCCCGGACA 
GATGGCGCCA 
ACCACGGGCA 
CGCCTGCTCG 
TACTACGAGA 
TCGTCCAACA 
ATCCGGGCCC 
AAGACGAGGA 
CTGCGGGCCC 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 
900 
960 

1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2334 



51 
I 

CAGTCCCTGC 
GAGCGAGTCC 
GGAGAGGGAG 
CAGCCTTGAT 
TGAAGAGCTG 
TGTTCAGGGA 
AGAGCATCGT 
CCCTGCGGCG 
AGCGGCATTT 
TCATGGACTT 
ACCTGGCTCG 
TGGACAAAGT 
CCAACAGTGA 
AGTTCCGGAA 
GCTTCGATGC 
GCACCCACCT 
ACGTGCTGGC 
CCTACACCCA 
CCAAGCACAA 
AGCTTCACAC 
TCGTGGAGCT 
TAGACAGCCC 
CTGGGTCCTT 
TTGAGCACGT 



60 

120 

180 

240 

300 

360 

420 

4B0 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 
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1260 

1320 

1380 

1440 
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GGATGGGACG CACGTGTGCC AGCTGCCGGA GGACCAGAAG GGCAACATCC ATCTGAAACC 1500 

TTCCTTCTCC GACGGCCTCA AGATGGACGC GGGCATCATC TGTGATGTGT GCACCTGCGA 1560 

GCTGCAAAAA GAGGTGCGGT CAGCTCGCTG CAGCTTCAAC GGAGACTTCG TGTGCGGACA 1620 

GTGTGTGTGC AGCGAGGGCT GGAGTGGCCA GACCTGCAAC TGCTCCACCG GCTCTCTGAG 1680 

TGACATTCAQ CCCTGCCTGC GGGAGGGCGA GGACAAGCCG TGCTCCGGCC GTGGGGAGTG 1740 

CCAGTGCGGG CACTGTGTGT GCTACGGCGA AGGCCGCTAC GAGGGTCAGT TCTGCGAGTA 1800 

TGACAACTTC CAGTGTCCCC GCACTTCCGG GTTCCTGTGC AATGACCGAG GACGCTGCTC 1860 

CATGGGCCAG TGTGTGTGTG AGCCTGGTTG GACAGGCCCA AGCTGTGACT GTCCCCTCAG 1920 

CAATGCCACC TGCATCGACA GCAATGGGGG CATCTGTAAT GGACGTGGCC ACTGTGAGTG 1980 

TGGCCGCTGC CACTGCCACC AGCAGTCGCT CTACACGGAC ACCATCTGCG AGATCAACTA 2040 

CTCGGCGATC CACCCGGGCC TCTGCGAGGA CCTACGCTCC TGCGTGCAGT GCCAGGCGTG 2100 

GGGCACCGGC GAGAAGAAGG GGCGCACGTG TGAGGAATGC AACTTCAAGG TCAAGATGGT 2160 

GGACGAGCTT AAGAGAGCCG AGGAGGTGGT GGTGCGCTGC TCCTTCCGGG ACGAGGATGA 2220 

CGACTGCACC TACAGCTACA CCATGGAAGG TGACGGCGCC CCTGGGCCCA ACAGCACTGT 2280 

CCTGGTGCAC AAGAAGAAGG ACTGCCCTCC GGGCTCCTTC TGGTGGCTCA TCCCCCTGCT 2340 

CCTCCTCCTC CTGCCGCTCC TGGCCCTGCT ACTGCTGCTA TGCTGGAAGT ACTGTGCCTG 2400 

CTGCAAGGCC TGCCTGGCAC TTCTCCCGTG CTGCAACCGA GGTCACATGG TGGGCTTTAA 2460 

GGAAGACCAC TACATGCTGC GGGAGAACCT GATGGCCTCT GACCACTTGG ACACGCCCAT 2 520 

GCTGCGCAGC GGGAACCTCA AGGGCCGTGA CGTGGTCCGC TGGAAGGTCA CCAACAACAT 2580 

GCAGCGGCCT GGCTTTGCCA CTCATGCCGC CAGCATCAAC CCCACAGAGC TGGTGCCCTA 2640 

CGGGCTGTCC TTGCGCCTGG CCCGCCTTTG CACCGAGAAC CTGCTGAAGC CTGACACTCG 2700 

GGAGTGCGCC CAGCTGCGCC AGGAGGTGGA GGAGAACCTG AACGAGGTCT ACAGGCAGAT 2760 

CTCCGGTGTA CACAAGCTCC AGCAGACCAA GTTCCGGCAG CAGCCCAATG CCGGGAAAAA 2820 

GCAAGACCAC ACCATTGTGG ACACAGTGCT GATGGCGCCC CGCTCGGCCA AGCCGGCCCT 2880 

GCTGAAGCTT ACAGAGAAGC AGGTGGAACA GAGGGCCTTC CACGACCTCA AGGTGGCCCC 2940 

CGGCTACTAC ACCCTCACTG CAGACCAGGA CGCCCGGGGC ATGGTGGAGT TCCAGGAGGG 3000 

CGTGGAGCTG GTGGACGTAC GGGTGCCCCT CTTTATCCGG CCTGAGGATG ACGACGAGAA 3060 

GCAGCTGCTG GTGGAGGCCA TCGACGTGCC CGCAGGCACT GCCACCCTCG GCCGCCGCCT 3120 

GGT AAA CATC ACCATCATCA AGGAGCAAGC CAGAGACGTG GTGTCCTTTG AGCAGCCTGA 3180 

GTTCTCGGTC AGCCGCGGGG ACCAGGTGGC CCGCATCCCT GTCATCCGGC GTGTCCTGGA 3240 

CGGCGGGAAG TCCCAGGTCT CCTACCGCAC ACAGGATGGC ACCGCGCAGG GCAACCGGGA 3300 

CTACATCCCC GTGGAGGGTG AGCTGCTGTT CCAGCCTGGG GAGGCCTGGA AAGAGCTGCA 3360 

GGTGAAGCTC CTGGAGCTGC AAGAAGTTGA CTCCCTCCTG CGGGGCCGCC AGGTCCGCCG 3420 

TTTCCACGTC CAGCTCAGCA ACCCTAAGTT TGGGGCCCAC CTGGGCCAGC CCCACTCCAC 3480 

CACCATCATC ATCAGGGACC CAGATGAACT GGACCGGAGC TTCACGAGTC AGATGTTGTC 3540 

ATCACAGCCA CCCCCTCACG GCGACCTGGG CGCCCCGCAG AACCCCAATG CTAAGGCCGC 3600 

TGGGTCCAGG AAGATCCATT TCAACTGGCT GCCCCCTTCT GGCAAGCCAA TGGGGTACAG 3660 

GGTAAAGTAC TGGATTCAGG GCGACTCCGA ATCCGAAGCC CACCTGCTCG ACAGCAAGGT 3720 

GCCCTCAGTG GAGCTCACCA ACCTGTACCC GTATTGCGAC TATGAGATGA AGGTGTGCGC 3780 

CTACGGGGCT CAGGGCGAGG GACOCTACAG CTCCCTGGTG TCCTGCCGCA CCCACCAGGA 3840 

AGTGCCCAGC GAGCCAGGGC GTCTGGCCTT CAATGTCGTC TCCTCCACGG TGACCCAGCT 3900 

GAGCTGGGCT GAGCCGGCTG AGACCAACGG TGAGATCACA GCCTACGAGG TCTGCTATGG 3960 

CCTGGTCAAC GATGACAACC GACCTATTGG GCCCATGAAG AAAGTGCTGG TTGACAACCC 4020 

TAAGAACCGG ATGCTGCTTA TTGAGAACCT TCGGGAGTCC CAGCCCTACC GCTACACGGT 4080 

GAAGGCGCGC AACGGGGCCG GCTGGGGGCC TGAGCGGGAG GCCATCATCA ACCTGGCCAC 4140 

CCAGCCCAAG AGGCCCATGT CCATCCCCAT CATCCCTGAC ATCCCTATCG TGGACGCCCA 4200 

GAGCGGGGAG GACTACGACA GCTTCCTTAT GTACAGCGAT GACGTTCTAC GCTCTCCATC 4260 

GGGCAGCCAG AGGCCCAGCG TCTCCGATGA CACTGGCTGC GGCTGGAAGT TCGAGCCCCT 4320 

GCTGGGGGAG GAGCTGGACC TGCGGCGCGT CACGTGGCGG CTGCCCCCGG AGCTCATCCC 4380 

GCGCCTGTCG GCCAGCAGCG GGCGCTCCTC CGACGCCGAG GCCCCCACGG CCCCCCGGAC 4440 

GACGGCGGCG CGGGCGGGAA GGGCGGCAGC CGTGCCCCGC AGTGCGACAC CCGGGCCCCC 4500 

CGGAGAGCAC CTGGTGAATG GCCGGATGGA CTTTGCCTTC CCGGGCAGCA CCAACTCCCT 4560 

GCACAGGATG ACCACGACCA GTGCTGCTGC CTATGGCACC CACCTGAGCC CACACGTGCC 4620 

CCACCGCGTG CTAAGCACAT CCTCCACCCT CACACGGGAC TACAACTCAC TGACCCGCTC 4680 

AGAACACTCA CACTCGACCA CACTGCCCAG GGACTACTCC ACCCTCACCT CCGTCTCCTC 4740 

CCACGACTCT CGCCTGACTG CTGGTGTGCC CGACACGCCC ACCCGCCTGG TGTTCTCTGC 4800 

CCTGGGGCCC ACATCTCTCA GAGTGAGCTG GCAGGAGCCG CGGTGCGAGC GGCCGCTGCA 4860 

GGGCTACAGT GTGGAGTACC AGCTGCTGAA CGGCGGTGAG CTGCATCGGC TCAACATCCC 4920 

CAACCCTGCC CAGACCTCGG TGGTGGTGGA AGACCTCCTG CCCAACCACT CCTACGTGTT 4980 

CCGCGTGCGG GCCCAGAGCC AGGAAGGCTG GGGCCGAGAG CGTGAGGGTG TCATCACCAT 5040 

TGAATCCCAG GTGCACCCGC AGAGCCCACT GTGTCCCCTG CCAGGCTCCG CCTTCACTTT 5100 

GAGCACTCCC AGTGCCCCAG GCCCGCTGGT GTTCACTGCC CTGAGCCCAG ACTCGCTGCA 5160 

GCTGAGCTGG GAGCGGCCAC GGAGGCCCAA TGGGGATATC GTCGGCTACC TGGTGACCTG 5220 

TGAGATGGCC CAAGGAGGAG GGCCAGCCAC CGCATTCCGG GTGGATGGAG ACAGCCCCGA 5280 

GAGCCGGCTG ACCGTGCCGG GCCTCAGCGA GAACGTGCCC TACAAGTTCA AGGTGCAGGC 5340 

CAGGACCACT GAGGGCTTCG GGCCAGAGCG CGAGGGCATC ATCACCATAG AGTCCCAGGA 5400 

TGGAGGACCC TTCCCGCAGC TGGGCAGCCG TGCCGGGCTC TTCCAGCACC CGCTGCAAAG 5460 

CGAGTACAGC AGCATCACCA CCACCCACAC CAGCGCCACC GAGCCCTTCC TAGTGGATGG 5520 

GCTGACCCTG GGGGCCCAGC ACCTGGAGGC AGGCGGCTCC CTCACCCGGC ATGTGACCCA 5580 

GGAGTTTGTG AGCCGGACAC TGACCACCAG CGGAACCCTT AGCACCCACA TGGACCAACA 5640 

GTTCTTCCAA ACTTGACCGC ACCCTGCCCC ACCCCCGCCA TGTCCCACTA GGOGTCCTCC 5700 

CGACTCCTCT CCCGGAGCCT CCTCAGCTAC TCCATCCTTG CACCCCTGGG GGCCCAGCCC 5760 

ACCCGCATGC ACAGAGCAGG GGCTAGGTGT CTCCTGGGAG GCATGAAGGG GGCAAGGTCC 5820 

GTCCTCTGTG GGCCCAAACC TATTTGTAAC CAAAGAGCTG GGAGCAGCAC AAGGACCCAG 5880 

CCTTTGTTCT GCACTTAATA AATGGTTTTG CTACTGCTAA AAAAAAAAAA AAAAAAAAAA 5940 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAA 5994 

Seq ID NO: C92 DNA Sequence 
Nucleic Acid Accession ft: NM_02391S 
Coding sequence: 250.. 1326 

1 11 21 31 41 51 

I I I I 1 1 

GGCACGAGGG TTTCGTTTTC ATGCTTTACC AGAAAATCCA CTTCCCTGCC GACCTTAGTT 60 
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TCAAAGCTTA TTCTTAATTA GAGACAAGAA ACCTGTTTCA ACTTGAAGAC ACCGTATGAG 120 

GTGAATGGAC AGCCAGCCAC CACAATGAAA GAAATCAAAC CAGGAATAAC CTATGCTGAA 180 

CCCACGCCTC AATCGTCCCC AAGTGTTTCC TGACACGCAT CTTTGCTTAC AGTGCATCAC 240 

AACTGAAGAA TGGGGTTCAA CTTGACGCTT GCAAAATTAC CAAATAACGA GCTGCACGGC 300 

CAAGAGAGTC ACAATTCAGG CAACAGGAGC GACGGGCCAG GAAAGAACAC CACCCTTCAC 360 

AATGAATTTG ACACAATTGT CTTGCCGGTG CTTTATCTCA TTATATTTGT GGCAAGCATC 420 

TTGCTGAATG GTTTAGCAGT GTGGATCTTC TTCCACATTA GGAATAAAAC CAGCTTCATA 480 

TTCTATCTCA AAAACATAGT GGTTGCAGAC CTCATAATGA CGCTGACATT TCCATTTCGA 540 

ATAGTCCATG ATGCAGGATT TGGACCTTGG TACTTCAAGT TTATTCTCTG CAGATACACT 600 

TCAGTTTTGT TTTATGCAAA CATGTATACT TCCATCGTGT TCCTTGGGCT GATAAGCATT 660 

GATCGCTATC TGAAGGTGGT CAAGCCATTT GGGGACTCTC GGATGTACAG CATAACCTTC 720 

ACGAAGGTTT TATCTGTTTG TGTTTGGGTG ATCATGGCTG TTTTGTCTTT GCCAAACATC 780 

ATCCTGACAA ATGGTCAGCC AACAGAGGAC AATATCCATG ACTGCTCAAA ACTTAAAAGT 840 

CCTTTGGGGG TCAAATGGCA TACGGCAGTC ACCTATGTGA ACAGCTGCTT GTTTGTGGCC 900 

GTGCTGGTGA TTCTGATCGG ATGTTACATA GCCATATCCA GGTACATCCA CAAATCCAGC 960 

AGGCAATTCA TAAGTCAGTC AAGCCGAAAG CGAAAACATA ACCAGAGCAT CAGGGTTGTT 1020 

GTGGCTGTGT TTTTTACCTG CTTTCTACCA TATCACTTGT GCAGAATTCC TTTTACTTTT 1080 

AGTCACTTAG ACAGGCTTTT AGATGAATCT GCACAAAAAA TCCTATATTA CTGCAAAGAA 1140 

ATTACACTTT TCTTGTCTGC GTGTAATGTT TGCCTGGATC CAATAATTTA CTTTTTCATG 1200 

TGTAGGTCAT TTTCAAGAAG GCTGTTCAAA AAATCAAATA TCAGAACCAG GAGTGAAAGC 1260 

ATCAGATCAC TGCAAAGTGT GAGAAGATCG GAAGTTCGCA TATATTATGA TTACACTGAT 1320 

GTGTAGGCCT TTTATTGTTT GTTGGAATCG ATATGTACAA AGTGTAAATA AATGTTT CTT 1380 

TTCATTATCC TTAAAAAAAA AA 1402 

25 Seq ID NO: C93 DNA Sequence 

Nucleic Acid Accession #: NM_020789.1 
Coding sequence: 2 08.. 3 6 99 

„ 1 11 21 31 41 51 

30 i | | | | i 

GGCACGAGGG TGGAGCCGAG CGGTGCGGAG CAGATCTGGT GGTTCTCCGG AGAGCAGCTT 60 

CCTTGGGTGT TACATGAGCC AAGCCCTCAC TGTACAGAAG AGTGAGAGCT GAAACCTGTT 120 

CCCTGAGCTG ATCAGAAGGA CATCCCTTGG CCCCTCCATC TGGGCTCCTG TGGATAGGAG 180 

GGGCTGGGTG AGCAGGCCAG CTGGGCTATG GTGTGGTGCC TCGGCCTGGC CGTCCTCAGC 240 

CTGGTCATCA GCCAGGGGGC TGACGGTCGA GGGAAGCCTG AGGTGGTATC GGTGGTGGGC 300 

CGGGCTGAGG AGAGTGTGGT GCTGGGCTGT GACCTGCTGC CCCCGGCCGG CCGGCCCCCC 360 

CTGCATGTCA TCGAGTGGCT GCGCTTTGGA TTCCTGCTTC CCATCTTCAT CCAGTTCGGC 420 

CTCTACTCTC CCCGAATTGA CCCTGATTAC GTGGGACGAG TCCGGCTGCA GAAGGGGGCC 480 

TCTCTCCAGA TTGAGGGTCT CCGGGTGGAA GACCAGGGCT GGTACGAGTG CCGCGTGTTC 540 

TTCCTGGACC AGCACATCCC TGAAGACGAT TTTGCTAACG GCTCCTGGGT GCATCTGACA 600 

GTCAATTCAC CCCCTCAATT CCAGGAGACA CCTCCTGCTG TGTTGGAAGT GCAGGAACTG 660 

GAGCCTGTGA CCCTGCGTTG TGTGGCCCGT GGCAGCCCCC TGCCTCATGT GACGTGGAAG 720 

CTCCGAGGAA AGGACCTTGG CCAGGGCCAG GGCCAGGTGC AAGTGCAGAA CGGGACGCTG 780 

CGGATCCGCC GGGTAGAGCG AGGCAGCTCT GGGGTCTACA CCTGCCAAGC CTCCAGCACT 840 

43 GAGGGCAGCG CCACCCACGC CACCCAGCTG CTAGTGCTAG GACCCCCAGT CATCGTGGTG 900 

CCCCCCAAGA ACAGCACAGT CAATGCCTCC CAGGATGTTT CATTGGCCTG CCATGCTGAG 960 

GCATACCCTG CTAACCTCAC CTACAGCTGG TTCCAGGACA ACATCAATGT CTTCCACATT 1020 

AGCCGCCTGC AGCCCCGGGT GCAGATCCTG GTGGACGGGA GCCTGCGGCT GCTGGCCACC 1080 

CAGCCTGATG ATGCCGGCTG CTACACCTGT GTGCCCAGCA ATGGCCTCCT GCATCCACCC 1140 

TCAGCCTCTG CCTACCTCAC TGTGCTCTGC ATGCCGGGGG TGATCCGCTG CCCGGTTCGT 1200 

GCCAACCCCC CACTGCTCTT TGTCAGCTGG ACCAAGGATG GAAAGGCCCT GCAGCTGGAC 1260 

AAGTTCCCTG GCTGGTCCCA GGGCACAGAA GGCTCACTGA TCATOGCCCT GGGGAACGAG 1320 

GATGCCCTGG GAGAATACTC CTGCACCCCC TACAACAGTC TTGGTACCGC CGGGCCCTCT 1380 

CCTGTGACCC GCGTGCTGCT CAAGGCTCCC CCAGCTTTTA TAGAGCGGCC CAAGGAAGAA 1440 

TATTTCCAAG AAGTAGGGCG GGAGCTGCTC ATCCCCTGCT CCGCCCAAGG GGACCCTCCT 1500 

CCTGTTGTCT CTTGGACCAA GGTGGGCCGG GGGCTGCAAG GCCAGGCCCA GGTGGACAGC 1560 

AACAGCAGCC TCATCCTGCG ACCATTGACC AAGGAGGCCC ACGGGCACTG GGAATGCAGT 1620 

GCCAGCAATG CTGTGGCCCG AGTGGCCACC TCCACGAACG TCTACGTGCT GGGCACTAGC 1680 

CCTCATGTTG TCACCAATGT GTCCGTGGTG GCTTTGCCCA AGGGTGCCAA TGTCTCCTGG 1740 

GAGCCTGGCT TTGATGGTGG TTATCTGCAG AGATTCAGTG TCTGGTACAC CCCACTGGCC 1800 

AAGCGTCCTG ACCGAATGCA CCATGACTGG GTGTCCTTGG CAGTGCCTGT GGGGGCTGCT 1860 

CACCTCCTAG TGCCAGGGCT GCAGCCCCAC ACCCAGTACC AGTTCAGCGT GCTAGCTCAG 1920 

AACAAGCTGG GGAGTGGTCC CTTCAGCGAA ATCGTCTTGT CTGCTCCGGA AGGGCTTCCT 1980 

ACCACGCCAG CTGCACCCGG GCTTCCCCCA ACAGAGATAC CGCCTCCCCT GTCCCCTCCG 2040 

CGGGGTCTGG TGGCAGTGAG GACACCCCGG GGGGTACTCC TGCATTGGGA TCCCCCAGAG 2100 

CTGGTCCCTA AGAGACTGGA TGGCTACGTC TTGGAAGGCC GGCAAGGCTC CCAGGGCTGG 2160 

GAGGTGCTGG ACCCGGCTGT GGCAGGCACA GAAACAGAGC TGCTGGTGCC AGGCCTCATC 2220 

AAGGATGTTC TCTACGAGTT CCGCCTCGTG GCCTTCGCGG GCAGCTTCGT CAGCGACCCC 2280 

AGCAACACGG CCAACGTCTC CACTTCCGGT CTGGAGGTCT ACCCTTCGCG CACGCAGCTG 2340 

CCGGGCCTCC TGCCTCAGCC CGTGCTGGCC GGCGTGGTGG GCGGAGTCTG CTTTCTGGGA 2400 

GTGGCOGTCC TTGTGAGCAT CCTGGCCGGC TGCCTCCTGA ACCGGCGCAG GGCTGCCCGC 2460 

CGCCGCCGCA AGCGCCTCCG CCAAGATCCA CCTCTTATCT TCTCTCCGAC CGGGAAGTCA 2520 

GCTGCACCCT CTGCTCTGGG CTCAGGCAGT CCTGACAGCG TGGCGAAGCT GAAGCTCCAG 2580 

GGATCCCCAG TCCCCAGCCT GCGCCAGAGT CTGCTCTGGG GGGATCCTGC CGGAACTCCC 2640 

AGCCCCCACC CGGATCCTCC ATCTAGCCGG GGACCCTTAC CTCTGGAGCC CATTTGCCGG 2700 

GGCCCAGACG GGCGCTTTGT GATGGGGCCC ACTGTGGCGG CCCCCCAGGA AAGGTCAGGC 2760 

CGGGAGCAGG CAGAACCTCG GACTCCAGCC CAGCGTCTGG CCCGGTCCTT TGACTGTAGC 2820 

AGCAGCAGCC CCAGTGGGGC ACCCCAGCCC CTCTGCATTG AAGACATCAG CCCTGTGGCA 2880 

CCCCCTCCAG CAGCCCCACC CAGTCCCTTG CCAGGTCCTG GACCCCTGCT CCAGTACCTG 2940 

AGCCTGCCCT TCTTCCGAGA GATGAATGTG GATGGGGACT GGCCCCCGCT TGAGGAGCCC 3000 

AGCCCTGCTG CACCCCCAGA TTACATGGAT ACCCGGCGCT GTCCCACCTC ATCTTTCCTT 3060 

CGTTCTCCAG AAACCCCTCC TGTATCCCCC AGGGAATCAC TTCCTGGGGC TGTGGTAGGG 3120 

GCTGGGGCCA CTGCAGAGCC CCCTTACACA GCCCTGGCTG ACTGGACACT GAGGGAGCGG 3180 

CTGCTGCCAG GCCTTCTCCC TGCTGCCCCT CGAGGCAGCC TCACCAGCCA GAGCAGCGGG 3240 
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CGAGGCAGCG CTTCGTTCCT GCGGCCCCCC TCCACAGCCC CCTCTGCAGG AGGCAGCTAC 3300 

CTCAGCCCTG CTCCAGGAGA CACCAGCAGC TGGGCCAGTG GCCCTGAGAG ATGGCCCCGA 3360 

AGGGAGCATG TGGTGACAGT CAGCAAGAGG AGGAACACAT CTGTGGACGA GAACTATGAG 3420 

TGGGACTCAG AATTCCCTGG GGACATGGAA TTGCTGGAGA CTTTGCACCT GGGCTTGGCC 3480 

J AGCTCCCGGC TCAGACCTGA AGCTGAGACA GAGCTAGGTG TGAAGACTCC AGAGGAGGGC 3540 

TGCCTCCTGA ACACTGCCCA TGTTACTGGC CCTGAGGCCC GCTGTGCTGC CCTTCGGGAG 3600 

GAATTCCTGG CCTTCCGCCG CCGCCGAGAT GCTACTAGGG CTCGGCTACC AGCCTATCGA 3660 

CAGCCAGTCC CCCACCCCGA ACAGGCCACT CTGCTGTGAA CATCCCTAAT GTGAGGCTGT 3720 

GAAAAGGCAT ATGGACCTGC AAAGGAGGCC CCCAACCAGA CAGACCTAGT TTCAAACGAG 3780 

iU GGCACTGCCC CTGCCTGCCC CTTTGGTGCC CAGGCACAGA CCCTGATAGT GGGTTTGGGT 3840 

CACCTTGGTA TGGAATGTAT GTGCTGACCC CCTAGGTGAG TCTGGGGATT GGAACAGGGA 3900 

TCTTAGGTCT GCCTCTCTCT CTCTCTCTCT CTCTCTCTCT CTCTCTCTGT GTGTGTGTGT 3960 

GTGTGTGAAG TTTTTTACAG GTGAATAAAC AAAGTTTGAA AGAAAAAAAA AAAAAAAAAA 4020 
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Seq ID NO: C94 DNA Sequence 
Nucleic Acid Accession ft; NMJ)06875 
Coding sequence: 186.. 1190 



Seq ID NO: C95 DNA Sequence 
, A Nucleic Acid Accession #: NM_002510.1 
OU Coding sequence: 92.. 1774 



4024 



1 11 21 31 41 51 

I I I I I I 

GAATTCGGCA CGAGCGCGCG GOGAATCTCA ACGCTGCGCC GTCTGOGGGC GCTTCCGGGC 60 

CACCAGTTTC TCTGCTTTCC ACCCTGGCGC CCCCCAGCCC TGGCTCCCCA GCTGCGCTGC 120 

CCCGGGCGTC CACGCCCTGC GGGCTTAGCG GGTTCAGTGG GCTCAATCTG CGCAGCGCCA 180 

CCTCCATGTT GACCAAGCCT CTACAGGGGC CTCCCGCGCC CCCCGGGACC CCCACGCCGC 240 

CGCCAGGAGG CAAGGATCGG GAAGCGTTCG AGGCCGAGTA TCGACTCGGC CCCCTCCTGG 300 

GTAAGGGGGG CTTTGGCACC GTCTTCGCAG GACACCGCCT CACAGATCGA CTCCAGGTGG 360 

CCATCAAAGT GATTCCCCGG AATCGTGTGC TGGGCTGGTC CCCCTTGTCA GACTCAGTCA 420 

CATGCCCACT CGAAGTCGCA CTGCTATGGA AAGTGGGTGC AGGTGGTGGG CACCCTGGCG 480 

TGATCCGCCT GCTTGACTGG TTTGAGACAC AGGAAGGCTT CATGCTGGTC CTCGAGCGGC 540 

CTTTGCCCGC CCAGGATCTC TTTGACTATA TCACAGAGAA GGGCCCACTG GGTGAAGGCC 600 

CAAGCCGCTG CTTCTTTGGC CAAGTAGTGG CAGCCATCCA GCACTGCCAT TCCCGTGGAG 660 

TTGTCCATCG TGACATCAAG GATGAGAACA TCCTGATAGA CCTACGCCGT GGCTGTGCCA 720 

AACTCATTGA TTTTGGTTCT GGTGCCCTGC TTCATGATGA ACCCTACACT GACTTTGATG 780 

GGACAAGGGT GTACAGCCCC CCAGAGTGGA TCTCTCGACA CCAGTACCAT GCACTCCCGG 840 

CCACTGTCTG GTCACTGGGC ATCCTCCTCT ATGACATGGT GTGTGGGGAC ATTCCCTTTG 900 

AGAGGGACCA GGAGATTCTG GAAGCTGAGC TCCACTTCCC AGCCCATGTC TCCCCAGACT 960 

GCTGTGCCCT AATCCGCCGG TGCCTGGCCC CCAAACCTTC TTCCCGACCC TCACTGGAAG 1020 

AGATCCTGCT GGACCCCTGG ATGCAAACAC CAGCCGAGGA TGTTACCCCT CAACCCCTCC 1080 

4U AAAGGAGGCC CTGCCCCTTT GGCCTGGTCC TTGCTACCCT AAGCCTGGCC TGGCCTGGCC 1140 

TGGCCCCCAA TGGTCAGAAG AGCCATCCCA TGGCCATGTC ACAGGGATAG ATGGACATTT 1200 

GTTGACTTGG TTTTACAGGT CATTACCAGT CATTAAAGTC CAGTATTACT AAGGTAAGGG 1260 

ATTGAGGATC AGGGGTTAGA AGACATAAAC CAAGTTTGCC CAGTTCCCTT CCCAATCCTA 1320 

CAAAGGAGCC TTCCTCCCAG AACCTGTGGT CCCTGATTTT GGAGGGGGAA C ITC rT G CTT 1380 

40 CTCATTTTGC TAAGGAAGTT TATTTTGGTG AAGTTGTTCC CATTTTGAGC CCCGGGACTC 1440 

TTATTTTGAT GATGTGTCAC CCCACATTGG CACCTCCTAC TACCACCACA CAAACTTAGT 1500 

TCATATGCTT TTACTTGGGC AAGGGTGCTT TCCTTCCAAT ACCCCAGTAG CTTTTATTTT 1560 

AGTAAAGGGA CCCTTTCCCC TAGCCTAGGG TCCCATATTG GGTCAAGCTG CTTACCTGCC 1620 

TCAGCCCAGG ATTTTTTATT TTGGGGGAGG TAATGCCCTG TTGTTACCCC AAGGCTTCTT 1680 

DU TTT TTITTTT TTTTTTTTTG GGTGAGGGGA CCCTACTTTG TTATCCCAAG TGCTCTTATT 1740 

CTGGTGAGAA GAACCTTAAT TCCATAATTT GGGAAGGAAT GGAAGATGGA CACCACCGGA 1800 

CACCACCAGA CAATAGGATG GGATGGATGG TTTTTTGGGG GATGGGCTAG GGGAAATAAG 1860 

GCTTGCTGTT TGTTTTCCTG GGGCGCTCCC TCCAATTTTG CAGATTTTTG CAACCTCCTC 1920 

CTGAGCCGGG ATTGTCCAAT TACTAAAATG TAAATAATCA CGTATTGTGG GGAGGGGAGT 1980 

TCCAAGTGTG CCCTCCTTTT TTTTCCTGCC TGGATTATTT AAAAAGCCAT GTGTGGAAAC 2040 

CCACTATTTA ATAAAAGTAA TAGAATCAGA AAAAAAAAAA AAAAAAAA 2088 



1 11 21 31 41 51 

I I I I I I 

CAGATGCCAG AAGAACACTG TTGCTCTTGG TGGACGGGCC CAGAGGAATT CAGAGTTAAA 60 

CCTTGAGTGC CTGCGTCCGT GAGAATTCAG CATGGAATGT CTCTACTATT TCCTGGGATT 120 

TCTGCTCCTG GCTGCAAGAT TGCCACTTGA TGCCGCCAAA CGATTTCATG ATGTGCTGGG 180 

CAATGAAAGA CCTTCTGCTT ACATGAGGGA GCACAATCAA TTAAATGGCT GGTCTTCTGA 240 

TGAAAATGAC TGGAATGAAA AACTCTACCC AGTGTGGAAG OGGGGAGACA TGAGGTGGAA 300 

AAACTCCTGG AAGGGAGGCC GTGTGCAGGC GGTCCTGACC AGTGACTCAC CAGCCCTCGT 360 

GGGCTCAAAT ATAACATTTG CGGTGAACCT GATATTCCCT AGATGCCAAA AGGAAGATGC 420 

CAATGGCAAC ATAGTCTATG AGAAGAACTG CAGAAATGAG GCTGGTTTAT CTGCTGATCC 4 B0 

ATATGTTTAC AACTGGACAG CATGGTCAGA GGACAGTGAC GGGGAAAATG GCACCGGCCA 540 

AAGCCATCAT AACGTCTTCC CTGATGGGAA ACCTTTTCCT CACCACCCCG GATGGAGAAG 600 

ATGGAATTTC ATCTACGTCT TCCACACACT TGGTCAGTAT TTCCAGAAAT TGGGACGATG 660 

TTCAGTGAGA GTTTCTGTGA ACACAGCCAA TGTGACACTT GGGCCTCAAC TCATGGAAGT 720 

GACTGTCTAC AGAAGACATG GACGGGCATA TGTTCCCATC GCACAAGTGA AAGATGTGTA 780 

CGTGGTAACA GATCAGATTC CTGTGTTTGT GACTATGTTC CAGAAGAACG ATCGAAATTC 840 

ATCCGACGAA ACCTTCCTCA AAGATCTCCC CATTATGTTT GATGTCCTGA TTCATGATCC 900 

TAGCCACTTC CTCAATTATT CTACCATTAA CTACAAGTGG AGCTTCGGGG ATAATACTGG 960 

CCTGTTTGTT TCCACCAATC ATACTGTGAA TCACACGTAT GTGCTCAATG GAACCTTCAG 1020 

CCTTAACCTC ACTGTGAAAG CTGCAGCACC AGGACCTTGT CCGCCACCGC CACCACCACC 10 80 

CAGACCTTCA AAACCCACCC CTTCTTTAGG ACCTGCTGGT GACAACCCCC TGGAGCTGAG 1140 

TAGGATTCCT GATGAAAACT GCCAGATTAA CAGATATGGC CACTTTCAAG CCACCATCAC 1200 

AATTGTAGAG GGAATCTTAG AGGTTAACAT CATCCAGATG ACAGACGTCC TGATGCCGGT 1260 
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GCCATGGCCT 
GGAGGTCTGT 
CCCTGTGGAT 
GACGTACTGT 
GATTTCTGTT 
CTCCGTTGGC 
CAAGGAATAC 
TGTCTTTCTC 
ACTCAAAAAC 
TTTTCAGTGC 
TTATTGTTAA 
TTTAGAGATG 
AAAGCAACTT 
TAACTAGTAG 
AGCCTAACCC 
ATGCATAAAG 
TTCAATACAC 
GTGCACACTT 
TGACAACCTA 
GGACATTTAG 
ATTTCCAAAT 
AGATGAGGTC 
ACTAAAACCA 
AAGTGTGGGA 



GAAAGCTCCC 
ACCATCATTT 
GTGGATGAGA 
GTGAACCTCA 
CCTGACAGAG 
TGCTTGGCCA 
AACCCAATAG 
AACCGTGCAA 
CAAGAATTTA 
CATTGATGTG 
ATAGATATTG 
GGGAGAGGGA 
AGCAAGGCTT 
GATAGAAACA 
AGGTTAACTG 
CCAATGTAGT 
ACTCATGAAC 
GCTAGACTCA 
CTTTGCTTGG 
TTAGTGCTTT 
TTTTGTATAG 

TCTACTATAT 
AGAGACAAAA 



TAATAGACTT 
CTGACCCCAC 
TGTGTCTGCT 
CCCTGGGGGA 
ACCCAGCCTC 
TATTTGTCAC 
AAAATAGTCC 
AAGCCGTGTT 
AAGGAGTTTC 
AGATGTGCTG 
TGGTTTGGGG 
TTATACTGCA 
CTTTTCATTA 
CTGTGTCCCG 
CAAGAAGAGG 
CCAGTTTCTA 
TCCTGATGGA 
GAAAAAATAC 
CTGAGTGAAG 
TTATATACCA 
TCGCTGCACA 
CATGGCAACT 
GTTAGACATG 
AAAAAAAAA 



TGTCGTGACC 
CTGCGAGATC 
GACTGTGAGA 
TGACACAAGC 
GCCTTTAAGG 
TGTGATCTCC 
TGGGAATGTG 
CTTCCCGGGA 
TTAAATTTCG 
GAGTGGCTAT 
AAGTTGAATT 
GGCAGCTTCA 
TTTTTTATGT 
AGAGTAAGGA 
CGGGATACTT 
AGATCATGTT 
ACAATAACAG 
TACTCTCATA 
GAATGATATT 
GGCATGATGC 
TATTTGAAAT 
TGATCAGTAA 
ACATTCTTTT 



TGCCAAGGGA 
ACCCAGAACA 
CGAACCTTCA 
CTGGCTCTCA 
ATGGCAAACA 
CTCTTGGTGT 
GTCAGAAGCA 
AACCAGGAAA 
ACCTTGTTTC 
TAACCTTTTT 
TTTTATAGGT 
GCCATGTTGT 
TTCACTTATA 
GAGAAGCTAC 
TCAGCTTTCC 
CCAAGCTAAC 
GCCCAAGCCT 
AATGGGTGGG 
CATATATTCA 
TGAGTGACAC 
CATATATTAA 
GGATTTCACC 
TCTCTCCTTC 



GCATTCCCAC 
CAGTCTGCAG 
ATGGGTCTGG 
CGAGCACCCT 
GTGCCCTGAT 
ACAAAAAACA 
AAGGCCTGAG 
AGGATCCGCT 
TGAAGCTCAC 
TTCCTAAAGA 
TAAATGTCAT 
GAAACTGATA 
AAGTCTTAGG 
TATTGATTAG 
ATGTAACTGT 
TGAATCCCAC 
GTGGTATGAT 
AGTATTTTGG 
TTTATTCCAT 
TCTTGTGTAT 
GACTTTCCAA 
TCTGTTTGTA 
CTGAAAAATA 



Seq ID NO: C96 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1-4247 



1 
I 

ATGCGAATCC 
TGGGCTAATG 
ACAGGAGCAC 
CAATCTCCTA 
AAATTTCAGG 
ACAGTGGAAA 
TTTAAAGCAA 
GAGCATAGTT 
GACCGATTTT 
ATTTTGTTTG 
GAAAGTGTTA 
CTGCCAAACT 
ACAGACACAG 

TTACAAAACA 
ACTGGAAAGG 
GACCCAGAGA 
ACCATGATTG 
CATGAATTTT 
AATATGAGTT 
AGCGACCAAC 
TTAATTGGAA 
GCTATTGTGA 
ATTTCTACCA 
CGATCCCCAA 
AATTCCACTT 
CAGACTGTGA 
GGCTCTAAAA 
AATACAGTTT 
ACTGGAGCTG 
GAGAACATAT 
GTCCTTATAC 
GAATCACTAA 
ACAGCACAGC 
ATACGTGTTG 
CAGGGTCCCT 
ACTGAGGTAA 
GTCAACGTGG 
CATGAGTCTC 
CTTGTGATCG 
TACTGGAGGA 
GTTATATCCA 
ATAAAGCACT 
TTTGAGGAAG 
CCAGACAACA 
AAGCTAGCAC 
GTTGATGGCT 
GCTGAAGATT 
AACCTCGTGG 
GAGTACGGGA 
AGGAATTTTA 
GGACGTGTGG 
TCCCTGCCAG 



11 
I 

TAAAGCGTTT 
GATACTACAG 
TGAATCAAAA 
TCAATATTGA 
GTTGGGATAA 
TTAATCTCAC 
GCAAGATAAC 
TAGAAGGACA 
CAAGTTTTGA 
AGGTTGGGAC 
GTCGTTTTGG 
CAACTGACAA 
TTGACTGGAT 
GTGAAGTTCT 
ATTTTOGAGA 
AAGAGATTCA 
ATTATACCAG 
AGAAGTTTGC 
TGACAGATGG 
ATGTTCTTCA 
TGATTGTCGA 
CTGAAGAAAT 
ATCCTGGTAG 
CAACACACTA 
CAAGAGGAAG 
CCCAACCAGT 
CTGAACTGCC 
CTGTTCTTAG 
CTATAACAGA 
AAGATTCTTC 
CCCAAGGGTA 
CAGAATCTGC 
AGGATCCTTC 
CCGATGTTGG 
ATGAATCTGA 
CAGTTACAGA 
CACCTCATGC 
TATACTCGCA 
GTATTGGTCT 
TGTCAGCCCT 
AATGCTTCCA 
CACCTCCAAC 
TTCCAAAGCA 
TGCAGAGCTG 
AGCACAAGAA 
AGCTTGCTGA 
ACAACAGACC 
TCTGGAGAAT 
AGAAAGGAAG 
ACTTTCTGGT 
CTCTAAGAAA 
TCACACAGTA 
TGCTGACCTT 



21 
I 

CCTCGCTTGC 
ACAACAGAGA 
AAATTGGGGA 
TGAAGATCTT 
AACATCATTG 
TAATGACTAC 
TTTTCACTGG 
AAAATTTCCA 
GGAAGCAGTC 
AGAAGAAAAT 
GAAGCAGGCT 
GTATTACATT 
TGTTTTTAAA 
TACAATGCAA 
GCAACAGTAC 
TGAAGCAGTT 
CCTTCTTGTT 
AGTTTTGTAC 
CTATCAAGAC 
GATAGTAGCC 
CATGCCTACT 
AATCAAGGAG 
AGACAGTGCT 
CAATCGCATA 
TGAATTCTCT 
CACTAAATTA 
ACCTCACACT 
ATCTCCACAT 
ATATGAGGAG 
AGGCTCCAGT 
TATATTTTCC 
TAGAAATGCT 
TATGGAGGGA 
ATCAGGCAGA 
GAAGACAACC 
TCTGGAAATG 
TTTTACCCCA 
GACAACCCAA 
AGCTGAGGGG 
GACTTTTATC 
GACTGCACAC 
ACCTATCTTT 
TGTTGCAGAT 
TACTGTTGAC 
TCGATACATA 
AAAGGATGGC 
AAAAGCTTAT 
GATATGGGAA 
GAGAAAATGT 
CACTCAGAAG 
CACAAAAATA 
TCACTACACG 
TGTGAGAAAG 



31 
I 

ATTCAGCTCC 
AAACTTGTTG 
AAGAAATATC 
ACACAAGTAA 
GAAAACACAT 
CGTGTCAGCG 
GGAAAATGCA 
CTTGAGATGC 
AAAGGAAAAG 
TTGGATTTCA 
GCTTTAGATC 
TACAATGGCT 
GATACAGTTA 
CAATCTGGTT 
AAGTTCTCTA 
TGTAGTTCAG 
ACATGGGAAA 
CAGCAGTTGG 
TTGGGTGCTA 
ATATGCACTA 
GATAATCCTG 
GAGGAAGAGG 
ACAAACCAAA 
GGGACGAAAT 
GGAAAGGGTG 
GCCACAGAAA 
GTGGAAGGTA 
ATGAACTTGT 
GAGAGTTTAT 
CCCGCAACTT 
TCCGAAAACC 
TCCGAAGATT 
AATGTGTGGT 
GAGAGCTTTC 
AAGTCCTTTT 
CCACATTATT 
TCCTCCAGAC 
CCGGTATACA 
TTGGAATCCG 
TGTCTAGTGG 
TTTTACTTAG 
CCAATTTCAG 
TTACATGCAA 
TTAGGTATTA 
AATATCGTTG 
AAACTGACTG 
ATTGCTGCCC 
CATAATGTGG 
GATCAGTACT 
AGTGTGCAAG 
AAAAAGGGCT 
CAGTGGCCTG 
GCAGCCTATG 



41 

I 

TCTGTGTTTG 
AAGAGATTGG 
CAACATGTAA 
ATGTGAATCT 
TCATTCATAA 
GAGGAGTTTC 
ATATGTCATC 
AAATCTACTG 
GGAAGTTAAG 
AAGCGATTAT 
CATTCATACT 
CATTGACATC 
GCATCTCTGA 
ATGTCATGCT 
GACAGGTGTT 
AACCAGAAAA 
GACCTCGAGT 
ATGGAGAGGA 
TTCTCAATAA 
ATGGCTTATA 
AACTTGATCT 
GAAAAGACAT 
TCAGGAAAAA 
ACAATGAAGC 
ATGTTCCCAA 
AAGATATTTC 
CTTCAGCCTC 
CGGGGACTGC 
TGACCAGTTT 
CTGCTATCCC 
CAGAGACAAT 
CAACTTCATC 
TTCCTAGCTC 
TCCAGACTAA 
CTGCAGGCCC 
CTACCTTTGC 
AACAGGATTT 
ATGAGGCCAG 
AGAAGAAGGC 
TTCTTGTGGG 
AGGACAGTAC 
ATGATGTCGG 
GTAGTGGGTT 
CAGCAGACAG 
CCTATGATCA 
ATTATATCAA 
AAGGCCCACT 
AAGTTATTGT 
GGCCTGCCGA 
TGCTTGCCTA 
CCCAGAAAGG 
ACATGGGAGT 
CCAAGCGCCA 




AAGCCAGTTG 
GATGGACTAC 
TTCCTCATAC 
TGTTCAGGCT 
CGTTTATGAT 
CCAAACCAAG 
TTTGCTACCC 
TGGAAAATAC 
TTTCCCTGAA 
TGAAGAAGGC 
GGAACCCCAG 
CAAGACTAAC 
TACATCTTTA 
CTTGACTTCT 
TTTAAATGAT 
AGAATCCTTA 
CAAGCTTGAT 
ATTCATCTCT 
AACATATGAT 
AGGTTCAGAA 
TACAGACATA 
TTACACTGAG 
AGTGATGTCA 
CTACTTCCCA 
GGTCTCCACG 
TAATAGTAGC 
AGTTATACCC 
TATTCTCATC 
ATCCCCTAGA 
AGCAATTCCA 
TACTGAAGAA 
CTCCAACCAC 
TAGCAGGGTT 
TGCCAATTAT 
GAAATCCACA 
CATGATAACA 
TGGGAGTGAG 
TTATACTGTG 
AAGACCCAGT 
ACCAGAGTAC 
TGCAGTGGGG 



1320 
1380 
1440 
1500 
1560 
1620 
16B0 
1740 
1600 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2669 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 
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CCTGTTGTCG 
AGTATGTTGC 
ATCCGTTCAC 
ACACTGGTTQ 
TATGTTAATG 
CAGCTCCTGA 
AACAGGGAAA 
TCATCCCTGA 
CAGAGCAATG 
AGGATGATAT 
GCAGAAGATG 
AAGGTCACTC 
CAGGACTTTA 
TGTCCTAAAT 
ATAAAAGAAG 
GTGACGGCAG 
TCCGTGGATG 
GACATTGAGC 
GAAGAGAATC 
GCTGAGAGCT 



TCCACTGCAG 
AGCAGATTCA 
AAAGAAATTA 
AGGCCATACT 
CACTCCTCAT 
GCCAGTCAAA 
AGAATCGAAC 
GTGGAGAAGG 
AATTCATCAT 
GGGACCATAA 
AATTTGTTTA 
TTATGGCTGA 
TCTTAGAAGC 
GGCCAAATCC 
AAGCTGCCAA 
GAACTTTCTG 
TTTACCAGGT 
AGTATCAGTT 
CATCCACCTC 
TAGAGTCTTT 



TGCTGGAGTT 
ACACGAAGGA 
TTTGGTACAA 
TAGTAAAGAA 
TCCTGGACCA 
TATACAGCAG 
TTCTTCTATC 
CACAGACTAC 
TACCCAGCAC 
TGCCCAACTG 
CTGGCCAAAT 
AGAACACAAA 
TACACAGGAT 
AGATAGCCCC 
TAGGGATGGG 
TGCTCTGACA 
AGCCAAGATG 
TCTCTACAAA 
TCTGGACAGT 
AGTTTAA 



GGAAGAACAG 
ACTGTCAACA 
ACTGAGGAGC 
ACTGAGGTGC 
GCAGGCAAAA 
AGTGACTATT 

ATCAATGCCT 
CCTCTCCTTC 
GTGGTTATGA 
AAAGATGAGC 
TGTCTATCTA 
GATTATGTAC 
ATTAGTAAAA 
CCTATGATTG 
ACCCTTATGC 
ATCAATCTGA 
GTGATCCTCA 
AATGGTGCAG 



GCACATATAT 
TATTTGGCTT 
AATATGTCTT 
TGGACAGTCA 
CAAAGCTAGA 
CTGCAGCCCT 
AAAGATCAAG 
CCTATATCAT 
ATACCATCAA 
TTCCTGATGG 
CTATAAATTG 
ATGAGGAAAA 
TTGAAGTGAG 
CTTTTGAACT 
TTCATGATGA 
ACCAACTAGA 
TGAGGCCAGG 
GCCTTGTGAG 
CATTGCCTGA 



TGTGCTAGAC 
CTTAAAACAC 
CATTCATGAT 
TATTCATGCC 
GAAACAATTC 
AAAGCAATGC 
GGTTGGCATT 
GGGCTATTAC 
GGATTTCTGG 
CCAAAACATG 
TGAGAGCTTT 
ACTTATAATT 
GCACTTTCAG 
TATAAGTGTT 
GCATGGAGGA 
AAAAGAAAAT 
AGTCTTTGCT 
CACAAGGCAG 
TGGAAATATA 
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51 
I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 
CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 
CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 
CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 
CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATGAAAAAAA TTGGGGAAAG 
AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 
CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 
AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 
GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 
AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 
GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 
GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 
GATTTCAAAG CGATTATTGA TGGAGJTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 
TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 
AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 
ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 
TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 
TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 
AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 
TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 
CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 
GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 
TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 
AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 
GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 
AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 
ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 
AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 
ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 
GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 
AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 
AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 
GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 
GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 
GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 
GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 
AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 
TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 
CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 
TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 
GTATACAATG GTGAGACACC TCTTCAACCT TCCTACAGTA GTGAAGTCTT TCCTCTAGTC 
ACCCCTTTGT TGCTTGACAA TCAGATCCTC AACACTACCC CTGCTGCTTC AAGTAGTGAT 
TCGGCCTTGC ATGCTACGCC TGTATTTCCC AGTGTCGATG TGTCATTTGA ATCCATCCTG 
TCTTCCTATG ATGGTGCACC TTTGCTTCCA TTTTCCTCTG CTTCCTTCAG TAGTGAATTG 
TTTCGCCATC TGCATACAGT TTCTCAAATC CTTCCACAAG TTACTTCAGC TACCGAGAGT 
GATAAGGTGC CCTTGCATGC TTCTCTGCCA GTGGCTGGGG GTGATTTGCT ATTAGAGCCC 
AGCCTTGCTC AGTATTCTGA TGTGCTGTCC ACTACTCATG CTGCTTCAGA GACGCTGGAA 
TTTGGTAGTG AATCTGGTGT TCTTTATAAA ACGCTTATGT TTTCTCAAGT TGAACCACCC 
AGCAGTGATG CCATGATGCA TGCACGTTCT TCAGGGCCTG AACCTTCTTA TGCCTTGTCT 
GATAATGAGG GCTCCCAACA CATCTTCACT GTTTCTTACA GTTCTGCAAT ACCTGTGCAT 

GATTCTGTGG gtgtaactta tcagggttcc ttatttagcg gccctagcca tataccaata 

CCTAAGTCTT CGTTAATAAC CCCAACTGCA TCATTACTGC AGCCTACTCA TGCCCTCTCT 
GGTGATGGGG AATGGTCTGG AGCCTCTTCT GATAGTGAAT TTCTTTTACC TGACACAGAT 
GGGCTGACAG CCCTTAACAT TTCTTCACCT GTTTCTGTAG CTGAATTTAC ATATACAACA 
TCTGTGTTTG GTGATGATAA TAAGGCGCTT tctaaaagtg aaataatata tggaaatgag 
ACTGAACTGC AAATTCCTTC TTTCAATGAG ATGGTTTACC CTTCTGAAAG CACAGTCATG 
CCCAACATGT ATGATAATGT AAATAAGTTG AATGCGTCTT TACAAGAAAC CTCTGTTTCC 
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ATTTCTAGCA CCAAGGGCAT GTTTCCAGGG TCCCTTGCTC ATACCACCAC TAAGGTTTTT 3480 

GATCATGAGA TTAGTCAAGT TCCAGAAAAT AACTTTTCAG TTCAACCTAC ACATACTGTC 3S40 

TCTCAAGCAT CTGGTGACAC TTCGCTTAAA CCTGTGCTTA GTGCAAACTC AGAGCCAGCA 3600 

TCCTCTGACC CTGCTTCTAG TGAAATGTTA TCTCCTTCAA CTCAGCTCTT ATTTTATGAG 3660 

ACCTCAGCTT CTTTTAGTAC TGAAGTATTG CTACAACCTT CCTTTCAGGC TTCTGATGTT 3720 

GACACCTTGC TTAAAACTGT TCTTCCAGCT GTGCCCAGTG ATCCAATATT GGTTGAAACC 3780 

CCCAAAGTTG ATAAAATTAG TTCTACAATG TTGCATCTCA TTGTATCAAA TTCTGCTTCA 3840 

AGTGAAAACA TGCTGCACTC TACATCTGTA CCAGTTTTTG ATGTGTCGCC TACTTCTCAT 3900 

ATGCACTCTG CTTCACTTCA AGGTTTGACC ATTTCCTATG CAAGTGAGAA ATATGAACCA 3960 

GTTTTGTTAA AAAGTGAAAG TTCCCACCAA GTGGTACCTT CTTTGTACAG TAATGATGAG 4020 

TTGTTCCAAA CGGCCAATTT GGAGATTAAC CAGGCCCATC CCCCAAAAGG AAGGCATGTA 4080 

TTTGCTACAC CTGTTTTATC AATTGATGAA CCATTAAATA CACTAATAAA TAAGCTTATA 4140 

CATTCCGATG AAATTTTAAC CTCCACCAAA AGTTCTGTTA CTGGTAAGGT ATTTGCTGGT 4200 

ATTCCAACAG TTGCTTCTGA TACATTTGTA TCTACTGATC ATTCTGTTCC TATAGGAAAT 4260 

GGGCATGTTG CCATTACAGC TGTTTCTCCC CACAGAGATG GTTCTGTAAC CTCAACAAAG 4320 

TTGCTGTTTC CTTCTAAGGC AACTTCTGAG CTGAGTCATA GTGCCAAATC TGATGCCGGT 4380 

TTAGTGGGTG GTGGTGAAGA TGGTGACACT GATGATGATG GTGATGATGA TGATGATGAC 4440 

AGAGGTAGTG ATGGCTTATC CATTCATAAG TGTATGTCAT GCTCATCCTA TAGAGAATCA 4500 

CAGGAAAAGG TAATGAATGA TTCAGACACC CACGAAAACA GTCTTATGGA TCAGAATAAT 4560 

CCAATCTCAT ACTCACTATC TGAGAATTCT GAAGAAGATA ATAGAGTCAC AAGTGTATCC 4620 

TCAGACAGTC AAACTGGTAT GGACAGAAGT CCTGGTAAAT CACCATCAGC AAATGGGCTA 4680 

TCCCAAAAGC ACAATGATGG AAAAGAGGAA AATGACATTC AGACTGGTAG TGCTCTGCTT 4740 

CCTCTCAGCC CTGAATCTAA AGCATGGGCA GTTCTGACAA GTGATGAAGA AAGTGGATCA 4800 

GGGCAAGGTA CCTCAGATAG CCTTAATGAG AATGAGACTT CCACAGATTT CAGTTTTGCA 4860 

GACACTAATG AAAAAGATGC TGATGGGATC CTGGCAGCAG GTGACTCAGA AATAACTCCT 4920 

GGATTCCCAC AGTCCCCAAC ATCATCTGTT. ACTAGCGAGA ACTCAGAAGT GTTCCACGTT 4980 

TCAGAGGCAG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 5040 

GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 5100 

CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 5160 

TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 5220 

ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 5280 

CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGACACTGA AAGAGTTTTA CCAGGAAGTG 5340 

CAGAGCTGTA CTGTTGACTT AGGTATTACA GCAGACAGCT CCAACCACCC AGACAACAAG' 5400 

CACAAGAATC GATACATAAA TATCGTTGCC TATGATCATA GCAGGGTTAA GCTAGCACAG 5460 

CTTGCTGAAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC 5520 

AACAGACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGC TGAAGATTTC 5580 

TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTGGAG 5640 

AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCCGATG GGAGTGAGGA GTACGGGAAC 5700 

TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 5760 

CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG ACGTGTGGTC S820 

ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 5880 

CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 5940 

CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG TGCTAGACAG TATGTTGCAG 6000 

CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TAAAACACAT CCGTTCACAA 6060 

AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG 6120 

GCCATACTTA GTAAAGAAAC TGAGGTGCTG GACAGTCATA TTCATGCCTA TGTTAATGCA 6180 

CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA AACAATTCCA GCTCCTGAGC 6240 

CAGTCAAATA TACAGCAGAG TGACTATTCT GCAGCCCTAA AGCAATGCAA GAGGGAAAAG 6300 

AATCGAACTT CTTCTATCAT CCCTGTGGAA AGATCAAGGG TTGGCATTTC ATCCCTGAGT 6360 

GGAGAAGGCA CAGACTACAT CAATGCCTCC TATATCATGG GCTATTACCA GAGCAATGAA 6420 

TTCATCATTA CCCAGCACCC TCTCCTTCAT ACCATCAAGG ATTTCTGGAG GATGATATGG 6480 

GACCATAATG CCCAACTGGT GGTTATGATT CCTGATGGCC AAAACATGGC AGAAGATGAA 6540 

TTTGTTTACT GGCCAAATAA AGATGAGCCT ATAAATTGTG AGAGCTTTAA GGTCACTCTT 6600 

ATGGCTGAAG AACACAAATG TCTATCTAAT GAGGAAAAAC TTATAATTCA GGACTTTATC 6660 

TTAGAAQCTA CACAGGATGA TTATGTACTT GAAGTGAGGC ACTTTCAGTG TCCTAAATGG 6720 

CCAAATCCAG ATAGCCCCAT TAGTAAAACT TTTGAACTTA TAAGTGTTAT AAAAGAAGAA 6780 

GCTGCCAATA GGGATGGGCC TATGATTGTT CATGATGAGC ATGGAGGAGT GACGGCAGGA 6840 

ACTTTCTGTG CTCTGACAAC CCTTATGCAC CAACTAGAAA AAGAAAATTC CGTGGATGTT 6900 

TACCAGGTAG CCAAGATGAT CAATCTGATG AGGCCAGGAG TCTTTGCTGA CATTGAGCAG 6960 

TATCAGTTTC TCTACAAAGT GATCCTCAGC CTTGTGAGCA CAAGGCAGGA AGAGAATCCA 7020 

TCCACCTCTC TGGACAGTAA TGGTGCAGCA TTGCCTGATG GAAATATAGC TGAGAGCTTA 7080 

GAGTCTTTAG TTTAACACAG AAAGGGGTGG GGGGACTCAC ATCTGAGCAT TGTTTTCCTC 7140 

TTCCTAAAAT TAGGCAGGAA AATCAGTCTA GTTCTGTTAT CTGTTGATTT CCCATCACCT 7200 

GACAGTAACT TTCATGACAT AGGATTCTGC CGCCAAATTT ATATCATTAA CAATGTGTGC 7260 

CTTTTTGCAA GACTTGTAAT TTACTTATTA TGTTTGAACT AAAATGATTG AATTTTACAG 7320 

TATTTCTAAG AATGGAATTG TGGTATTTTT TTCTGTATTG ATTTTAACAG AAAATTTCAA 7380 

TTTATAGAGG TTAGGAATTC CAAACTACAG AAAATGTTTG TTTTTAGTGT CAAATTTTTA 7440 

GCTGTATTTG TAGCAATTAT CAGGTTTGCT AGAAATATAA CTTTTAATAC AGTAGCCTGT 7500 

AAATAAAACA CTCTTCCATA TGATATTCAA CATTTTACAA CTGCAGTATT CACCTAAAGT 7560 

AGAAATAATC TGTTACTTAT TGTAAATACT GCCCTAGTGT CTCCATGGAC CAAATTTATA 7620 

TTTATAATTG TAGATTTTTA TATTTTACTA CTGAGTCAAG TTTTCTAGTT CTGTGTAATT 7680 

GTTTAGTTTA ATGACGTAGT TCATTAGCTG GTCTTACTCT ACCAGTTTTC TGACATTGTA 7740 

TTGTGTTACC TAAGTCATTA ACTTTGTTTC AGCATGTAAT TTTAACTTTT GTGGAAAATA 7800 

GAAATACCTT CATTTTGAAA GAAGTTTTTA TGAGAATAAC ACCTTACCAA ACATTGTTCA 7860 

AATGGTTTTT ATCCAAGGAA TTGCAAAAAT AAATATAAAT ATTGCCATTA AAAAAAAAAA 7920 

AAAAAAAAAA AAAAAAAAAA AAAA 7944 
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CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 1BO 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

5 AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

10 GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

15 ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

20 CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

25 AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

30 AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

35 GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

40 GTATACAATG CAGAGGCCAG TAATAGTAGC CATGAGTCTC GTATTGGTCT AGCTGAGGGG 2460 

TTGGAATCCG AGAAGAAGGC AGTTATACCC CTTGTGATCG TGTCAGCCCT GACTTTTATC 2520 

TGTCTAGTGG TTCTTGTGGG TATTCTCATC TACTGGAGGA AATGCTTCCA GACTGCACAC 2580 

TTTTACTTAG AGGACAGTAC ATCCCCTAGA GTTATATCCA CACCTCCAAC ACCTATCTTT 2640 

CCAATTTCAG ATGATGTCGG AGCAATTCCA ATAAAGCACT TTCCAAAGCA TGTTGCAGAT 2700 

45 TTACATGCAA GTAGTGGGTT TACTGAAGAA TTTGAGACAC TGAAAGAGTT TTACCAGGAA 2760 

GTGCAGAGCT GTACTGTTGA CTTAGGTATT ACAGCAGACA GCTCCAACCA CCCAGACAAC 2820 

AAGCACAAGA ATCGATACAT AAATATCGTT GCCTATGATC ATAGCAGGGT TAAGCTAGCA 2880 

CAGCTTGCTG AAAAGGATGG CAAACTGACT GATTATATCA ATGCCAATTA TGTTGATGGC 2940 

TACAACAGAC CAAAAGCTTA TATTGCTGCC CAAGGCCCAC TGAAATCCAC AGCTGAAGAT 3000 

50 TTCTGGAGAA TGATATGGGA ACATAATGTG GAAGTTATTG TCATGATAAC AAACCTCGTG 3060 

GAGAAAGGAA GGAGAAAATG TGATCAGTAC TGGCCTGCCG ATGGGAGTGA GGAGTACGGG 3120 

AACTTTCTGG TCACTCAGAA GAGTGTGCAA GTGCTTGCCT ATTATACTGT GAGGAATTTT 3180 

ACTCTAAGAA ACACAAAAAT AAAAAAGGGC TCCCAGAAAG GAAGACCCAG TGGACGTGTG 3240 

GTCACACAGT ATCACTACAC GCAGTGGCCT GACATGGGAG TACCAGAGTA CTCCCTGCCA 3300 

55 GTGCTGACCT TTGTGAGAAA GGCAGCCTAT GCCAAGCGCC ATGCAGTGGG GCCTGTTGTC 3360 

GTCCACTGCA GTGCTGGAGT TGGAAGAACA GGCACATATA TTGTGCTAGA CAGTATGTTG 3420 

CAGCAGATTC AACACGAAGG AACTGTCAAC ATATTTGGCT TCTTAAAACA CATCOGTTCA 3480 

CAAAGAAATT ATTTGGTACA AACTGAGGAG CAATATGTCT TCATTCATGA TACACTGGTT 3540 

GAGGCCATAC TTAGTAAAGA AACTGAGGTG CTGGACAGTC ATATTCATGC CTATGTTAAT 3600 

60 GCACTCCTCA TTCCTGGACC AGCAGGCAAA ACAAAGCTAG AGAAACAATT CCAGCTCCTG 3660 

AGCCAGTCAA AT AT ACAG CA GAGTGACTAT TCTGCAGCCC TAAAGCAATG CAACAGGGAA 3720 

AAGAATCGAA CTTCTTCTAT CATCCCTGTG GAAAGATCAA GGGTTGGCAT TTCATCCCTG 3780 

AGTGGAGAAG GCACAGACTA CATCAATGCC TCCTATATCA TGGGCTATTA CCAGAGCAAT 3840 

GAATTCATCA TTACCCAGCA CCCTCTCCTT CATACCATCA AGGATTTCTG GAGGATGATA 3900 

65 TGGGACCATA ATGCCCAACT GGTGGTTATG ATTCCTGATG GCCAAAACAT GGCAGAAGAT 3960 

GAATTTGTTT ACTGGCCAAA TAAAGATGAG CCTATAAATT GTGAGAGCTT TAAGGTCACT 4020 

CTTATGGCTG AAGAACACAA ATGTCTATCT AATGAGGAAA AACTTATAAT TCAGGACTTT 4080 

ATCTTAGAAG CTACACAGGA TGATTATGTA CTTGAAGTGA GGCACTTTCA GTGTCCTAAA 4140 

TGGCCAAATC CAGATAGCCC CATTAGTAAA ACTTTTGAAC TTATAAGTGT TATAAAAGAA 4200 

70 GAAGCTGCCA ATAGGGATGG GCCTATGATT GTTCATGATG AGCATGGAGG AGTGACGGCA 4260 

GGAACTTTCT GTGCTCTGAC AACCCTTATG CACCAACTAG AAAAAGAAAA TTCCGTGGAT 4320 

GTTTACCAGG TAGCCAAGAT GATCAATCTG ATGAGGCCAG GAGTCTTTGC TGACATTGAG 4380 

CAGTATCAGT TTCTCTACAA AGTGATCCTC AGCCTTGTGA GCACAAGGCA GGAAGAGAAT 4440 

CCATCCACCT CTCTGGACAG TAATGGTGCA GCATTGCCTG ATGGAAATAT AGCT GAGAG C 4500 

75 TTAGAGTCTT TAGTTTAACA CAGAAAGGGG TGGGGGGACT CACATCTGAG CATTGTTTTC 4560 

CTCTTCCTAA AATTAGGCAG GAAAATCAGT CTAGTTCTGT TATCTGTTGA TTTCCCATCA 4620 

CCTGACAGTA ACTTTCATGA CATAGGATTC TGCCGCCAAA TTTATATCAT TAACAATGTG 4680 

TGCCTTTTTG CAAGACTTGT AATTTACTTA TTATGTTTGA ACTAAAATGA TTGAATTTTA 4740 

CAGTATTTCT AAGAATGGAA TTGTGGTATT TTTTTCTGTA TTGATTTTAA CAGAAAATTT 4800 

80 CAATTTATAG AGGTTAGGAA TTCCAAACTA CAGAAAATGT TTGTTTTTAG TGTCAAATTT 4860 

TTAGCTGTAT TTGTAGCAAT TATCAGGTTT GCTAGAAATA TAACTTTTAA TACAGTAGCC 4920 

TGTAAATAAA ACACTCTTCC ATATGATATT CAACATTTTA CAACTGCAGT ATTCACCTAA 4980 

AGTAGAAATA ATCTGTTACT TATTGTAAAT ACTGCCCTAG TGTCTCCATG GACCAAATTT 5040 

ATATTTATAA TTGTAGATTT TTATATTTTA CTACTGAGTC AAGTTTTCTA GTTCTGTGTA 5100 
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ATTGTTTAGT TTAATGACGT AGTTCATTAG CTGGTCTTAC TCTACCAGTT TTCTGACATT SI 60 

GTATTGTGTT ACCTAAGTCA TTAACTTTGT TTCAGCATGT AATTTTAACT TTTGTGGAAA 5220 

ATAGAAATAC CTTCATTTTG AAAGAAGTTT TTATGAGAAT AACACCTTAC CAAACATTGT 5280 

TCAAATGGTT TTTATCCAAG GAATTGCAAA AATAAATATA AATATTGCCA TTAAAAAAAA 5340 

AAAAAAAAAA AAAAAAAAAA AAAAAAA 5367 

Seq ID NO: C99 DNA Sequence 

Nucleic Acid Accession # : Eos sequence 

Coding sequence: 501.. 4514 

1 11 21 31 41 51 

I I I I I I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAT TGGGGAAAGA 300 

AATATCCAAC ATGTAATAGC CCAAAACAAT CTCCTATCAA TATTGATGAA GATCTTACAC 360 

AAGTAAATGT GAATCTTAAG AAACTTAAAT TTCAGGGTTG GGATAAAACA TCATTGGAAA 420 

ACACATTCAT TCATAACACT GGGAAAACAG TGGAAATTAA TCTCACTAAT GACTACCGTG 480 

TCAGCGGAGG AGTTTCAGAA ATGGTGTTTA AAGCAAGCAA GATAACTTTT CACTGGGGAA 540 

AATGCAATAT GTCATCTGAT GGATCAGAGC ATAGTTTAGA AGGACAAAAA TTTCCACTTG 600 

AGATGCAAAT CTACTGCTTT GATGCGGACC GATTTTCAAG TTTTGAGGAA GCAGTCAAAG 660 

GAAAAGGGAA GTTAAGAGCT TTATCCATTT TGTTTGAGGT TGGGACAGAA GAAAATTTGG 720 

ATTTCAAAGC GATTATTGAT GGAGTCGAAA GTGTTAGTCG TTTTGGGAAG CAGGCTGCTT 780 

TAGATCCATT CATACTGTTG AACCTTCTGC CAAACTCAAC TGACAAGTAT TACATTTACA 840 

ATGGCTCATT GACATCTCCT CCCTGCACAG ACACAGTTGA CTGGATTGTT TTTAAAGATA 900 

CAGTTAGCAT CTCTGAAAGC CAGTTGGCTG TTTTTTGTGA AGTTCTTACA ATGCAACAAT 960 

CTGGTTATGT CATGCTGATG GACTACTTAC AAAACAATTT TCGAGAGCAA CAGTACAAGT 1020 

TCTCTAGACA GGTGTTTTCC TCATACACTG GAAAGGAAGA GATTCATGAA GCAGTTTGTA 1080 

GTTCAGAACC AGAAAATGTT CAGGCTGACC CAGAGAATTA TACCAGCCTT CTTGTTACAT 1140 

GGGAAAGACC TCGAGTCGTT TATGATACCA TGATTGAGAA GTTTGCAGTT TTGTACCAGC 1200 

AGTTGGATGG AGAGGACCAA ACCAAGCATG AATTTTTGAC AGATGGCTAT CAAGACTTGG 1260 

GTGCTATTCT CAATAATTTG CTACCCAATA TGAGTTATGT TCTTCAGATA GTAGCCATAT 1320 

GCACTAATGG CTTATATGGA AAATACAGCG ACCAACTGAT TGTCGACATG CCTACTGATA 1380 

ATCCTGAACT TGATCTTTTC CCTGAATTAA TTGGAACTGA AGAAATAATC AAGGAGGAGG 1440 

AAGAGGGAAA AGACATTGAA GAAGGCGCTA TTGTGAATCC TGGTAGAGAC AGTGCTACAA 1500 

ACCAAATCAG GAAAAAGGAA CCCCAGATTT CTACCACAAC ACACTACAAT CGCATAGGGA 1560 

CGAAATACAA TGAAGCCAAG ACTAACCGAT CCCCAACAAG AGGAAGTGAA TTCTCTGGAA 1620 

AGGGTGATGT TCCCAATACA TCTTTAAATT CCACTTCCCA ACCAGTCACT AAATTAGCCA 1680 

CAGAAAAAGA TATTTCCTTG ACTTCTCAGA CTGTGACTGA ACTGCCACCT CACACTGTGG 1740 

AAGGTACTTC AGCCTCTTTA AATGATGGCT CTAAAACTGT TCTTAGATCT CCACATATGA 1800 

ACTTGTCGGG GACTGCAGAA TCCTTAAATA CAGTTTCTAT AACAGAATAT GAGGAGGAGA 1860 

GTTTATTGAC CAGTTTCAAG CTTGATACTG GAGCTGAAGA TTCTTCAGGC TCCAGTCCCG 1920 

CAACTTCTGC TATCCCATTC ATCTCTGAGA ACATATCCCA AGGGTATATA TTTTCCTCCG 1980 

AAAACCCAGA GACAATAACA TATGATGTCC TTATACCAGA ATCTGCTAGA AATGCTTCCG 2040 

AAGATTCAAC TTCATCAGGT TCAGAAGAAT CACTAAAGGA TCCTTCTATG GAGGGAAATG 2100 

TGTGGTTTCC TAGCTCTACA GACATAACAG CACAGCCOGA TGTTGGATCA GGCAGAGAGA 2160 

GCTTTCTCCA GACTAATTAC ACTGAGATAC GTGTTGATGA ATCTGAGAAG ACAACCAAGT 2220 

CCTTTTCTGC AGGCCCAGTG ATGTCACAGG GTCCCTCAGT TACAGATCTG GAAATGCCAC 2280 

ATTATTCTAC CTTTGCCTAC TTCCCAACTG AGGTAACACC TCATGCTTTT ACCCCATCCT 2340 

CCAGACAACA GGATTTGGTC TCCACGGTCA ACGTGGTATA CTCGCAGACA ACCCAACCGG 2400 

TATACAATGA GGCCAGTAAT AGTAGCCATG AGTCTOGTAT TGGTCTAGCT GAGGGGTTGG 2460 

AATCCGAGAA GAAGGCAGTT ATACCCCTTG TGATCGTGTC AGCCCTGACT TTTATCTGTC 2520 

TAGTGGTTCT TGTGGGTATT CTCATCTACT GGAGGAAATG CTTCCAGACT GCACACTTTT 2580 

ACTTAGAGGA CAGTACATCC CCTAGAGTTA TATCCACACC TCCAACACCT ATCTTTCCAA 2640 

TTTCAGATGA TGTCGGAGCA ATTCCAATAA AGCACTTTCC AAAGCATGTT GCAGATTTAC 2700 

ATGCAAGTAG TGGGTTTACT GAAGAATTTG AGACACTGAA AGAGTTTTAC CAGGAAGTGC 2760 

AGAGCTGTAC TGTTGACTTA GGTATTACAG CAGACAGCTC CAACCACCCA GACAACAAGC 2820 

ACAAGAATCG ATACATAAAT ATCGTTGCCT ATGATCATAG CAGGGTTAAG CTAGCACAGC 2880 

TTGCTGAAAA GGATGGCAAA CTGACTGATT ATATCAATGC CAATTATGTT GATGGCTACA 2940 

ACAGACCAAA AGCTTATATT GCTGCCCAAG GCCCACTGAA ATCCACAGCT GAAGATTTCT 3000 

GGAGAATGAT ATGGGAACAT AATGTGGAAG TTATTGTCAT GATAACAAAC CTCGTGGAGA 3060 

AAGGAAGGAG AAAATGTGAT CAGTACTGGC CTGCCGATGG GAGTGAGGAG TACGGGAACT 3120 

TTCTGGTCAC TCAGAAGAGT GTGCAAGTGC TTGCCTATTA TACTGTGAGG AATTTTACTC 3180 

TAAGAAACAC AAAAATAAAA AAGGGCTCCC AGAAAGGAAG ACCCAGTGGA OGTGTGGTCA 3240 

CACAGTATCA CTACACGCAG TGGCCTGACA TGGGAGTACC AGAGTACTCC CTGCCAGTGC 3300 

TGACCTTTGT GAGAAAGGCA GCCTATGCCA AGCGCCATGC AGTGGGGCCT GTTGTOGTCC 3360 

ACTGCAGTGC TGGAGTTGGA AGAACAGGCA CATATATTGT GCTAGACAGT ATGTTGCAGC 3420 

AGATTCAACA CGAAGGAACT GTCAACATAT TTGGCTTCTT AAAACACATC CGTTCACAAA 3480 

GAAATTATTT GGTACAAACT GAGGAGCAAT ATGTCTTCAT TCATGATACA CTGGTTGAGG 3540 

CCATACTTAG TAAAGAAACT GAGGTGCTGG ACAGTCATAT TCATGCCTAT GTTAATGCAC 3600 

TCCTCATTCC TGGACCAGCA GGCAAAACAA AGCTAGAGAA ACAATTCCAG CTCCTGAGCC 3660 

AGTCAAATAT ACAGCAGAGT GACTATTCTG CAGCCCTAAA GCAATGCAAC AGGGAAAAGA 3720 

ATCGAACTTC TTCTATCATC CCTGTGGAAA GATCAAGGGT TGGCATTTCA TCCCTGAGTG 3780 
GAGAAGGCAC AGACTACATC AATGCCTCCT ATATCATGGG CTATTACCAG AGCAATGAAT 3840 
TCATCATTAC CCAGCACCCT CTCCTTCATA CCATCAAGGA TTTCTGGAGG ATGATATGGG 3900 
ACCATAATGC CCAACTGGTG GTTATGATTC CTGATGGCCA AAACATGGCA GAAGATGAAT 3960 
TTGTTTACTG GCCAAATAAA GATGAGCCTA TAAATTGTGA GAGCTTTAAG GTCACTCTTA 4020 
TGGCTGAAGA ACACAAATGT CTATCTAATG AGGAAAAACT TATAATTCAG GACTTTATCT 4080 
TAGAAGCTAC ACAGGATGAT TATGTACTTG AAGTGAGGCA CTTTCAGTGT CCTAAATGGC 4140 
CAAATCCAGA TAGCCCCATT AGTAAAACTT TTGAACTTAT AAGTGTTATA AAAGAAGAAG 4200 
CTGCCAATAG GGATGGGCCT ATGATTGTTC ATGATGAGCA TGGAGGAGTG ACGGCAGGAA 4260 
CTTTCTGTGC TCTGACAACC CTTATGCACC AACTAGAAAA AGAAAATTCC GTGGATGTTT 4320 
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ACCAGGTAGC 
ATCAGTTTCT 
CCACCTCTCT 
A6TCTTTAGT 
TCCTAAAATT 
ACAGTAACTT 
TTTTTGCAAG 
ATTTCTAAGA 
TTATAGAGGT 
CTGTATTTGT 
AATAAAACAC 
GAAATAATCT 
TTATAATTGT 
TTTAGTTTAA 
TGTGTTACCT 
AAATACCTTC 
ATGGTTTTTA 
AAAAAAAAAA 



CAAGATGATC 
CTACAAAGTG 
GGACAGTAAT 
TTAACACAGA 
AGGCAGGAAA 
TCATGACATA 
ACTTGTAATT 
ATGGAATTGT 
TAGGAATTCC 
AGCAATTATC 
TCTTCCATAT 
GTTACTTATT 
AGATTTTTAT 
TGACGTAGTT 
AAGTCATTAA 
ATTTTGAAAG 
TCCAAGGAAT 
AAAAAAAAAA 



AATCTGATGA 
ATCCTCAGCC 
GGTGCAGCAT 
AAGGGGTGGG 
ATCAGTCTAG 
GGATTCTGCC 
TACTTATTAT 
GGTATTTTTT 
AAACTACAGA 
AGGTTTGCTA 
GATATTCAAC 
GTAAATACTG 
ATTTTACTAC 
CATTAGCTGG 
CTTTGTTTCA 
AAGTTTTTAT 
TGCAAAAATA 
AAA 



GGCCAGGAGT 
TTGTGAGCAC 
TGCCTGATGG 
GGGACTCACA 
TTCTGTTATC 
GCCAAATTTA 
GTTTGAACTA 
TCTGTATTGA 
AAATGTTTGT 
GAAATATAAC 
ATTTTACAAC 
CCCTAGTGTC 
TGAGTCAAGT 
TCTTACTCTA 
GCATGTAATT 
GAGAATAACA 
AATATAAATA 



CTTTGCTGAC 
AAGGCAGGAA 
AAATATAGCT 
TCTGAGCATT 
TGTTGATTTC 
TATCATTAAC 
AAATGATTGA 
TTTTAACAGA 
TTTTAGTGTC 
TTTTAATACA 
TGCAGTATTC 
TCCATGGACC 
TTTCTAGTTC 
CCAGTTTTCT 
TTAACTTTTG 
CCTTACCAAA 
TTGCCATTAA 



ATTGAGCAGT 
GAGAATCCAT 
GAGAGCTTAG 
GTTTTCCTCT 
CCATCACCTG 
AATGTGTGCC 
ATTTTACAGT 
AAATTTCAAT 
AAATTTTTAG 
GTAGCCTGTA 
ACCTAAAGTA 
AAATTTATAT 
TGTGTAATTG 
GACATTGTAT 
TGGAAAATAG 
CATTGTTCAA 
AAAAAAAAAA 



Seq ID NO j C100 DNA Sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence : 148 . . 4362 



11 



31 



41 



51 



4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4960 
5040 
5100 
5160 
5220 
5280 
5340 
5363 



21 

! I I I 1 I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 
CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 
CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AACGTTTCCT CGCTTGCATT 
CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 
CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 
AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 
CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 
AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATJA ATCTCACTAA TGACTACCGT 
GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 
AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 
GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 
GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 
GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 
TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 
AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 
ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTTGT GAAGTTCTTA CAATGCAACA 
ATCTGGTTAT GTCATGCTGA TGGACTACTT ACAAAACAAT TTTCGAGAGC AACAGTACAA 
GTTCTCTAGA CAGGTGTTTT CCTCATACAC TGGAAAGGAA GAGATTCATG AAGCAGTTTG 
TAGTTCAGAA CCAGAAAATG TTCAGGCTGA CCCAGAGAAT TATACCAGCC TTCTTGTTAC 
ATGGGAAAGA CCTOGAGTCG TTTATGATAC CATGATTGAG AAGTTTGCAG TTTTGTACCA 
GCAGTTGGAT GGAGAGGACC AAACCAAGCA TGAATTTTTG ACAGATGGCT ATCAAGACTT 
GGGTGCTATT CTCAATAATT TGCTACCCAA TATGAGTTAT GTTCTTCAGA TAGTAGCCAT 
ATGCACTAAT GGCTTATATG GAAAATACAG CGACCAACTG ATTGTCGACA TGCCTACTGA 
TAATCCTGAA CTTGATCTTT TCCCTGAATT AATTGGAACT GAAGAAATAA TCAAGGAGGA 
GGAAGAGGGA AAAGACATTG AAGAAGGCGC TATTGTGAAT CCTGGTAGAG ACAGTGCTAC 
AAACCAAATC AGGAAAAAGG AACCCCAGAT TTCTACCACA ACACACTACA ATCGCATAGG 
GACGAAATAC AATGAAGCCA AGACTAACGG ATCCCCAACA AGAGGAAGTG AATTCTCTGG 
AAAGGGTGAT GTTCOCAATA CATCTTTAAA TTCCACTTCC CAACCAGTCA CTAAATTAGC 
CACAGAAAAA GATATTTCCT TGACTTCTCA GACTGTGACT GAACTGCCAC CTCACACTGT 
GGAAGGTACT TCAGCCTCTT TAAATGATGG CTCTAAAACT GTTCTTAGAT CTCCACATAT 
GAACTTGTCG GGGACTGCAG AATCCTTAAA TACAGTTTCT ATAACAGAAT ATGAGGAGGA 
GAGTTTATTG ACCAGTTTCA AGCTTGATAC TGGAGCTGAA GATTCTTCAG GCTCCAGTCC 
CGCAACTTCT GCTATCCCAT TCATCTCTGA GAACATATCC CAAGGGTATA TATTTTCCTC 
CGAAAACCCA GAGACAATAA CATATGATGT CCTTATACCA GAATCTGCTA GAAATGCTTC 
CGAAGATTCA ACTTCATCAG GTTCAGAAGA ATCACTAAAG GATCCTTCTA TGGAGGGAAA 
TGTGTGGTTT CCTAGCTCTA CAGACATAAC AGCACAGCCC GATGTTGGAT CAGGCAGAGA 
GAGCTTTCTC CAGACTAATT ACACTGAGAT ACGTGTTGAT GAATCTGAGA AGACAACCAA 
GTCCTTTTCT GCAGGCCCAG TGATGTCACA GGGTCCCTCA GTTACAGATC TGGAAATGCC 
ACATTATTCT ACCTTTGCCT ACTTCCCAAC TGAGGTAACA CCTCATGCTT TTACCCCATC 
CTCCAGACAA CAGGATTTGG TCTCCACGGT CAACGTGGTA TACTCGCAGA CAACCCAACC 
GGTATACAAT GAGGCCAGTA ATAGTAGCCA TGAGTCTCGT ATTGGTCTAG CTGAGGGGTT 
GGAATCCGAG AAGAAGGCAG TTATACCCCT TGTGATGGTG TCAGCCCTGA CTTTTATCTG 
TCTAGTGGTT CTTGTGGGTA TTCTCATCTA CTGGAGGAAA TGCTTCCAGA CTGCACACTT 
TTACTTAGAG GACAGTACAT CCCCTAGAGT TATATCCACA CCTCCAACAC CTATCTTTCC 
AATTTCAGAT GATGTCGGAG CAATTCCAAT AAAGCACTTT CCAAAGCATG TTGCAGATTT 
ACATGCAAGT AGTGGGTTTA CTGAAGAATT TGAGACACTG AAAGAGTTTT ACCAGGAAGT 
GCAGAGCTGT ACTGTTGACT TAGGTATTAC AGCAGACAGC TCCAACCACC CAGACAACAA 
GCACAAGAAT CGATACATAA ATATCGTTGC CTATGATCAT AGCAGGGTTA AGCTAGCACA 
GCTTGCTGAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC 
AACAGACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGC TGAAGATTTC 
TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTGGAG 
AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCCGATG GGAGTGAGGA GTACGGGAAC 
TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 
CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 
ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 
CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 
CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG TGCTAGACAG TATGTTGCAG 
CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TAAAACACAT CCGTTCACAA 
AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG 
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180 
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360 
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600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 
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1800 
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1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 
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3060 

3120 
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3240 

3300 
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GCCATACTTA 
CTCCTCATTC 
CTGTCACCCA 
GGCTTAACTG 
TCAAATATAC 
CGAACTTCTT 
GAAGGCACAG 
ATCATTACCC 
CATAATGCCC 
GTTTACTGGC 
GCTGAAGAAC 
GAAGCTACAC 
AATCCAGATA 
GCCAATAGGG 
TTCTGTGCTC 
CAGGTAGCCA 
CAGTTTCTCT 
ACCTCTCTGG 
TCTTTAGTTT 
CTAAAATTAG 
AGTAACTTTC 
TTTGCAAGAC 
TTCTAAGAAT 
ATAGAGGTTA 
GTATTTGTAG 
TAAAACACTC 
AATAATCTGT 
ATAATTGTAG 
TAGTTTAATG 
TGTTACCTAA 
ATACCTTCAT 
GGTTTTTATC 
AAAAAAAAAA 



GTAAAGAAAC 
CTGGACCAGC 
GGCTGGAGTG 
ATCCTCCTAC 
AGCAGAGTGA 
CTATCATCCC 
ACTACATCAA 
AGCACCCTCT 
AACTGGTGGT 
CAAATAAAGA 
ACAAATGTCT 
AGGATGATTA 
GCCCCATTAG 
ATGGGCCTAT 
TGACAACCCT 
AGATGATCAA 
ACAAAGTGAT 
ACAGTAATGG 
AACACAGAAA 
GCAGGAAAAT 
ATGACATAGG 
TTGTAATTTA 
GGAATTGTGG 
GGAATTCCAA 
CAATTATCAG 
TTCCATATGA 
TACTTATTGT 
ATTTTTATAT 
ACGTAGTTCA 
GTCATTAACT 
TTTGAAAGAA 
CAAGGAATTG 
AAAAAAAAAA 



TGAGGTGCTG 
AGGCAAAACA 
CAGAGGCACA 
CTCAGCCTCC 
CTATTCTGCA 
TGTGGAAAGA 
TGCCTCCTAT 
CCTTCATACC 
TATGATTCCT 
TGAGCCTATA 
ATCTAATGAG 
TGTACTTGAA 
TAAAACTTTT 
GATTGTTCAT 
TATGCACCAA 
TCTGATGAGG 
CCTCAGCCTT 
TGCAGCATTG 

CAGTCTAGTT 
ATTCTGCCGC 
CTTATTATGT 
TATTTTTTTC 
ACTACAGAAA 
GTTTGCTAGA 
TATTCAACAT 
AAATACTGCC 
TTTACTACTG 
TTAGCTGGTC 
TTGTTTCAGC 
GTTTTTATGA 
CAAAAATAAA 
A 



GACAGTCATA 
AAGCTAGAGA 
ATCTCGGCTC 
CGAGTGGCTG 
GCCCTAAAGC 
TCAAGGGTTG 
ATCATGGGCT 
ATCAAGGATT 
GATGGCCAAA 
AATTGTGAGA 
GAAAAACTTA 
GTGAGGCACT 
GAACTTATAA 
GATGAGCATG 
CTAGAAAAAG 
CCAGGAGTCT 
GTGGGCACAA 
CCTGATGGAA 
GACTCACATC 
CTGTTATCTG 
CAAATTTATA 
TTGAACTAAA 
TGTATTGATT 
ATGTTTGTTT 
AATATAACTT 
TTTACAACTG 
CTAGTGTCTC 
AGTCAAGTTT 
TTACTCTACC 
ATGTAATTTT 
GAATAACACC 
TATAAATATT 



TTCATGCCTA 
AACAATTCCA 
ACTGCAACCT 
GGACTATACT 
AATGCAACAG 
GCATTTCATC 
ATTACCAGAG 
TCTGGAGGAT 
ACATGGCAGA 
GCTTTAAGGT 
TAATTCAGGA 
TTCAGTGTCC 
GTGTTATAAA 
GAGGAGTGAC 
AAAATTCCGT 
TTGCTGACAT 
GGCAGGAAGA 
ATATAGCTGA 
TGAGCATTGT 
TTGATTTCCC 
TCATTAACAA 
ATGATTGAAT 
TTAACAGAAA 
TTAGTGTCAA 
TTAATACAGT 
CAGTATTCAC 
CATGGACCAA 
TCTAGTTCTG 
AGTTTTCTGA 
AACTTTTGTG 
TTACCAAACA 
GCCATTAAAA 



TGTTAATGCA 
GGGTCTCACT 
TCCTCTCCCT 
CCTGAGCCAG 
GGAAAAGAAT 
CCTGAGTGGA 
CAATGAATTC 
GATATGGGAC 
AGATGAATTT 
CACTCTTATG 
CTTTATCTTA 
TAAATGGCCA 
AGAAGAAGCT 
GGCAGGAACT 
GGATGTTTAC 
TGAGCAGTAT 
GAATCCATCC 
GAGCTTAGAG 
TTTCCTCTTC 
ATCACCTGAC 
TGTGTGCCTT 
TTTACAGTAT 
ATTTCAATTT 
ATTTTTAGCT 
AGCCTGTAAA 
CTAAAGTAGA 
ATTTATATTT 
TGTAATTGTT 
CATTGTATTG 
GAAAATAGAA 
TTGTTCAAAT 
AAAAAAAAAA 



Seq ID NO: CI 01 DNA Sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 1..3340 

1 11 21 31 41 

I I I I I 

ATGCGAATCC TAAAGCGTTT CCTCGCTTGC ATTCAGCTCC TCTGTGTTTG 
TGGGCTAATG GATACTACAG ACAACAGAGA AAACTTGTTG AAGAGATTGG 
ACAGGAGCAC TGAATCAAAA AAATTGGGGA AAGAAATATC CAACATGTAA 
CAATCTCCTA TCAATATTGA TGAAGATCTT ACACAAGTAA ATGTGAATCT 
AAATTTCAGG GTTGGGATAA AACATCATTG GAAAACACAT TCATTCATAA 
ACAGTGGAAA TTAATCTCAC TAATGACTAC CGTGTCAGCG GAGGAGTTTC 
TTTAAAGCAA GCAAGATAAC TTTTCACTGG GGAAAATGCA ATATGTCATC 
GAGCATAGTT TAGAAGGACA AAAATTTCCA CTTGAGATGC AAATCTACTG 
GACCGATTTT CAAGTTTTGA GGAAGCAGTC AAAGGAAAAG GGAAGTTAAG 
ATTTTGTTTG AGGTTGGGAC AGAAGAAAAT TTGGATTTCA AAGCGATTAT 
GAAAGTGTTA GTCGTTTTGG GAAGCAGGCT GCTTTAGATC CATTCATACT 
CTGCCAAACT CAACTGACAA GTATTACATT TACAATGGCT CATTGACATC 
ACAGACACAG TTGACTGGAT TGTTTTTAAA GATACAGTTA GCATCTCTGA 
GCTGTTTTTT GTGAAGTTCT TACAATGCAA CAATCTGGTT ATGTCATGCT 
TTACAAAACA ATTTTCGAGA GCAACAGTAC AAGTTCTCTA GACAGGTGTT 
ACTGGAAAGG AAGAGATTCA TGAAGCAGTT TGTAGTTCAG AACCAGAAAA 
GACCCAGAGA ATTATACCAG CCTTCTTGTT ACATGGGAAA GACCTCGAGT 
ACCATGATTG AGAAGTTTGC AGTTTTGTAC CAGCAGTTGG ATGGAGAGGA 
CATGAATTTT TGACAGATGG CTATCAAGAC TTGGGTGCTA TTCTCAATAA 
AATATGAGTT ATGTTCTTCA GATAGTAGCC ATATGCACTA ATGGCTTATA 
AGCGACCAAC TGATTGTCGA CATGCCTACT GATAATCCTG AGGCCAGTAA 
GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG GAATCCGAGA AGAAGGCAGT 
GTGATCGTGT CAGCCCTGAC TTTTATCTGT CTAGTGGTTC TTGTGGGTAT 
TGGAGGAAAT GCTTCCAGAC TGCACACTTT TACTTAGAGG ACAGTACATC 
ATATCCACAC CTCCAACACC TATCTTTCCA ATTTCAGATG ATGTCGGAGC 
AAGCACTTTC CAAAGCATGT TGCAGATTTA CATGCAAGTA GTGGGTTTAC 
GAGACACTGA AAGAGTTTTA CCAGGAAGTG CAGAGCTGTA CTGTTGACTT 
GCAGACAGCT CCAACCACCC AGACAACAAG CACAAGAATC GATACATAAA 
TATGATCATA GCAGGGTTAA GCTAGCACAG CTTGCTGAAA AGGATGGCAA 
TATATCAATG CCAATTATGT TGATGGCTAC AACAGACCAA AAGCTTATAT 
GGCCCACTGA AATCCACAGC TGAAGATTTC TGGAGAATGA TATGGGAACA 
GTTATTGTCA TGATAACAAA CCTCGTGGAG AAAGGAAGGA GAAAATGTGA 
CCTGCCGATG GGAGTGAGGA GTACGGGAAC TTTCTGGTCA CTCAGAAGAG 
CTTGCCTATT ATACTGTGAG GAATTTTACT CTAAGAAACA CAAAAATAAA 
CAGAAAGGAA GACCCAGTGG ACGTGTGGTC ACACAGTATC ACTACACGCA 
ATGGGAGTAC CAGAGTACTC CCTGCCAGTG CTGACCTTTG TGAGAAAGGC 
AAGCGCCATG CAGTGGGGCC TGTTGTCGTC CACTGCAGTG CTGGAGTTGG 
ACATATATTG TGCTAGACAG TATGTTGCAG CAGATTCAAC ACGAAGGAAC 
TTTGGCTTCT TAAAACACAT CCGTTCACAA AGAAATTATT TGGTACAAAC 
TATGTCTTCA TTCATGATAC ACTGGTTGAG GCCATACTTA GTAAAGAAAC 
GACAGTCATA TTCATGCCTA TGTTAATGCA CTCCTCATTC CTGGACCAGC 
AAGCTAGAGA AACAATTCCA GCTCCTGAGC CAGTCAAATA TACAGCAGAG 
GCAGCCCTAA AGCAATGCAA CAGGGAAAAG AATCGAACTT CTTCTATCAT 
AGATCAAGGG TTGGCATTTC ATCCCTGAGT GGAGAAGGCA CAGACTACAT 




AAGCCAGTTG 
GATGGACTAC 
TTCCTCATAC 
TGTTCAGGCT 
CGTTTATGAT 
CCAAACCAAG 
TTTGCTACCC 
TGGAAAATAC 
TAGTAGCCAT 
TATACCCCTT 
TCTCATCTAC 
CCCTAGAGTT 
AATTCCAATA 
TGAAGAATTT 
AGGTATTACA 
TATCGTTGCC 
ACTGACTGAT 
TGCTGCCCAA 
TAATGTGGAA 
TCAGTACTGG 
TGTGCAAGTG 
AAAGGGCTCC 
GTGGCCTGAC 
AGCCTATGCC 
AAGAACAGGC 
TGTCAACATA 
TGAGGAGCAA 
TGAGGTGCTG 
AGGCAAAACA 
TGACTATTCT 
CCCTGTGGAA 
CAATGCCTCC 



3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4 8 60 
4920 
4980 
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5160 
5220 
5280 
5340 
5400 
5460 
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60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 
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TATATCATGG GCTATTACCA GAGCAATGAA TTCATCATTA CTCAGCACCC TCTCCTTCAT 2700 

ACCATCAAGG ATTTCTGGAG GATGATATGG GACCATAATG CCCAACTGGT GGTTATGATT 2760 

CCTGATGGCC AAAACATGGC AGAAGATGAA TTTGTTTACT GGCCAAATAA AGATGAGCCT 2820 

ATAAATTGTG AGAGCTTTAA GGTCACTCTT ATGGCTGAAG AACACAAATG TCTATCTAAT 2880 

GAGGAAAAAC TTATAATTCA GGACTTTATC TTAGAAGCTA CACAGGATGA TTATGTACTT 2940 

GAAGTGAGGC ACTTTCAGTG TCCTAAATGG CCAAATCCAG ATAGCCCCAT TAGTAAAACT 3000 

TTTGAACTTA TAAGTGTTAT AAAAGAAGAA GCTGCCAATA GGGATGGGCC TATGATTGTT 3060 

CATGATGAGC ATGGAGGAGT GACGGCAGGA ACTTTCTGTG CTCTGACAAC CCTTATGCAC 3120 

CAACTAGAAA AAGAAAATTC CGTGGATGTT TACCAGGTAG CCAAGATGAT CAATCTGATG 3180 

AGGCCAGGAG TCTTTGCTGA CATTGAGCAG TATCAGTTTC TCTACAAAGT GATCCTCAGC 3240 

CTTGTGAGCA CAAGGCAGGA AGAGAATCCA TCTACCTCTC TGGACAGTAA TGGTGCAGCA 3300 

TTGCCTGATG GAAATATAGC TGAGAGCTTA GAGTCTTTAG 3340 

Seq ID NO: C102 DMA Sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 1..4480 

1 11 21 31 41 51 

i i i i.i i 

ATGCGAATCC TAAAGCGTTT CCTCGCTTGC ATTCAGCTCC TCTGTGTTTG CCGCCTGGAT 60 

TGGGCTAATG GATACTACAG ACAACAGAGA AAACTTGTTG AAGAGATTGG CTGGTCCTAT 120 

ACAGGAGCAC TGAATCAAAA AAATTGGGGA AAGAAATATC CAACATGTAA TAGCCCAAAA 180 

CAATCTCCTA TCAATATTGA TGAAGATCTT ACACAAGTAA ATGTGAATCT TAAGAAACTT 240 

AAATTTCAGG GTTGGGATAA AACATCATTG GAAAACACAT TCATTCATAA CACTGGGAAA 300 

ACAGTGGAAA TTAATCTCAC TAATGACTAC CGTGTCAGCG GAGGAGTTTC AGAAATGGTG 360 

TTTAAAGCAA GCAAGATAAC TTTTCACTGG GGAAAATGCA ATATGTCATC TGATGGATCA 420 

GAGCATAGTT TAGAAGGACA AAAATTTCCA CTTGAGATGC AAATCTACTG CTTTGATGCA 480 

GACCGATTTT CAAGTTTTGA GGAAGCAGTC AAAGGAAAAG GGAAGTTAAG AGCTTTATCC 540 

ATTTTGTTTG AGGTTGGGAC AGAAGAAAAT TTGGATTTCA AAGCGATTAT TGATGGAGTC 600 

GAAAGTGTTA GTCGTTTTGG GAAGCAGGCT GCTTTAGATC CATTCATACT GTTGAACCTT 6 SO 

CTGCCAAACT CAACTGACAA GTATTACATT TACAATGGTT CATTGACATC TCCTCCCTGC 720 

ACAGACACAG TTGACTGGAT TGTTTTTAAA GATACAGTTA GCATCTCTGA AAGCCAGTTG 780 

GCTGTTTTTT GTGAAGTTCT TACAATGCAA CAATCTGGTT ATGTCATGCT GATGGACTAC 840 

TTACAAAACA ATTTTCGAGA GCAACAGTAC AAGTTCTCTA GACAGGTGTT TTCCTCATAC 900 

ACTGGAAAGG AAGAGATTCA TGAAGCAGTT TGTAGTTCAG AACCAGAAAA TGTTCAGGCT 960 

GACCCAGAGA ATTATACCAG CCTTCTTGTT ACATGGGAAA GACCTCGAGT CGTTTATGAT 1020 

ACCATGATTG AGAAGTTTGC AGTTTTGTAC CAGCAGTTGG ATGGAGAGGA CCAAACCAAG 1080 

CATGAATTTT TGACAGATGG CTATCAAGAC TTGGGTGCTA TTCTCAATAA TTTGCTACCC 1140 

AATATGAGTT ATGTTCTTCA GATAGTAGCC ATATGCACTA ATGGCTTATA TGGAAAATAC 1200 

AGCGACCAAC TGATTGTCGA CATGCCTACT GATAATCCTG AACTTGATCT TTTCCCTGAA 1260 

TTAATTGGAA CTGAAGAAAT AATCAAGGAG GAGGAAGAGG GAAAAGACAT TGAAGAAGGC 1320 

GCTATTGTGA ATCCTGGTAG AGACAGTGCT ACAAACCAAA TCAGGAAAAA GGAACCCCAG 1380 

ATTTCTACCA CAACACACTA CAATCGCATA GGGACGAAAT ACAATGAAGC CAAGACTAAC 1440 

CGATCCCCAA CAAGAGGAAG TGAATTCTCT GGAAAGGGTG ATGTTCCCAA TACATCTTTA 1500 

AATTCCACTT CCCAACCAGT CACTAAATTA GCCACAGAAA AAGATATTTC CTTGACTTCT 1560 

CAGACTGTGA CTGAACTGCC ACCTCACACT GTGGAAGGTA CTTCAGCCTC TTTAAATGAT 1620 

GGCTCTAAAA CTGTTCTTAG ATCTCCACAT ATGAACTTGT CGGGGACTGC AGAATCCTTA 1680 
AATACAGTTT CTATAACAGA ATATGAGGAG GAGAGTTTAT TGACCAGTTT CAAGCTTGAT 1740 
ACTGGAGCTG AAGATTCTTC AGGCTCCAGT CCCGCAACTT CTGCTATCCC ATTCATCTCT 1800 
GAGAACATAT CCCAAGGGTA TATATTTTCC TCCGAAAACC CAGAGACAAT AACATATGAT 1860 
GTCCTTATAC CAGAATCTGC TAGAAATGCT TCCGAAGATT CAACTTCATC AGGTTCAGAA 1920 
GAATCACTAA AGGATCCTTC TATGGAGGGA AATGTGTGGT TTCCTAGCTC TACAGACATA 1980 
ACAGCACAGC CCGATGTTGG ATCAGGCAGA GAGAGCTTTC TCCAGACTAA TTACACTGAG 2040 
ATACGTGTTG ATGAATCTGA GAAGACAACC AAGTCCTTTT CTGCAGGCCC AGTGATGTCA 2100 
CAGGGTCCCT CAGTTACAGA TCTGGAAATG CCACATTATT CTACCTTTGC CTACTTCCCA 2160 
ACTGAGGTAA CACCTCATGC TTTTATCCCA TCCTCCAGAC AACAGGATTT GGTCTCCACG 2220 
GTCAACGTGG TATACTCGCA GACAACCCAA CCGGTATACA ATGAGGCCAG TAATAGTAGC 2280 
CATGAGTCTC GTATTGGTCT AGCTGAGGGG TTGGAATCCG AGAAGAAGGC AGTTATACCC 2340 
CTTGTGATGG TGTCAGCCCT GACTTTTATC TGTCTAGTGG TTCTTGTGGG TATTCTCATC 2400 
TACTGGAGGA AATGCTTCCA GACTGCACAC TTTTACTTAG AGGACAGTAC ATCCCCTAGA 2460 
GTTATATCCA CACCTCCAAC ACCTATCTTT CCAATTTCAG ATGATGTCGG AGCAATTCCA 2520 
ATAAAGCACT TTCCAAAGCA TGTTGCAGAT TTACATGCAA GTAGTGGGTT TACTGAAGAA 2580 
TTTGAGACAC TGAAAGAGTT TTACCAGGAA GTGCAGAGCT GTACTGTTGA CTTAGGTATT 2640 
ACAGCAGACA GCTCCAACCA CCCAGACAAC AAGCACAAGA ATCGATACAT AAATATCGTT 2700 
GCCCATGATC ATAGCAGGGT TAAGCTAGCA CAGCTTGCTG AAAAGGATGG CAAACTGACT 2760 
GATTATATCA ATGCCAATTA TGTTGATGGC TACAACAGAC CAAAAGCTTA TATTGCTGCC 2820 
CAAGGCCCAC TGAAATCCAC AGCTGAAGAT TTCTGGAGAA TGATATGGGA ACATAATGTG 2880 
GAAGTTATTG TCATGATAAC AAACCTCGTG GAGAAAGGAA GGAGAAAATG TGATCAGTAC 2940 
TGGCCTGCCG ATGGGAGTGA GGAGTACGGG AACTTTCTGG TCACTCAGAA GAGTGTGCAA 3000 
GTGCTTGCCT ATTATACTGT GAGGAATTTT ACTCTAAGAA ACACAAAAAT AAAAAAGGGC 3060 
TCCCAGAAAG GAAGACCCAG TGGACGTGTG GTCACACAGT ATCACTACAC GCAGTGGCCT 3120 
GACATGGGAG TACCAGAGTA CTCCCTGCCA GTGCTGACCT TTGTGAGAAA GGCAGCCTAT 3180 
GCCAAGCGCC ATGCAGTGGG GCCTGTTGTC GTCCACTGCA GTGCTGGAGT TGGAAGAACA 3240 
GGCACATATA TTGTGCTAGA CAGTATGTTG CAGCAGATTC AACACGAAGG AACTGTCAAC 3300 
ATATTTGGCT TCTTAAAACA CATCCGTTCA CAAAGAAATT ATTTGGTACA AACTGAGGAG 3360 
CAATATGTCT TCATTCATGA TACACTGGTT GAGGCCATAC TTAGTAAAGA AACTGAGGTG 3420 
CTGGACAGTC ATATTCATGC CTATGTTAAT GCACTCCTCA TTCCTGGACC AGCAGGCAAA 3480 
ACAAAGCTAG AGAAACAATT CCAGGGTCTC ACTCTGTCAC CCAGGCTGGA GTGCAGAGGC 3S40 
ACAATCTCGG CTCACTGCAA CCTTCCTCTC CCTGGCTTAA CTGATCCTCC TACCTCAGCC 3600 
TCCCGAGTGG CTAGGACTAT ACTCCTGAGC CAGTCAAATA TACAGCAGAG TGACTATTCT 3660 
GCAGCCCTAA AGCAATGCAA CAGGGAAAAG AATCGAACTT CTTCTATCAT CCCTGTGGAA 3720 
AGATCAAGGG TTGGCATTTC ATCCCTGAGT GGAGAAGGCA CAGACTACAT CAATGCCTCC 3780 
TATATCATGG GCTATTACCA GAGCAATGAA TTCATCATTA CCCAGCACCC TCTCCTTCAT 3840 
ACCATCAAGG ATTTCTGGAG GATGATATGG GACCATAATG CCCAACTGGT GGTTATGATT 3900 
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CCTGATGGCC AAAACATGGC AGAAGATGAA TTTGTTTACT GGCCAAATAA AGATGAGCCT 3960 

ATAAATTGTG AGAGCTTTAA GGTCACTCTT ATGGCTGAAG AACACAAATG TCTATCTAAT 4020 

GAGGAAAAAC TTATAATTCA GGACTTTATC TTAGAAGCTA CACAGGATGA TTATGTACTT 4080 

GAAGTGAGGC ACTTTCAGTG TCCTAAATGG CCAAATCCAG ATAGCCCCAT TAGTAAAACT 4140 

TTTGAACTTA TAAGTGTTAT AAAAGAAGAA GCTGCCAATA GGGATGGGCC TATGATTGTT 4200 

CATGATGAGC ATGGAGGAGT GACGGCAGGA ACTTTCTGTG CTCTGACAAC CCTTATGCAC 4260 

CAACTAGAAA AAGAAAATTC CGTGGATGTT TACCAGGTAG CCAAGATGAT CAATCTGATG 4320 

AGGCCAGGAG TCTTTGCTGA CATTGAGCAG TATCAGTTTC TCTACAAAGT GATCCTCAGC 4360 

CTTGTGAGCA CAAGGCAGGA AGAGAATCCA TCCACCTCTC TGGACAGTAA TGGTGCAGCA 4440 

TTGCCTGATG GAAATATAGC TGAGAGCTTA GAGTCTTTAG 4480 



Seq ID NO: C103 DNA Sequence 

Nucleic Acid Accession fl: Eos sequence 

Coding sequence: 1..4220 



ATGCGAATCC 
TGGGCTAATG 
ACAGGAGCAC 
CAATCTCCTA 
AAATTTCAGG 
ACAGTGGAAA 
TTTAAAGCAA 
GAGCATAGTT 
GACCGATTTT 
ATTTTGTTTG 
GAAAGTGTTA 
CTGCCAAACT 
ACAGACACAG 
GCTGTTTTTT 
TTACAAAACA 
ACTGGAAAGG 
GACCCAGAGA 
ACCATGATTG 
CATGAATTTT 
AATATGAGTT 
AGCGACCAAC 
TTAATTGGAA 
GCTATTGTGA 
ATTTCTACCA 
CGATCCCCAA 
AATTCCACTT 
CAGACTGTGA 
GGCTCTAAAA 
AATACAGTTT 
ACTGGAGCTG 
GAGAACATAT 
GTCCTTATAC 
GAATCACTAA 
ACAGCACAGC 
ATACGTGTTG 
CAGGGTCCCT 
ACTGAGGTAA 
GTCAACGTGG 
CATGAGTCTC 
CTTGTGATCG 
TACTGGAGGA 
GTTATATCCA 
ATAAAGCACT 
TTTGAGACAC 
ACAGCAGACA 
GCCTATGATC 
GATTATATCA 
CAAGGCCCAC 
GAAGTTATTG 
TGGCCTGCCG 
GTGCTTGCCT 
TCCCAGAAAG 
GACATGGGAG 
GCCAAGCGCC 
GGCACATATA 
ATATTTGGCT 
CAATATGTCT 
CTGGACAGTC 
ACAAAGCTAG 
TCTGCAGCCC 
GAAAGATCAA 
TCCTATATCA 
CATACCATCA 
ATTCCTGATG 
CCTATAAATT 
AATGAGGAAA 
GACGGCAGGA 



11 
I 

TAAAGCGTTT 
GATACTACAG 
TGAATCAAAA 
TCAATATTGA 
GTTGGGATAA 
TTAATCTCAC 
GCAAGATAAC 
TAGAAGGACA 
CAAGTTTTGA 
AGGTTGGGAC 
GTCGTTTTGG 
CAACTGACAA 
TTGACTGGAT 
GTGAAGTTCT 
ATTTTCGAGA 
AAGAGATTCA 
ATTATACCAG 
AGAAGTTTGC 
TGACAGATGG 
ATGTTCTTCA 
TGATTGTCGA 
CTGAAGAAAT 
ATCCTGGTAG 
CAACACACTA 
CAAGAGGAAG 
CCCAACCAGT 
CTGAACTGCC 
CTGTTCTTAG 
CTATAACAGA 
AAGATTCTTC 
CCCAAGGGTA 
CAGAATCTGC 
AGGATCCTTC 
COGATGTTGG 
ATGAATCTGA 
CAGTTACAGA 
CACCTCATGC 
TATACTCGCA 
GTATTGGTCT 
TGTCAGCCCT 
AATGCTTCCA 
CACCTCCAAC 
TTCCAAAGCA 
TGAAAGAGTT 
GCTCCAACCA 
ATAGCAGGGT 
ATGCCAATTA 
TGAAATCCAC 
TCATGATAAC 
ATGGGAGTGA 
ATTATACTGT 
GAAGACCCAG 
TACCAGAGTA 
ATGCAGTGGG 
TTGTGCTAGA 
TCTTAAAACA 
TCATTCATGA 
ATATTCATGC 
AGAAACAATT 
TAAAGCAATG 
GGGTTGGCAT 
TGGGCTATTA 
AGGATTTCTG 
GCCAAAACAT 
GTGAGAGCTT 
AACTTATAAT 
ACTTTCTGTG 



21 
I 

CCTCGCTTGC 
ACAACAGAGA 
AAATTGGGGA 
TGAAGATCTT 
AACATCATTG 
TAATGACTAC 
TTTTCACTGG 
AAAATTTCCA 
GGAAGCAGTC 
AGAAGAAAAT 
GAAGCAGGCT 
GTATTACATT 
TGTTTTTAAA 
TACAATGCAA 
GCAACAGTAC 
TGAAGCAGTT 
CCTTCTTGTT 
AGTTTTGTAC 
CTATCAAGAC 
GATAGTAGCC 
CATGCCTACT 
AATCAAGGAG 
AGACAGTGCT 
CAATCGCATA 
TGAATTCTCT 
CACTAAATTA 
ACCTCACACT 
ATCTCCACAT 
ATATGAGGAG 
AGGCTCCAGT 
TATATTTTCC 
TAGAAATGCT 
TATGGAGGGA 
ATCAGGCAGA 
GAAGACAACC 
TCTGGAAATG 
TTTTACCCCA 
GACAACCCAA 
AGCTGAGGGG 
GACTTTTATC 
GACTGCACAC 
ACCTATCTTT 
TGTTGCAGAT 
TTACCAGGAA 
CCCAGACAAC 
TAAGCTAGCA 
TGTTGATGGC 
AGCTGAAGAT 
AAACCTCGTG 
GGAGTACGGG 
GAGGAATTTT 
TGGACGTGTG 
CTCCCTGCCA 
GCCTGTTGTC 
CAGTATGTTG 
CATCCGTTCA 
TACACTGGTT 
CTATGTTAAT 
CCAGCTCCTG 
CAACAGGGAA 
TTCATCCCTG 
CCAGAGCAAT 
GAGGATGATA 
GGCAGAAGAT 
TAAGGTCACT 
TCAGGACTTT 
CTCTGACAAC 



31 



ATTCAGCTCC 
AAACTTGTTG 
AAGAAATATC 
ACACAAGTAA 
GAAAACACAT 
CGTGTCAGCG 
GGAAAATGCA 
CTTGAGATGC 
AAAGGAAAAG 
TTGGATTTCA 
GCTTTAGATC 
TACAATGGCT 
GATACAGTTA 
CAATCTGGTT 
AAGTTCTCTA 
TGTAGTTCAG 
ACATGGGAAA 
CAGCAGTTGG 
TTGGGTGCTA 
ATATGCACTA 
GATAATCCTG 
GAGGAAGAGG 
ACAAACCAAA 
GGGACGAAAT 
GGAAAGGGTG 
GCCACAGAAA 
GTGGAAGGTA 
ATGAACTTGT 
GAGAGTTTAT 
CCCGCAACTT 
TCCGAAAACC 
TCCGAAGATT 
AATGTGTGGT 
GAGAGCTTTC 
AAGTCCTTTT 
CCACATTATT 
TCCTCCAGAC 
CCGGTATACA 
TTGGAATCCG 
TGTCTAGTGG 
TTTTACTTAG 
CCAATTTCAG 
TTACATGCAA 
GTGCAGAGCT 
AAGCACAAGA 
CAGCTTGCTG 
TACAACAGAC 
TTCTGGAGAA 
GAGAAAGGAA 
AACTTTCTGG 
ACTCTAAGAA 
GTCACACAGT 
GTGCTGACCT 
GTCCACTGCA 
CAGCAGATTC 
CAAAGAAATT 
GAGGCCATAC 
GCACTCCTCA 
AGCCAGTCAA 
AAGAATCGAA 
AGTGGAGAAG 
GAATTCATCA 
TGGGACCATA 
GAATTTGTTT 
CTTATGGCTG 
ATCTTAGAAG 
CCTTATGCAC 



41 
I 

TCTGTGTTTG 
AAGAGATTGG 
CAACATGTAA 
ATGTGAATCT 
TCATTCATAA 
GAGGAGTTTC 
ATATGTCATC 
AAATCTACTG 
GGAAGTTAAG 
AAGCGATTAT 
CATTCATACT 
CATTGACATC 
GCATCTCTGA 
ATGTCATGCT 
GACAGGTGTT 
AACCAGAAAA 
GACCTCGAGT 
ATGGAGAGGA 
TTCTCAATAA 
ATGGCTTATA 
AACTTGATCT 
GAAAACACAT 
TCAGGAAAAA 
ACAATGAAGC 
ATGTTCCCAA 
AAGATATTTC 
CTTCAGCCTC 
CGGGGACTGC 
TGACCAGTTT 
CTGCTATCCC 
CAGAGACAAT 
CAACTTCATC 
TTCCTAGCTC 
TCCAGACTAA 
CTGCAGGCCC 
CTACCTTTGC 
AACAGGATTT 
ATGAGGCCAG 
AGAAGAAGGC 
TTCTTGTGGG 
AGGACAGTAC 
ATGATGTCGG 
GTAGTGGGTT 
GTACTGTTGA 
ATCGATACAT 
AAAAGGATGG 
CAAAAGCTTA 
TGATATGGGA 
GGAGAAAATG 
TCACTCAGAA 
ACACAAAAAT 
ATCACTACAC 
TTGTGAGAAA 
GTGCTGGAGT 
AACACGAAGG 
ATTTGGTACA 
TTAGTAAAGA 
TTCCTGGACC 
ATATACAGCA 
CTTCTTCTAT 
GCACAGACTA 
TTACCCAGCA 
ATGCCCAACT 
ACTGGCCAAA 
AAGAACACAA 
CTACACAGGC 
CAACTAGAAA 



51 



CCGCCTGGAT 
CTGGTCCTAT 
TAGCCCAAAA 
TAAGAAACTT 
CACTGGGAAA 
AGAAATGGTG 
TGATGGATCA 
CTTTGATGOG 
AGCTTTATCC 
TGATGGAGTC 
GTTGAACCTT 
TCCTCCCTGC 
AAGCCAGTTG 
GATGGACTAC 
TTCCTCATAC 
TGTTCAGGCT 
CGTTTATGAT 
CCAAACCAAG 
TTTGCTACCC 
TGGAAAATAC 
TTTCCCTGAA 
TGAAGAAGGC 
GGAACCCCAG 
CAAGACTAAC 
TACATCTTTA 
CTTGACTTCT 
TTTAAATGAT 
AGAATCCTTA 
CAAGCTTGAT 
ATTCATCTCT 
AACATATGAT 
AGGTTCAGAA 
TACAGACATA 
TTACACTGAG 
AGTGATGTCA 
CTACTTCCCA 
GGTCTCCACG 
TAATAGTAGC 
AGTTATACCC 
TATTCTCATC 
ATCCCCTAGA 
AGCAATTCCA 
TACTGAAGAA 
CTTAGGTATT 
AAATATCGTT 
CAAACTGACT 
TATTGCTGCC 
ACATAATGTG 
TGATCAGTAC 
GAGTGTGCAA 
AAAAAAGGGC 
GCAGTGGCCT 
GGCAGCCTAT 
TGGAAGAACA 
AACTGTCAAC 
AACTGAGGAG 
AACTGAGGTG 
AGCAGGCAAA 
GAGTGACTAT 
CATCCCTGTG 
CATCAATGCC 




60 

120 

180 

240 

300 

360 
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480 
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600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 
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3780 

3840 

3900 

3960 
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5 

10 



CGTGGATGTT TACCAGGTAG CCAAGATGAT CAATCTGATG AGGCCAGGAG TCTTTGCTGA 4080 

CATTGAGCAG TATCAGTTTC TCTACAAAGT GATCCTCAGC CTTGTGAGCA CAAGGCAGGA 4140 

AGAGAATCCA TCCACCTCTC TGGACAGTAA TGGTGCAGCA TTGCCTGATG GAAATATAGC 4200 

TGAGAGCTTA GAGTCTTTAG 4220 

Seq ID NO: C104 DNA Sequence 

Nucleic Acid Accession fl: XMJJ02914.6 

Coding sequencer 1..4314 



1 11 - 21 31 41 51 

I I I I I I 

ATGAAGGATA TCGACATAGG AAAAGAGTAT ATCATCCCCA GTCCTGGGTA TAGAAGTGTG 60 

AGGGAGAGAA CCAGCACTTC TGGGACGCAC AGAGACOGTG AAGATTCCAA GTTCAGGAGA 120 

ACTCGACCGT TGGAATGCCA AGATGCCTTG GAAACAGCAG CCCGAGCCGA GGGCCTCTCT 1B0 

CTTGATGCCT CCATGCATTC TCAGCTCAGA ATCCTGGATG AGGAGCATCC CAAGGGAAAG 240 

TACCATCATG GCTTGAGTGC TCTGAAGCCC ATCCGGACTA CTTCCAAACA CCAGCACCCA 300 

GTGGACAATG CTGGGCTTTT TTCCTGTATG ACTTTTTCGT GGCTTTCTTC TCTGGCCCGT 360 

GTGGCCCACA AGAAGGGGGA GCTCTCAATG GAAGACGTGT GGTCTCTGTC CAAGCACGAG 420 

TCTTCTGACG TGAACTGCAG AAGACTAGAG AGACTGTGGC AAGAAGAGCT GAATGAAGTT 480 

GGGCCAGACG CTGCTTCCCT GCGAAGGGTT GTGTGGATCT TCTGCCGCAC CAGGCTCATC 540 

CTGTCCATCG TGTGCCTGAT GATCACGCAG CTGGCTGGCT TCAGTGGACC AGCCTTCATG 600 

GTGAAACACC TCTTGGAGTA TACCCAGGCA ACAGAGTCTA ACCTGCAGTA CAGCTTGTTG 660 

TTAGTGCTGG GCCTCCTCCT GACGGAAATC GTGOGGTCTT GGTCGCTTGC ACTGACTTGG 720 

GCATTGAATT ACCGAACCGG TGTCCGCTTG CGGGGGGCCA TCCTAACCAT GGCATTTAAG 780 

AAGATCCTTA AGTTAAAGAA CATTAAAGAG AAATCCCTGG GTGAGCTCAT CAACATTTGC 840 

TCCAACGATG GGCAGAGAAT GTTTGAGGGA GCAGCCGTTG GCAGCCTGCT GGCTGGAGGA 900 

CCCGTTGTTG CCATCTTAGG CATGATTTAT AATGTAATTA TTCTGGGACC AACAGGCTTC 960 

CTGGGATCAG CTGTTTTTAT CCTCTTTTAC CCAGCAATGA TGTTTGCATC ACGGCTCACA 1020 

GCATATTTCA GGAGAAAATG CGTGGCCGCC ACGGATGAAC GTGTCCAGAA GATGAATGAA 1080 

GTTCTTACTT ACATTAAATT TATCAAAATG TATGCCTGGG TCAAAGCATT TTCTCAGAGT 1140 

GTTCAAAAAA TCCGCGAGGA GGAGCGTCGG ATATTGGAAA AAGCTGGGTA CTTCCAGAGC 1200 

ATCACTGTGG GTGTGGCTCC CATTGTGGTG GTGATTGCCA GCGTGGTGAC CTTCTCTGTT 1260 

CATATGACCC TGGGCTTCGA TCTGACAGCA GCACAGGCTT TCACAGTGGT GACAGTCTTC 1320 

AATTCCATGA CTTTTGCTTT GAAAGTAACA CCGTTTTCAG TAAAGTCCCT CTCAGAAGCC 1380 

TCAGTGGCTG TTGACAGATT TAAGAGTTTG TTTCTAATGG AAGAGGTTCA CATGATAAAG 1440 

AACAAACCAG CCAGTCCTCA CATCAAGATA GAGATGAAAA ATGCCACCTT GGCATGGGAC 1500 

TCCTCCCACT CCAGTATCCA GAACTCGCCC AAGCTGACCC CCAAAATGAA AAAAGACAAG 1560 

AGGGCTTCCA GGGGCAAGAA AGAGAAGGTG AGGCAGCTGC AGCGCACTGA GCATCAGGCG 1620 

GTGCTGGCAG AGCAGAAAGG CCACCTCCTC CTGGACAGTG ACGAGCGGCC CAGTCCCGAA 1680 

GAGGAAGAAG GCAAGCACAT CCACCTGGGC CACCTGCGCT TACAGAGGAC ACTGCACAGC 1740 

ATCGATCTGG AGATCCAAGA GGGTAAACTG GTTGGAATCT GCGGCAGTGT GGGAAGTGGA 1800 

AAAACCTCTC TCATTTCAGC CATTTTAGGC CAGATGACGC TTCTAGAGGG CAGCATTGCA 1860 

ATCAGTGGAA CCTTCGCTTA TGTGGCCCAG CAGGCCTGGA TCCTCAATGC TACTCTGAGA 1920 

GACAACATCC TGTTTGGGAA GGAATATGAT GAAGAAAGAT ACAACTCTGT GCTGAACAGC 1980 

TGCTGCCTGA GGCCTGACCT GGCCATTCTT CCCAGCAGCG ACCTGACGGA GATTGGAGAG 2040 

CGAGGAGCCA ACCTGAGCGG TGGGCAGCGC CAGAGGATCA GCCTTGCCCG GGCCTTGTAT 2100 

AGTGACAGGA GCATCTACAT CCTGGACGAC CCCCTCAGTG CCTTAGATGC CCATGTGGGC 2160 

AACCACATCT TCAATAGTGC TATCCGGAAA CATCTCAAGT CCAAGACAGT TCTGTTTGTT 2220 

ACCCACCAGT TACAGTACCT GGTTGACTGT GATGAAGTGA TCTTCATGAA AGAGGGCTGT 2280 

ATTAOGGAAA GAGGCACCCA TGAGGAACTG ATGAATTTAA ATGGTGACTA TGCTACCATT 2340 

TTTAATAACC TGTTGCTGGG AGAGACACCG CCAGTTGAGA TCAATTCAAA AAAGGAAACC 2400 

AGTGGTTCAC AGAAGAAGTC ACAAGACAAG GGTCCTAAAA CAGGATCAGT AAAGAAGGAA 2460 

AAAGCAGTAA AGCCAGAGGA AGGGCAGCTT GTGCAGCTGG AAGAAAAAGG GCAGGGTTCA 2520 

GTGCCCTGGT CAGTATATGG TGTCTACATC CAGGCTGCTG GGGGCCCCTT GGCATTCCTG 2580 

GTTATTATGG CCCTTTTCAT GCTGAATGTA GGCAGCACCG CCTTCAGCAC CT UU TUUTT U 2640 

AGTTACTGGA TCAAGCAAGG AAGCGGGAAC ACCACTGTGA CTCGAGGGAA CGAGACCTCG 2700 

GTGAGTGACA GCATGAAGGA CAATCCTCAT ATGCAGTACT ATGCCAGCAT CTACGCCCTC 2760 

TCCATGGCAG TCATGCTGAT CCTGAAAGCC ATTCGAGGAG TTGTCTTTGT CAAGGGCACG 2820 

CTGCGAGCTT CCTCCCGGCT GCATGACGAG CTTTTCCGAA GGATCCTTCG AAGCCCTATG 2880 

AAGTTTTTTG ACACGACCCC CACAGGGAGG ATTCTCAACA GGTTTTCCAA AGACATGGAT 2940 

GAAGTTGACG TGCGGCTGCC GTTCCAGGCC GAGATGTTCA TCCAGAACGT TATCCTGGTG 3000 

TTCTTCTGTG TGGGAATGAT CGCAGGAGTC TTCCCGTGGT TCCTTGTGGC AGTGGGGCCC 3060 

CTTGTCATCC TCTTTTCAGT CCTGCACATT GTCTCCAGGG TCCTGATTCG GGAGCTGAAG 3120 

CGTCTGGACA ATATCACGCA GTCACCTTTC CTCTCCCACA TCACGTCCAG CATACAGGGC 3180 

CTTGCCACCA TCCACGCCTA CAATAAAGGG CAGGAGTTTC TGCACAGATA CCAGGAGCTG 3240 

CTGGATGACA ACCAAGCTCC TTTTTTTTTG TTTACGTGTG CGATGCGGTG GCTGGCTGTG 3300 

CGGCTGGACC TCATCAGCAT CGCCCTCATC ACCACCACGG GGCTGATGAT CGTTCTTATG 3360 

CACGGGCAGA TTCCCCCAGC CTATGCGGGT CTCGCCATCT CTTATGCTGT CCAGTTAACG 3420 

GGGCTGTTCC AGTTTACGGT CAGACTGGCA TCTGAGACAG AAGCTCGATT CACCTCGGTG 3480 

GAGAGGATCA ATCACTACAT TAAGACTCTG TCCTTGGAAG CACCTGCCAG AATTAAGAAC 3540 

AAGGCTCCCT CCCCTGACTG GCCCCAGGAG GGAGAGGTGA CCTTTGAGAA CGCAGAGATG 3600 

AGGTACCGAG AAAACCTCCC TCTCGTCCTA AAGAAAGTAT CCTTCACGAT CAAACCTAAA 3660 

GAGAAGATTG GCATTGTGGG GCGGACAGGA TCAGGGAAGT CCTCGCTGGG GATGGCCCTC 3720 

TTCCGTCTGG TGGAGTTATC TGGAGGCTGC ATCAAGATTG ATGGAGTGAG AATCAGTGAT 3780 

/!> ATTGGCCTTG CCGACCTCCG AAGCAAACTC TCTATCATTC CTCAAGAGCC GGTGCTGTTC 3840 

AGTGGCACTG TCAGATCAAA TTTGGACCCC TTCAACCAGT ACACTGAAGA CCAGATTTGG 3900 

GATGCCCTGG AGAGGACACA CATGAAAGAA TGTATTGCTC AGCTACCTCT GAAACTTGAA 3960 

TCTGAAGTGA TGGAGAATGG GGATAACTTC TCAGTGGGGG AACGGCAGCT CTTGTGCATA 4020 

GCTAGAGCCC TGCTCOGCCA CTGTAAGATT CTGATTTTAG ATGAAGCCAC AGCTGCCATG 4080 

OU GACACAGAGA CAGACTTATT GATTCAAGAG ACCATCCGAG AAGCATTTGC AGACTGTACC 4140 

ATGCTGACCA TTGCCCATCG CCTGCACACG GTTCTAGGCT CCGATAGGAT TATGGTGCTG 4200 

GCCCAGGGAC AGGTGGTGGA GTTTGACACC CCATCGGTCC TTCTGTCCAA CGACAGTTCC 4260 

CGATTCTATG CCATGTTTGC TGCTGCAGAG AACAAGGTCQ CTGTCAAGGG CTAG 4314 
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Seq ID NO: CIO 5 DMA Sequence 

Nucleic Acid Accession «i NM_00S688.1 

Coding sequence: 126.. 4439 

5 1 11 21 31 41 SI 

I I 1 I I I 

CCGGGCAGGT GGCTCATGCT CGGGAGCGTG GTTGAGCGGC TGGCGCGGTT GTCCTGGAGC 60 

AGGGGCGCAG GAATTCTGAT GTGAAACTAA CAGTCTGTGA GCCCTGGAAC CTCCGCTCAG 120 

AGAAGATGAA GGATATCGAC ATAGGAAAAG AGTATATCAT CCCCAGTCCT GGGTATAGAA 190 

10 GTGTGAGGGA GAGAACCAGC ACTTCTGGGA CGCACAGAGA CCGTGAAGAT TCCAAGTTCA 240 

GGAGAACTCG ACCGTTGGAA TGCCAAGATG CCTTGGAAAC AGCAGCCCGA GCCGAGGGCC 300 

TCTCTCTTGA TGCCTCCATG CATTCTCAGC TCAGAATCCT GGATGAGGAG CATCCCAAGG 360 

GAAAGTACCA TCATGGCTTG AGTGCTCTGA AGCCCATCCG GACTACTTCC AAACACCAGC 420 

ACCCAGTGGA CAATGCTGGG CTTTTTTCCT GTATGACTTT TTCGTGGCTT TCTTCTCTGG 480 

15 CCCGTGTGGC CCACAAGAAG GGGGAGCTCT CAATGGAAGA CGTGTGGTCT 'CTGTCCAAGC 540 

ACGAGTCTTC TGACGTGAAC TGCAGAAGAC TAGAGAGACT GTGGCAAGAA GAGCTGAATG 600 

AAGTTGGGCC AGACGCTGCT TCCCTGCGAA GGGTTGTGTG GATCTTCTGC CGCACCAGGC 660 

TCATCCTGTC CATCGTGTGC CTGATGATCA CGCAGCTGGC TGGCTTCAGT GGACCAGCCT 720 

TCATGGTGAA ACACCTCTTG GAGTATACCC AGGCAACAGA GTCTAACCTG CAGTACAGCT 780 

20 TGTTGTTAGT GCTGGGCCTC CTCCTGACGG AAATCGTGCG GTCTTGGTCG CTTGCACTGA 840 

CTTGGGCATT GAATTACCGA ACCGGTGTCC GCTTGCGGGG GGCCATCCTA ACCATGGCAT 900 

TTAAGAAGAT CCTTAAGTTA AAGAACATTA AAGAGAAATC CCTGGGTGAG CTCATCAACA 960 

TTTGCTCCAA CGATGGGCAG AGAATGTTTG AGGCAGCAGC CGTTGGCAGC CTGCTGGCTG 1020 

GAGGACCCGT TGTTGCCATC TTAGGCATGA TTTATAATGT AATTATTCTG GGACCAACAG 1080 

25 GCTTCCTGGG ATCAGCTGTT TTTATCCTCT TTTACCCAGC AATGATGTTT GCATCACGGC 1140 

TCACAGCATA TTTCAGGAGA AAATGCGTGG CCGCCACGGA TGAAOGTGTC CAGAAGATGA 1200 

ATGAAGTTCT TACTTACATT AAATTTATCA AAATGTATGC CTGGGTCAAA GCATTTTCTC 1260 

AGAGTGTTCA AAAAATCCGC GAGGAGGAGC GTCGGATATT GGAAAAAGCC GGGTACTTCC 1320 

AGGGTATCAC TGTGGGTGTG GCTCCCATTG TGGTGGTGAT TGCCAGCGTG GTGACCTTCT 1380 

30 CTGTTCATAT GACCCTGGGC TTCGATCTGA CAGCAGCACA GGCTTTCACA GTGGTGACAG 1440 

TCTTCAATTC CATGACTTTT GCTTTGAAAG TAACACCGTT TTCAGTAAAG TCCCTCTCAG 1500 

AAGCCTCAGT GGCTGTTGAC AGATTTAAGA GTTTGTTTCT AATGGAAGAG GTTCACATGA 1560 

TAAAGAACAA ACCAGCCAGT CCTCACATCA AGATAGAGAT GAAAAATGCC ACCTTGGCAT 1620 

GGGACTCCTC CCACTCCAGT ATCCAGAACT CGCCCAAGCT GACCCCCAAA ATGAAAAAAG 1680 

35 ACAAGAGGGC TTCCAGGGGC AAGAAAGAGA AGGTGAGGCA GCTGCAGCGC ACTGAGCATC 1740 

AGGCGGTGCT GGCAGAGCAG AAAGGCCACC TCCTCCTGGA CAGTGACGAG CGGCCCAGTC 1800 

CCGAAGAGGA AGAAGGCAAG CACATCCACC TGGGCCACCT GCGCTTACAG AGGACACTGC 1860 

ACAGCATOGA TCTGGAGATC CAAGAGGGTA AACTGGTTGG AATCTGCGGC AGTGTGGGAA 1920 

GTGGAAAAAC CTCTCTCATT TCAGCCATTT TAGGCCAGAT GACGCTTCTA GAGGGCAGCA 1980 

40 TTGCAATCAG TGGAACCTTC GCTTATGTGG CCCAGCAGGC CTGGATCCTC AATGCTACTC 2040 

TGAGAGACAA CATCCTGTTT GGGAAGGAAT ATGATGAAGA AAGATACAAC TCTGTGCTGA 2100 

ACAGCTGCTG CCTGAGGCCT GACCTGGCCA TTCTTCCCAG CAGCGACCTG ACGGAGATTG 2160 

GAGAGCGAGG AGCCAACCTG AGOGGTGGGC AGCGCCAGAG GATCAGCCTT GCCCGGGCCT 2220 

TGTATAGTGA CAGGAGCATC TACATCCTGG ACGACCCCCT CAGTGCCTTA GATGCCCATG 2280 

45 TGGGCAACCA CATCTTCAAT AGTGCTATCC GGAAACATCT CAAGTCCAAG ACAGTTCTGT 2340 

TTGTTACCCA CCAGTTACAG TACCTGGTTG ACTGTGATGA AGTGATCTTC ATGAAAGAGG 2400 

GCTGTATTAC GGAAAGAGGC ACCCATGAGG AACTGATGAA TTTAAATGGT GACTATGCTA 2460 

CCATTTTTAA TAACCTGTTG CTGGGAGAGA CACCGCCAGT TGAGATCAAT TCAAAAAAGG 2520 

AAACCAGTGG TTCACAGAAG AAGTCACAAG ACAAGGGTCC TAAAACAGGA TCAGTAAAGA 2580 

50 AGGAAAAAGC AGTAAAGCCA GAGGAAGGGC AGCTTGTGCA GCTGGAAGAG AAAGGGCAGG 2640 

GTTCAGTGCC CTGGTCAGTA TATGGTGTCT ACATCCAGGC TGCTGGGGGC CCCTTGGCAT 2700 

TCCTGGTTAT TATGGCCCTT TTCATGCTGA ATGTAGGCAG CACCGCCTTC AGCACCTGGT 2760 

GGTTGAGTTA CTGGATCAAG CAAGGAAGCG GGAACACCAC TGTGACTCGA GGGAACGAGA 2820 

CCTGGGTGAG TGACAGCATG AAGGACAATC CTCATATGCA GTACTATGCC AGCATCTACG 2880 

55 CCCTCTCCAT GGCAGTCATG CTGATCCTGA AAGCCATTOG AGGAGTTGTC TTTGTCAAGG 2940 

GCACGCTGCG AGCTTCCTCC CGGCTGCATG ACGAGCTTTT CCGAAGGATC CTTCGAAGCC 3000 

CTATGAAGTT TTTTGACACG ACCCCCACAG GGAGGATTCT CAACAGGTTT TCCAAAGACA 3060 

TGGATGAAGT TGACGTGOGG CTGCCGTTCC AGGCCGAGAT GTTCATCCAG AACGTTATCC 3120 

- TGGTGTTCTT CTGTGTGGGA ATGATCGCAG GAGTCTTCCC GTGGTTCCTT GTGGCAGTGG 3180 

60 GGCCCCTTGT CATCCTCTTT TCAGTCCTGC ACATTGTCTC CAGGGTCCTG ATTCGGGAGC 3240 

TGAAGCGTCT GGACAATATC ACGCAGTCAC CTTTCCTCTC CCACATCACG TCCAGCATAC 3300 

AGGGCCTTGC CACCATCCAC GCCTACAATA AAGGGCAGGA GTTTCTGCAC AGATACCAGG 3360 

AGCTGCTGGA TGACAACCAA GCTCCTTTTT TTTTGTTTAC GTGTGCGATG CGGTGGCTGG 3420 
e , c CTGTGCGGCT GGACCTCATC AGCATCGCCC TCATCACCAC CACGGGGCTG ATGATCGTTC 3480 
65 TTATGCACGG GCAGATTCCC CCAGCCTATG CGGGTCTCGC CATCTCTTAT GCTGTCCAGT 3S40 
TAACGGGGCT GTTCCAGTTT ACGGTCAGAC TGGCATCTGA GACAGAAGCT CGATTCACCT 3600 
OGGTGGAGAG GATCAATCAC TACATTAAGA CTCTGTCCTT GGAAGCACCT GCCAGAATTA 3660 
AGAACAAGGC TCCCTCCCCT GACTGGCCCC AGGAGGGAGA GGTGACCTTT GAGAACGCAG 3720 
AGATGAGGTA CCGAGAAAAC CTCCCTCTTG TCCTAAAGAA AGTATCCTTC ACGATCAAAC 3780 
70 CTAAAGAGAA GATTGGCATT GTGGGGCGGA CAGGATCAGG GAAGTCCTCG CTGGGGATGG 3840 
CCCTCTTCCG TCTGGTGGAG TTATCTGGAG GCTGCATCAA GATTGATGGA GTGAGAATCA 3900 
GTGATATTGG CCTTGCCGAC CTCCGAAGCA AACTCTCTAT CATTCCTCAA GAGCCGGTGC 3960 
TGTTCAGTGG CACTGTCAGA TCAAATTTGG ACCCCTTCAA CCAGTACACT GAAGACCAGA 4020 
TTTGGGATGC CCTGGAGAGG ACACACATGA AAGAATGTAT TGCTCAGCTA CCTCTGAAAC 4080 
75 TTGAATCTGA AGTGATGGAG AATGGGGATA ACTTCTCAGT GGGGGAACGG CAGCTCTTGT 4140 
GCATAGCTAG AGCCCTGCTC CGCCACTGTA AGATTCTGAT TTTAGATGAA GCCACAGCTG 4200 
CCATGGACAC AGAGACAGAC TTATTGATTC AAGAGACCAT CCGAGAAGCA TTTGCAGACT 4260 
GTACCATGCT GACCATTGCC CATCGCCTGC ACACGGTTCT AGGCTCCGAT AGGATTATGG 4320 
TGCTGGCCCA GGGACAGGTG GTGGAGTTTG ACACCCCATC GGTCCTTCTG TCCAACGACA 4380 
80 GTTCCCGATT CTATGCCATG TTTGCTGCTG CAGAGAACAA GGTCGCTGTC AAGGGCTGAC 4440 
TCCTCCCTGT TGACGAAGTC TCTTTTCTTT AGAGCATTGC CATTCCCTGC CTGGGGCGGG 4500 
CCCCTCATCG CGTCCTCCTA CCGAAACCTT GCCTTTCTCG ATTTTATCTT TCGCACAGCA 4560 
GTTCCGGATT GGCTTGTGTG TTTCACTTTT AGGGAGAGTC ATATTTTGAT TATTGTATTT 4620 
ATTCCATATT CATGTAAACA AAATTTAGTT TTTGTTCTTA ATTGCACTCT AAAAGGTTCA 4680 
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GGGAACCGTT 
TCTATATATA 
TATTAAAATA 
TTGCTGTACT 
CTCTAGCTGG 
ATAGTGGGCC 
GAGACGGGTG 
CTGTCCTGGT 
TTTCACTCCC 
TTTCCTGCCT 
TCCCACTGCC 
GTTGGTTCCA 
ATTCCCACAC 
CTCACCGCAG 
CAGCTCTTGC 
ACCTCAGGTT 
GGGGCTGGTA 
ATGTCGTGAC 
CAAAAATCTG 
AAAAAAAAAA 



ATTATAATTG 
ATTCTGTACA 
AGCACTGTGC 
AGAGATCTGG 
TGGTTTCACG 
CTCCGACAGC 
GGCGGCTGGA 
GTCACTTACT 
TCCATCAAGA 
TCTTCTTTTT 
TCAGGTTCCT 
AGCCCTGGAG 
CTCCACAGTT 
TOGTCGCACA 
TAATCAGTGT 
GCTGGTTGCT 
GCTCAGGTGG 
CAACTAGACA 
AAAATGTGAA 



TATCAGAGGC 
TAGCCTATAT 
TAATAACAGT 
TTTTGCTATT 
GTGCCAGGTT 
CCCCTCTGCC 
GACCATGCAG 
GTTTCTGTCA 
ATGGGGATCA 
GCTGTTGTTT 
ATGGCTGGCC 
CCAACTGCTG 
CAGTGGCAGG 
GTCTCTCTCT 
CTCACACTGG 
GTGTGGTTTG 
GCGTGGTCAC 

TAAAATTATT 



CTATAATGAA 
TTACAGTGAA 
GCATATTCCT 
AGACTGTAGG 
TTCTGGGTGT 
GCCTCCCCAC 
AGCGCOGTGA 
GGAGAGCAGC 
CAGAGACATT 
CTAAACAAGA 
ACTGCACAGA 
CTTTTTGAGG 
GCTCAGGATT 
CTCTCTCCCC 
CGTAGAAGTT 
GTGTGTTCCC 
TGCTGTCATC 
TTAGCATGTT 
TTGGATTTTG 



GCTTTATACG 
AATGTAAGCT 
TTCTATCATT 
AAGAGTAGCA 
CCAAAGGAAG 
AGCCGCTCCA 
GTTCTCAGGG 
GGGGCGAAGC 
CCTCCGAGCC 
ATCAGTCTAT 
GCTCTCCAGC 
TGGCACTTTT 
TCGTGGGTCT 
TCAAAGTCTG 
TTTGTACTGT 
GCAAACCCCC 
AGTTGAATGG 
TGCTGAACAC 
TAAAAAAAAA 



TGTAGCTATA 
GTTTATTTTA 
TTTGTACAGT 
TTTCATTCTT 
ACGTGTGGCA 
GGGGTGGCTG 
CTCCTGCCTT 
CCAGGCCCCT 
GGGGAGTTTC 
CCACAGAGAG 
TCCAAGACCT 
TCATTTGCCT 
GTTTTCCTTT 
CAACTTTAAG 
AAAGAGACCT 
TTTGTGCTGT 
TCAGCGTTGC 
CTTGTGGAAG 
AAAAAAAAAA 
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Seq ID NO: C10G DMA Sequence 
Nucleic Acid Accession #: NM_005562 
Coding sequence: 90.. 3671 

1 11 21 31 41 51 

I I I I I I 

ACAGOGGAGC GCAGAGTGAG AACCACCAAC CGAGGCGCCG GGCAGCGACC CCTGCAGCGG 
AGACAGAGAC TGAGCGGCCC GGCACCGCCA TGCCTGCGCT CTGGCTGGGC TGCTGCCTCT 
GCTTCTCGCT CCTCCTGCCC GCAGCCCGGG CCACCTCCAG GAGGGAAGTC TGTGATTGCA 
ATGGGAAGTC CAGGCAGTGT ATCTTTGATC GGGAACTTCA CAGACAAACT GGTAATGGAT 
TCCGCTGCCT CAACTGCAAT GACAACACTG ATGGCATTCA CTGCGAGAAG TGCAAGAATG 
GCTTTTACCG GCACAGAGAA AGGGACCGCT GTTTGCCCTG CAATTGTAAC TCCAAAGGTT 
CTCTTAGTGC TCGATGTGAC AACTCTGGAC GGTGCAGCTG TAAACCAGGT GTGACAGGAG 
CCAGATGCGA CCGATGTCTG CCAGGCTTCC ACATGCTCAC GGATGCGGGG TGCACCCAAG 
ACCAGAGACT GCTAGACTCC AAGTGTGACT GTGACCCAGC TGGCATCGCA GGGCCCTGTG 
ACGCGGGCCG CTGTGTCTGC AAGCCAGCTG TTACTGGAGA AOGCTGTGAT AGGTGTCGAT 
CAGGTTACTA TAATCTGGAT GGGGGGAACC CTGAGGGCTG TACCCAGTGT TTCTGCTATG 
GGCATTCAGC CAGCTGCCGC AGCTCTGCAG AATACAGTGT CCATAAGATC ACCTCTACCT 
TTCATCAAGA TGTTGATGGC TGGAAGGCTG TCCAACGAAA TGGGTCTCCT GCAAAGCTCC 
AATGGTCACA GCGCCATCAA GATGTGTTTA GCTCAGCCCA AOGACTAGAC CCTGTCTATT 
TTGTGGCTCC TGCCAAATTT CTTGGGAATC AACAGGTGAG CTATGGGCAA AGCCTGTCCT 
TTGACTACCG TGTGGACAGA GGAGGCAGAC ACCCATCTGC CCATGATGTG ATTCTGGAAG 
GTGCTGGTCT ACGGATCACA GCTCCCTTGA TGCCACTTGG CAAGACACTQ CCTTGTGGGC 
TCACCAAGAC TTACACATTC AGGTTAAATG AGCATCCAAG CAATAATTGG AGCCCCCAGC 
TGAGTTACTT TGAGTATCGA AGGTTACTGC GGAATCTCAC AGCCCTCCGC ATCCGAGCTA 
CATATGGAGA ATACAGTACT GGGTACATTG ACAATGTGAC CCTGATTTCA GCCCGCCCTG 
TCTCTGGAGC CCCAGCACCC TGGGTTGAAC AGTGTATATG TCCTGTTGGG TACAAGGGGC 
AATTCTGCCA GGATTGTGCT TCTGGCTACA AGAGAGATTC AGCGAGACTG GGGCCTTTTG 
GCACCTGTAT TCCTTGTAAC TGTCAAGGGG GAGGGGCCTG TGATCCAGAC ACAGGAGATT 
GTTATTCAGG GGATGAGAAT CCTGACATTG AGTGTGCTGA CTGCCCAATT GGTTTCTACA 
ACGATCCGCA CGACCCCCGC AGCTGCAAGC CATGTCCCTG TCATAACGGG TTCAGCTGCT 
CAGTGATGOC GGAGACGGAG GAGGTGGTGT GCAATAACTG CCCTCCCGGG GTCACCGGTG 
CCCGCTGTGA GCTCTGTGCT GATGGCTACT TTGGGGACCC CTTTGGTGAA CATGGCCCAG 
TGAGGCCTTG TCAGCCCTGT CAATGCAACA ACAATGTGGA CCCCAGTGCC TCTGGGAATT 
GTGACCGGCT GACAGGCAGG TGTTTGAAGT GTATCCACAA CACAGCCGGC ATCTACTGCG 
ACCAGTGCAA AGCAGGCTAC TTOGGGGACC CATTGGCTCC CAACCCAGCA GACAAGTGTC 
GAGCTTGCAA CTGTAACCCC ATGGGCTCAG AGCCTGTAGG ATGTCGAAGT GATGGCACCT 
GTGTTTGCAA GCCAGGATTT GGTGGCCCCA ACTGTGAGCA TGGAGCATTC AGCTGTCCAG 
CTTGCTATAA TCAAGTGAAG ATTCAGATGG ATCAGTTTAT GCAGCAGCTT CAGAGAATGG 
AGGCCCTGAT TTCAAAGGCT CAGGGTGGTG ATGGAGTAGT ACCTGATACA GAGCTGGAAG 
GCAGGATGCA GCAGGCTGAG CAGGCCCTTC AGGACATTCT GAGAGATGCC CAGATTTCAG 
AAGGTGCTAG CAGATCCCTT GGTCTCCAGT TGGCCAAGGT GAGGAGCCAA GAGAACAGCT 
ACCAGAGCCG CCTGGATGAC CTCAAGATGA CTGTGGAAAG AGTTCGGGCT CTGGGAAGTC 
AGTACCAGAA CCGAGTTCGG GATACTCACA GGCTCATCAC TCAGATGCAG CTGAGCCTGG 
CAGAAAGTGA AGCTTCCTTG GGAAACACTA ACATTCCTGC CTCAGACCAC TACGTGGGGC 
CAAATGGCTT TAAAAGTCTG GCTCAGGAGG CCACAAGATT AGCAGAAAGC CACGTTGAGT 
CAGCCAGTAA CATGGAGCAA CTGACAAGGG AAACTGAGGA CTATTCCAAA CAAGCCCTCT 
CACTGGTGCG CAAGGCCCTG CATGAAGGAG TCGGAAGCGG AAGCGGTAGC CCGGACGGTG 
CTGTGGTGCA AGGGCTTGTG GAAAAATTGG AGAAAACCAA GTCCCTGGCC CAGCAGTTGA 
CAAGGGAGGC CACTCAAGCG GAAATTGAAG CAGATAGGTC TTATCAGCAC AGTCTCCGCC 
TCCTGGATTC AGTGTCTCGG CTTCAGGGAG TCAGTGATCA GTCCTTTCAG GTGGAAGAAG 
CAAAGAGGAT CAAACAAAAA GCGGATTCAC TCTCAACGCT GGTAACCAGG CATATGGATG 
AGTTCAAGCG TACACAAAAG AATCTGGGAA ACTGGAAAGA AGAAGCACAG CAGCTCTTAC 
AGAATGGAAA AAGTGGGAGA GAGAAATCAG ATCAGCTGCT TTCCCGTGCC AATCTTGCTA 
AAAGCAGAGC ACAAGAAGCA CTGAGTATGG GCAATGCCAC TTTTTATGAA GTTGAGAGCA 
TCCTTAAAAA CCTCAGAGAG TTTGACCTGC AGGTGGACAA CAGAAAAGCA GAAGCTGAAG 
AAGCCATGAA GAGACTCTCC TACATCAGCC AGAAGGTTTC AGATGCCAGT GACAAGACCC 
AGCAAGCAGA AAGAGCCCTG GGGAGCGCTG CTGCTGATGC ACAGAGGGCA AAGAATGGGG 
CCGGGGAGGC CCTGGAAATC TCCAGTGAGA TTGAACAGGA GATTGGGAGT CTGAACTTGG 
AAGCCAATGT GACAGCAGAT GGAGCCTTGG CCATGGAAAA GGGACTGGCC TCTCTGAAGA 
GTGAGATGAG GGAAGTGGAA GGAGAGCTGG AAAGGAAGGA GCTGGAGTTT GACACGAATA 
TGGATGCAGT ACAGATGGTG ATTACAGAAG CCCAGAAGGT TGATACCAGA GCCAAGAACG 
CTGGGGTTAC AATCCAAGAC ACACTCAACA CATTAGACGG CCTCCTGCAT CTGATGGACC 
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AGCCTCTCAG TGTAGATGAA GAGGGGCTGG TCTTACTGQA GCAGAAGCTT TCCCGAGCCA 3480 

AGACCCAGAT CAACAGCCAA CTGCGGCCCA TGATGTCAGA GCTGGAAGAG AGGGCACGTC 3540 

AGCAGAGGGG CCACCTCCAT TTGCTGGAGA CAAGCATAGA TGGGATTCTG GCTGATGTGA 3600 

AGAACTTGGA GAACATTAGG GACAACCTGC CCCCAGGCTG CTACAATACC CAGGCTCTTG 3660 

AGCAACAGTG AAGCTGCCAT AAATATTTCT CAACTGAGGT TCTTGGGATA CAGATCTCAG 3720 

GGCTCGGGAG CCATGTCATG TGAGTGGGTG GGATGGGGAC ATTTGAACAT GTTTAATGGG 3780 

TATGCTCAGG TCAACTGACC TGACCCCATT CCTGATCCCA TGGCCAGGTG GTTGTCTTAT 3840 

TGCACCATAC TCCTTGCTTC CTGATGCTGG GCAATGAGGC AGATAGCACT GGGTGTGAGA 3900 

ATGATCAAGG ATCTGGACCC CAAAGAATAG ACTGGATGGA AAGACAAACT GCACAGGCAG 3960 

ATGTTTGCCT CATAATAGTC GTAAGTGGAG TCCTGGAATT TGGACAAGTG CTGTTGGGAT 4020 

ATAGTCAACT TATTCTTTGA GTAATGTGAC TAAAGGAAAA AACTTTGACT TTGCCCAGGC 4080 

ATGAAATTCT TCCTAATGTC AGAACAGAGT GCAACCCAGT CACACTGTGG CCAGTAAAAT 4140 

ACTATTGCCT CATATTGTCC TCTGCAAGCT TCTTGCTGAT CAGAGTTCCT CCTACTTACA 4200 

ACCCAGGGTG TGAACATGTT CTCCATTTTC AAGCTGGAAG AAGTGAGCAG TGTTGGAGTG 42 SO 

AGGACCTGTA AGGCAGGCCC ATTCAGAGCT ATGGTGCTTG CTGGTGCCTG CCACCTTCAA 4320 

GTTCTGGACC TGGGCATGAC ATC CTTTCTT TTAATGATGC CATGGCAACT TAGAGATTGC 4380 

ATTTTTATTA AAGCATTTCC TACCAGCAAA GCAAATGTTG GGAAAGTATT TACTTTTTCG 4440 

GTTTCAAAGT GATAGAAAAG TGTGGCTTGG GCATTGAAAG AGGTAAAATT CTCTAGATTT 4500 

ATTAGTCCTA ATTCAATCCT ACTTTTCGAA CACCAAAAAT GATGCGCATC AATGTATTTT 4560 

ATCTTATTTT CTCAATCTCC TCTCTCTTTC CTCCACCCAT AATAAGAGAA TGTTCCTACT 4620 

CACACTTCAG CTGGGTCACA TCCATCCCTC CATTCATCCT TCCATCCATC TTTCCATCCA 4680 

TTACCTCCAT CCATCCTTCC AACATATATT TATTGAGTAC CTACTGTGTG CCAGGGGCTG 4740 

GTGGGACAGT GGTGACATAG TCTCTGCCCT CATAGAGTTG ATTGTCTAGT GAGGAAGACA 4800 

AGCATTTTTA AAAAATAAAT TTAAACTTAC AAACTTTGTT TGTCACAAGT GGTGTTTATT 4860 

GCAATAACCG CTTGGTTTGC AACCTCXTTG CTCAACAGAA CATATGTTGC AAGACCCTCC 4920 

CATGGGGGCA CTTGAGTTTT GGCAAGGCTG ACAGAGCTCT GGGTTGTGCA CATTTCTTTG 4980 

CATTCCAGCT GTCACTCTGT GCCTTTCTAC AACTGATTGC AACAGACTGT TGAGTTATGA 5040 

TAACACCAGT GGGAATTGCT GGAGGAACCA GAGGCACTTC CACCTTGGCT GGGAAGACTA 5100 

A TGGTGCTGCC TTGCTTCTGT ATTTCCTTGG ATTTTCCTGA AAGTGTTTTT AAATAAAGAA 5160 

D\) CAATTGTTAG ATGCC 5175 
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Seq ID NO: CI 07 dna Sequence 
Nucleic Acid Accession #: NM_021101 
Coding sequence: 221.. 856 

1 11 21 31 41 51 

I I 1 I I I 

GAATTCGGCA CGAGGCCTCG TGCCGGGGAG CAACCTCAGC TTCTAGTATC CAGACTCCAG 60 

CGCCGCCCCG GGCGCGGACC CCAACCCCGA CCCAGAGCTT CTCCAGCGGC GGCGCAGCGA 120 

GCAGGGCTCC CCGCCTTAAC TTCCTCCGCG GGGCCCAGCC ACCTTCGGGA GTCCGGGTTG 180 . 

CCCACCTGCA AACTCTCCGC CTTCTGCACC TGCCACCCCT GAGCCAGCGC GGGCGCCCGA 240 

GCGAGTCATG GCCAAOGCGG GGCTGCAGCT GTTGGGCTTC ATTCTCGCCT TCCTGGGATG 300 

GATCGGCGCC ATCGTCAGCA CTGCCCTGCC CCAGTGGAGG ATTTACTCCT ATGCCGGCGA 360 

CAACATCGTG ACCGCCCAGG CCATGTACGA GGGGCTGTGG ATGTCCTGCG TGTCGCAGAG 420 

CACCGGGCAG ATCCAGTGCA AAGTCTTTGA CTCCTTGCTG AATCTGAGCA GCACATTGCA 480 

AGCAACCCGT GCCTTGATGG TGGTTGGCAT CCTCCTGGGA GTGATAGCAA TCTTTGTGGC 540 

CACCGTTGGC ATGAAGTGTA TGAAGTGCTT GGAAGACGAT GAGGTGCAGA AGATGAGGAT 600 

GGCTGTCATT GGGGGCGCGA TATTTCTTCT TGCAGGTCTG GCTATTTTAG TTGCCACAGC 660 

ATGGTATGGC AATAGAATOG TTCAAGAATT CTATGACCCT ATGACCCCAG TCAATGCCAG 720 

GTACGAATTT GGTCAGGCTC TCTTCACTGG CTGGGCTGCT GCTTCTCTCT GCCTTCTGGG 780 

AGGTGCCCTA CTTTGCTGTT CCTGTCCCCG AAAAACAACC TCTTACCCAA CACCAAGGCC 840 

CTATCCAAAA CCTGCACCTT CCAGCGGGAA AGACTACGTG TGACACAGAG GCAAAAGGAG 900- 

AAAATCATGT TGAAACAAAC CGAAAATGGA CATTGAGATA CTATCATTAA CATTAGGACC 960 

TTAGAATTTT GGGTATTGTA ATCTAAAGTA TGTTATTACA AAACAAACAA ACAAACAAAA 1020 

AACCCATGTG TTAAAATACT CAGTGCTAAA CATGGCTTAA TCTTATTTTA TCTTCTTTCC 1080 

TCAATATAGG AGGGAAGATT TTTCCATTTG TATTAGTGCT TCCCATTGAG TAATCATACT 1140 

CAAATGGGGG AAGGGGTGCT CCTTAAATAT ATATAGATAT GTATATATAC ATGTTTTTCT 1200 

ATTAAAAATA GCCAGTAAAA AAAAAAAAAA AAAAAAA 1237 

Seq ID NO: CI 08 DNA Sequence 
Nucleic Acid Accession #: AF508964.1 
Coding sequence: 98..1S31 



1 11 21 31 41 51 

65 | | | | | | 

CAGAGCCGCA AGCGCAGGGA AGGCCTCCCC GCACGGTGGG GGAAAGCGGC CGGTGCAGCG 60 

CGGGGACAGG CACTCGGGCT GGCACTGGCT GCTAGGGATG TCGTCCTGGA TAAGGTGGCA 120 

TGGACCCGCC ATGGCGCGGC TCTGGGGCTT CTGCTGGCTG GTTGTGGGCT TCTGGAGGGC 180 

CGCTTTCGCC TGTCCCACGT CCTGCAAATG CAGTGCCTCT CGGATCTGGT GCAGCGACCC 240 

TTCTCCTGGC ATCGTGGCAT TTCCGAGATT GGAGCCTAAC AGTGTAGATC CTGAGAACAT 300 

CACCGAAATT TTCATCGCAA ACCAGAAAAG GTTAGAAATC ATCAACGAAG ATGATGTTGA 360 

AGCTTATGTG GGACTGAGAA ATCTGACAAT TGTGGATTCT GGATTAAAAT TTGTGGCTCA 420 

TAAAGCATTT CTGAAAAACA GCAACCTGCA GCACATCAAT TTTACCCGAA ACAAACTGAC 480 

GAGTTTGTCT AGGAAACATT TCCGTCACCT TGACTTGTCT GAACTGATCC TGGTGGGCAA 540 

TCCATTTACA TGCTCCTGTG ACATTATGTG GATCAAGACT CTCCAAGAGG CTAAATCCAG 600 

TCCAGACACT CAGGATTTGT ACTGCCTGAA TGAAAGCAGC AAGAATATTC CCCTGGCAAA 660 

CCTGCAGATA CCCAATTGTG GTTTGCCATC TGCAAATCTG GCCGCACCTA ACCTCACTGT 720 

GGAGGAAGGA AAGTCTATCA CATTATCCTG TAGTGTGGCA GGTGATCCGG TTCCTAATAT 780 

GTATTGGGAT GTTGGTAACC TGGTTTCCAA ACATATGAAT GAAACAAGCC ACACACAGGG 840 

CTCCTTAAGG ATAACTAACA TTTCATCCGA TGACAGTGGG AAGCAGATCT CTTGTGTGGC 900 

GGAAAATCTT GTAGGAGAAG ATCAAGATTC TGTCAACCTC ACTGTGCATT TTGCACCAAC 960 

TATCACATTT CTCGAATCTC CAACCTCAGA CCACCACTGG TGCATTCCAT TCACTGTGAA 1020 

AGGCAACCCC AAACCAGCGC TTCAGTGGTT CTATAACGGG GCAATATTGA ATGAGTCCAA 1080 

ATACATCTGT ACTAAAATAC ATGTTACCAA TCACACGGAG TACCACGGCT GCCTCCAGCT 1140 
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GGATAATCCC ACTCACATGA ACAATGGGGA CTACACTCTA ATAGCCAAGA ATGAGTATGG 1200 

GAAGGATGAG AAACAGATTT CTGCTCACTT CATGGGCTGG CCTGGAATTG AOGATGGTGC 1260 

AAACCCAAAT TATCCTGATG TAATTTATGA AGATTATGOA ACTGCAGCGA ATGACATCGG 1320 

GGACACCACG AACAGAAGTA ATGAAATCCC TTCCACAGAC GTCACTGATA AAACCGGTCG 1380 

GGAACATCTC TCGGTCTATG CTGTGGTGGT GATTGCGTCT GTGGTGGGAT TTTGCCTTTT 1440 

GGTAATGCTG TTTCTGCTTA AGTTGGCAAG ACACTCCAAG TTTGGCATGA AAGGTTTTGT 1500 

TTTGTTTCAT AAGATCCCAC TGGATGGGTA GCTGAAATAA AGGAAAAGAC AGAGAAAGGG 1560 

GCTGTGGTGC TTGTTGGTTG ATGCTGCCAT GTAAGCTGGA CTCCTGGGAC TGCTGTTGGC 1620 

TTATCCCGGG AAGTGCTGCT TATCTGGGGT TTTCTGGTAG ATGTGGGCGG TGTTTGGAGG 1600 

CTGTACTATA TGAAGCCTGC ATATACTGTG AGCTGTGATT GGGGAACACC AATGCAGAGG 1740 

TAACTCTCAG GCAGCTAAGC AGCACCTCAA GAAAACATGT TAAATTAATG CTTCTCTTCT 1800 

TACAGTAGTT CAAATACAAA ACTGAAATGA AATCCCATTG GATTGTACTT CTCTTCTGAA I860 

AAGTGTGCTT TTTGACCCTA CTGGACATTT ATTGACTTAA TTGCTTCTGT TTATTAAAAT 1920 

TGACCTGCAA AGTTAAAAAA AAATTAAAGT TGAGAACAGG TATAAGTGCA CACTGAATAG 1980 

TCTAATCTAC ATGTAACACA TATTTTAGTA TGATTTTCTA TACTCTAATC AGCACTGAAT 2040 

TCAGAGGGTT TGACTTTTTC ATCTATAACA CAGTGACTAA AAGAGTTAAG GGTATATATA 2100 

CCATCACTTT GGGACTTGGT AGTATTATTA AAAGGTTATT TCCTTCACTG TCAATAAAAG 2160 

TCCAAATGTT TAGCTTAGGT CTGAGAGTCA AACAATGTTA AGGATTGTCT TAAAGTTCCT 2220 

TAGCCAGCAA AACAAAACAA AACAAAACAA ACAAATGAAA AACGTTTAAA AAGAAGAAGA 2280 

AGAAAAAAAA CAAGAACAAG CAGCAACAGC TGTTTTGTTG GGGCTATAGA TTTAAGTTAG 2340 

GCATAGTCAA TTTCAGAATA ACTAAGAGTG GAATATATGC ATATGGTGAA ATTATAACCT 2400 

TGCCCTTTTT TATTTGCCCT CTGCGATCCA CCTGGCTTTT TAGAAGTCTG CCGAGTGAGA 2460 

AGGCCACAGT ATCTCATGCT GTTTGCATTA CAGAACTGCA GCTTTTCTAC TCTGAAAAGG 2520 

CCTGGGAGCA GAATGGCTGG CCTGCTGTGA GCAGGAGAGG AGATTCTAAG AAGGATAGTC 2580 

CCCCCTACAA CATACTGTCA TACTGCTGGG TTTTCATGGG TAGGAAAGCT TGTCCTGACC 2640 

CCAGCAGCAA AGAGGTGGCA GGTCGCTAAT GAATATATGC TTTATAATGT CCTTCTTCAT 2700 

TGCTGAGAGG GCAGCCTTAG AGCTGTGGAT TTCTGCATCC CCCCTGAGTC TGACCCATGG 2760 

ACACCTGTTT CATTCACTTT AGCATCACAG TGACCTTTGT ATGCTCTGTT CAGTCTGTGT 2820 

CAGGCAGTAT GCTTGTCCTG AAGAGAGGTT TGGCTATCCC CCACCCCACC CACCCACCCT 2880 

GTTCCTTTTT TATCAGGAGG ACTTCAGAGC CAGGCCTGCA GCATTTTGTT TGAAAACACA 2940 

ATCAGCTCTG ACAGTTAGAC ATGCACACAG ACGCCATAGC TGGATTGGAA ACATTGATGT 3000 

TTTAAAAATT TATTTTTTTT GGAAATAGTT GCACAAATGC TGCAATTTAG CTTTAAGGTT 3060 

CTATAGATTT TTAACTAGTC CAACACAGTC AGAAACATTG TTTTGAATCC TCTGTAAACC 3120 

AAGGCATTAA TCTTAATAAA CCAGGATCCA TTTAGGTACC ACTTGATATA AAAAGGATAT 3180 

CCATAATGAA TATTTTATAC TGCATCCTTT ACATTAGCCA CTAAATACGT TATTGCTTGA 3240 

TGAAGACCTT TCACAGAATC CTATGGATTG CAGCATTTCA CTTGGCTACT TCATACCCAT 3300 

GCCTTAAAGA GGGGCAGTTT CTCAAAAGCA GAAACATGCC GCCAGTTCTC AAGTTTTCCT 3360 

CCTAACTCCA TTTGAATGTA AGGGCAGCTG GCCCCCAATG TGGGGAGGTC CGAACATTTT 3420 

CTGAATTCCC ATTTTCTTGT TCGCGGCTAA ATGACAGTTT CTGTCATTAC TTAGATTCCG 3480 

ATCTTTCCCA AAGGTGTTGA TTTACAAAGA GGCCAGCTAA TAGCAGAAAT CATGACCCTG 3540 

AAAGAGAGAT GAAATTCAAG CTGTGAGCCA GGCAGGAGCT CAGTATGGCA AAGGTTCTTG 3600 

AGAATCAGCC ATTTGGTACA AAAAAGATTT TTAAAGCTTT TATGTTATAC CATGGAGCCA 3660 

TAGAAAGGCT ATGGATTGTT TAAGAACTAT TTTAAAGTGT TCCAGACCCA AAAAGGAAAA 3720 

ATAAAAAAAA AGGAATATTT GTACCCAACA GCTAGAAGGA TTGCAAGGTA GATTTTTGTT 3780 

TTAAAATGGA GAGAAGTGGA CAGATAAGGC CATTTAATAT ATCAAAGATC AGTTGACATC 3840 

TCTAGGGAAT GATGAAACAG CAGGCTATTA GAAAATTATT TCATATAGTT CTCGTGTTCT 3900 

TTTCTTTTTT TTAATCCCTG AAGGGAGATC AGTAACATAG CTTCTCTTTT CTGTACTCTA 3960 

GACCACCCCT TTTCATCATT TTGCTTTTTA TGTCTCCCAT AAGAAATGTG CTTTTTAGAG 4020 

CTTCCTAATG CATGTGTTGC ATTATTGCAG CATTAGAAAA GGAGAGGTAG CATTTTTGCT 4080 

GAAATCGGGC CTGTCACTCT CCAATAAAGG TTCTGGCACT TCAATGCCAG GCAGGTCTCC 4140 

TAAATGAACA GAATGATCTG TGTGAGCCGA TGCCTGCCCT TCCAGAGGGG CCACTGTCCC 4200 

CAGCCGCAGC CAACTGTGTC CCACAGGAAT GGGAGCCTAG GTTTCCAAAT CTTGTGATTC 4260 
TTTAGGAGAA ACATGAAACC TGGATTTCGT GTGAAATGTC CCGATTGTTA AAAAGTTGGC 4320 

TCAATTATTT TTAAAACATT TTGTAAGCCA ACAAAAGTCT GTGGGCTGCC AGTTTATTAC 4380 

TTTTGTCTTA AAACATGATC ATTGTTCTCT CACGGTATCC TTCTGTCTTC CCGTTGCAAA 4440 
TTCACTTTTC TTTCTTCCTG ACATTGCCAT TGAGGGCTTT GTTACCACAA GCTAAGAAAC 4500 
TGAGTTTAAC AGCCCAGTTA TCTGCAACAT GTCAATTACC TTTGCTCCTC TCCTGTGATT 4560 
CCCACCATGC TGTGACCCTC AGCTGTCTCC CTTTGCTGGG AATTCTGCAC CAATGTCTTC 4620 
CCCTCAACCC ATTCCCTGGT TGGTCCTACT CCCGTGTGGC CAGAGACATC CTAGCAAATC 4680 
CTTCCTCCTA TTATATCTGA CACTAATTTC TTTTCAACAG CGCTCATGTC TCTTGGCCCA 4740 
GTCAGGTGCT GCCAGGTTTA GATAGGAAAG TACATGTCCC ATTTTCATGG GTGCCCTTAA 4800 
TGTGGTCCAC GTCCTATATC TTATTATATT TACTCATGGC TCAATGGGGG CCTCCAGAGA 4860 
CCCTCTCAGG CTGCTGAGCT AGACTAAGGA ATGCATCCAC TGTCATCACA TGAGACACTG 4920 
ACTCTGTGAC GACAAAAGTA CAAACAGTCT GAGGCTAAGA AAGGTTCATC TCACAACAGG 4980 
AAAAACAAAT CTCAACACAC ATTAGAGATA ATTGATTCAG GGGTTTTCTC TCCCAGTCTC 5040 
CCAGCAGGGA CTGATTTCAT TTCTGACCCA CTAGGTTTTC TTTCCAGAAA TAGGTAGCAA 5100 
GGACAAGAAC TAAACAATCC CAGCCCCACC CAGCAACACA GAACACAGGA GTTTGCTTTT 5160 
GGCTTCTCAC TCTCCAAGTA ACCCTGAATT AGGCCCAGAA TGGCTGAGGC TTGGAGCATC 5220 
TCCTCAGACA GAGCAGAGGC GACACCTCTT CAGGGGTGTG TGGAGTAAAT AGCTCGAAGA 5280 
GCTGAAGACA GAAAACCAGT TTCACGCCAG GTGCGAGAGA GAGCATAATG GAGGGAAGCC 5340 
CGCTTTCTCT CTCCTCTTCT TTTCTCTTTA TTTCTTTAGA GCACTTGACT TTTTTTTCTC 5400 
TCTCTCTCTA GTATTCTAAA CTGACCCCAT GACCAACTGA GAATTTATTT TTGTTTCATT 5460 
GGTTGTTTCA CAGAATTAGA ACACACACGA CTTTTTATTC CTCCATTGCA AAATGGAATC 5520 
AAGATACTAC ACAAGACCTG TGCTTTCTTC CTTTGCATGA TTTACACCTC CGCCTGTTTT 5580 
GGTGCTAGCT GTCTAGAACT TCTCTCTTGG TTTGAATCTG ATTCCTTCAC ACTACACTAG 5640 
AAGTTTATTT CATCTTGTTT TGTCTAGACT CCAGATACAG AGGGACAGCT GGACTGAGGA 5700 
CAAGCAATTC CAT CT AG CAT AGGGTCTCTC AGGGTTGGTG CATCCAGCCA CATGGGCAGG 5760 
GCCAGTCACA TCTAGTCTAT GTCCCCAGAG CCCTTGGAGT TGCGCAGCTT AGCTGACTTG 5820 
ACTCCAAGGA AATTAGTACA GAAGTAACCA CTCTATTAAG TGTGTTCTGC TATGTTCACA 5880 
TGCCTGTAGT ACCTGCAAAC CATGCCAGGT TCATCTAAAG ACATAGGGGA AGATTAAGGA 5940 
CTCTTTTGGA CAGACCATGA ATTGAATTTG CTGCCAGGTG CTGCCAGACT GAATTTGGCT 6000 
GACAGAACTC CCAGCCCAGG AAAGTTCCAT GACAATGACT GTCGCAGAAG GAAATTTCCC 6060 
ACTAAAGTCA GTCCATTTTC AAGTTTTGGT CTTCAGAGAC AAAAGAACGT CCCAGCCACC 6120 
TGATTTTGAT GGTGAGGTAA CTCTAAGTTG AATTCAGGCT AGTGTTGCAG TATAGCTTTG 6180 
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GCATGTTCAT GAGTGAGCAC CCAGAATGTG TTGAACCAAC CCCCACCCCT AACTACTGAC 6240 

TATGACTGCA GTGGGTTTTT ATGGGGAAAA AAAGTGTGAA AAGCAAAAAG AAAGGAACAG 6300 

AGATTTTTTA TCACCTTTAT TGTAAGACAG TCCATTTATG AATTGAGTAT AAACACATAC 6360 

AAAGTAACAA GAGATTCCTA AGAAACGCAA ATCCTTGAGT TTCACGCACT TCATGTTCAA 6420 

CCATTTGCTG TAATCCAGAG GCAGCCTGTG AATCATTCTC ATGCCCTGTT T TTTTT TT TT 6480 

TTTTCC TATA ATGTTCTGGG TTTAAAAGCC ATCTTTTCCA CATTTTCTGT AAATAATGGA 6540 

TAATCATTTT AAAAAGTTTT ATTTTTAGTG CTGTTTTAAC AATGTAGATA GATCATAAAT 6600 

GTACTTGCTG AATTCAATCA TTTTTAACAA GCCAATAAAG TTTGATAATT CAAAAAAAAA 6660 

AAAAAAAAAA AAAAAAAAA 6679 

Seq ID NO: CIO 9 DNA Sequence 

Nucleic Acid Accession #: NM_006180.1 

Coding sequence: 352.. 2820 ~ 



L 11 21 31 41 51 

I I I I I I 

CCCCCATTCG CATCTAACAA GGAATCTGCG CCCCAGAGAG TCCCGGACGC CGCCGGTCGG 60 

TGCCCGGCGC GCCGGQCCAT GCAGCGACGG CCGCCGCGGA GCTCCGAGCA GOGGTAGCGC 120 

CCCCCTGTAA AGCGGTTCGC TATGCCGGGA CCACTGTGAA CCCTGCCGCC TGCCGGAACA 180 

ZU CTCTTCGCTC CGGACCAGCT CAGCCTCTGA TAAGCTGGAC TCGGCACGCC CGCAACAAGC 240 

ACCGAGGAGT TAAGAGAGCC GCAAGCGCAG GGAAGGCCTC CCCGCACGGG TGGGGGAAAG 300 

CGGCCGGTGC AGCGCGGGGA CAGGCACTCG GGCTGGCACT GGCTGCTAGG GATGTCGTCC 360 

TGGATAAGGT GGCATGGACC CGCCATGGCG CGGCTCTGGG GCTTCTGCTG GCTGGTTGTG 420 

GGCTTCTGGA GGGCCGCTTT CGCCTGTCCC ACGTCCTGCA AATGCAGTGC CTCTCGGATC 480 

/J TGGTGCAGCG ACCCTTCTCC TGGCATCGTG GCATTTCCGA GATTGGAGCC TAACAGTGTA 540 

GATCCTGAGA ACATCACCGA AATTTTCATC GCAAACCAGA AAAGGTTAGA AATCATCAAC 600 

GAAGATGATG TTGAAGCTTA TGTGGGACTG AGAAATCTGA CAATTGTGGA TTCTGGATTA 660 

AAATTTGTGG CTCATAAAGC ATTTCTGAAA AACAGCAACC TGCAGCACAT CAATTTTACC 720 

~ ~ CGAAACAAAC TGACGAGTTT GTCTAGGAAA CATTTCCGTC ACCTTGACTT GTCTGAACTG 780 

JU ATCCTGGTGG GCAATCCATT TACATGCTCC TGTGACATTA TGTGGATCAA GACTCTCCAA 840 

GAGGCTAAAT CCAGTCCAGA CACTCAGGAT TTGTACTGCC TGAATGAAAG CAGCAAGAAT 900 

ATTCCCCTGG CAAACCTGCA GATACCCAAT TGTGGTTTGC CATCTGCAAA TCTGGCCGCA 960 

CCTAACCTCA CTGTGGAGGA AGGAAAGTCT ATCACATTAT" CCTGTAGTGT GGCAGGTGAT 1020 

„ CCGGTTCCTA ATATGTATTG GGATGTTGGT AACCTGGTTT CCAAACATAT GAATGAAACA 1080 

J J AGCCACACAC AGGGCTCCTT AAGGATAACT AACATTTCAT CCGATGACAG TGGGAAGCAG 1140 

ATCTCTTGTG TGGCGGAAAA TCTTGTAGGA GAAGATCAAG ATTCTGTCAA CCTCACTGTG 1200 

CATTTTGCAC CAACTATCAC ATTTCTCGAA TCTCCAACCT CAGACCACCA CTGGTGCATT 1260 

CCATTCACTG TGAAAGGCAA CCCCAAACCA GCGCTTCAGT GGTTCTATAA CGGGGCAATA 1320 

A TTGAATGAGT CCAAATACAT CTGTACTAAA ATACATGTTA CCAATCACAC GGAGTACCAC 1380 

4U GGCTGCCTCC AGCTGGATAA TCCCACTCAC ATGAACAATG GGGACTACAC TCTAATAGCC 1440 

AAGAATGAGT ATGGGAAGGA TGAGAAACAG ATTTCTGCTC ACTTCATGGG CTGGCCTGGA 1500 

ATTGACGATG GTGCAAACCC AAATTATCCT GATGTAATTT ATGAAGATTA TGGAACTGCA 1560 

GCGAATGACA TCGGGGACAC CACGAACAGA AGTAATGAAA TCCCTTCCAC AGACGTCACT 1620 

A r GATAAAACCG GTCGGGAACA TCTCTCGGTC TATGCTGTGG TGGTGATTGC GTCTGTGGTG 1680 

42) GGATTTTGCC TTTTGGTAAT GCTGTTTCTG CTTAAGTTGG CAAGACACTC CAAGTTTGGC 1740 

ATGAAAGGCC CAGCCTCCGT TATCAGCAAT GATGATGACT CTGCCAGCCC ACTCCATCAC 1800 

ATCTCCAATG GGAGTAACAC TCCATCTTCT TCGGAAGGTG GCCCAGATGC TGTCATTATT 1860 

GGAATGACCA AGATCCCTGT CATTGAAAAT CCCCAGTACT TTGGCATCAC CAACAGTCAG 1920 

_ n CTCAAGCCAG ACACATTTGT TCAGCACATC AAGCGACATA ACATTGTTCT GAAAAGGGAG 1980 

3U CTAGGCGAAG GAGCCTTTGG AAAAGTGTTC CTAGCTGAAT GCTATAACCT CTGTCCTGAG 2040 

CAGGACAAGA TCTTGGTGGC AGTGAAGACC CTGAAGGATG CCAGTGACAA TGCACGCAAG 2100 

GACTTCCACC GTGAGGCCGA GCTCCTGACC AACCTCCAGC ATGAGCACAT CGTCAAGTTC 2160 

TATGGCGTCT GCGTGGAGGG OGACCCCCTC ATCATGGTCT TTGAGTACAT GAAGCATGGG 2220 

GACCTCAACA AGTTCCTCAG GGCACACGGC CCTGATGCCG TGCTGATGGC TGAGGGCAAC 2280 

J J CCGCCCACGG AACTGACGCA GTCGCAGATG CTGCATATAG CCCAGCAGAT CGCCGCGGGC 2340 

ATGGTCTACC TGGCGTCCCA GCACTTCGTG CACCGOGATT TGGCCACCAG GAACTGCCTG 2400 

GTCGGGGAGA ACTTGCTGGT GAAAATCGGG GACTTTGGGA TGTCCCGGGA CGTGTACAGC 2460 

ACTGACTACT ACAGGGTCGG TGGCCACACA ATGCTGCCCA TTCGCTGGAT GCCTCCAGAG 2520 

AGCATCATGT ACAGGAAATT CACGACGGAA AGCGACGTCT GGAGCCTGGG GGTCGTGTTG 2580 

OU TGGGAGATTT TCACCTATGG CAAACAGCCC TGGTACCAGC TGTCAAACAA TGAGGTGATA 2640 

GAGTGTATCA CTCAGGGCCG AGTCCTGCAG CGACCCCGCA CGTGCCCCCA GGAGGTGTAT 2700 

GAG CTGATGC TGGGGTGCTG GCAGCGAGAG CCCCACATGA GGAAGAACAT CAAGGGCATC 2760 

CATACCCTCC TTCAGAACTT GGCCAAGGCA TCTCCGGTCT ACCTGGACAT TCTAGGCTAG 2820 

s-r GGCCCTTTTC CCCAGACCGA TCCTTCCCAA CGTACTCCTC AGACGGGCTG AGAGGATGAA 2880 

OJ CATCTTTTAA CTGCCGCTGG AGGCCACCAA GCTGCTCTCC TTCACTCTGA CAGTATTAAC 2940 

ATCAAAGACT CCGAGA AGCT CTCGAGGGAA GCAGTGTGTA CTTCTTCATC CATAGACACA 3000 

GTA TTGACTT CTTTTTGGCA TTATCTCTTT CTCTCTTTCC ATCTCCCTTG GTTGTTCCTT 3060 

TTTCTTTTTT TAAATTTTCT TTTTCTTCTT TTTTTTCGTC TTCCCTGCTT CACGATTCTT 3120 

ACCCTTTCTT TTGAATCAAT CTGGCTTCTG CATTACTATT AACTCTGCAT AGACAAAGGC 3180 

/U CTTAACAAAC GTAATTTGTT ATATCAGCAG ACACTCCAGT TTGCCCACCA CAACTAACAA 3240 

TGCCTTGTTG TATTCCTGCC TTTGATGTGG ATGAAAAAAA GGGAAAACAA ATATTTCACT 3300 

TAAACTTTGT CACTTCTGCT GTACAGATAT CGAGAGTTTC TATGGATTCA CTTCTATTTA 3360 

TTTATTATTA TTACTGTTCT TATTGTTTTT GGATGGCTTA AGCCTGTGTA TAAAAAAGAA 3420 

~c AACTTGTGTT CAATCTGTGA AGCCTTTATC TATGGGAGAT TAAAACCAGA GAGAAAGAAG 3480 

/J ATTTATTATG AACCGCAATA TGGGAGGAAC AAAGACAACC ACTGGGATCA GCTGGTGTCA 3540 

GTCCCTACTT AGGAAATACT CAGCAACTGT TAGCTGGGAA GAATGTATTC GGCACCTTCC 3600 

CCTGAGGACC TTTCTGAGGA GTAAAAAGAC TACTGGCCTC TGTGCCATGG ATGATTCTTT 3660 

TCCCATCACC AGAAATGATA GCGTGCAGTA GAGAGCAAAG ATGGCTT 3707 



Seq ID NO: C110 DNA Sequence 

Nucleic Acid Accession #i Eos sequence 

Coding sequence: 939.-2372 

1 11 21 31 41 51 
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I 1 I I I I 

AAQACGGATT CTCAGACAAG GCTTGCAAAT GCCCCGCAGC CATCATTTAA CTGCACCCGC 60 

AGAATAGTTA CGGTTTGTCA CCCGACCCTC CCGGATCGCC TAATTTGTCC CTAGTGAGAC 120 

CCCGAGGCTC TGCCCGCGCC TGGCTTCTTC GTAGCTGGAT GCATATCGTG CTCCGGGCAG 180 

CGCGGGCGCA GGGCACGCGT TOGOGCACAC CCTAGCACAC ATGAACACGC GCAAGAGCTG 240 

AACCAAGCAC GGTTTCCATT TCAAAAAGGG AGACAGCCTC TACCGCGATT GTAGAAGAGA 300 

CTGTGGTGTG AATTAGGGAC CGGGAGGCGT OGAAOGGAGG AAOGGTTCAT CTTAGAGACT 360 

AATTTTCTGG AGTTTCTGCC CCTGCTCTGC GTCAGCCCTC ACGTCACTTC GCCAGCAGTA 420 

GCAGAGGCGG CGGCGGCGGC TCCCGGAATT GGGTTGGAGC AGGAGCCTCG CTGGCTGCTT 460 

CGCTCGOGCT CTACGCGCTC AGTCCCOGGC GGTAGCAGGA GCCTGGACCC AGGCGCCGCC S40 

GGCGGGCGTG AGGCGCCGGA GCCCGGCCTC GAGGTGCATA CCGGACCCCC ATTCGCATCT 600 

AACAAGGAAT CTGCGCCCCA GAGAGTCCCG GGAGCGCCGC CGGTCGGTGC CCGGCGCGCC 660 

GGGCCATGCA GCGACGGCCG CCGCGGAGCT CCGAGCAGCG GTAGCGCCCC CCTGTAAAGC 720 

GGTTCGCTAT GCCGGGGCCA CTGTGAACCC TGCCGCCTGC CGGAACACTC TTCGCTCCGG 780 

ACCAGCTCAG CCTCTGATAA GCTGGACTOG GCACGCCCGC AACAAGCACC GAGGAGTTAA 840 

GAGAGCCGCA AGCGCAGGGA AGGCCTCCCC GCACGGGTGG GGGAAAGCGG CCGGTGCAGC 900 

GCGGGGACAG GCACTCGGGC TGGCACTGGC TGCTAGGGAT GTCGTCCTGG ATAAGGTGGC 960 

ATGGACCCGC CATGGCGCGG CTCTGGGGCT TCTGCTGGCT GGTTGTGGGC TTCTGGAGGG 1020 

COGCTTTCGC CTGTCCCACG TCCTGCAAAT GCAGTGCCTC TCGGATCTGG TGCAGCGACC 1080 

CTTCTCCTGG CATCGTGGCA TTTCCGAGAT TGGAGCCTAA CAGTGTAGAT CCTGAGAACA 1140 

TCACCGAAAT TTTCATCGCA AACCAGAAAA GGTTAGAAAT CATCAACGAA GATGATGTTG 12 0O 

AAGCTTATGT GGGACTGAGA AATCTGACAA TTGTGGATTC TGGATTAAAA TTTGTGGCTC 1260 

ATAAAGCATT TCTGAAAAAC AGCAACCTGC AGCACATCAA TTTTACCCGA AACAAACTGA 1320 

CGAGTTTGTC TAGGAAACAT TTCCGTCACC TTGACTTGTC TGAACTGATC CTGGTGGGCA 1360 

ATCCATTTAC ATGCTCCTGT GACATTATGT GGATCAAGAC TCTCCAAGAG GCTAAATCCA 1440 

GTCCAGACAC TCAGGATTTG TACTGCCTGA ATGAAAGCAG CAAGAATATT CCCCTGGCAA 1S0O 

ACCTGCAGAT ACCCAATTGT GGTTTGCCAT CTGCAAATCT GGCCGCACCT AACCTCACTG 1560 

TGGAGGAAGG AAAGTCTATC ACATTATCCT GTAGTGTGGC AGGTGATCOG GTTCCTAATA 1620 

TGTATTGGGA TGTTGGTAAC CTGGTTTCCA AACATATGAA TGAAACAAGC CACACACAGG 1680 

GCTCCTTAAG GATAACTAAC ATTTCATCCG ATGACAGTGG GAAGCAGATC TCTTGTGTGG 1740 

CGGAAAATCT TGTAGGAGAA GATCAAGATT CTGTCAACCT CACTGTGCAT TTTGCACCAA 1800 

CTATCACATT TCTCGAATCT CCAACCTCAG ACCACCACTG GTGCATTCCA TTCACTGTGA 1860 

AAGGCAACCC CAAACCAGCG CTTCAGTGGT TCTATAACGG GGCAATATTG AATGAGTCCA 1920 

AATACATCTG TACTAAAATA CATGTTACCA ATCACACGGA GTACCAOGGC TGCCTCCAGC 1980 

TGGATAATCC CACTCACATG AACAATGGGG ACTACACTCT AATAGCCAAG AATGAGTATG 2040 

GGAAGGATGA GAAACAGATT TCTGCTCACT TCATGGGCTG GCCTGGAATT GACGATGGTG 2100 

CAAACCCAAA TTATCCTGAT GTAATTTATG AAGATTATGG AACTGCAGCG AATGACATCG 2160 

GGGACACCAC GAACAGAAGT AATGAAATCC CTTCCACAGA CGTCACTGAT AAAACCGGTC 2220 

GGGAACATCT CTCGGTCTAT GCTGTGGTGG TGATTGCGTC TGTGGTGGGA TTTTGCCTTT 2280 

TGGTAATGCT GTTTCTGCTT AAGTTGGCAA GACACTCCAA GTTTGGCATG AAAGGTTTTG 2340 

TTTTGTTTCA TAAGATCCCA CTGGATGGGT AGCTGAAATA AAGGAAAAGA CAGAGAAAGG 2400 

GGCTGTGGTG CTTGTTGGTT GATGCTGCCA TGTAAGCTGG ACTCCTGGGA CTGCTGTTGG 2460 

CTTATCCCGG GAAGTGCTGC TTATCTGGGG TTTTCTGGTA GATGTGGGCG GTGTTTGGAG 2520 

GCTGTACTAT ATGAAGCCTG CATAXACTGT GAGCTGTGAT TGGGGAACAC CAATGCAGAG 2580 

GTAACTCTCA GGCAGCTAAG CAGCACCTCA AGAAAACATG TTAAATTAAT GCTTCTCTTC 2640 

TTACAGTAGT TCAAATACAA AACTGAAATG AAATCCCATT GGATTGTACT TCTCTTCTGA 2700 

AAAGTGTGCT TTTTGACCCT ACTGGACATT TATTGACTTA ATTGCTTCTG TTTATTAAAA 2760 

TTGACCTGCA AAGTTAAAAA AAAATTAAAG TTGAGAACAG GTATAAGTGC ACACTGAATA 2820 

GTCTAATCTA CATGTAACAC ATATTTTAGT GTGATTTTCT ATACTCTAAT CAGCACTGAA 2880 

TTCAGAGGGT TTGACTTTTT CATCTATAAC ACAGTGACTA AAAGAGTTAA GGGTATATAT 2940 

ACCATCACTT TGGGACTTGG TAGTATTATT AAAAGGTTAT TTCCTTCACT GTCAATAAAA 3000 

GTCCAAATGT TTAGCTTAGG TCTGAGAGTC AAACAATGTT AAGGATTGTC TTAAAGTTCC 3060 

TTAGCCAGCA AAACAAAACA AAACAAAACA AACAAATGAA AAACGTTTAA AAAGAAGAAG 3120 

AAGAAAAAAA ACAAGAACAA GCAGCAACAG CTGTTTTGTT GGGGCTATAG ATTTAAGTTA 3180 

GGCATAGTCA ATTTCAGAAT AACTAAGAGT GGAATATATG CATATGGTGA AATTATAACC 3240 

TTGCCCTTTT TTATTTGCCC TCTGCGATCC ACCTGCTTTT TAGAAGTCTG CCGAGTGAGA 3300 

AGGCCACAGT ATCTCATGCT GTTTGCATTA CAGAACTGCA GCTTTTCTAC TCTGAAAAGG 3360 

CCTGGGAGCA GAATGGCTGG CCTGCTGTGA GCAGGAGAGG AGATTCTAAG AAGGATAGTC 3420 

CCCCCTACAA CATACTGTCA TACTGCTGGG TTTTCATGGG TAGGAAAGCT TGTC C TGACC 3480 

CCAGCAGCAA AGAGGTGGCA GGTCGCTAAT GAATATATGC TTTATAATGT CCTTCTTCAT 3540 

TGCTGAGAGG GCAGCCTTAG AGCTGTGGAT TTCTGCATCC CCCCTGAGTC TGACCCATGG 3600 

ACACCTGTTT CATTCACTTT AGCATCACAG TGACCTTTGT ATGCTCTGTT CAGTCTGTGT 3660 

CAGGCAGTAT GCTTGTCCTG AAGAGAGGTT TGGCTATCCC CACCCCACCC CACCCCACCC 3720 

TGTTCCTTTT TTATCAGGAG GACTTCAGAG CCAGGCCTGC AGCATTTTGT TTGAAAACAC 3780 

AATCAGCTCT GACAGTTAGA CATGCACACA GACGCCATAG CTGGATTGGA AACATTGATG 3840 

TTTTAAAAAT TTATTTTTTT TGGAAATAGT TGCACAAATG CTGCAATTTA GCTTTAAGGT 3900 

TCTATAGATT TTTAACTAGT CCAACACAGT CAGAAACATT GTTTTGAATC CTCTGTAAAC 3960 

CAAGGCATTA ATCTTAATAA ACCAGGATCC ATTTAGGTAC CACTTGATAT AAAAAGGATA 4020 

TCCATAATGA ATATTTTATA CTGCATCCTT TACATTAGCC ACTAAATACG TTATTGCTTG 4080 

ATGAAGACCT TTCACAGAAT CCTATGGATT GCAGCATTTC ACTTGGCTAC TTCATACCCA 4140 

TGCCTTAAAG AGGGGCAGTT TCTCAAAAGC AGAAACATGC CGCCAGTTCT CAAGTTTTCC 4200 

TCCTAACTCC ATTTGAATGT AAGGGCAGCT GGCCCCCAAT GTGGGGAGGT CCGAACATTT 4260 

TCTGAATTCC CATTTTCTTG TTCGCGGCTA AATGACAGTT TCTGTCATTA CTTAGATTCC 4320 

GATCTTTCCC AAAGGTGTTG ATTTACAAAG AGGCCAGCTA ATAGCAGAAA TCATGACCCT 4380 

GAAAGAGAGA TGAAATTCAA GCTGTGAGCC AGGGAGGAGC TCAGTATGGC AAAGGTTCTT 4440 

GAGAATCAGC CATTTGGTAC AAAAAAGATT TTTAAAGCTT TTATGTTATA CCATGGAGCC 4500 

ATAGAAAGGC TATGGATTGT TTAAGAACTA TTTTAAAGTG TTCCAGACCC AAAAAGGAAA 4560 

AATAAAAAAA AAGGAATATT TGTACCCAAC AGCTAGAAGG ATTGCAAGGT AGATTTTTGT 4620 

TTTAAAATGG AGAGAAGTGG ACAGATAAGG CCATTTAATA TATCAAAGAT CAGTTGACAT 4660 

OU CTCCTAGGGA ATGATGAAAA CAGCAGGCTA T 4711 
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ATGCCACTGA AGCATTATCT CCTTTTGCTG GTGGGCTGCC AAGCCTGGGG TGCAGGGTTG 60 

J GCCTACCATG GCTGCCCTAG CGAGTGTACC TGCTCCAGGG CCTCCCAGGT GGAGTGCACC 120 

GGGGCACGCA TTGTGGCGGT GCCCACCCCT CTGCCCTGGA ACGCCATGAG CCTGCAGATC 180 

CTCAACACGC ACATCACTGA ACTCAATGAG TCCCCGTTCC TCAATATCTC AGCCCTCATC 240 

GCCCTGAGGA TTGAGAAGAA TGAGCTGTCG CGCATCACGC CTGGGGCCTT CCGAAACCTG 300 

GGCTCGCTGC GCTATCTCAG CCTCGCCAAC AACAAGCTGC AGGTTCTGCC CATCGGCCTC 360 

IU TTCCAGGGCC TGGACAGCCT TGAGTCTCTC CTTCTGTCCA GTAACCAGCT GTTGCAGATC 420 

CAGCCGGCCC ACTTCTCCCA GTGCAGCAAC CTCAAGGAGC TGCAGTTGCA CGGCAACCAC 480 

CTGGAATACA TCCCTGACGG AGCCTTCGAC CACCTGGTAG GACTCACGAA GCTCAATCTG 540 

GGCAAGAATA GCCTCACCCA CATCTCACCC AGGGTCTTCC AGCACCTGGG CAATCTCCAG 600 

- GTCCTCCGGC TGTATGAGAA CAGGCTCACG GATATCCCCA TGGGCACTTT TGATGGGCTT 660 

1 J GTTAACCTGC AGGAACTGGC TCTACAGCAG AACCAGATTG GACTGCTCTC CCCTGGTCTC 720 

TTCCACAACA ACCACAACCT CCAGAGACTC TACCTGTCCA ACAACCACAT CTCCCAGCTG 780 

CCACCCAGCA TCTTCATGCA GCTGCCCCAG CTCAACCGTC TTACTCTCTT TGGGAATTCC 840 

CTGAAGGAGC TCTCTCTGGG GATCTTCGGG CCCATGCCCA ACCTGCGGGA GCTTTGGCTC 900 

TATGACAACC ACATCTCTTC TCTACCCGAC AATGTCTTCA GCAACCTCCG CCAGTTGCAG 960 

Z\) GTCCTGATTC TTAGCCGCAA TCAGATCAGC TTCATCTCCC CGGGTGCCTT CAACGGGCTA 1020 

ACGGAGCTTC GGGAGCTGTC CCTCCACACC AACGCACTGC AGGACCTGGA OGGGAATGTC 1080 

TTCOGCATGT TGGCCAACCT GCAGAACATC TCCCTGCAGA ACAATCGCCT CAGACAGCTC 1140 

CCAGGGAATA TCTTCGCCAA CGTCAATGGC CTCATGGCCA TCCAGCTGCA GAACAACCAG 1200 

CTGGAGAACT TGCCCCTCGG CATCTTCGAT CACCTGGGGA AACTGTGTGA GCTGOGGCTG 1260 

ZO TATGACAATC CCTGGAGGTG TGACTCAGAC ATCCTTCCGC TCCGCAACTG GCTCCTGCTC 1320 

AACCAGCCTA GGTTAGGGAC GGACACTGTA CCTGTGTGTT TCAGCCCAGC CAATGTCCGA 1380 

GGCCAGTCCC TCATTATCAT CAATGTCAAC GTTGCTGTTC CAAGCGTCCA TGTCCCTGAG 1440 

GTGCCTAGTT ACCCAGAAAC ACCATGGTAC CCAGACACAC CCAGTTACCC TGACACCACA 1500 

- TCCGTCTCTT CTACCACTGA GCTAACCAGC CCTGTGGAAG ACTACACTGA TCTGACTACC 1560 

ATTCAGGTCA CTGATGACCG CAGCGTTTGG GGCATGACCC AGGCCCAGAG CGGGCTGGCC 1620 

ATTGCCGCCA TTGTAATTGG CATTGTCGCC CTGGCCTGCT CCCTGGCTGC CTGCGTCGGC 1680 

TGTTGCTGCT GCAAGAAGAG GAGCCAAGCT GTCCTGATGC AGATGAAGGC ACCCAATGAG 1740 

TGTTAA 1746 

35 Seq ID NO: C112 DNA Sequence 

Nucleic Acid Accession #: NM_002658.1 
Coding sequence: 77.. 1372 

AC\ } 11 21 31 41 51 
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GTCCCCGCAG CGCCGTCGCG CCCTCCTGCC GCAGGCCACC GAGGCCGCCG CCGTCTAGCG 60 

CCCCGACCTC GCCACCATGA GAGCCCTGCT GGCGCGCCTG CTTCTCTGCG TCCTGGTCGT 120 

GAGOGACTCC AAAGGCAGCA ATGAACTTCA TCAAGTTCCA TCGAACTGTG ACTGTCTAAA 180 

TGGAGGAACA TGTGTGTCCA ACAAGTACTT CTCCAACATT CACTGGTGCA ACTGCCCAAA 240 

43 GAAATTCGGA GGGCAGCACT GTGAAATAGA TAAGTCAAAA ACCTGCTATG AGGGGAATGG 300 

TCACTTTTAC CGAGGAAAGG CCAGCACTGA CACCATGGGC CGGCCCTGCC TGCCCTGGAA 360 

CTCTGCCACT GTCCTTCAGC AAAOGTACCA TGCCCACAGA TCTGATGCTC TTCAGCTGGG 420 

CCTGGGGAAA CATAATTACT GCAGGAACCC AGACAACCGG AGGCGACCCT GGTGCTATGT 480 

_ n GCAGGTGGGC CTAAAGCCGC TTGTCCAAGA GTGCATGGTG CATGACTGCG CAGATGGAAA 540 

DU AAAGCCCTCC TCTCCTCCAG AAGAATTAAA ATTTCAGTGT GGCCAAAAGA CTCTGAGGCC 600 

CCGCTTTAAG ATTATTGGGG GAGAATTCAC CACCATCGAG AACCAGCCCT GGTTTGCGGC 660 

CATCTACAGG AGGCACCGGG GGGGCTCTGT CACCTACGTG TGTGGAGGCA GCCTCATCAG 720 

CCCTTGCTGG GTGATCAGCG CCACACACTG CTTCATTGAT TACCCAAAGA AGGAGGACTA 780 

CATCGTCTAC CTGGGTCGCT CAAGGCTTAA CTCCAACACG CAAGGGGAGA TGAAGTTTGA 840 

03 GGTGGAAAAC CTCATCCTAC ACAAGGACTA CAGCGCTGAC AOGCTTGCTC ACCACAACGA 900 

CATTGCCTTG CTGAAGATCC GTTCCAAGGA GGGCAGGTGT GCGCAGCCAT CCCGGACTAT 960 

ACAGACCATC TGCCTGCCCT CGATGTATAA CGATCCCCAG TTTGGCACAA GCTGTGAGAT 1020 

CACTG GCTTT GGAAAAGAGA ATTCTACCGA CTATCTCTAT CCGGAGCAGC TGAAAATGAC 1080 

TGTTGTGAAG CTGATTTCCC ACCGGGAGTG TCAGCAGCCC CACTACTACG GCTCTGAAGT 1140 

OU CACCACCAAA ATGCTATGTG CTGCTGACCC CCAATGGAAA ACAGATTCCT GCCAGGGAGA 1200 

CTCAGGGGGA CCCCTCGTCT GTTCCCTCCA AGGCCGCATG ACTTTGACTG GAATTGTGAG 1260 

CTGGGGCCGT GGATGTGCCC TGAAGGACAA GCCAGGCGTC TACACGAGAG TCTCACACTT 1320 

CTTACCCTGG ATCCGCAGTC ACACCAAGGA AGAGAATGGC CTGGCCCTCT GAGGGTCCCC 1380 

^_ AGGGAGGAAA CGGGCACCAC CCGCTTTCTT GCTGGTTGTC ATTTTTGCAG TAGAGTCATC 1440 

OD TCCATCAGCT GTAAGAAGAG ACTGGGAAGA TAGGCTCTGC ACAGATGGAT TTGCCTGTGG 1500 

CACCACCAGG GTGAACGACA ATAGCTTTAC CCTCACGGAT AGGCCTGGGT GCTGGCTGCC IS 60 

CAGACCCTCT GGCCAGGATG GAGGGGTGGT CCTGACTCAA CATGTTACTG ACCAGCAACT 1620 

TGTCTTTTTC TGGACTGAAG CCTGCAGGAG TTAAAAAGGG CAGGGCATCT CCTGTGCATG 1680 

GGCTGGAAGG GAGAGCCAGC TCCCCCGACC GGTGGGCATT TGTGAGGCCC ATGGTTGAGA 1740 

/U AATGAATAAT TTCCCAATTA GGAAGTGTAA GCAGCTGAGG TCTCTTGAGG GAGCTTAGCC 1800 

AATGTGGGAG CAGCGGTTTG GGGAGCAGAG ACACTAACGA CTTCAGGGCA GGGCTCTGAT 1860 

ATTCCATGAA TGTATCAGGA AATATATATG TGTGTGTATG TTTGCACACT TGTTGTGTGG 1920 

GCTGTGAGTG TAAGTGTGAG TAAGAGCTGG TGTCTGATTG TTAAGTCTAA ATATTTCCTT 1980 

~r AAACTGTGTG GACTGTGATG CCACACAGAG TGGTCTTTCT GGAGAGGTTA TAGGTCACTC 2040 

/> CTGGGGCCTC TTGGGTCCCC CACGTGACAG TGCCTGGGAA TGTACTTATT CTGCAGCATG 2100 

ACCTGTGACC AGCACTGTCT CAGTTTCACT TTCACATAGA TGTCCCTTTC TTGGCCAGTT 2160 

ATCCCTTCCT TTTAGCCTAG TTCATCCAAT CCTCACTGGG TGGGGTGAGG ACCACTCCTT 2220 

ACACTGAATA TTTATATTTC ACTATTTTTA TTTATATTTT TGTAATTTTA AATAAAAGTG 2280 

ATCAATAAAA TGTGATTTTT CTGA 2304 
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AGAGTACGTG TTTACAGATA AAACTGGTAC ACTGACAGAA AATGAGATGC AGTTTCGGGA 60 
ATGTTCAATT AATGGCATGA AATACCAAGA AATTAATGGT AGACTTGTAC CCGAAGGACC 120 
AACACCAGAC TCTTCAGAAG GAAACTTATC TTATCTTAGT AGTTTATCCC ATCTTAACAA 180 
CTTATCCCAT CTTACAACCA GTTCCTCTTT CAGAACCAGT CCTGAAAATG AAACTGAACT 240 
AATTAAAGAA CATGATCTCT TCTTTAAAGC AGTCAGTCTC TGTCACACTG TACAGATTAG 300 
CAATGTTCAA ACTGACTGCA CTGGTGATGG TCCCTGGCAA TCCAACCTGG CACCATCGCA 360 
GTTGGAGTAC TATGCATCTT CACCAGATGA AAAGGCTCTA GTAGAAGCTG CTGCAAGGAT 420 
TGGTATTGTG TTTATTGGCA ATTCTGAAGA AACTATGGAG GTTAAAACTC TTGGAAAACT 480 
GGAACGGTAC AAACTGCTTC ATATTCTGGA ATTTGATTCA GATCGTAGGA GAATGAGTGT 540 
AATTGTTCAG GCACCTTCAG GTGAGAAGTT ATTATTTGCT AAAGGAGCTG AGTCATCAAT 600 
TCTCCCTAAA TGTATAGGTG GAGAAATAGA AAAAACCAGA ATTCATGTAG ATGAATTTGC 660 
TTTGAAAGGG CTAAGAACTC TGTGTATAGC ATATAGAAAA TTTACATCAA AAGAGTATGA 720 
GGAAATAGAT AAACGCATAT TTGAAGCCAG GACTGCCTTG CAGCAGCGGG AAGAGAAATT 780 
GGCAGCTGTT TTCCAGTTCA TAGAGAAAGA CCTGATATTA CTTGGAGCCA CAGCAGTAGA 840 
AGACAGACTA CAAGATAAAG TTCGAGAAAC TATTGAAGCA TTGAGAATGG CTGGTATCAA 900 
AGTATGGGTA CTTACTGGGG ATAAACATGA AACAGCTGTT AGTGTGAGTT TATCATGTGG 960 

CCATTTTCAT AGAACCATGA ACATCCTTGA ACTTATAAAC CAGAAATCAG ACAGCGAGTG 1020 

TGCTGAACAA TTGAGGCAGC TTGCCAGAAG AATTACAGAG GATCATGTGA TTCAGCATGG 1080 

GCTGGTAGTG GATGGGACCA GCCTATCTCT TGCACTCAGG GAGCATGAAA AACTATTTAT 1140 

GGAAGTTTGC AGAAATTGTT CAGCTGTATT ATGCTGTCGT ATGGCTCCAC TGCAGAAAGC 1200 

AAAAGTAATA AGACTAATAA AAATATCACC TGAGAAACCT ATAACATTGG CTGTTGGTGA 1260 

TGGTGCTAAT GACGTAAGCA TGATACAAGA AGCCCATGTT GGCATAGGAA TCATGGGTAA 1320 

AGAAGGAAGA CAGGCTGCAA GAAACAGTGA CTATGCAATA GCCAGATTTA AGTTCCTCTC 1380 

CAAATTGCTT TTTGTTCATG GTCATTTTTA TTATATTAGA ATAGCTACCC TTGTACAGTA 1440 

TTTTTTTTAT AAGAATGTGT GCTTTATCAC ACCCCAGTTT TTATATCAGT TCTACTOTTT 1500 

GTTTTCTCAG CAAACATTGT ATGACAGCGT GTACCTGACT TTATACAATA TTTGTTTTAC 1560 

TTCCCTACCT ATTCTGATAT ATAGTCTTTT GGAACAGCAT GTAGACCCTC ATGTGTTACA 1620 

AAATAAGCCC ACCCTTTATC GAGACATTAG TAAAAACCGC CTCTTAAGTA TTAAAACATT 1680 

TCTTTATTGG ACCATCCTGG GCTTCAGTCA TGCCTTTATT TTCTTTTTTG GATCCTATTT 1740 

ACTAATAGGG AAAGATACAT CTCTGCTTGG AAATGGCCAG ATGTTTGGAA ACTGGACATT 1800 

TGGCACTTTG GTCTTCACAG TCATGGTTAT TACAGTCACA GTAAAGATGG CTCTGGAAAC 1860 

TCATTTTTGG ACTTGGATCA ACCATCTCGT TACCTGGGGA TCTATTATAT TTTATTTTGT 1920 

ATTTTCCTTG TTTTATGGAG GGATTCTCTG GCCATTTTTG GGCTCCCAGA ATATGTATTT 1980 

TGTGTTTATT CAGCTCCTGT CAAGTGGTTC TGCTTGGTTT GCCATAATCC TCATGGTTGT 2040 

TACATGTCTA TTTCTTGATA TCATAAAGAA GGTCTTTGAC CGACACCTCC ACCCTACAAG 2100 

TACTGAAAAG GCACAGCTTA CTGAAACAAA TGCAGGTATC AAGTGCTTGG ACTCCATGTG 2160 

CTGTTTCCCG GAAGGAGAAG CAGCGTGTGC ATCTGTTGGA AGAATGCTGG AACGAGTTAT 2220 

AGGAAGATGT AGTCCAACCC ACATCAGCAG ATCATGGAGT GCATCGGATC CTTTCTATAC 2280 

CAACGACAGG AGCATCTTGA CTCTCTCCAC AATGGACTCA TCTACTTGTT AAAGGGGCAG 2340 

TAGTACTTTG TGGGAGCCAG TTCACCTCCT TTCCTAAAAT TCAGTGTGAT CACCCTGTTA 2400 

ATGGCCACAC TAGCTCTGAA ATTAATTTCC AAAATCTTTG TAGTAGTTCA TACCCACTCA 2460 

GAGTTATAAT GGCAAACAAA CAGAAAGCAT TAGTACAAGC CCCTCCCAAC ACCCTTAATT 2520 

TGAATCTGAA CATGTTAAAA TTTGAGAATA AAGAGACATT TTTCATCTCT TTGTCTGGTT 2580 

TGTCCCTTGT GCTTATGGGA CTCCTAATGG CATTTCAGTC TGTTGCTGAG GCCATTATAT 2640 

TTTAATATAA ATGTAGAAAA AAGAGAGAAA TCTTAGTAAA GAGTATTTTT TAGTATTAGC 2700 

TTGATTATTG ACTCTTCTAT TTAAATCTGC TTCTGTAAAT TATGCTGAAA GTTTGCCTTG 2760 

AGAACTCTAT TTTTTTATTA GAGTTATATT TAAAGCTTTT CATGGGAAAA GTTAATGTGA 2820 

ATACTGAGGA ATTTTGGTCC CTCAGTGACC TGTGTTGTTA ATTCATTAAT GCATTCTGAG 2880 

TTCACAGAGC AAATTAGGAG AATCATTTCC AACCATTATT TACTGCAGTA TGGGGAGTAA 2940 

^™ CCA ArrCCTCTAA CTGTACTGTA ACACAGCCTG TAAAGTTAGC CATATAAATG 3000 

CAAGGGTATA TCATATATAC AAATCAGGAA TCAGGTCCGT TCACCGAACT TCAAATTGAT 3060 

GTTTACTAAT ATTTTTGTGA CAGAGTATAA AGACCCTATA GTGGGTAAAT TAGATACTAT 3120 

TAGCATATTA TTAATTTAAT GTCTTTATCA TTGGATCTTT TGCATGCTTT AATCTGGTTA 3180 

ACATATTTAA ATTTGCTTTT TTTCTCTTTA CCTGAAGGCT CTGTGTATAG TATTTCATGA 3240 

CATCGTTGTA CAGTTTAACT ATATCAATAA AAAGTTTGGA CAGTATTTAA ATATTGCAAA 3300 

™GTTTAAT TATACAAATC AGAATAGTAT GGGTAATTAA ATGAATACAA AAAGAAGAGC 3360 

CTCTTTCTGC AGCCGACTTA GACATGCTCT TCCCTTTCTA TAAGCTAGAT TTTAGAATAA 3420 

AGGGTTTCAG TTAATAATCT TATTTTCAGG TTATGTCATC TAACTTATAG CAAACTACCA 3480 

CAATACAGTG AGTTCTGCCA GTGTCCCAGT ACAAGGCATA TTTCAGGTGT GG CTGTGGAA 3540 

TGTAAAAATG CTCAACTTGT ATCAGGTAAT GTTAGCAATA AATTAAATGC TAAGAATGAT 3600 

TAATCGGGTA CATGTTACTG TAATTAACTC ATTGCACTTC AAAACCTAAC TTCCATCCTG 3660 

AATTTATCAA GTAGTTCAGT ATTGTCATTT GTTTTTGTTT TATTGAAAAG TAATGTTGTC 3720 

TTAAGATTTA GAAGTGATTA TTAGCTTGAG AACTATTACC CAGCTCTAAG CAAATAATGA 3780 

TTGTATACAT ATTAAGATAA TGGTTAAATG CGGTTTTACC AAGTTTTCCC TTGAAAATGT 3840 

AATTCCTTTA TGGAGATTTA TTGTGCAGCC CTAAGCTTCC TTCCCATTTC ATGAATATAA 3900 

GGCTTCTAGA ATTGGACTGG CAGGGGAAAG AATGGTAGAG ACAGAAATTA AGACTTTATC 3960 

CTTGTTTGCT TGTAAACTAT TATTTTCTTG CTAATGTAAC ATTTGTCTGT TCCAGTGATG 4020 

TAAGGATATT AAGTTATTAA GCTAAATATT AATTTTCAAA AATAGTCCTT CTTTAACTTA 4080 

GATATTTCAT AGCTGGATTT AGGAAGATCT GTTATTCTGG AAGTACTAAA AAGAATAATA 4140 

CAACGTACAA TGTCTGCATT CACTAATTCA TGTTCCAGAA GAGGAAATAA TGAAGATATA 4200 

CTCAGTAGAG TACTAGGTGG GAGGATATGG AAATTTGCTC ATAAAATCTC TTATAAAACG 4260 

TGCATATAAC AAAATGACAC CCAGTAGGCC TGCATTACAT TTACATGACC GTGTTTATTT 4320 

GCCATCAAAT AAACTGAGTA CTGACACCAG ACAAAGACTC CAAAGTCATA AAATAGCCTA 4380 

TGACCAACTG CAGCAAGACA GGAGGTCAGC TCGCCTATAA TGGTGCTTAA AGTGTGATTG 4440 

ATGTAATTTT CTGTACTCAC CATTTGAAGT TAGTTAAGGA GAACTTTATT TTTTTAAAAA 4500 

AAGTAAATGG CAACCACTAG TGTGCTCATC CTGAACTGTT ACTCCAAATC CACTCCGTTT 4S60 

TTAAAGCAAA ATTATC TTGT GATTTTAAGA AAAGAGTTTT CTATTTATTT AAGAAAGTAA 4620 

CAATG CAGTC TGCAAGCTTT CAGTAGTTTT CTAGTGCTAT ATTCATCCTG TAAAACTCTT 4680 

ACTACGTAAC CAGTAATCAC AAGGAAAGTG TCCCCTTTGC ATATTTCTTT AAAATTCTTT 4740 

CTTTGGAAAG TATGATGTTG ATAATTAACT TACCCTTATC TGCCAAAACC AGAGCAAAAT 4800 

GCTAAATACG TTATTGCTAA TCAGTGGTCT CAAATCGATT TGCCTCCCTT TGCCTCGTCT 4860 

GAGGGCTGTA AGCCTGAAGA TAGTGGCAAG CACCAAGTCA GTTTCCAAAA TTGCCCCTCA 4920 
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GCTGCTTTAA GTGACTCAGC ACCCTGCCTC AGCTTCAGCA GGCGTAGGCT CACCCTGGGC 4 980 

GGAGCAAAGT ATGGGCCAGG GAGAACTACA GCTACGAAGA CCTGCTGTCG AGTTGAGAAA SO 40 

AGGGGAGAAT TTATGGTCTG AATTTTCTAA CTGTCCTCTT TCTTGGGTCT AAAGCTCATA 5100 

ATACACAAAG GCTTCCAGAC CTGAGCCACA CCCAGGCCCT ATCCTGAACA GGAGACTAAA 5160 

CAGAGGCAAA TCAACCCTAG GAAATACTTG CATTCTGCCC TACGGTTAGT ACCAGGACTG 5220 

AGGTCATTTC TACTGGAAAA GATTGTGAGA TTGAACTTAT CTGATCGCTT GAGACTCCTA 5280 

ATAGGCAGGA GTCAAGGCCA CTAGAAAATT GACAGTTAAG AGCCAAAAGT TTTTAAAATA 5340 

TGCTACTCTG AAAAATCTOG TGAAGGCTGT AGGAAAAGGG AGAATCTTCC ATGTTGGTGT 5400 

TTTTCCTGTA AAGATCAGTT TGGGGTATGA TATAAGCAGG TATTAATAAA AATAACACAC 5460 

CAAAGAGTTA CGTAAAACAT GTTTTATTAA TTTTGGTCCC CACGTACAGA CATTTTATTT 5520 

CTATTTTGAA ATGAGTTATC TATTTTCATA AAAGTAAAAC ACTATTAAAG TGCTGTTTTA 5580 

TGTGAAATAA CTTGAATGTT GTTCCTATAA AAAATAGATC ATAACTCATG ATATGTTTGT 5640 

AATCATGGTA ATTTAGATTT TTATGAGGAA TGAGTATCTG GAAATATTGT AGCAATACTT 5700 

GGTTTAAAAT TTTGGACCTG AGACACTGTG GCTGTCTAAT GTAATCCTTT AAAAATTCTC 5760 

TGCATTGTCA GTAAATGTAG TATATTATTG TACAGCTACT CATAATTTTT TAAAGTTTAT 5820 

GAAGTTATAT TTATCAAATA AAAACTTTCC TATAT 5855 



Seq ID NO: C114 DNA Sequence 
Nucleic Acid Accession ft: XM 087461.1 
ZO Coding sequence: 236.. 1138 
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CCGGCGCCGG GGGCGGOGGG GAACCCCAAA CGCAACCGGG TCTGGAGGGA TCCCCGCGCC 60 

ZD GAGCCAGCCG CCGTCACCGC CTCCGCGCCG CCCCTGCGGG CTTGGCAGGC GCCCGGCGCG 120 

CCCGCACTGC GCCCGGCCGC CGGCTCCCGC GGTCCCACCG TGAGCTCGCC GGCCCGTCGC 180 

CCGCTCGCCA TGCAACCGCC GCCGGCCTCG CGCGCGTAGG CGCCCGCCGC AGGCCATGCT 240 

GCCCCTGCTC GCGGCGCTCC TGGCCGCCGC CTGCCCGCTG CCGCCCGTCC GCGGCGGGGC 300 

CGCGGACGCG CCCGGCCTCC TCGGGGTGCC CTCCAATGCT TCAGTCAACG CGTCCTCCGC 360 

GGCGAGCCCA TCGCCCCGCG GCTGCTGGCC TCGGCGGCCC CCGGGCCCCC CGAGCGCCCG 420 

GGCCCGGAGG AGGCGGCGGC GGCGGCGGCG CCTGTGCAAC ATCAGCGTGC AGCGGCAGAT 480 

GCTGAGCTCG CTGCTCGTGC GCTGGGGCCG CCCGCGGGGC TTCCAGTGCG ACCTACTGCT 540 

CTTCTCCACC AACGCGCACG GCCGCGCTTT CTTCGCCGCC GCCTTCCACC GCGTCGGGCC 600 

GCCGCTGCTC ATCGAGCACC TGGGGCTGGC GGCGGGCGGC GCGCAGCAGG ACCTGCGCCT 660 

CTGCGTGGGC TGCGGCTGGG TGCGCGGTCG CCGCACCGGC CGCCTCCGGC CCGCCGCCGC 720 

CCCCAGCGCC GCCGCCGCCA CCGCCGGGGC GCCCACCGCG CTGCCAGCCT ACCCCGCGGC 780 

CGAGCCGCCC GGGCCGCTGT GGCTGCAGGG CGAGCCGCTG CATTTCTGCT GCCTAGACTT 840 

CAGCCTGGAG GAGCTGCAGG GCGAGCCGGG CTGGCGGCTG AACCGTAAGC CCATTGAGTC 900 

CACGCTGGTG GCCTGCTTCA TGACCCTGGT CATCGTGGTG TGGAGCGTGG CCGCCCTCAT 960 

CTGGCCGGTG CCCATCATCG CCGGCTTCCT GCCCAACGGC ATGGAACAGC GCCGGACCAC 1020 

CGCCAGCACC ACCGCAGCCA CCCCCGCCGC AGTGCCCGCA GGGACCACGG CAGCCGCCGC 1080 

CGCCGCCGCC GCTGCCGCCG CCGCCGCGGC CGTCACTTCG GGGGTGGCGA CCAAGTGACC 1140 

CGCTCCGCTC CTCCCTGTGT CCGTCCTGTG TCCGCGCGCG CGGGTGCCTT TCCCGCCGGA 1200 

GACTCGGCCG GTGTGCTTCG TGCTGTAGTT ATCGTTAGTT CCTCTTCCCG AGATGGGGCC 1260 

GCCGAGAGAC CCCAGCGCCT TTGAAAAGCA AGGTTTGTGC TGCGCTTCCA GTTCCGAAAA 1320 

GCAGATGTTT AAGCCCTTGG ACTGAGGGTG GGATCGCAGC TCCGAAGACG GAGAGGAGGG 1380 

AAATGGGGCC CTTTCCCCTC TATTGCATCC CCCTGCCCGA CTCCTTCCCC GCACCCACGT 1440 

GCC CTAGA TT CATGGCAGAA AATGACCAAA TCCTGTGTAT TTGTTTTATA TATTTAATAA 1500 

CTGTTTTAAA TGAAAGTTTT AGTAAAAAAA ATACAAAACA AAAAGATTAA ATTGCTATTG 1560 

CTGTAGTAAG AGAAGCTCTT TGTATCTGAA CATAGTTGTA TTTGAAATTT GTGGTTTTTT 1620 

AATTTATTTA AAATTGGGGG GAGGGCATGG GAAGGATTTA ACACCGATAT ATTGTTACCG 1680 

CTGAAAATGA ACTTTATGAA CCTTTTCCAA GTTGATCTAT CCAGTGACGT GGCCTGGTGG 1740 

GCGTTTCTTC TTGTACTTAT GTGGTTTTTT GGCTTTTAAT ACAGACATTT TCCTCC 1796 

55 Seq ID NO: C115 DNA Sequence 

Nucleic Acid Accession ft: XMJ)51S22.4 
Coding sequencei 127.. 1215 

sc. 1 11 21 31 41 51 

60 | || | | | 

ACAGTTGTTG CAAAGTGCTC AGCACTAAGG GAGCCAGCGC ACAGCACAGC CAGGAAGGOG 60 

AGCGAGCCCA GCCAGCCCAG CCAGCCCAGC CAGCCCGGAG GTCATTTGAT TGCCCGCCTC 120 

AGAACGATGG ATCTGCATCT CTTCGACTAC TCAGAGCCAG GGAACTTCTC GGACATCAGC 180 

TGGCCATGCA ACAGCAGCGA CTGCATCGTG GTGGACACGG TGATGTGTCC CAACATGCCC 240 

AACAAAAGGG TCCTGCTCTA CACGCTCTCC TTCATTTACA TTTTCATCTT CGTCATCGGC 300 

ATGATTGCCA ACTCCGTGGT GGTCTGGGTG AATATCCAGG CCAAGACCAC AGGCTATGAC 360 

ACGCACTGCT ACATCTTGAA CCTGGCCATT GCCGACCTGT GGGTTGTCCT CACCATCCCA 420 

GTCTGGGTGG TCAGTCTCGT GCAGCACAAC CAGTGGCCCA TGGGCGAGCT CACGTGCAAA 480 

GTCACACACC TCATCTTCTC CATCAACCTC TTCGGCAGCA TTTTCTTCCT CACGTGCATG 540 

AGCGTGGACC GCTACCTCTC CATCACCTAC TTCACCAACA CCCCCAGCAG CAGGAAGAAG 600 

ATGGTACGCC GTGTCGTCTG CATCCTGGTG TGGCTGCTGG CCTTCTGCGT GTCTCTGCCT 660 

GACACCTACT ACCTGAAGAC CGTCACGTCT GCGTCCAACA ATGAGACCTA CTGCCGGTCC 720 

TTCTA CCCCG AGCACAGCAT CAAGGAGTGG CTGATCGGCA TGGAGCTGGT CTCCGTTGTC 780 

TTGGGCTTTG CCGTTCCCTT CTCCATTATC GCTGTCTTCT ACTTCCTGCT GGCCAGAGCC 840 

ATCTCGGOGT CCAGTGACCA GGAGAAGCAC AGCAGCCGGA AGATCATCTT CTCCTACGTG 900 

GTGGTCTTCC TTGTCTGCTG GCTGCCCTAC CACGTGGCGG TGCTGCTGGA CATCTTCTCC 960 

ATCCTGCACT ACATCCCTTT CACCTGCCGG CTGGAGCACG CCCTCTTCAC GGCCCTGCAT 1020 

GTCACACAGT GCCTGTCGCT GGTGCACTGC TGCGTCAACC CTGTCCTCTA CAGCTTCATC 1080 

AATCGCAACT ACAGGTACGA GCTGATGAAG GCCTTCATCT TCAAGTACTC GGCCAAAACA 1140 

GGGCTCACCA AGCTCATCGA TGCCTCCAGA GTCTCAGAGA CGGAGTACTC TGCCTTGGAG 1200 

CAGAGCACCA AATGATCTGC CCTGGAGAGG CTCTGGGACG GGTTTACTTG TTTTTGAACA 1260 

GGGTGATGGG CCCTATGGTT TTCTAGAGCA AAGCAAAGTA GCTTCGGGTC TTGATGCTTG 1320 

AGTAGAGTGA AGAGGGGAGC ACGTGCCCCC TGCATCCATT CTCTCTTTCT CTTGATGACG 1380 

CAGCTGTCAT TTGGCTGTGC GTGCTGACAG TTTTGCAACA GGCAGAGCTG TGTCGCACAG 1440 
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CAGTGCTGTG 
TTTCTGTGTT 
TCTCACTATT 
TGGGAGGCAT 
AGTTTTGACT 
ATATAAATAT 
ATCTGTGTGG 
TTGTGACATT 
TGCAAATCTG 
ATTTTTTTTT 



CGTCAGAGCC 
TCCTGAATTT 
GGTGTACCTT 
AGTGCTGACA 
AAGGATGACA 
ATAAATATAT 
TGTTTTGTAC 
AATAGTATTG 
CACACACAAC 
AATAAAGATT 



AGCTGAGGAC 
TTTATATGGT 
ATAAATGTAT 
TATATTCAGA 
CTAATTGTTA 
GCCAGTCTTG 
CGGCACGGGA 
TAAAGTTACA 
GAACAGTTGC 
TTTGTTTCCT 



AGGCTTGCCT 
GATTTGTATT 
TTGAAAGTTA 
GTGTTGTAGT 
GCTGTTTTGA 
GCTGAAATGT 
TATGGAACGA 
TTTTAAAATA 
ATTTCAGAGA 



GGACTTCTGT 
TAAATTTTAA 
AATATATTTT 
TTTAAGGTTA 
AATTATATAT 
TTTATTTACC 
AAACTGCTTT 
AACAAAAAAC 
GTTCTCTCAA 



AAGATAGGAT 
GACTTTATTT 
AAATATTGTT 
GCGTGACTTC 
ATATAAATAT 
ATAGTTTTAT 
GTAATGCAGT 
TGTTCTGGAC 
TTTGTAAGTT 



Seg ID NO: CI 16 DNA Sequence 

Nucleic Acid Accession #t NMJ300350.1 

Coding sequence : 82 . .6903 



1500 
1560 
1620 
16B0 
1740 
1800 
1860 
1920 
1980 
2010 



1 11 21 31 41 51 

I I I I I I 

CTGGCTCTTA ACGGCGTTTA TGTCCTTTGC TGTCTGAGGG GCCTCAGCTC TGACCAATCT 60 

GGTCTTCGTG TGGTCATTAG CATGGGCTTC GTGAGACAGA TACAGCTTTT GCTCTGGAAG 120 

AACTGGACCC TGCGGAAAAG GCAAAAGATT CGCTTTGTGG TGGAACTCGT GTGGCCTTTA 180 

TCTTTATTTC TGGTCTTGAT CTGGTTAAGG AATGCCAACC CGCTCTACAG CCATCATGAA 240 

TGCCATTTCC CCAACAAGGC GATGCCCTCA GCAGGAATGC TGCCGTGGCT CCAGGGGATC 300 

TTCTGCAATG TGAACAATCC CTGTTTTCAA AGCCCCACCC CAGGAGAATC TCCTGGAATT 360 

GTGTCAAACT ATAACAACTC CATCTTGGCA AGGGTATATC GAGATTTTCA AGAACTCCTC 420 

ATGAATGCAC CAGAGAGCCA GCACCTTGGC CGTATTTGGA CAGAGCTACA CATCTTGTCC 480 

CAATTCATGG ACACCCTCOG GACTCACCCG GAGAGAATTG CAGGAAGAGG AATACGAATA 540 

AGGGATATCT TGAAAGATGA AGAAACACTG ACACTATTTC TCATTAAAAA CATCGGCCTG 600 

TCTGACTCAG TGGTCTACCT TCTGATCAAC TCTCAAGTCC GTCCAGAGCA GTTCGCTCAT 660 

GGAGTCCCGG ACCTGGCGCT GAAGGACATC GCCTGCAGCG AGGCCCTCCT GGAGCGCTTC 720 

ATCATCTTCA GCCAGAGACG CGGGGCAAAG ACGGTGCGCT ATGCCCTGTG CTCCCTCTCC 780 

CAGGGCACCC TACAGTGGAT AGAAGACACT CTGTATGCCA ACGTGGACTT CTTCAAGCTC 840 

TTCCGTGTGC TTCCCACACT CCTAGACAGC CGTTCTCAAG GTATCAATCT GAGATCTTGG 900 

GGAGGAATAT TATCTGATAT GTCACCAAGA ATTCAAGAGT TTATCCATCG GCCGAGTATG 960 

CAGGACTTGC TGTGGGTGAC CAGGCCCCTC ATGCAGAATG GTGGTCCAGA GACCTTTACA 1020 

AAGCTGATGG GCATCCTGTC TGACCTCCTG TGTGGCTACC CCGAGGGAGG TGGCTCTCGG 1080 

GTGCTCTCCT TCAACTGGTA TGAAGACAAT AACTATAAGG CCTTTCTGGG GATTGACTCC 1140 

ACAAGGAAGG ATCCTATCTA TTCTTATGAC AGAAGAACAA CATCCTTTTG TAATGCATTG 1200 

ATCCAGAGCC TGGAGTCAAA TCCTTTAACC AAAATCGCTT GGAGGGCGGC AAAGCCTTTG 1260 

CTGATGGGAA AAATCCTGTA CACTCCTGAT TCACCTGCAG CACGAAGGAT ACTGAAGAAT 1320 

GCCAACTCAA CTTTTGAAGA ACTGGAACAC GTTAGGAAGT TGGTCAAAGC CTGGGAAGAA 1380 

GTAGGGCCCC AGATCTGGTA CTTCTTTGAC AACAGCACAC AGATGAACAT GATCAGAGAT 1440 

ACCCTGGGGA ACCCAACAGT AAAAGACTTT TTGAATAGGC AGCTTGGTGA AGAAGGTATT 1500 

ACTGCTGAAG CCATCCTAAA CTTCCTCTAC AAGGGCCCTC GGGAAAGCCA GGCTGACGAC 1560 

ATGGCCAACT TCGACTGGAG GGACATATTT AACATCACTG ATCGCACCCT CCGCCTGGTC 1620 

AATCAATACC TGGAGTGCTT GGTCCTGGAT AAGTTTGAAA GCTACAATGA TGAAACTCAG 1680 

CTCACCCAAC GTGCCCTCTC TCTACTGGAG GAAAACATGT TCTGGGCCGG AGTGGTATTC 1740 

CCTGACATGT ATCCCTGOAC CAGCTCTCTA CCACCCCACG TGAAGTATAA GATCCGAATG 1800 

GACATAGACG TGGTGGAGAA AACCAATAAG ATTAAAGACA GGTATTGGGA TTCTGGTCCC 1860 

AGAGCTGATC CCGTGGAAGA TTTCCGGTAC ATCTGGGGCG GGTTTGCCTA TCTGCAGGAC 1920 

ATGGTTGAAC AGGGGATCAC AAGGAGCCAG GTGCAGGCGG AGGCTCCAGT TGGAATCTAC 1980 

CTCCAGCAGA TGCCCTACCC CTGCTTCGTG GACGATTCTT TCATGATCAT CCTGAACCGC 2040 

TGTTTCCCTA TCTTCATGGT GCTGGCATGG ATCTACTCTG TCTCCATGAC TGTGAAGAGC 2100 

ATCGTCTTGG AGAAGGAGTT GCGACTGAAG GAGACCTTGA AAAATCAGGG TGTCTCCAAT 2160 

GCAGTGATTT GGTGTACCTG GTTCCTGGAC AGCTTCTCCA TCATGTCGAT GAGCATCTTC 2220 

CTCCTGACGA TATTCATCAT GCATGGAAGA ATCCTACATT ACAGCGACCC ATTCATCCTC 2280 

TTCCTGTTCT TGTTGGCTTT CTCCACTGCC ACCATCATGC TGTGCTTTCT GCTCAGCACC 2340 

TTCTTCTCCA AGGCCAGTCT GGCAGCAGCC TGTAGTGGTG TCATCTATTT CACCCTCTAC 2400 

CTGCCACACA TCCTGTGCTT CGCCTGGCAG GACCGCATGA CCGCTGAGCT GAAGAAGGCT 2460 

GTGAGCTTAC TGTCTCCGGT GGCATTTGGA TTTGGCACTG AGTACCTGGT TOSCTTTGAA 2520 

GAGCAAGGCC TGGGGCTGCA GTGGAGCAAC ATCGGGAACA GTCCCACGGA AGGGGACGAA 2580 

TTCAGCTTCC TGCTGTCCAT GCAGATGATG CTCCTTGATG CTGCGTGCTA TGGCTTACTC 2640 

GCTTGGTACC TTGATCAGGT GTTTCCAGGA GACTATGGAA CCCCACTTCC TTGGTACTTT 2700 

CTTCTACAAG AGTCGTATTG GCTTAGCGGT GAAGGGTGTT CAACCAGAGA AGAAAGAGCC 2760 

CTGGAAAAGA CCGAGCCCCT AACAGAGGAA ACGGAGGATC CAGAGCACCC AGAAGGAATA 2820 

CACGACTCCT TCTTTGAACG TGAGCATCCA GGGTGGGTTC CTGGGGTATG CGTGAAGAAT 2880 

CTGGTAAAGA TTTTTGAGCC CTGTGGCCGG CCAGCTGTGG ACCGTCTGAA CATCACCTTC 2940 

TACGAGAACC AGATCACCGC ATTCCTGGGC CACAATGGAG CTGGGAAAAC CACCACCTTG 3000 

TCCATCCTGA CGGGTCTGTT GCCACCAACC TCTGGGACTG TGCTCGTTGG GGGAAGGGAC 3060 

ATTGAAACCA GCCTGGATGC AGTCCGGCAG AGCCTTGGCA TGTGTCCACA GCACAACATC 3120 

CTGTTCCACC ACCTCACGGT GGCTGAGCAC ATGCTGTTCT ATGCCCAGCT GAAAGGAAAG 3180 

TCCCAGGAGG AGGCCCAGCT GGAGATGGAA GCCATGTTGG AGGACACAGG CCTCCACCAC 3240 

AAGCGGAATG AAGAGGCTCA GGACCTATCA GGTGGCATGC AGAGAAAGCT GTCGGTTGCC 3300 

ATTGCCTTTG TGGGAGATGC CAAGGTGGTG ATTCTGGACG AACCCACCTC TGGGGTGGAC 3360 

CCTTACTCGA GACGCTCAAT CTGGGATCTG CTCCTGAAGT ATCGCTCAGG CAGAACCATC 3420 

ATCATGCCCA CTCACCACAT GGACGAGGCC GACCACCAAG GGGACCGCAT TGCCATCATT 3480 

GCCCAGGGAA GGCTCTACTG CTCAGGCACC CCACTCTTCC TGAAGAACTG CTTTGGCACA 3540 

GGCTTGTACT TAACCTTGGT GCGCAAGATG AAAAACATCC AGAGCCAAAG GAAAGGCAGT 3600 

GAGGGGACCT GCAGCTGCTC GTCTAAGGGT TTCTCCACCA CGTGTCCAGC CCACGTCGAT 3660 

GACCTAACTC CAGAACAAGT CCTGGATGGG GATGTAAATG AGCTGATGGA TGTAGTTCTC 3720 

CACCATGTTC CAGAGGCAAA GCTGGTGGAG TGCATTGGTC AAGAACTTAT CTTCCTTCTT 3780 

CCAAATAAGA ACTTCAAGCA CAGAGCATAT GCCAGCCTTT TCAGAGAGCT GGAGGAGACG 3840 

CTGGCTGACC TTGGTCTCAG CAGTTTTGGA ATTTCTGACA CTCCCCTGGA AGAGATTTTT 3900 

CTGAAGGTCA CGGAGGATTC TGATTCAGGA CCTCTGTTTG CGGGTGGCGC TCAGCAGAAA 3960 

AGAGAAAACG TCAACCCCCG ACACCCCTGC TTGGGTCCCA GAGAGAAGGC TGGACAGACA 4020 

1274 



WO 03/042661 



CCAATGTCTG CTCCCCAGGQ GCGCCGGCTG CTCACCCAGA GGGCCAGCCT 4080 

CCCCCAGAGC CAGAGTGCCC AGGCCCGCAG CTCAACACGG GGACACAGCT GGTCCTCCAG 414D 

CATGTGCAGG CGCTGCTGGT CAAGAGATTC CAACACACCA TCCGCAGCCA CaIgSotc 4200 

^SS^ ^IS?' CCC GGCTACCTTT GTGTTTTTGG CTCTGATGCT TTCTATTGTT 4260 

tI^SSZ J* 3003 *™ CCCCX3CTTTG ACCCTTCACC CCTGGATATA TGGGCAGCAG 4320 

TACACCTTCT TCAGCATGGA TGAACCAGGC AGTGAGCAGT TCACGGTACT TGCAGACGTC 4380 

™ C r^ Th AGCCAGGCTT TGGCAACCGC TGCCTGAAGG AA(SS 4440 

CCCTGTGGCA ACTCAACACC CTGGAAGACT CCTTCTGTGT CCCCAAACAT CACCCAGCTG 4500 

JSES?* ACAGGTCAAC ccrrcACCAT cctgcagg^g S o 

SS^SS CCATGCTGCC AGAGTGCCCC GAGGGTGCOQ GGGGCCTCCC GCCCCCCCAG 4620 

AGAACACAGC GCAGCACGGA AATTCTACAA GACCTGACGG ACAGGAACAT CTCCGACTTC 4680 

^ A T CCTCC TCTTATAAGA AGCAGCTTAA AGAGCAAATT SS A ill 

GAACAGAGGT ATGGAGGAAT TTCCATTGGA GGAAAGCTCC CAGTCGTCCC CATCACGGGG 4800 

GAAGCACTTG TTGGGTTTTT AAGCGACCTT GGCCGGATCA TGAATGTGAG OGG^S 4860 

al^™^ AGGCCTCTAA AGAAATACCT GATTTCCTTA AACATCTAGA AACTGAAGAC 4920 

AACATTAAGG TGTGGTTTAA TAACAAAGGC TGGCATGCCC TGGTCAGCTT TCTCAATGTG 4980 

SEESt^ CCATCTTACG GGCCAGCCTG CCTAAGGACA GGAGCCCCgI S5S££E 5040 

^^S* ™ GCCAACC CCTGAACCTG ACCAAGGAGC AGCTCTCAGA GATTACAGTG "oO 

CTGACCACTT CAGTGGATGC TGTGGTTGCC ATCTGCGTGA TTTTCTCCAT GTCCTTCGTC 5160 

CCAGCCAGCT TTGTCCTTTA TTTGATCCAG GAGCGGGTGA ACAAATCCAA GCACCTCCAG 5220 

TTTATCAGTG GAGTGAGCCC CACCACCTAC TGGGTGACCA ACTTCCTCTG GGACATCATG 5280 

AATTATTCCG TGAGTGCTGG GCTGGTGGTG GGCATCTTCA TCGGGTTTCA GAAGAAAGCC 5340 

TACACTTCTC CAGAAAACCT TCCTGCCCTT GTGGCACTGC TCCTGCTGTA TGGATGGGOG lltl 

™™ ACCC AGCATCCTTC CTGTTTGATO TCCCCAGCAC A^C^S 5460 

rS^™ GTTCATCGGC ATCAACAGCA GTGCTATTAC CTTCATCTTG 5520 

GAATTATTTG ATAATAACCG GACGCTGCTC AGGTTCAACG CCGTGCTGAG GAAGCTGCTC 5580 

ATTGTCTTCC CCCACTTCTG CCTGGGCGGG GGCCTCATTG ACCTTGCACT GAGCCAGGCT 56^0 

CTGACAGATC TCTATGCCCG GTTTGGTGAG GAGCACTCTG CAAATCCGTT CCACTGGGAC 5700 

CTGATTGGGA AGAACCTGTT TGCCATGGTG GTGGAAGGGG TGGTGTACTT CCTCCTGACC 5760 

CTGCTGGTCC AGCGCCACTT CTTCCTCTCC CAATGGATTG CCGAGCCCAC TAAGGAGCCC 5820 

ATTGTTGATG AAGATGATGA TGTGGCTGAA GAAAGACAAA GAATTATTAC TGGTGGAAAT 5880 

AAAACTGACA TCTTAAGGCT ACATGAACTA ACCAAGATTT ATCTGGGCAC CTCCAGCCCA 5940 

^^P* OGGAGTTCGC CCTGGAGAGT GCTTTGGCCT CCTGGGAGTG 6000 

£j^nS CACATTCAAG ATGCTCACTC GGGACACCAC AGTGACCTCA 6060 

GGGGATGCCA CCGTAGCAGG CAAGAGTATT TTAACCAATA TTTCTGAAGT CCATCAAAAT 6120 

ATGGGCTACT GTCCTCAGTT TGATGCAATC GATGAGCTGC TCACAGGACG AGAACATCTT 6180 

It^Sl™ CCOGGCTTCG AGGTGTACCA GCAGAAGAAA TCGAAAAGGT 6240 

^^2* GCCTGGGCCT GACTCTCTAC GCCGACTGCC TGGCTGGCAC GTACAGTGGG 6300 

GGCAACAAGC GGAAACTCTC CACAGCCATC GCACTCATTG GCTGCCCACC GCTGGTCCTG 6360 

S^^^ CGACCACAGG GATGGACCCC CAGGCACGCC GCATGCTGTG GAACGTCATC 6420 

GAGGGCTCTG GTCCTCACAT CCCACAGCAT GGAAGAATGT 6480 

Gj^CACTGT GTACCCGGCT GGCCATCATG GTAAAGGGCG CCTTTCGATG TATGGGCACC 6540 

rSS^ T^EE^ ATTTGGAGAT GGCTATATCG TCACAATGAA GATCAAATGC SSOO 

CCGAAGGACG ACCTGCTTCC TGACCTGAAC CCTGTGGAGC AGTTCTTCCA GGGGAACTTC 6660 

CCAGGGAGTG TGCAGAGGGA GAGGCACTAC AACATGCTCC AGTTCCAGGT C^cSS 6720 

TCCCTGGOGA GGATCTTCCA GCTCCTCCTC TCCCACAAGG ACAGCCTGCT CATCGAGGAG 6780 

TACTCAGTCA CACAGACCAC ACTGGACCAG GTGTTTGTAA ATTTTGCTAA ACAGCAGACT 6840 

r^^^ GCACCCTCGA GCTGCTGGAG CCAGTC^ct ££SS£ S900 

TGATCTTTCA CACCGCTCGT TCCTGCAGCC AGAAAGGAAC TCTGGGCAGC TGGAGGCGCA 6960 

GGAGCCTGTG CCCATATGGT CATCCAAATG GACTGGCCCA gStaS " ^ 

CGAATTCAGA AAGAGGTCTT TCAGAAGGAA 7080 

ACCGAAAC J G ACTTGCTCAC CTGGAACACC TGATGGTGAA ACCAAACAAA TACAAAATCC 7140 

TTCTCCAGAC CCCAGAACTA GAAACCCCGG GCCATCCCAC TAGCAGCTTT GGCCTCCATA 7200 

TTGCTCTCAT TTCAAGCAGA TCTGCTTTTC TGCATGTTTG TCTGTGTGTC TGOGTTGTGT 7260 

GTGATTTTCA TGGAAAAATA AAATGCAAAT GCACTCATCA CAAAAAAAAA AAAAAAAA 7318 

Seq ID NO: C117 DNA Sequence 

Nucleic Acid Accession fh NM_006671.2 

Coding sequence : 13 8 . . 182 0 



1 
I 

GGCACGAGGC 
GCCCAGACCC 
AGCAGGCCAG 
GGCGGAATGG 
TCTTCTTGAG 
AGCTCCTGAT 
CCGGACTTGC 
ACTACCTGTG 
ACOCAGGCAG 
CAGCCGATGC 
CATTCAAACA 
AGGAGGCCCC 
TGCAGAACTT 
GCACCAGCGA 
TGCTGGGCCG 
AGTCGGTCAT 
TCATTGCGGG 
TCTACTCAGT 
TCTACTTCTT 
TGCTCATCGC 
TGCTGGAGAA 
CCATCAACAT 



11 
I 

TGGTGTTTAG 
TGTGCCCCCG 
GCTCACCATG 
ACTCCTCATC 
GACCCGGCGC 
GAGGATGCTG 
CTCCCTGGAT 
GACCACCTTC 
CGCGGCCCAG 
CCTGTTGGAC 
GTACCGCACC 
TCCTCGGCGG 
CGCCCTGGAC 
TGGCATGAAT 
CATGGGTGAC 
GAAGATOGTG 
TAAGATCCTG 
CACCGTGGTG 
CATCACCAAG 
GCTGGCCACC 
CAACCACATC 
GGACGGCACT 



21 
I 

CAACTCCGAC 
GCCGGGCTCT 
GTGCCGCATG 
CTGTCTGTGC 
CTCTCACCAC 
AAGATGATGA 
GCCAAGACCT 
ATGGCTGTCA 
AAGGAGACCA 
CTCATCCGGA 
AAGACCACCC 
ATCCTCATCT 
CTGACCCCGC 
GTGCTGGGCA 
AGOGGGGCCC 
GCGGTGGCTG 
GAGATGGACG 
TGCGGGCTGG 
AAGAATCCCA 
TCCTCCAGCT 
GACCGGCGCA 
GCGCTCTACG 



31 
I 

CACCTGCCTG 
CATGCGTGGA 
CCATCTTGGC 
TGTCTGTCAT 
AGGAAATTAG 
TCCTGCCACT 
CTAGCCGCCT 
TCGTGGGCAT 
CGGAGCAGAG 
ACATGTTCCC 
CAGTTGTCAA 
ACGGGGTCCA 
CGCCCGAGGT 
TCGTCTTCTT 
CCCTGGTCAG 
TGTGGTATTT 
ACCCCAGGGC 
TGCTCCACGG 
TCGTCTTCAT 
CAGCCACACT 
TCGCTCGCTT 
AGGCTGTGGC 



41 

! 

CTGAGGGGCT 
ATGGTGCTGT 
ACGGGGGAGG 
CGTGGGCTGC 
TTACTTCCAG 
GGTGGTCTCC 
GGGCGTCCTC 
CTTCATGGTC 
TGGGAAGCCC 
AGCCAACCTA 
GTCCCCCAAG 
GGAGGAGAAT 
CGTTTACAAG 
CTCTGCCACC 
CTTCTGCCAG 
CCCCTTCGGC 
CGTCGGCAAG 
GCTCTTTATC 
CCGCGGCATC 
GCCCATCACC 
CGTGCTGCCC 
CGCCATCTTC 



51 
I 

AGAGCCCTCA 
GCCCCTTGCC 
GACGTGTGCA 
CTCCTCGGCT 
TTCCCTGGAG 
AGCTTGATGT 
ACCGTGGCGT 
TCCATCATCC 
ATCATGAGCT 
GTAGAAGCCA 
GTGGCACCAG 
GGCTCCCATG 
TCAGAGCCGG 
ATGGGCATCA 
TGCCTCAATG 
ATTGTGTTCC 
AAGCTGGGCT 
CTGCCCCTGC 
CTGCAGGCTC 
TTCAAGTGCC 
GTGGGTGCCA 
ATCGCCCAGG 



60 

120 

160 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 



1275 
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TCAACAACTA 
CCAGCATTGG 
CCTCCGTGGG 
ACCGTTTCCG 
ATATATGTCG 
CCAAGCCAGT 
TAGCCGAGGC 
TGGAGCGGGA 
AGCTGGAGAC 
GGGTCCTGAG 
CACACATTCT 
CAGAGAAGGG 
CTGAGGCCCC 
TATCCGGCCC 
AAGGTCTCAC 
GGAGGACAGG 
TCAGCCACAC 
GGCCACAGTG 
GGAGACAGAT 
ATGGAATAGG 
AGCTGCCGAT 
AGGTTTTCTA 
TTTTCAAAAA 



CGAGCTGGAC 
GGCAGCTGGC 
ACTGCCCACC 
CACCATGATT 
GAAGGATTTT 
GAGCCTCCAG 
CTCCGAGCTC 
TGAGGAGCTG 
CAATGTCTGA 
GCAGGAACTC 
TCTCACCCTT 
AAAGGCTGCA 
GCTGTGACAG 
CACTTTCTGG 
GTTAAGGTCA 
GCAGCCTGCA 
AGTTCTCCCC 
CGTCCAGTCT 
GTCTTTACTA 
TAGGCAAGCC 
GGGGCCCTGG 
TTCCTTTCTG 



TTTGGCCAGA 
ATCCCCCAGG 
GATGACATCA 
AACGTGCTGG 
GCCCGGGACA 
GAGATCGTGG 
ACCCTGGGCC 
CCCGCTGCGA 
GCCTGCGGAG 
GACTCTCCAA 
GAGAGGCTGG 
TGGGGGAGCC 
TTTCCCCGGT 
ATGACAGACT 
AGACACTAAC 
GCTGTGTCCA 
AAGGGGAGCA 
CCTGTCCTGC 
GAGCTGAAAG 
AGGACTCCGG 
TGTCAGAACT 
TGTTTGCAAA 
AAA 



TCATCACCAT 
CCGGCCTCGT 
CCCTCATCAT 
GTGATGCGCT 
CAGGCACCGA 
CAGCCCAGCA 
CCACCTGCCC 
GTCTGAACCA 
CTGCAGGGGC 
CCCTCCTGAG 
AATTAACCCC 
CCATCCAGGG 
GTGAGCCCGG 
TGAGGCTCTG 
TCAAATCTTT 
GGCCCAGGCC 
GCCCAGGGCC 
CACGTGTCTT 
GCCCCCTTGA 
GAAGGAGGTG 
CCCCAAAGGC 
TTCAGTGTTA 



CAGTATCACA 
CACCATGGTC 
TGCCGTTGAC 
GGCAGCGGGG 
GAAACTGCTG 
GAATGGCTGT 
CCACCACGTC 
CTGCACCATC 
AGGCGAGGCC 
CAGCCGGCAG 
GCTTGACGGA 
AGTGATGGGC 
TGAGGGCGGC 
AGAGCTGAAA 
CAAGCCCCGC 
CCACCCCATA 
AAGCCCCGCT 
TTGCAAAGCT 
CACATCCAGG 
CAGCCAGGAT 
CTGTGCGTCC 
ACTAAATAAA 



GCCACTGCAG 
ATCGTGCTCA 
TGGGCTCTGG 
ATCATGGCCC 
CCCTGCGAGA 
GTGAAGAGTG 
CCCGTTCAAG 
CAGATCAGCG 
TCCAGGGGCA 
GGGCCAGGAT 
AAATGTATCT 
CCGGCATTGC 
AGGCAGGGGT 
ACACTTGTCC 
CTCTCCTCTT 
ACAGGTGGCC 
GCCTTCCCCA 
CCTTGGATGT 
CCAACCTCCC 

AAGTGGAGTC 
GGTATTTTGT 



1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2663 



Seq ID NO: CUB DNA Sequence 
Nucleic Acid Accession #: NM_005689 
Coding sequence: 278.. 2806 

1 11 21 31 41 51 

I I I I I I 

GGGCCTGCAG TTGGCAGAAG GGTCCCGGGC CCAGAGCCAG CGGGGCCGTG CTGAGACGGC 60 

GTACGTGCCC TGCGTGAGTG CGTGGCGGCG GCGCGTGCGC TAGGGGAGTG GGCGGTGAGG 120 

CCTGGTCCAC GTGCGTCCCT TCCCGGGACC CCCGCAGCTT GGCGCCCAGC GGCTACGTGA 180 

GCCAAGGCAC CCGGATGTCC GCGCCCCTCT CCGAGTGACA AGTCCCGGCC TCCGGTCCCG 240 

CAGTGCCCGC AGCCTCGGCC GGCGTCCACG CATTGCCATG GTGACTGTGG GCAACTACTG 300 

CGAGGCCGAA GGGCCCGTGG GTCCGGCCTG GATGCAGGAT GGCCTGAGTC CCTGCTTCTT 360 

CTTCACGCTC GTGCCCTCGA CGCGGATGGC TCTAGGGACT CTGGCCTTGG TGCTGGCTCT 420 

TCCCTGCAGA CGCCGGGAGC GGCCCGCTGG TGCTGATTCG CTGTCTTGGG GGGCCGGCCC 480 

TCGCATCTCT CCCTACGTGC TGCAGCTGCT TCTGGCCACA CTTCAGGCGG CGCTGCCCCT 540 

GGCCGGCCTG GCTGGCCGGG TGGGCACTGC CCGGGGGGCC CCACTGCCAA GCTATCTACT 600 

TCTGGCCTCC GTGCTGGAGA GTCTGGCCGG CGCCTGTGGC CTGTGGCTGC TTGTCGTGGA 660 

GCGGAGCCAG GCACGGCAGC GTCTGGCAAT GGGCATCTGG ATCAAGTTCA GGCACAGCCC 720 

TGGTCTCCTG CTCCTCTGGA CTGTGGCGTT TGCAGCTGAG AACTTGGCCC TGGTGTCTTG 780 

GAACAGCCCA CAGTGGTGGT GGGCAAGGGC AGACTTGGGC CAACAGGTTC AGTTTAGCCT 840 

GTGGGTGCTG CGGTATGTGG TCTCTGGAGG GCTGTTTGTC CTGGGTCTCT GGGCCCCTGG 900 

ACTTCGTCCC CAGTCCTATA CATTGCAGGT TCATGAAGAG GACCAAGATG TGGAAAGGAG 960 

CCAGGTTCGG TCAGCAGCCC AACAGTCTAC CTGGCGAGAT TTTGGCAGGA AGCTCCGCCT 1020 

CCTGAGTGGC TACCTGTGGC CTOGAGGGAG TCCAGCTCTG CAGCTGGTGG TGCTCATCTG 1080 

CCTGGGGCTC ATGGGTTTGG AACGGGCACT CAATGTGTTG GTGCCTATAT TCTATAGGAA 1140 

CATTGTGAAC TTGCTGACTG AGAAGGCACC TTGGAACTCT CTGGCCTGGA CTGTTACCAG 1200 

TTACGTCTTC CTCAAGTTCC TCCAGGGGGG TGGCACTGGC AGTACAGGCT TCGTGAGCAA 1260 

CCTGCGCACC TTCCTGTGGA TCCGGGTGCA GCAGTTCACG TCTCGGCGGG TGGAGCTGCT 1320 

CATCTTCTCC CACCTGCACG AGCTCTCACT GCGCTGGCAC CTGGGGCGCC GCACAGGGGA 1380 

GGTGCTGCGG ATCGCGGATC GGGGCACATC CAGTGTCACA GGGCTGCTCA GCTACCTGGT 1440 

GTTCAATGTC ATCCCCACGC TGGCCGACAT CATCATTGGC ATCATCTACT TCAGCATGTT 1500 

CTTCAACGCC TGGTTTGGCC TCATTGTGTT CCTGTGCATG AGTCTTTACC TCACCCTGAC 1560 

CATTGTGGTC ACTGAGTGGA GAACCAAGTT TCGTCGTGCT ATGAACACAC AGGAGAACGC 1620 

TACCCGGGCA CGAGCAGTGG ACTCTCTGCT AAACTTCGAG ACGGTGAAGT ATTACAACGC 1680 

CGAGAGTTAC GAAGTGGAAC GCTATCGAGA GGCCATCATC AAATATCAGG GTTTGGAGTG 1740 

GAAGTCGAGC GCTTCACTGG TTTTACTAAA TCAGACCCAG AACCTGGTGA TTGGGCTCGG 1800 

GCTCCTCGCC GGCTCCCTGC TTTGCGCATA CTTTGTCACT GAGCAGAAGC TACAGGTTGG 1860 

GGACTATGTG CTCTTTGGCA CCTACATTAT CCAGCTGTAC ATGCOCCTCA ATTGGTTTGG 1920 

CACCTACTAC AGGATGATCC AGACCAACTT CATTGACATG GAGAACATGT TTGACTTGCT 1980 

GAAAGAGGAG ACAGAAGTGA AGGACCTTCC TGGAGCAGGG CCCCTTCGCT TTCAGAAGGG 2040 

CCGTATTGAG TTTGAGAACG TGCACTTCAG CTATGCCGAT GGGCGGGAGA CTCTGCAGGA 2100 

CGTGTCTTTC ACTGTGATGC CTGGACAGAC ACTTGCCCTG GTGGGCCCAT CTGGGGCAGG 2160 

GAAGAGCACA ATTTTGCGCC TGCTGTTTCG CTTCTACGAC ATCAGCTCTG GCTGCATCCG 2220 

AATAGATGGG CAGGACATTT CACAGGTGAC CCAGGCCTCT CTCCGGTCTC ACATTGGAGT 2280 

TGTGCCCCAA GACACTGTCC TCTTTAATGA CACCATCGCC GACAATATCC GTTACGGCCG 2340 

TGTCACAGCT GGGAATGATG AGGTGGAGGC TGCTGCTCAG GCTGCAGGCA TCCATGATGC 2400 

CATTATGGCT TTCCCTGAAG GGTACAGGAC ACAGGTGGGC GAGCGGGGAC TGAAGCTGAG 2460 

CGGCGGGGAG AAGCAGCGCG TCGCCATTGC COGCACCATC CTCAAGGCTC CGGGCATCAT 2520 

TCTGCTGGAT GAGGCAACGT CAGCGCTGGA TACATCTAAT GAGAGGGCCA TCCAGGCTTC 2580 

TCTGGCCAAA GTCTGTGCCA ACCGCACCAC CATCGTAGTG GCACACAGGC TCTCAACTGT 2640 

GGTCAATGCT GACCAGATCC TCGTCATCAA GGATGGCTGC ATCGTGGAGA GGGGACGACA 2700 

CGAGGCTCTG TTGTCCCGAG GTGGGGTGTA TGCTGACATG TGGCAGCTGC AGCAGGGACA 2760 

GGAAGAAACC TCTGAAGACA CTAAGCCTCA GACCATGGAA CGGTGACAAA AGTTTGGCCA 2820 

CTTCCCTCTC AAAGACTAAC CCAGAAGGGA ATAAGATGTG TCTCCTTTCC CTGGCTTATT 2880 

TCATCCTGGT CTTGGGGTAT GGTGCTAGCT ATGGTAAGGG AAAGGGACCT TTCCGAAAAA 2940 

CATCTTTTGG GGAAATAAAA ATGTGGACTG TGAAAAAAAA AAAAAAAAAA AAA 2993 

Seq ID NO: CI 19 DNA Sequence 
Nucleic Acid Accession #: NM_000676 
Coding sequence: 333.. 1331 



1276 



WO 03/042661 



PCT/US02/36810 



1 11 21 31 41 SI 

I I I I I I 

GGGCAATTTG TTAGTTATCC GCCGCCACCA AGACGCGGCA CGGCGCCTGG ACCGGAGGGG 60 

J CCCCGCGCGG GCGCGAACTT TGGGCTCGGG CQAGTGGGTG GTGCTCCGCC CAGCCCGAGA 120 

CGGGCGGGCG CGCGGGCCAA TGGGTGCCGC CTCTTGGCCG CGGGGGGCCC CGACCCGTGG 180 

GTCCCGGCCA CCAGCGCCCC AGCCCCGAGG CTCAGAAGCG GCAGGCGGAG GCGCGGTCCG 240 

GGCGCTATGG CCATGCCCGG CGGGTCTCAC GCGGCTGCCC CTCGCCCGGC GCGCCTTCGG 300 

TAGGGGGCGC CCGGGGCCCA GCTGGCCGGG CCATGCTGCT GGAGACACAG GACGCGCTGT 360 

1U ACGTGGCGCT GGAGCTGGTC ATCGCCGCGC TTTGGGTGGC GGGCAACGTG CTGGTGTGCG 420 

CCGCGGTGGG CACGGCGAAC ACTCTGCAGA CGCCCACCAA CTACTTCCTG GTGTCCCTGG 460 

CTGCGGCCGA CGTGGCCGTG GGGCTCTTCG CCATCCCCTT TGCCATCACC ATCAGCCTGG S40 

GCTTCTGCAC TGACTTCTAC GGCTGCCTCT TCCTCGCCTG CTTCGTGCTG GTGCTCAOGC 600 

AGAGCTCCAT CTTCAGCCTT CTGGCCGTGG CAGTCGACAG ATACCTGGCC ATCTGTGTCC 660 

I J CGCTCAGGTA TAAAAGTTTG GTCACGGGGA CCCGAGCAAG AGGGGTCATT GCTGTCCTCT 720 

GGGTCCTTGC CTTTGGCATC GGATTGACTC CATTCCTGGG GTGGAACAGT AAAGACAGTG 780 

CCACCAACAA CTGCACAGAA CCCTGGGATG GAACCACGAA TGAAAGCTGC TGCCTTGTGA 840 

AGTGTCTCTT TGAGAATGTG GTCCCCATGA GCTACATGGT ATATTTCAAT TTCTTTGGGT 900 

_ GTGTTCTGCC CCCACTGCTT ATAATGCTGG TGATCTACAT TAAGATCTTC CTGGTGGCCT 960 

ZU GCAGGCAGCT TCAGCGCACT GAGCTGATGG ACCACTCGAG GACCACCCTC CAGCGGGAGA 1020 

TCCATGCAGC CAAGTCACTG GCCATGATTG TGGGGATTTT TGCCCTGTGC TGGTTACCTG 1080 

TGCATGCTGT TAACTGTGTC ACTCTTTTCC AGCCAGCTCA GGGTAAAAAT AAGCCCAAGT 1140 

GGGCAATGAA TATGGCCATT CTTCTGTCAC ATGCCAATTC AGTTGTCAAT CCCATTGTCT 1200 

rye ATGCTTACCG GAACCGAGAC TTCCGCTACA CTTTTCACAA AATTATCTCC AGGTATCTTC 1260 

ZJ TCTGCCAAGC AGATGTCAAG AGTGGGAATG GTCAGGCTGG GGTACAGCCT GCTCTCGGTG 1320 

TGGGCCTATG ATCTAGGCTC TCGCCTCTTC CAGGAGAAGA TACAAATCCA CAAGAAACAA 13 BO 

AGAGGACACG GCTGGTTTTC ATTGTGAAAG ATAGCTACAC CTCACAAGGA AATGGACTGC 1440 

CTCTCTTGAG CACTTCCCTG GAGCTACCAC GTATCTAGCT AATATGTATG TGTCAGTAGT 1500 

AGGCTCCAAG GATTGACAAA TATATTTATG ATCTATTCAG CTGCTTTTAC TGTGTGGATT 1560 

J\) ATGCCAACAG CTTGAATGGA TTCTAACAGA CTCTTTTGTT TTTAAAAGTC TGCCTTGTTT 1620 

AT GGTGGA AA ATTACTGAAA CTATTTTACT GTGAAACAGT GTGAACTATT ATAATGCAAA 1680 

TACTTTTTAA CTTAGAGGCA ATGGAAAAAT AAAAGTTGAC TGTACTAAAA ATG 1733 
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Seq 10 NO: C120 DNA Sequence 
Nucleic Acid Accession #t NM_052932 
Coding sequences 217.. 786 



1 11 21 31 41 51 

A(\ 1 I I I I | 

W CCCAGCCCGG CCCCGCCGCC CCGGCTGCGC ACGCGACGCC CCCTCCAGGC CCCGCTCCTG 60 

CGCCCTATTT GGTCATTCGG GGGGCAAGCG GCGGGAGGGG AAACGTGCGC GGCCGAAGGG 120 

GAAGCGGAGC CGGCGCCGGC TGCGCAGAGG AGCCGCTCTC GCCGCCGCCA CCTCGGCTGG 180 

GAGCCCACGA GGCTGCCGCA TCCTGCCCTC GGAACAATGG GACTCGGCGC GCGAGGTGCT 240 

A C TGGGCCGCGC TGCTCCTGGG GACGCTGCAG GTGCTAGCGC TGCTGGGGGC CGCCCATGAA 300 

43 AGCGCAGCCA TGGCGGAGAC TCTCCAACAT GTGCCTTCTG ACCATACAAA TGAAACTTCC 360 

AACAGTACTG TGAAACCACC AACTTCAGTT GCCTCAGACT CCAGTAATAC AACGGTCACC 420 

ACCATGAAAC CTACAGCGGC ATCTAATACA ACAACACCAG GGATGGTCTC AACAAATATG 480 

ACTTCTACCA CCTTAAAGTC TACACCCAAA ACAACAAGTG TTTCACAGAA CACATCTCAG 540 

« n ATATCAACAT CCACAATGAC CGTAACCCAC AATAGTTCAG TGACATCTGC TGCTTCATCA 600 

GTAACAATCA CAACAACTAT GCATTCTGAA GCAAAGAAAG GATCAAAATT TGATACTGGG 660 

AGCTTTGTTG GTGGTATTGT ATTAACGCTG GGAGTTTTAT CTATTCTTTA CATTGGATGC 720 

AAAATGTATT ACTCAAGAAG AGGCATTCGG TATCGAACCA TAGATGAACA TGATGCCATC 780 

ATTTAAGGAA ATCCATGGAC CAAGGATGGA ATACAGATTG ATGCTGCCCT ATCAATTAAT 840 

TTTGGTTTAT TAATAGTTTA AAACAATATT CTCTTTTTGA AAATAGTATA AACAGGCCAT 900 

3D GCATATAATG TACAGTGTAT TACGTAAATA TGTAAAGATT CTTCAAGGTA ACAAGGGTTT 960 

GGGTTTTGAA ATAAACATCT GGATCTTATA GACCGTTCAT ACAATGGTTT TAGCAAGTTC 1020 

ATAGTAAGAC AAACAAGTCC TATCTTTTTT TTTTTGGCTG GGGTGGGGGC ATTGGTCACA 1080 

TATGACCAGT AATTGAAAGA CGTCATCACT GAAAGACAGA ATGCCATCTG GGCATACAAA 1140 

s n TAAGAAGTTT GTCACAGCAC TCAGGATTTT GGGTATCTTT TGTAGCTCAC ATAAAGAACT 1200 

OU TCAGTGCTTT TGAGAGCTGG ATATATCTTA ATTACTAATG CCACACAGAA ATTATACAAT 1260 

CAAACTAGAT CTGAAGCATA ATTTAAGAAA AACATCAACA TTTTTTGTGC TTTAAACTGT 1320 

AGTAGTTGGT CTAGAAACAA AATACTCCAA GAAAAAGAAA ATTTTCAAAT AAAACCCAAA 1380 

ATAATAGCTT TGCTTAGCCC TGTTAGGGAT CCATTGGAGC ATTAAGGAGC ACATATTTTT 1440 

ATTAACTTCT TTTGAGCTTT CAATGTTGAT GTAATTTTTG TTCTCTGTGT AATTTAGGTA 1500 

03 AACTGCAGTG TTTAACATAA TAATGTTTTA AAGACTTAGT TGTCAGTATT AAATAATCCT 1560 

GGCATTATAG GGAAAAAACC TCCTAGAAGT TAGATTATTT GCTACTGTGA GAATATTGTC 1620 

ACCACTGGAA GTTACTTTAG TTCATTTAAT TTTAATTTTA TATTTTGTGA ATATTTTAAG 1680 

AACTGTAGAG CTGCTTTCAA TATCTAGAAA TTTTTAATTG AGTGTAAACA CACCTAACTT 1740 

„ TAAGAAAAAG AACCGCTTGT ATGATTTTCA AAAGAACATT TAGAATTCTA TAGAGTCAAA 1800 

/U ACTATAGCGT AATGCTGTGT TTATTAAGCC AGGGATTGTG GGACTTCCCC CAGGCAACTA 1860 

AACCTGCAGG ATGAAAATGC TATATTTTCT TTCATGCACT GTCGATATTA CTCAGATTTG 1920 

GGGAAATGAC ATTTTTATAC TAAAACAAAC ACCAAAATAT TTTAGAATAA ATTCTTAGAA 1980 

AGTTTTGAGA GGAATTTTTA GAGAGGACAT TTCCTCCTTC CTGATTTGGA TATTCCCTCA 2040 

_ AATCCCTCCT CTTACTCCAT GCTGAAGGAG AAGTACTCTC AGATGCATTA TGTTAATGGA 2100 

GAGRAAAAGC ACAGTATTGT AGAGACACCA ATATTAGCTA ATGTATTTTG GAGTGTTTTC 2160 

CATTTTACAG TTTATATTCC AGCACTCAAA ACTCAGGGTC AAGTTTTAAC AAAAGAGGTA 2220 

TGTAGTCACA GTAAATACTA AGATGGCATT TCTATCTCAG AGGGCCAAAG TGAATCACAC 2280 

CAGTTTCTGA AGGTCCTAAA AATAGCTCAG ATGTCCTAAT GAACATGCAC CTACATTTAA 2340 

TAGGAGTACA ATAAAACTGT TGTCAGCTTT TGTTTTACAG AGAACGCTAG ATATTAAGAA 2400 

OU TTTTGAAATG GATCATTTCT ACTTGCTGTG CATTTTAACC AATAATCTGA TGAATATAGA 2460 

AAAAAATGAT CCAAAATATG GATATGATTG GATGTATGTA ACACATACAT GGAGTATGGA 2520 

GGAAATTTTC TGAAAAATAC ATTTAGATTA GTTTAGTTTG AAGGAGAGGT GGGCTGATGG 2580 

CTGAGTTGTA TGTTACTAAC TTGGCCCTGA CTGGTTGTGC AACCATTGCT TCATTTCTTT 2640 

GCAAAATGTA GTTAAGATAT ACTTTATTCT AATGAAGGCC TTTTAAATTT GTCCACTGCA 2700 
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TTCTTGGTAT TTCACTACTT CAAGTCAGTC AGAACTTCGT AGACCGACCT GAAG TT TC1T 2760 

TTTGAATACT TGTTTCTTTA GCACTTTGAA GATAGAAAAA CCACTTTTTA AGTACTAAGT 2820 

CATCATTTGC CTTGAAAGTT TCCTCTGCAT TGGGTTTGAA GTAGTTTAGT TATGTCTTTT 2B80 

TCTCTGTATG TAAGTAGTAT AATTTGTTAC TTTCAAATAC CCGTACTTTG AATGTAGGTT 2940 

TTTTTGTTGT TGTTATCTAT AAAAATTGAG GGAAATGGTT ATGCAAAAAA ATATTTTGCT 3000 

TTGGACCATA TTTCTTAAGC ATAAAAAAAT GCTCAGTTTT GCTTGCATTC CTTGAGAATG 3060 

TATTTATCTG AAGATCAAAA CAAACAATCC AGATGTATAA GTACTAGGCA GAAGCCAATT 3120 

TTAAAATTTC CTTGAATAAT CCATGAAAGG AATAATTCAA ATACAGATAA ACAGAGTTGG 3180 

CAGTATATTA TAGTGATAAT TTTGTATTTT CAAMAAAAAA AAAGTTAAAC TCTTCTTTTC 3240 

TTTTTATTAT AATGACCAGC TTTTGGTATT TCATTGTTAC CAAGTTCTAT TTTTAGATAA 3300 

AATTGTTCTC CTTCTAAAAA AAAAAAAAAA AAAAAAAA 3338 



Seq ID NO: C121 DNA Sequence 
Nucleic Acid Accession « ; NM_00419S 
1 -> Coding sequence : 1 . . 726 



1 11 21 31 41 51 

I I I I I I 

ATGGCACAGC ACGGGGCGAT GGGCGCGTTT CGGGCCCTGT GCGGCCTGGC GCTGCTGTGC 60 

GCGCTCAGCC TGGGTCAGCG CCCCACCGGG GGTCCCGGGT GCGGCCCTGG GCGCCTCCTG 120 

CTTGGGACGG GAACGGACGC GCGCTGCTGC ' CGGGTTCACA CGACGCGCTG CTGCCGCGAT 180 

TACCCGGGCG AGGAGTGCTG TTCCGAGTGG GACTGCATGT GTGTCCAGCC TGAATTCCAC 240 

TGCGGAGACC CTTGCTGCAC GACCTGCCGG CACCACCCTT GTCCCCCAGG CCAGGGGGTA 300 

CAGTCCCAGG GGAAATTCAG TTTTGGCTTC CAGTGTATCG ACTGTGCCTC GGGGACCTTC 360 

TCCGGGGGCC ACGAAGGCCA CTGCAAACCT TGGACAGACT GCACCCAGTT CGGGTTTCTC 420 

ACTGTGTTCC CTGGGAACAA GACCCACAAC GCTGTGTGCG TCCCAGGGTC CCCGCCGGCA 480 

GAGCCGCTTG GGTGGCTGAC CGTCGTCCTC CTGGCCGTGG CCGCCTGCGT CCTCCTCCTG 540 

ACCTCGGCCC AGCTTGGACT GCACATCTGG CAGCTGAGGA GTCAGTGCAT GTGGCCCCGA 600 

GAGACCCAGC TGCTGCTGGA GGTGCCGCCG TCGACCGAAG ACGCCAGAAG CTGCCAGTTC 660 

CCCGAGGAAG AGCGGGGCGA GCGATCGGCA GAGGAGAAGG GGCGGCTGGG AGACCTGTGG 720 

GTGTGA 726 



Seq ID NO: C122 DNA Sequence 
^ Nucleic Acid Accession #: AK091896.1 
J J Coding sequence: 28.. 1572 



1 11 21 31 41 51 

I I I I I I 

AGATCCGCGA GCCCGTCAGC CTGCGCCATG GGCTGCGACG GCCGCGTGTC GGGGCTGCTC 60 

CGCCGCAACC TGCAGCCCAC GCTCACCTAC TGGAGCGTCT TCTTCAGCTT CGGCCTGTGC 120 

ATCGCCTTCC TGGGGCCCAC GCTGCTGGAC CTGCGCTGTC AGACGCACAG CTCGCTGCCC 180 

CAGATCTCCT GGGTCTTCTT CTCGCAGCAG CTCTGCCTCC TGCTGGGCAG CGCCCTCGGG 240 

GGCGTCTTCA AAAGGACCCT GGCCCAGTCA CTATGGGCCC TGTTCACCTC CTCTCTGGCC 300 

ATCTCCCTGG TGTTTGCCGT CATCCCCTTC TGCCGCGACG TGAAGGTGCT GGCCTCAGTC 360 

ATGGCGCTGG CGGGCTTGGC CATGGGCTGC ATOGACACCG TGGCCAACAT GCAGCTGGTA 420 

AGGATGTACC AGAAGGACTC GGCCGTCTTC CTCCAGGTGC TCCATTTCTT CGTGGGCTTT 480 

GGTGCTCTGC TGAGCCCCCT TATTGCTGAC CCTTTCCTGT CTGAGGCCAA CTGCTTGCCT 540 

GCCAATAGCA CGGCCAACAC CACCTCCCGA GGCCACCTGT TCCATGTCTC CAGGGTGCTG 600 

GGCCAGCACC ACGTAGATGC CAAGCCTTGG TCCAACCAGA CGTTCCCAGG GCTGACTCCA 660 

AAGGACGGGG CAGGGACCCG AGTGTCCTAT GCCTTCTGGA TCATGGCCCT CATCGATCTT 720 

CCAGTGCCCA TGGCTGTGCT GATGCTGCTG TCCAAGGAGC GGCTGCTGAC CTGCTGTCCC 780 

CAGAGGAGGC CCCTGCTTCT GTCTGCTGAT GAGCTTGCCT TGGAGACACA GCCTCCTGAG 840 

AAGGAAGATG CCTCCTCACT GCCCCCAAAG TTTCAGTCAC ACCTAGGGCA TGAGGACCTG 900 

TTCAGCTGCT GCCAAAGGAA GAACCTCAGA GGAGCCCCTT ATTCCTTCTT TGCCATCCAC 960 

ATCACGGGCG CCCTGGTACT GTTCATGACG GATGGGTTGA CGGGTGCCTA TTCCGCCTTC 1020 

GTGTACAGCT ATGCTGTGGA GAAGCCCCTG TCTGTGGGAC ACAAGGTGGC TGGCTACCTC 1080 

CCCAGCCTCT TCTGGGGCTT CATCACACTG GGCCGGCTCC TCTCCATTCC CATATCCTCA 1140 

AGAATGAAGC CGGCCACCAT GGTTTTCATC AACGTGGTTG GCGTGGTGGT GACGTTCCTG 1200 

GTGCTGCTTA TTTTCTCCTA CAACGTCGTC TTCCTGTTCG TGGGGACGGC AAGCCTGGGC 1260 

CTGTTTCTCA GCAGCACCTT CCCCAGCATG CTGGCCTACA CGGAGGACTC GCTGCAGTAC 1320 

AAAGGCTGTG CAACCACAGT GCTGGTGACA GGGGCAGGAG TTGGCGAGAT GGTGCTGCAG 1380 

ATGCTGGTTG GTTCGATATT CCAGGCTCAG GGCAGCTATA GTTTCCTGGT CTGTGGOGTG 1440 

ATCTTTGGTT GTCTGGCTTT TACCTTCTAT ATCTTGCTCC TGTTTTTCCA CAGGATGCAC 1500 

CCTGGACTCC CATCAGTTCC TACCCAAGAC AGATCAATTG GAATGGAAAA CTCTGAGTGC 1560 

TACCAGAGGT AAAACTGGGT GAAGAAGGCA AGAGAAGACT TTCAGCCTCT TGATCACCAG 1620 

CACGACCATA CTGTTTCAGA AAGCTGGGTG GTGGTGGAGG CGCTCTCTCA ATGGCTATTC 1680 

AAGTCTTCTC CACTAAAACT TGGTTGGGTA GAGGAAATTA AATTGAGTCC TGGTACCTGG 1740 

TCAAAATCAT TAGAAGTTTA CCTGGCTTCT CAAGTTATCT TCTTCCCTGG TTCAGACTGT 1800 

TGGTAAGAGC TGTCCAGATA CCCAGATGGG AAGGAAGGAG ACAGCCGCGC GCTTCACTCC 1860 

ATTTGTCACC TCATGCATGG ACCATACTCT GGGTTTGAGA TCATTCTTCA TTGAAGTTTG 1920 

TAAAAATAGG TTGAAATTGT AAAGCTCCAT GATCACTGCT ATATGTAGAT ATATTTCAAT 1980 

TTAAGCAAAA CAAGCTGCAA GTTATTCCCT GGCATGCTCA AAGGATTTTC GTGCTTTTCA 2040 

CTTAATAGTC CAAAGTCTCT TAAATTCCTG CTGCAGATAT CAATAGCTTA TCTATATTCT 2100 

CAAACACCAA AAGGAAAAGT TGAATCTTGC TCTCTTTGGT ATACTAATGT AGTGGTATGC 2160 

TAAGCTGGCT CATACCAACT TAGAAAAGCT GATTGTAAAA TTTTCATTTT GACAGCTGGT 2220 

TATTAAATGC AGCCATTATT AAAAATCAAA TCATACAAAC TTATAATTAA ATCAATTACA 2280 

TTTAAAACAA AGGTAATAAA TATTCAAAGC ATATCACTTC CT 2322 

Seq ID NO: C123 DNA Sequence 

Nucleic Acid Accession #: NM_002203.2 

Coding sequence: 43.. 3588 

1 11 21 31 41 51 

I I I I i I 
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CTGCAAACCC AGCGCAACTA CGGTCCCCCG GTCAGACCCA GGATGGGGCC AGAACGQACA 60 

GGGGCCGCGC CGCTGCCGCT GCTGCTGGTG TTAGCGCTCA GTCAAGGCAT TTTAAATTGT 120 

TGTTTGGCCT ACAATGTTGG TCTCCCAGAA GCAAAAATAT TTTCCGGTCC TTCAAGTGAA 180 

CAGTTTGGGT ATGCAGTGCA GCAGTTTATA AATCCAAAAG GCAACTGGTT ACTGGTTGGT 240 

J TCACCCTGGA GTGGCTTTCC TGAGAACCGA ATGGGAGATG TGTATAAATG TCCTGTTGAC 300 

CTATCCACTG CCACATGTGA AAAACTAAAT TTGCAAACTT CAACAAGCAT TCCAAATGTT 360 

ACTGAGATGA AAACCAACAT GAGCCTCGGC TTGATCCTCA CCAGGAACAT GGGAACTGGA 420 

GGTTTTCTCA CATGTGGTCC TCTGTGGGCA CAGCAATGTG GGAATCAGTA TTACACAACG 480 

GGTGTGTGTT CTGACATCAG TCCTGATTTT CAGCTCTCAG CCAGCTTCTC ACCTGCAACT 540 

10 CAGCCCTGCC CTTCCCTCAT AGATGTTGTG GTTGTGTGTG ATGAATCAAA TAGTATTTAT 600 

CCTTGGGATG CAGTAAAGAA TTTTTTGGAA AAATTTGTAC AAGGCCTTGA TATAGGCCCC 660 

ACAAAGACAC AGGTGGGGTT AATTCAGTAT GCCAATAATC CAAGAGTTGT GTTTAACTTG 720 

AACACATATA AAACCAAAGA AGAAATGATT GTAGCAACAT CCCAGACATC CCAATATGGT 780 

GGGGACCTCA CAAACACATT CGGAGCAATT CAATATGCAA GAAAATATGC CTATTCAGCA 840 

I J GCTTCTGGTG GGCGACGAAG TGCTACGAAA GTAATGGTAG TTGTAACTGA CGGTGAATCA 900 

CATGATGGTT CAATGTTGAA AGCTGTGATT GATCAATGCA ACCATGACAA TATACTGAGG 960 

TTTGGCATAG CAGTTCTTGG GTACTTAAAC AGAAACGCCC TTGATACTAA AAATTTAATA 1020 

AAAGAAATAA AAGCGATCGC TAGTATTCCA ACAGAAAGAT ACTTTTTCAA TGTGTCTGAT 1080 

GAAGCAGCTC TACTAGAAAA GGCTGGGACA TTAGGAGAAC AAATTTTCAG CATTGAAGGT 1140 

ZU ACTGTTCAAG GAGGAGACAA CTTTCAGATG GAAATGTCAC AAGTGGGATT CAGTGCAGAT 1200 

TACTCTTCTC AAAATGATAT TCTGATGCTG GGTGCAGTGG GAGCTTTTGG CTGGAGTGGG 1260 

ACCATTGTCC AGAAGACATC TCATGGCCAT TTGATCTTTC CTAAACAAGC CTTTGACCAA 1320 

ATTCTGCAGG ACAGAAATCA CAGTTCATAT TTAGGTTACT CTGTGGCTGC AATTTCTACT 1380 

GGAGAAAGCA CTCACTTTGT TGCTGGTGCT CCTCGGGCAA ATTATACCGG CCAGATAGTG 1440 

ZD CTATATAGTG T GAAT GAGAA TGGCAATATC ACGGTTATTC AGGCTCACCG AGGTGACCAG 1500 

ATTGGCTCCT ATTTTGGTAG TGTGCTGTGT TCAGTTGATG TGGATAAAGA CACCATTACA 1560 

GACGTGCTCT TGGTAGGTGC ACCAATGTAC ATGAGTGACC TAAAGAAAGA GGAAGGAAGA 1620 

GTCTACCTGT TTACTATCAA AAAGGGCATT TTGGGTCAGC ACCAATTTCT TGAAGGCCCC 1680 

GAGGG CATTG AAAACACTCG ATTTGGTTCA GCAATTGCAG CTCTTTCAGA CATCAACATG 1740 

J\) GATGGCTTTA ATGATGTGAT TGTTGGTTCA CCACTAGAAA ATCAGAATTC TGGAGCTGTA 1800 

TACATTTACA ATGGTCATCA GGGCACTATC CGCACAAAGT ATTCCCAGAA AATCTTGGGA 1860 

TCCGATGGAG CCTTTAGGAG CCATCTCCAG TACTTTGGGA GGTCCTTGGA TGGCTATGGA 1920 

GATTTAAATG GGGATTCCAT CACCGATGTG TCTATTGGTG CCTTTGGACA AGTGGTTCAA 1980 

CTCTGGTCAC AAAGTATTGC TGATGTAGCT ATAGAAGCTT CATTCACACC AGAAAAAATC 2040 

J J ACTTTGGTCA ACAAGAATGC TCAGATAATT CTCAAACTCT GCTTCAGTGC AAAGTTCAGA 2100 

CCTACTAAGC AAAACAATCA AGTGGCCATT GTATATAACA TCACACTTGA TGCAGATGGA 2160 

TTTTCATCCA GAGTAACCTC CAGGGGGTTA TTTAAAGAAA ACAATGAAAG GTGCCTGCAG 2220 

AAGAATATGG TAGTAAATCA AGCACAGAGT TGCCCCGAGC ACATCATTTA TATACAGGAG 2280 

. - CCCTCTGATG TTGTCAACTC TTTGGATTTG CGTGTGGACA TCAGTCTGGA AAACCCTGGC 2340 

4U ACTAGCCCTG CCCTTGAAGC CTATTCTGAG ACTGCCAAGG TCTTCAGTAT TCCTTTCCAC 2400 

AAAGACTGTG GTGAGGATGG ACTTTGCATT TCTGATCTAG TCCTAGATGT CCGACAAATA 2460 

CCAGCTGCTC AAGAACAACC CTTTATTGTC AGCAACCAAA ACAAAAGGTT AACATTTTCA 2S20 . 

GTAACACTGA AAAATAAAAG GGAAAGTGCA TACAACACTG GAATTGTTGT TGATTTTTCA 2580 

GAAAACTTGT TTTTTGCATC ATTCTCCCTA CCGGTTGATG GGACAGAAGT AACATGCCAG 2640 

4D GTGGCTGCAT CTCAGAAGTC TGTTGCCTGC GATGTAGGCT ACCCTGCTTT AAAGAGAGAA 2700 

CAACAGGTGA CTTTTACTAT TAACTTTGAC TTCAATCTTC AAAACCTTCA GAATCAGGCG 2760 

TCTCTCAGTT TCCAAGCCTT AAGTGAAAGC CAAGAAGAAA ACAAGGCTGA TAATTTGGTC 2820 

AACCTCAAAA TTCCTCTCCT GTATGATGCT GAAATTCACT TAACAAGATC TACCAACATA 2880 

~ AATTTTTATG AAATCTCTTC GGATGGGAAT GTTCCTTCAA TCGTGCACAG TTTTGAAGAT 2940 

jU GTTGGTCCAA AATTCATCTT CTCCCTGAAG GTAACAACAG GAAGTGTTCC AGTAAGCATG 3000 

GCAACTGTAA TCATCCACAT CCCTCAGTAT ACCAAAGAAA AGAACCCACT GATGTACCTA 3060 

ACTGGGGTGC AAACAGACAA GGCTGGTGAC ATCAGTTGTA ATGCAGATAT CAATCCACTG 3120 

AAAATAGGAC AAACATCTTC TTCTGTATCT TTCAAAAGTG AAAATTTCAG GCACACCAAA 3180 

GAATTGAACT GCAGAACTGC TTCCTGTAGT AATGTTACCT GCTGGTTGAA AGACGTTCAC 3240 

J J ATGAAAGGAG AATACTTTGT TAATGTGACT ACCAGAATTT GGAACGGGAC TTTCGCATCA 3300 

TCAACGTTCC AGACAGTACA GCTAACGGCA GCTGCAGAAA TCAACACCTA TAACCCTGAG 3360 

ATATATGTGA TTGAAGATAA CACTGTTACG ATTCCCCTGA TGATAATGAA ACCTGATGAG 3420 

AAAGCCGAAG TACCAACAGG AGTTATAATA GGAAGTATAA TTGCTGGAAT CCTTTTGCTG 3480 

~ TTAGCTCTGG TTGCAATTTT ATGGAAGCTC GGCTTCTTCA AAAGAAAATA TGAAAAGATG 3540 

OU ACCAAAAATC CAGATGAGAT TGATGAGACC ACAGAGCTCA GTAGCTGAAC CAGCAGACCT 3600 

ACCTGCAGTG GGAACOGGCA GCATCCCAGC CAGGGTTTGC TGTTTGCGTG CATGGATTTC 3660 

TTTTTAAATC CCATATTTTT TTTATCATGT CGTAGGTAAA CTAACCTGGT ATTTTAAGAG 3720 

AAAACTGCAG GTCAGTTTGG ATGAAGAAAT TGTGGGGGGT GGGGGAGGTG CGGGGGGCAG 3780 

GTAGGGAAAT AATAGGGAAA ATACCTATTT TATATGATGG GGGAAAAAAA GTAATCTTTA 3840 

Oj AACTGGCTGG CCCAGAGTTT ACATTCTAAT TTGCATTGTG TCAGAAACAT GAAATGCTTC 3900 

CAAGCATGAC AACTTTTAAA GAAAAATATG ATACTCTCAG ATTTTAAGGG GGAAAACTGT 3960 

TCTCTTTAAA ATATTTGTCT TTAAACAGCA ACTACAGAAG TGGAAGTGCT TGATATGTAA 4020 

GTACT TCCAC TTGTGTATAT TTTAATGAAT ATTGATGTTA ACAAGAGGGG AAAACAAAAC 4080 

ACAGGTTTTT TCAATTTATG CPGCTCATCC AAAGTTGCCA CAGATGATAC TTCCAAGTGA 4140 

IK) TAATTTTATT TATAAACTAG GTAAAATTTG TTGTTGGTTC CTTTTATACC ACGGCTGCCC 4200 

CTTCCACACC CCATCTTGCT CTAATGATCA AAACATGCTT GAATAACTGA GCTTAGAGTA 4260 

TACC TCCTAT ATGTCCATTT AAGTTAGGAG AGGGGGCGAT ATAGAGACTA AGGCACAAAA 4320 

TTTTGTTTAA AACTCAGAAT ATAACATTTA TGTAAAATCC CATCTGCTAG AAGCCCATCC 4380 

TGTGCCAGAG GAAGGAAAAG GAGGAAATTT CCTTTCTCTT TTAGGAGGCA CAACAGTTCT 4440 

/*> CTTCTAGGAT TTGTTTGGCT GACTGGCAGT AACCTAGTGA ATTTTTGAAA GATGAGTAAT 4500 

TTCTTTGGCA ACCTTCCTCC TCCCTTACTG AACCACTCTC CCACCTCCTG GTGGTACCAT 4560 

TATTATAGAA GCCCTCTACA GCCTGACTTT CTCTCCAGCG GTCCAAAGTT ATCCCCTCCT 4620 

TTACCCCTCA T CCAAAG TTC CCACTCCTTC AGGACAGCTG CTGTGCATTA GATATTAGGG 4680 

on GGGAAAGTCA TCTGTTTAAT TTACACACTT GCATGAATTA CTGTATATAA ACTCCTTAAC 4740 

OU TTCAGGGAGC TATTTTCATT TAGTGCTAAA CAAGTAAGAA AAATAAGCTA GAGTGAATTT 4800 

CTAAATGTTG GAATGTTATG GGATGTAAAC AATGTAAAGT AAAACACTCT CAGGATTTCA 4860 

CCAGAAGTTA CAGATGAGGC ACTGGAAACC ACCACCAAAT TAGCAGGTGC ACCTTCTGTG 4920 

GCTGTCTTGT TTCTGAAGTA CTTTTTCTTC CACAAGAGTG AATTTGACCT AGGCAAGTTT 4980 

GTTCAAAAGG TAGATCCTGA GATGATTTGG TCAGATTGGG ATAAGGCCCA GCAATCTGGA 5040 

1279 



WO 03/042661 



PCT/US02/36810 



TTTTAACAAG CACCCCAGTC ACTAGGATGC AGATGGACCA CACTTTGAGA AACACCACCC 5100 

ATTTCTACTT TTTGCACCTT ATTTTCTCTG TTCCTGAGCC CCCACATTCT CTAGGAGAAA S160 

CTTAGATTAA AATTCACAGA CACTACATAT CTAAAGCTTT GACAAGTCCT TGACCTCTAT 5220 

AAACTTCAGA GTCCTCATTA TAAAATGGGA AGACTGAGCT GGAGTTCAGC AGTGATGCTT 5280 

J TTTAGTTTTA AAAGTCTATG ATCTGATCTG GACTTCCTAT AATACAAATA CACAATCCTC 5340 

CAAGAATTTG ACTTGGAAAA G 5361 

Seq ID NO: C124 DNA Sequence 
Nucleic Acid Accession #: NM_031460 ■ 
1U Coding sequence: 103.. 1101 

1 11 21 31 41 51 

I I I 1 I I 

AGCAGGCGTT TGCGAGAGGA GATACGAGCT GGACGCCTGG CCCTTCCCTC CCACCGGGTC 60 

1 J CTAGTCCACC GCTCCCGGCG CCGGCTCCCC GCCTCTCCCG CTATGTACCG ACCGCGAGCC 120 

CGGGCGGCTC CCGAGGGCAG GGTCCGGGGC TGCGCGGTGC CCAGCACCGT GCTCCTGCTG 180 

CTCGCCTACC TGGCTTACCT GGCGCTGGGC ACCGGCGTGT TCTGGACGCT GGAGGGCCGC 240 

GCGGCGCAGG ACTCCAGCCG CAGCTTCCAG CGCGACAAGT GGGAGCTGTT GCAGAACTTC 300 

ACGTGTCTGG ACCGCCCGGC GCTGGACTCG CTGATCCGGG ATGTCGTCCA AGCATACAAA 360 

AACGGAGCCA GCCTCCTCAG CAACACCACC AGCATGGGGC GCTGGGAGCT CGTGGGCTCC 420 

TTCTTCTTTT CTGTGTCCAC CATCACCACC ATTGGCTATG GCAACCTGAG CCCCAACACG 480 

ATGGCTGCCC GCCTCTTCTG CATCTTCTTT GCCCTTGTGG GGATCCCACT CAACCTCGTG 540 

GTGCTCAACC GACTGGGGCA TCTCATGCAG CAGGGAGTAA ACCACTGGGC CAGCAGGCTG 600 

GGGGGCACCT GGCAGGATCC TGACAAGGCG CGGTGGCTGG CGGGCTCTGG CGCCCTCCTC 660 

TCGGGCCTCC TGCTCTTCCT GCTGCTGCCA CCGCTGCTCT TCTCCCACAT GGAGGGCTGG 720 

AGCTACACAG AGGGCTTCTA CTTCGCCTTC ATCACCCTCA GCACCGTGGG CTTCGGCGAC 780 

TACGTGATTG GAATGAACCC CTCCCAGAGG TACCCACTGT GGTACAAGAA CATGGTGTCC 840 

CTGTGGATCC TCTTTGGGAT GGCATGGCTG GCCTTGATCA TCAAACTCAT CCTCTCCCAG 900 

CTGGAGACGC CAGGGAGGGT ATGTTCCTGC TGCCACCACA GCTCTAAGGA AGACTTCAAG 960 

3U TCCCAAAGCT GGAGACAGGG ACCTGACCGG GAGCCAGAGT CCCACTCCCC ACAGCAAGGA 1020 

TGCTATCCAG AGGGACCCAT GGGAATCATA CAGCATCTGG AACCTTCTGC TCACGCTGCA 1080 

GGCTGTGGCA AGGACAGCTA GTTATACTCC ATTCTTTGGT CGTCGTCCTC GGTAGCAAGA 1140 

CCCCTGATTT TAAGCTTTGC ACATGTCCAC CCAAACTAAA GACTACATTT TCCATCCACC 1200 

CTAGAGGCTG GGTGCAGCTA TATGATTAAT TCTGCCCAAT AGGGTATACA GAGACATGTC 1260 

J J CTGGGTGACA TGGGATGTGA CTTTGGGGTG TCGGGGCAGC ATGCCCTTCT CCCCCACTTC 1320 

CTTACTTTAG CGGGCTGCAA TGCCGCCGAT ATGATGGCTG GGAGCTCTGG CAGCCATACG 1380 

GCACCATGAA GTAGCGGCAA TGTTTGAGCG GCACAATTAG ATAGGAAGAG TCTGGATCTC 1440 

TGATGATCAC AGAGCCATCC TAACAAACGG AATATCACCG ACCCTCCTTT ATGTGAGAGA 1500 

GAAATAAACA TCTATGAAA 1519 
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Seq ID NO: C125 DNA Sequence 
Nucleic Acid Accession #: NM_004154 
Coding sequence: 309.. 1295 "~ 



Seq ID NO: CI 2 6 DNA Sequence 
Nucleic Acid Accession #: NM_007197 
Coding sequence: 18.. 1763 

1 11 21 31 41 51 

I I I I I I 
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I I I I I I 

AAGGACAGAG GAGGGGCCCT TCCTGTCAGC TGGCTGGGAG CAGAGGTGGC ri M JK3TCTTTT 60 

CGGAAGAACT GGTTCTGTGG AATTTGTGCT TATTTCCCAT CAAGGATCAA GGACCTGCTC 120 

TGGGGCTACC TCAGGGCCCC ACAGGATGAG GGGCTGGTTT TCAGATGAGT TTTCTGCTTG 180 

CCTGTCATCT GGATAGTGTC TAAAAATTTG CAAACTGCCT TCTTGTCAGT GTCTTGCTCA 240 

TTCTTCATGA CACTCCTGAT ATGTCTCTCA GTTTCCTCAT CTGCTGCCTC TCCAGACTTC 300 

TGCCAGAACA TTGCACGCGA CAGTTTCAGG CACAGAACTG ACTGGCAGCA GGGGCTGCTC 360 

CACGAGTGGG AATTTGCTCC AGCACTTCAC GGACTGCAAG CGAGGCACTT GCTAACTCTT 420 

GGATAACAAG ACCTCTGCCA GAAGAACCAT GGCTTTGGAA GGCGGAGTTC AGGCTGAGGA 480 

GATGGGTGCG GTCCTCAGTG AGCCCCTGCC TCCCTGAACA TAGGAAACCC ACCTGGGCAG 540 

CCATGGAATG GGACAATGGC ACAGGCCAGG CTCTGGGCTT GCCACCCACC ACCTGTGTCT 600 

ACCGCGAGAA CTTCAAGCAA CTGCTGCTGC CACCTGTGTA TTCGGCGGTG CTGGCGGCTG 660 

GCCTGCCGCT GAACATCTGT GTCATTACCC AGATCTGCAC GTCCCGCCGG GCCCTGACCC 720 

GCACGGCCGT GTACACCCTA AACCTTGCTC TGGCTGACCT GCTATATGCC TGCTCCCTGC 780 

CCCTGCTCAT CTACAACTAT GCCCAAGGTG ATCACTGGCC CTTTGGCGAC TTCGCCTGCC 840 

GCCTGGTCCG CTTCCTCTTC TATGCCAACC TGCACGGCAG CATCCTCTTC CTCACCTGCA 900 

TCAGCTTCCA GCGCTACCTG GGCATCTGCC ACCCGCTGGC CCCCPGGCAC AAACGTGGGG 960 

GCCGCCGGGC TGCCTGGCTA GTGTGTGTAG CCGTGTGGCT GGCCGTGACA ACCCAGTGCC 1020 

TGCCCACAGC CATCTTOGCT GCCACAGGCA TCCAGOGTAA CCGCACTGTC TGCTATGACC 1080 

TCAGCCCGCC TGCCCTGGCC ACCCACTATA TGCCCTATGG CATGGCTCTC ACTGTCATCG 1140 

GCTTCCTGCT GCCCTTTGCT GCCCTGCTGG CCTGCTACTG TCTCCTGGCC TGCCGCCTGT 1200 

GCCGCCAGGA TGGCCCGGCA GAGCCTGTGG CCCAGGAGCG GCGTGGCAAG GCGGCCCGCA 1260 

TGGCCGTGGT GGTGGCTGCT GCCTTTGCCA TCAGCTTCCX GCCTTTTCAC ATCACCAAGA 1320 

CAGCCTACCT GGCAGTGCGC TCGACGCCGG GCGTCCCCTG CACTGTATTG GAGGCCTTTG 1380 

/U CAGCGGCCTA CAAAGGCACG CGGCCGTTTG CCAGTGCCAA CAGCGTGCTG GACCCCATCC 1440 

TCTTCTACTT CACCCAGAAG AAGTTCOGCC GGCGACCACA TGAGCTCCTA CAGAAACTCA 1500 

CAGCCAAATG GCAGAGGCAG GGTCGCTGAG TCCTCCAGGT CCTGGGCAGC CTTCATATTT 1560 

GCCATTGTGT CCGGGGCACC AGGAGCCCCA CCAACCCCAA ACCATGCGGA GAATTAGAGT 1620 

TCAGCTCAGC TGGGCATGGA GTTAAGATCC CTCACAGGAC CCAGAAGCTC ACCAAAAACT 1680 

ATTTCTTCAG CCCCTTCTCT GGCCCAGACC CTGTGGGCAT GGAGATGGAC AGACCTGGGC 1740 

CTGGCTCTTG AGAGGTCCCA GTCAGCCATG GAGAGCTGGG GAAACCACAT TAAGGTGCTC 1800 

ACAAAAATAC AGTGTGACGT GTACTGTCAA AA 1832 
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ACACGTCCAA CGCCAGCATG CAGCGCCCGG GCCCCCGCCT GTGGCTGGTC CTGCAGGTGA 60 

TGGGCTCGTG CGCCGCCATC AGCTCCATGG ACATGGAGCG CCCGGGCGAC GGCAAATGCC 120 

AGCCCATCGA GATCCCGATG TGCAAGGACA TCGGCTACAA CATGACTCGT ATGCCCAACC 180 

TGATGGGCCA CGAGAACCAG CGCGAGGCAG CCATCCAGTT GCACGAGTTC GCGCCGCTGG 240 

5 TGGAGTACGG CTGCCACGGC CACCTCCGCT TCTTCCTGTG CTCGCTGTAC GCGCCGATGT 300 

GCACCGAGCA GGTCTCTACC CCCATCCCCG CCTGCCGGGT CATGTGCGAG CAGGCCCGGC 360 

TCAAGTGCTC CCCGATTATG GAGCAGTTCA ACTTCAAGTG GCCCGACTCC CTGGACTGCC 420 

GGAAACTCCC CAACAAGAAC GACCCCAACT ACCTGTGCAT GGAGGCGCCC AACAACGGCT 480 

CGGACGAGCC CACCCGGGGC TCGGGCCTGT TCCCGCCGCT GTTCCGGCCG CAGCGGCCCC 540 

10 ACAGCGCGCA GGAGCACCCG CTGAAGGACG GGGGCCCCGG GCGCGGCGGC TGCGACAACC 600 

CGGGCAAGTT CCACCACGTG GAGAAGAGCG CGTCGTGCGC GCCGCTCTGC ACGCCCGGCG 660 

TGGACGTGTA CTGGAGCCGC GAGGACAAGC GCTTCGCAGT GGTCTGGCTG GCCATCTGGG 120 

CGGTGCTGTG CTTCTTCTCC AGCGCCTTCA CCGTGCTCAC CTTCCTCATC GACCCGGCCC 780 

i GCTTCCGCTA CCCCGAGCGC CCCATCATCT TCCTCTCCAT GTGCTACTGC GTCTACTCCG 840 

15 TGGGCTACCT CATCCGCCTC TTCGCCGGCG CCGAGAGCAT CGCCTGCGAC CGGGACAGCG 900 

GCCAGCTCTA TGTCATCCAG GAGGGACTGG AGAGCACCGG CTGCACGCTG GTCTTCCTGG 960 

TCCTCTACTA CTTCGGCATG GCCAGCTCGC TGTGGTGGGT GGTCCTCACG CTCACCTGGT 1020 

TCCTGGCCGC CGGCAAGAAG TGGGGCCACG AGGCCATCGA AGCCAACAGC AGCTACTTCC 1080 

ACCTGGCAGC CTGGGCCATC CCGGCGGTGA AGACCATCCT GATCCTGGTC ATGCGCAGGG 1140 

20 TGGCGGGGGA CGAGCTCACC GGGGTCTGCT ACGTGGGCAG CATGGACGTC AACGCGCTCA 1200 

CCGGCTTCGT GCTCATTCCC CTGGCCTGCT ACCTGGTCAT CGGCACGTCC TTCATCCTCT 1260 

CGGGCTTCGT GGCCCTGTTC CACATCCGGA GGGTGATGAA GACGGGCGGC GAGAACACGG 1320 

ACAAGCTGGA GAAGCTCATG GTGCGTATCG GGCTCTTCTC TGTGCTGTAC ACCGTGCCGG 1380 

CCACCTGTGT GATCGCCTGC TACTTTTACG AACGCCTCAA CATGGATTAC TGGAAGATCC 1440 

25 TGGCGGCGCA GCACAAGTGC AAAATGAACA ACCAGACTAA AACGCTGGAC TGCCTGATGG 1500 

CCGCCTCCAT CCCCGCCGTG GAGATCTTCA TGGTGAAGAT CTTTATGCTG CTGGTGGTGG 1560 

GGATCACCAG CGGGATGTGG ATTTGGACCT OCAAGACTCT GCAGTCCTGG CAGCAGGTGT 1620 

GCAGCCGTAG GTTAAAGAAG AAGAGCCGGA GAAAACCGGC CAGCGTGATC ACCAGCGGTG 1680 

GGATTTACAA AAAAGCCCAG CATCCCCAGA AAACTCACCA CGGGAAATAT GAGATCCCTG 1740 

30 CCCAGTCGCC CACCTGCGTG TGAACAGGGC TGGAGGGAAG GGCACAGGGG CGCCCGGAGC 1800 

TAAGATGTGG TGCTTTTCTT GGTTGTGTTT TTCTTTCTTC TTCTTCTTTT TTTTTTTTTT I860 

ATAAAAGCAA AAGAGAAATA CATAAAAAAG TGTTTACCCT GAAATTCAGG ATGCTGTGAT 1920 

ACACTGAAAG GAAAAATGTA CTTAAAGGGT TTTGTTTTGT TTTGGTTTTC CAGCGAAGGG 1980 

AAGCTCCTCC AGTGAAGTAG CCTCTTGTGT AACTAATTTG TGGTAAAGTA GTTGATTCAG 2040 

35 CCCTCAGAAG AAAACTTTTG TTTAGAGCCC TCCGTAAATA TACATCTGTG TATTTGAGTT 2100 

GGCTTTGCTA CCCATTTACA AATAAGAGGA CAGATAACTG CTTTGCAAAT TCAAGAGCCT 2160 

CCCCTGGGTT AACAAATGAG CCATCCCCAG GGCCCACCCC CAGGAAGGCC ACAGTGCTGG 2220 

GCGGCATCCC TGCAGAGGAA AGACAGGACC CGGGGCCCGC CTCACACCCC AGTGGATTTG 22B0 

GAGTTGCTTA AAATAGACTC TGGCCTTCAC CAATAGTCTC TCTGCAAGAC AGAAACCTCC 2340 

40 ATCAAACCTC ACATTTGTGA ACTCAAACGA TGTGCAATAC ATTTTTTTCT CTTTCCTTGA 2400 

AAATAAAAAG AGAAACAAGT ATTTTGCTAT ATATAAAGAC AACAAAAGAA ATCTCCTAAC 2460 

AAAAGAACTA AGAGGCCCAG CCCTCAGAAA CCCTTCAGTG CTACATTTTG TGGCTTTTTA 2520 

ATGGAAACCA AGCCAATGTT ATAGACGTTT GGACTGATTT GTGGAAAGGA GGGGGGAAGA 2580 

GGGAGAAGGA TCATTCAAAA GTTACCCAAA GGGCTTATTG ACTCTTTCTA TTGTTAAACA 2640 

45 AATGATTTCC ACAAACAGAT CAGGAAGCAC TAGGTTGGCA GAGACACTTT GTCTAGTGTA 2700 

TTCTCTTCAC AGTGCCAGGA AAGAGTGGTT TCTGCGTGTG TATATTTGTA ATATATGATA 2760 

TTTTTCATGC TCCACTATTT TATTAAAAAT AAAATATGTT CTTTAAAAAA A 2811 

Seq ID NO i C127 DNA Sequence 
50 Nucleic Acid Accession #: NM_005761.l 
Coding sequence : 250 - . 4 956 



1 11 21 31 41 51 

I I I I I I 

GCGAGGAGGA AACGGTGCCG GAGCGCGCAG GGCTT G CT O C CGCCACCGCC GCTGCACAGG 60 

CTGCCGGAGC GAGCCTGCCG CGCGCCGCCC TCCCCGCTCT CCTTCCTGGG CGAGCTGOGG 120 

GGATGGGGCG GCCGCGGGAG CCCGAGCGCG CGCAGGAACC GCCGCCGCGG CCGCCCGCGT 180 

CTCCGTTGCC GOGCGCCTGA GCCGCCGTCG CCGCCGCGCG CCCTGCCCGQ GGGCGGCCCC 240 

CCCAGCCCCA TGGAGGTCTC CCGGAGGAAG GCGCCGCCGC GCCCCCCGCG CCCCGCAGCG 300 

CCACTGCCCC TGCTCGCCTA TCTGCTGGCA CTGGCGGCTC CCGGCCGGGG CGCGGACGAG 360 

CCCGTGTGGC GGTCGGAGCA AGCCATCGGA GCCATCGCGG CGAGOCAGGA GGACGGCGTG 420 

TTTGTGGCGA GCGGCAGCTG CCTGGACCAG CTGGACTACA GCCTGGAGCA CAGCCTCTCG 480 

CGCCTGTACC GGGACCAAGC GGGCAACTGC ACAGAGCCGG TCTCGCTGGC GCCCCCCGCG S40 

CGGCCCCGGC CCGGGAGCAG CTTCAGCAAG CTGCTGCTGC CCTACCGCGA GGGGGCGGCC 600 

GGCCTOGGGG GGCTGCTGCT CACOGGCTGG ACCTTCGACC GGGGCGCCTG CGAGGTGCGG 660 

CCCCTGGGCA ACCTGAGCCG CAACTCCCTG CGCAACGGCA CCGAGGTGGT GTCGTGCCAC 720 

CCGCAGGGCT CGACGGCCGG CGTGGTGTAC CGCGCGGGCC GGAACAACCG CTGGTACCTG 780 

GCGGTGGCCG CCACCTACGT GCTGCCTGAG CCGGAGACGG CGAGCCGCTG CAACCCCGCG 840 

GCATCCGACC ACGACACGGC CATCGCGCTC AAGGACACGG AGGGGCGCAG CCTGGCCAOG 900 

CAGGAGCTGG GGCGCCTCAA GCTGTGCGAG GGCGCGGGCA GCCTGCACTT CGTGGACGCC 960 

TTTCTCTGGA ACGGCAGCAT CTACTTCCCC TACTACCCCT ACAACTATAC GAGCGGCGCT 1020 

GCCACCGGCT GGCCCAGCAT GGCGCGCATC GGGCAGAGCA CCGAGGTGCT GTTCCAGGGC 1080 

CAGGCATCCC TCGACTGCGG CCACGGCCAC CCCGACGGCC GCCGCCTGCT CCTCTCCTCC 1140 

AGCCTAGTGG AGGCCCTGGA CGTCTGGGCG GGAGTGTTCA GCGCGGCCGC TGGAGAGGGC 1200 

CAGGAGCGGC GCTCCCCCAC CACCACGGCG CTCTGCCTCT TCAGAATGAG TGAGATCCAG 1260 

GCGCGCGCCA AGAGGGTCAG CTGGGACTTC AAGACGGCCG AGAGCCACTG CAAAGAAGGG 1320 

GATCAACCTG AAAGAGTCCA ACCAATCGCA TCATCTACCT TGATCCATTC CGACCTGACA 1380 

TCCGTTTATG GCACCGTGGT AATGAACAGG ACTGTTTTAT TCTTGGGGAC TGGAGATGGC 1440 

QA CAGTTACTTA AGGTTATTCT TGGTGAGAAT TTGACTTCAA ATTGTCCAGA GGTTATCTAT 1500 

oO GAAATTAAAG AAGAGACACC TGTTTTCTAC AAACTCGTTC CTGATCCTGT GAAGAATATC 1560 

TACATTTATC TAACAGCTGG GAAAGAGGTG AGGAGAATTC GTGTTGCAAA CTGCAATAAA 1620 

CATAAATCCT GTTCGGAGTG TTTAACAGCC ACAGACCCTC ACTGCGGTTG GTGCCATTCG 1680 

CTACAAAGGT GCACTTTTCA AGGAGATTGT GTACATTCAG AGAACTTAGA AAACTGGCTG 1740 

GATATTTCGT CTGGAGCAAA AAAGTGCCCT AAAATTCAGA TAATTCGAAG CAGTAAAGAA 1800 
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AAGACTACAG TGACTATGGT GGGAAGCTTC TCTCCAAGAC ACTCAAAGTG CATGGTGAAG 1860 

AATGTGGACT CTAGCAGGGA GCTCTGCCAG AATAAAAGTC AGCCCAACCG GACCTGCACC 1920 

TGTAGCATCC CAACCAGAGC AACCTACAAA GATGTTTCAG TTGTCAACGT GATGTTCTCC 1980 • 

TTCGGTTCTT GGAATTTATC AGACAGATTC AACTTTACCA ACTGCTCATC ATTAAAAGAA 2040 

J TGCCCAGCAT GCGTAGAAAC TGGCTGCGCG TGGTGTAAAA GTGCAAGAAG GTGTATCCAC 2100 

CCCTTCACAG CTTGCGACCC TTCTGATTAT GAGAGAAACC AGGAACAGTG TCCAGTGGCT 2160 

GTCGAGAAGA CATCAGGAGG AGGAAGACCC AAGGAGAACA AGGGGAACAG AACCAACCAG 2220 

GCTTTACAGG TCTTCTACAT TAAGTCCATT GAGCCACAGA AAGTATCGAC ATTAGGGAAA 2280 

AGCAACGTGA TAGTAACGGG AGCAAACTTT ACCCGGGCAT CGAACATCAC AATGATCCTG 2340 

10 AAAGGAACCA GTACCTGTGA TAAGGATGTG ATACAGGTTA GCCATGTGCT AAATGACACC 2400 

CACATGAAAT TCTCTCTTCC ATCAAGCOGG AAAGAAATGA AGGATGTGTG TATCCAGTTT 2460 

GATGGTGGGA ACTGCTCTTC TGTGGGATCC TTATCCTACA TTGCTCTGCC ACATTGTTCC 2520 

CTTATATTTC CTGCTACCAC CTGGATCAGT GGTGGTCAAA ATATAACCAT GATGGGCAGA 2580 

AATTTTGATG TAATTGACAA CTTAATCATT TCACATGAAT TAAAAGGAAA CATAAATGTC 2640 

15 TCTGAATATT GTGTGGCGAC TTACTGCGGG TTTTTAGCCC CCAGTTTAAA GAGTTCAAAA 2700 

GTGCGCACGA ATGTCACTGT GAAGCTGAGA GTACAAGACA CCTACTTGGA TTGTGGAACC 2760 

CTGCAGTATC GGGAGGACCC CAGATTCACG GGGTATCGGG TGGAATCCGA GGTGGACACA 2820 

GAACTGGAAG TGAAAATTCA AAAAGAAAAT GACAACTTCA ATATTTCCAA AAAAGACATT 2880 

GAAATTACTC TCTTCCATGG GGAAAATGGG CAATTAAATT GCAGTTTTGA AAATATTACT 2940 

20 AGAAATCAAG ATCTTACCAC CATCCTTTGC AAAATTAAAG GCATCAAGAC TGCAAGCACC 3000 

ATTGCCAACT CTTCTAAGAA AGTTCGGGTC AAGCTGGGAA ACCTGGAGCT CTACGTCGAG 3060 

CAGGAGTCAG TTCCTTCCAC ATGGTATTTT CTGATTGTGC TCCCTGTCTT GCTAGTGATT 3120 

GTCATTTTTG CGGCCGTGGG GGTGACCAGG CACAAATCGA AGGAGCTGAG TCGCAAACAG 3180 

AGTCAACAAC TAGAATTGCT GGAAAGCGAG CTCCGGAAAG AGATACGTGA CGGCTTTGCT 3240 

25 GAGCTGCAGA TGGATAAATT GGATGTGGTT GATAGTTTTG GAACTGTTCC CTTCCTTGAC 3300 

TACAAACATT TTGCTCTGAG AACTTTCTTC CCTGAGTCAG GTGGCTTCAC CCACATCTTC 3360 

ACTGAAGATA TGCATAACAG AGACGCCAAC GACAAGAATG AAAGTCTCAC AGCTTTGGAT 3420 

GCCCTAATCT GTAATAAAAG CTTTCTTGTT ACTGTCATCC ACACCCTTGA AAAGCAGAAG 3480 

AACTTTTCTG TGAAGGACAG GTGTCTGTTT GCCTCCTTCC TAACCATTGC ACTGCAAACC 3540 

30 AAGCTGGTCT ACCTGACCAG CATCCTAGAG GTGCTGACCA GGGACTTGAT GGAACAGTGT 3600 

AGTAACATGC AGCCGAAACT CATGCTGAGA CGCACGGAGT CCGTCGTCGA AAAACTCCTC 3660 

ACAAACTGGA TGTCCGTCTG CCTTTCTGGA TTTCTCCGGG AGACTGTCGG AGAGCCCTTC 3720 

TATTTGCTGG TGACGACTCT GAACCAGAAA ATTAACAAGG GTCCCGTGGA TGTAATCACT 3780 

TGCAAAGCCC TGTACACACT TAATGAAGAC TGGCTGTTGT GGCAGGTTCC GGAATTCAGT 3840 

35 ACTGTGGCAT TAAACGTCGT CTTTGAAAAA ATCCCGGAAA ACGAGAGTGC AGATGTCTGT 3900 

CGGAATATTT CAGTCAATGT TCTCGACTGT GACACCATTG GCCAAGCCAA AGAAAAGATT 3960 

TTCCAAGCAT TCTTAAGCAA AAATGGCTCT CCTTATGGAC TTCAGCTTAA TGAAATTGGT 4020 

CTTGAGCTTC AAATGGGCAC ACGACAGAAA GAACTTCTGG ACATCGACAG TTCCTCCGTG 4080 

ATTCTTGAAG ATGGAATCAC CAAGCTAAAC ACCATTGGCC ACTATGAGAT ATCAAATGGA 4140 

40 TCCACTATAA AAGTCTTTAA GAAGATAGCA AATTTTACTT CAGATGTGGA GTACTCGGAT 4200 

GACCACTGCC ATTTGATTTT ACCAGATTCG GAAGCATTCC AAGATGTGCA AGGAAAGAGA 4260 

CATOGAGGGA AGCACAAGTT CAAAGTAAAA GAAATGTATC TGACAAAGCT GCTGTCGACC 4320 

AAGGTGGCAA TTCATTCTGT GCTTGAAAAA CTTTTTAGAA GCATTTGGAG TTTACCCAAC 4380 

AGCAGAGCTC CATTTGCTAT AAAATACTTT TTTGACTTTT TGGACGCCCA GGCTGAAAAC 4440 

45 ■ AAAAAAATCA CAGATCCTGA CGTCGTACAT ATTTGGAAAA CAAACAGCCT TCCTCTTCGC 4500 

TTCTGGGTAA ACATCCTGAA GAACCCTCAG TTTGTCTTTG ACATTAAGAA GACACCACAT 4560 

ATAGACGGCT GTTTGTCAGT GATTGCCCAG GCATTCATGG ATGCATTTTC TCTCACAGAG 4620 

CAGCAACTAG GGAAGGAAGC ACCAACTAAT AAGCTTCTCT ATGCCAAGGA TATCCCAACC 4680 

TACAAAGAAG AAGTAAAATC TTATTACAAA GCAATCAGGG ATTTGCCTCC ATTGTCATCC 4740 

50 TCAGAAATGG AAGAATTTTT AACTCAGGAA TCTAAGAAAC ATGAAAATGA ATTTAATGAA 4800 

GAAGTGGCCT TGACAGAAAT TTACAAATAC ATCGTAAAAT ATTTTGATGA GATTCTAAAT 4860 

AAACTAGAAA GAGAACGAGG GCTGGAAGAA GCTCAGAAAC AACTCTTGCA TGTAAAAGTC 4920 

TTATTTGATG AAAAGAAGAA ATGCAAGTGG ATGTAAGCAC TCTGGGGCCT GGCTTAATCT 4980 

GGCAAAGTTC TTCAGACGAC TTGGGAGCAA AATGGCTGCT TGAGCTACTC TGTGTCGTTA 5040 

55 ATTTGTTGTT TGCACATAGG TTCCACTTTG GGCACTGTCT TTTTAAGAGA CCAAGGCACA 5100 

TGCACAGCTT TTAGAAAGCA A 5121 
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Seq ID NO: C128 UNA Sequence 

Nucleic Acid Accession #: NM_002185.1 

Coding sequence: 23.. 1402 



1 
I 

CTCTCTCTCT 
TTTACTTCAA 
AGAACTGGAT 
TTCACTGACC 
ATGTGGGGCC 
CATCGAGACA 
AAAGAGTCTA 
TGACCTGAGT 
ACACTTGCAA 
GGATGAAAAC 
AAAGCTCCAA 
TAAAGGCTTC 
TAGCTCAGGG 
CGCTCTGTTG 
GCCCAGTCTC 
TTTAAATGTG 
CATTCAAGCT 
AGAATCTGAG 
TGTAGTCGTC 
TGTCAGTGCA 
TGGCAAGAAT 



11 
I 

ATCTCTCTCA 
GTCGTTTCTG 
G ACT ACT CAT 
TGTGCTTTTG 
CTCGTGGAGG 
AAGAAATTCT 
ACCTGCAAAA 
GTCATCTATC 
AAGAAGTATG 
AAATGGACGC 
CCGGCAGCAA 
TGGAGTGAAT 
GAGATGGATC 
GTCATCTTGG 
CCCGATCATA 
AGTTTCAATC 
AGAGATGAAG 
AAGCAGAGGC 
ACTCCAGAAA 
TGTGACGCCC 
GGGCCTCATG 



21 
I 

GAATGACAAT 
GAGAAAGTGG 
TCTCATGCTA 
AGGACCCAGA 
TAAAGTGCCT 
TACTGATTGG 
AAATAGACCT 
GGGAAGGAGC 
TAAAAGTTTT 
ATGTGAATTT 
TGTATGAGAT 
GGAGTCCAAG 
CTATCTTACT 
CCTGTGTGTT 
AGAAGACTCT 
CTGAAAGTTT 
TGGAAGGTTT 
TTGGAGGGGA 
GCTTTGGAAG 
CTATTCTCTC 
TGTACCAGGA 



31 
I 

TCTAGGTACA 
CTATGCTCAA 
TAGCCAGTTG 
TGTCAACACC 
GAATTTCAGG 
AAAGAGCAAT 
AACCACTATA 
CAATGACTTT 
AATGCATGAT 
ATCCAGCACA 
TAAAGTTCGA 
TTATTACTTC 
AACCATCAGC 
ATGGAAAAAA 
GGAACATCTT 
CCTGGACTGC 
TCTGCAAGAT 
TGTGCAGAGC 
AGATTCATCC 
CTCTTCCAGG 
CCTCCTGCTT 



41 

I 

ACTTTTGGCA 
AATGGAGACT 
GAAGTGAATG 
ACCAATCTGG 
AAACTACAAG 
ATATGTGTGA 
GTTAAACCTG 
GTGGTGACAT 
GTAGCTTACC 
AAGCTGACAC 
TCCATCCCTG 
AGAACTCCAG 
ATTTTGAGTT 
AGGATTAAGC 
TGTAAGAAAC 
CAGATTCATA 
ACGTTTCCTC 
CCCAACTGCC 
CTCACATGCC 
TCCCTAGACT 
AGCCTTGGGA 



51 

I 

TGGTTTTTTC 
TGGAAGATGC 
GATCGCAGCA 
AATTTGAAAT 
AGATATATTT 
AGGTTGGAGA 
AGGCTCCTTT 
TTAATACATC 
GCCAGGAAAA 
TCCTGCAGAG 
ATCACTATTT 
AGATCAATAA 
TTTTCTCTGT 
CTATCGTATG 
CAAGAAAAAA 
GGGTGGATGA 
AGCAACTAGA 
CATCTGAGGA 
TGGCTGGGAA 
GCAGGGAGAG 
CTACAAACAG 
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CACGCTGCCC CCTCCATTTT CTCTCCAATC TGGAATCCTG ACATTGAACC CAGTTGCTCA 1320 

GGGTCAGCCC ATTCTTACTT CCCTGGGATC AAATCAAGAA GAAGCATATG TCACCATGTC 1380 

CAGCTTCTAC CAAAACCAGT GAAGTGTAAG AAACCCAGAC TGAACTTACC GTGAGCGACA 1440 

AAGATGATTT AAAAGGGAAG TCTAGAGTTC CTAGTCTCCC TCACAGCACA GAGAAGACAA 1500 

5 AATTAGCAAA ACCCCACTAC ACAGTCTGCA AGATTCTGAA ACATTGCTTT GACCACTCTT 1560 

CCTGAGTTCA GTGGCACTCA ACATGAGTCA AGAGCATCCT GCTTCTACCA TGTGGATTTG 1620 

GTCACAAGGT TTAAGGTGAC CCAATGATTC AGCTATTT 1658 

Seq ID NO: C129 DNA Sequence 
10 Nucleic Acid Accession 8: NM_002722.1 
Coding sequence: 15.. 302 

1 11 21 31 41 51 

K I I I I I I 

ID ACTCTGGACT COGGATGGCT GCCGCACGCC TCTGCCTCTC CCTGCTGCTC CTGTCCACCT 60 

GCGTGGCTCT GTTACTACAG CCACTGCTGG GTGCCCAGGG AGCCCCACTG GAGCCAGTGT 120 

ACCCAGGGGA CAATGCCACA CCAGAGCAGA TGGCCCAGTA TGCAGCTGAT CTCCGTAGAT 180 

ACATCAACAT GCTGACCAGG CCTAGGTATG GGAAAAGACA CAAAGAGGAC ACGCTGGCCT 240 

TCTCGGAGTG GGGGTCCCCG CATGCTGCTG TCCCCAGGGA GCTCAGCCCG CTGGACTTAT 300 

20 AATGCCACCT TCTGTCTCCT ACGACTCCAT GAGCAGCGCC AGCCCAGCTC TCCCCTCTGC 360 

ACCCTTGGCT CTGGCCAAAG CTTGCTCCCT GCTCCCACAC AGGCTCAATA AAGCAAGTCA 420 

AAGCC 425 

Seq ID NO: C130 DNA Sequence 
25 Nucleic Acid Accession #: NM_03254S.l 
Coding sequence : 47 . . 718 

1 11 21 31 41 51 

« n I I I I I I 

JU AAACTGATCT TCAATGCACT AAGAGAAGGA GACTCTCAAA CCAAAAATGA CCTGGAGGCA 60 

CCATGTCAGG CTTCTGTTTA CGGTCAGTTT GGCATTACAG ATCATCAATT TGGGAAACAG 120 

CTATCAAAGA GAGAAACATA ACGGCGGTAG AGAGGAAGTC ACCAAGGTTG CCACTCAGAA 180 

GCACCGACAG TCACCGCTCA ACTGGACCTC CAGTCATTTC GGAGAGGTGA CTGGGAGCGC 240 

CGAGGGCTGG GGGCCGGAGG AGCCGCTCCC CTACTCCCGG GCTTTCGGAG AGGGTGCGTC 300 

35 CGCGCGGCCG CGCTGCTGCA GGAACGGCGG TACCTGCGTG CTGGGCAGCT TCTGCGTGTG 360 

CCCGGCCCAC TTCACCGGCC GCTACTGCGA GCATGACCAG AGGCGCAGTG AATGCGGCGC 420 

CCTGGAGCAC GGAGCCTGGA CCCTCCGCGC CTGCCACCTC TGCAGGTGCA TCTTCGGGGC 480 

CCTGCACTGC CTCCCCCTCC AGACGCCTGA CCGCTGTGAC CCGAAAGACT TCCTGGCCTC 540 

CCACGCTCAC GGGCCGAGCG CCGGGGGCGC GCCCAGCCTG CTACTCTTGC TGCCCTGCGC 600 

40 ACTCCTGCAC CGCCTCCTGC GCCCGGATGC GCCCGCGCAC CCTCGGTCCC TGGTCCCTTC 660 

CGTCCTCCAG CGGGAGCGGC GCCCCTGOGG AAGGCCGGGA CTTGGGCATC GCCTTTAATT 720 

TTCTATGTTG TAAATAATAG ATGTGTTTAG TTTACCGTAA GCTGAAGCAC TGGGTGAATA 780 
TTTTTATTGG GTAATAAATA TTTTCATGAA AGCGCCAAAA AAAAAAAAAA AAAAAAAAAA 840 

846 
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Seq ID NO: C131 DNA Sequence 

Nucleic Acid Accession #: NM_006533.l 

Coding sequence: 72.. 467 

1 11 21 31 41 51 

I I I I I I 

AGGGAGAGAG GGAGGGGAGG AAATTGGAGA CCCCAGCACC CCCTTGCTCA CTCTCTTGCT 60 

CACAGTCCAC GATGGCCCGG TCCCTGGTGT GCCTTGGTGT CATCATCTTG CTGTCTGCCT 120 

TCTCCGGACC TGGTGTCAGG GGTGGTCCTA TGCCCAAGCT GGCTGACCGG AAGCTGTGTG 180 

CGGACCAGGA GTGCAGCCAC CCTATCTCCA TGGCTGTGGC CCTTCAGGAC TACATGGCCC 240 

CCGACTGCCG ATTCCTGACC ATTCACCGGG GCCAAGTGGT GTATGTCTTC TCCAAGCTGA 300 

AGGGCCGTGG GCGGCTCTTC TGGGGAGGCA GCGTTCAGGG AGATTACTAT GGAGATCTGG 360 

CTGCTCGCCT GGGCTATTTC CCCAGTAGCA TTGTCCGAGA GGACCAGACC CTGAAACCTG 420 

GCAAAGTCGA TGTGAAGACA GACAAATGGG ATTTCTACTG CCAGTGAGCT CAGCCTACCG 480 

CTGGCCCTGC CGTTTCCCCT CCTTGGGTTT ATGCAAATAC AATCAGCCCA GTGCAAAC 538 

Seq ID NO: C132 DNA Sequence 
Nucleic Acid Accession ft : AB064272 
Coding sequence: 1..708 

1 11 21 31 41 51 

I I t I I I 

ATGACACAAG TCACAGAAAA GTCCACAGAA CACCCAGAAA AGACCACGTC AACCACAGAG 60 

AAAACCACAA GAACCCCAGA AAAGCCTACG CTATACTCAG AGAAGACCAT ATGCACCAAA 120 

GGGAAAAACA CACCAGTCCC AGAAAAGCCT ACAGAAAACC TGGGGAACAC CACACTGACC 180 

ACTGAGACCA TAAAAGCCCC AGTAAAGTCC ACAGAAAACC CAGAAAAAAC AGCAGCAGTC 240 

ACAAAGACTA TAAAACCTTC AGTCAAGGTC ACAGGAGACA AATCTCTCAC TACTACCTCT 300 

TCTCATCTAA ATAAAACTGA AGTTACTCAT CAGGTGCCCA CTGGTTCTTT CACCCTCATT 360 

ACATCTAGAA CGAAGCTGAG TTCTATCACA TCAGAAGCCA CAGGAAACGA GAGCCATCCA 420 

TACCTCAATA AAGATGGCTC ACAGAAAGGT ATCCACGCTG GACAGATGGG AGAGAATGAT 480 

TCATTCCCTG CATGGGCCAT AGTTATTGTG GTCCTGGTGG CTGTGATTCT CCTCCTGGTG 540 

TTCCTTGGCC TGATCTTCTT GGTCTCCTAT ATGATGCGGA CACGCCGCAC ACTAACCCAG 600 

AACACCCAGT ACAATGATGC AGAGGATGAG GGTGGCCCCA ATTCCTACCC GGTCTACCTG 660 

ATGGAGCAGC AGAATCTTGG CATGGGCCAG ATCCCTTCCC CACGGTGA 70 B 



Seq ID NO: C133 DNA Sequence 

Nucleic Acid Accession ft: NM_080870.1 

Coding sequence : 3 . . 710 
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1 
I 

AGATGACACA 
AGAAAACCAC 
AAGGGAAAAA 
CCACTGAGAC 
TCACAAAGAC 
CTTCTCATCT 
TTACATCTAG 
CATACCTCAA 
ATTCATTCCC 
TGTTCCTTGG 
AGAACACCCA 
TGATGGAGCA 
GGCGCCCAGC 
AATTTCTATT 
CTTCATCTGT 
AGGGGACAAA 
CTGAGAATGA 
TGAGTCCTAA 
GGTTTATGGG 
TTATTTCCAT 
AGGACCCTTG 
AAGAAGAAAA 
CCAGAAGAAA 
TTAGTTTTCC 
GGACACGCAA 
GACCTTCCCT 
GTAGCTTTAT 
CCTTCACCCC 
ATTTTCTCTC 
' AGGAGAATGA 
ATGTT 




41 
I 

AAAGACCACG 
AGAGAAGACC 
CCTGGGGAAC 
CCCAGAAAAA 
CAAATCTCTC 
CACTGGTTCT 
CACAGGAAAC 
TGGACAGATG 
GGCTGTGATT 
GACACGCCGC 
CAATTCCTAC 
CCCACGGTGA 
GGATGAGGAA 
GACGGTTACC 
AGCAAGGAGG 
TCTCTGTAGA 
ACCAATGCAG 
CCAGACTGAC 
GGTTTCAGGA 
AGGCTCATCT 
AAAACAAAAA 
TGCCCTGGAG 
TCCTCTTCCT 
TGGCTTGCTG 
ACCCACTGAT 
TCTTCCACCA 
CCCCATTCTC 
CATCTGTGTG 
TGATTGGGTA 
TTTTTATAAT 



SI 
I 

TCAACCACAG 
ATATGCACCA 
ACCACACTGA 
ACAGCAGCAG 
ACTACTACCT 
TTCACCCTCA 
GAGAGCCATC 
GGAGAGAATG 
CTCCTCCTGG 
ACACTAACCC 
CCGGTCTAOC 
TCTTGGAGTA 
CCGGACTCAC 
AGTATTAACC 
GTGTAAGTTT 
AGGTAATGGT 
AACACTGCAC 
TTGTGAGTGG 
CAGCATTATG 
CTGAGTTCTC 
AAACGGATCC 
AAAAAAAAGT 
CCCAAATCCC 
GGTTGTGGGT 
CTCCACCCCA 
AAAGCCAGCT 



GAGGGGGAAA 
AAAGTATAAC 



Seq ID MO: C134 DNA Sequence 

Nucleic Acid Accession fgenesh predicted 

Coding sequence: 1. • 10674 



1 
I 

ATGTGGCCTC 
TTTCAGCAGA 
GGGGCCCCCG 
GTGGAGCGGC 
CGCCTGGAGC 
AGCGAGCTCA 
CGCGTGGCCA 
TCCACCCGCC 
TCCTACCGAG 
CTTCATGCTA 
GGGGGAGACC 
TTTGGCATAT 
GAGCACTGTT 
CATGAAGATC 
TGTGATGAAG 
GGCCATTTTG 
ACAGCTTGCC 
ATTCCATGTC 
GTCTGCAGAG 
CTGAAGCCTC 
GCCTGTGGGG 
CTACCCAATG 
CTCCGCCAGC 
ACATGTTTGG 
CAAGGAAACA 
TTTCAGATGC 
TTTGGGAOGA 
ATGCTGAGAT 
GACGTGGAGG 
CAAGATTCTG 
GTGTCAGTCC 
GCTATCGTAT 
AAGGTTATTG 
GTCTCGGAGA 
GCTGAATTGG 
ATAGTACAGT 
GTCATAAAAG 
ACTCCAGATA 
GAAGGGTCTA 
ACCACTGAAT 
GAGATGTTCT 
GCATTTGAGA 




GGAGTATCCC 
TGGGCCAGGC 
TTGTCTTCCT 
TGTTCGTCCG 
TCGTGACCTT 
GCGCGCGCCA 
GTGGCGGCAC 
GAGAAAACTC 
CTAGACCAAT 
GGCAAGGGAA 
ACCTGCTACA 
TACCTTCTGG 
GCAAGGACTG 
AGTGCATCTG 
CATCGGGGAC 
CTGATGAAAA 
AGGGATACAG 
CGGAAAATGG 
TCCGATGTCA 
GTTTGTGGTC 
CGAAACATGG 
TTGCCTGTGA 
GCCAGTGGGA 
CCAAAGATGT 
TCTGCTATGT 
GTACCACTTC 
CTCCTCAAAT 
CCAATGTTAC 
ACGTTCATCC 
ACACGGCAAC 
ATGCAGAACC 
AGGTACATGC 
TCATTACCAG 
ATACAGCCAC 
GTTCTCCCTG 
ATACTGGAGT 
CTGACAAGTA 
GGCCAGACTG 
ACAAAGCAGC 
CGACCCTGGG 



GTTCCGGOGA 
GGTGGATGAT 
CAAGCTGCTG 
CTCGTCCAAG 
GCACAAGTGC 
CTACACCAAG 
AACAAAAGTT 
TGCAGCGTCA 
CATTCGAGAG 
CAGTTTTGAA 
GAGTTTTATT 
CTGTGACCGA 
TGAAAAGGGG 
ATACAAACCT 
TCACACCTCT 
GGCATCTGGC 
TTACTTTATC 
CCCTGGATTT 
CGGTTCAGAG 
CCACATCAGC 
TGAAGGGTAC 
TGGGCCAGAA 
CATCATATCC 
AAGTTGCCGC 
TGGAAAATGG 
CAACTGTCCT 
CTGGCAGATT 
AGCTTTCACC 
TGACCTATCC 
ACCTGTCATA 
CGCAAGCTGG 
AAGTCATACA 
TGACCCCTCA 
TGAAATTCCA 
CAACTGTACA 
TTATTGTGCT 
TGCCAAAAAA 
TCGTTGTGAT 
AAAAATGGTC 



GGTCTGGCGC 
AGCTTCCGCC 
GCTCCTGGCG 
CGCGTGCGGC 
TCGTCCAGCG 
TCCGACTTCC 
AACTACGTGG 
GCGCTGCTCC 
GGCGCCTTCC 
GTATTTCTCA 
CTGCGAGATT 
CTGAATGACA 
GAATTTGAGG 
CAAGATGATA 
ATGGGAAGCT 
TATTACGGGA 
GAAGGCTCAC 
OCACCTGGAA 
CAGACCTGTG 
CAAAAGACTT 
GATCTTGTGG 
AGCTACTGCA 
TGTTCTACAA 
AGACTAGAAG 
CCCOGGTGTG 
CCCCACAACT 
CAAGGGTTCA 
AATGTCGGAG 
AAGGACATAG 
CCAACAGCTA 
CCACCTTACC 
GGCAACCAGG 
GACTGGTGCA 
GATGAGCCTC 
CAAGGAGACC 
GGCAATAACA 
TTCACACCTG 
TTAACTTGCT 
TATGAAGATG 
CGTTTTGCAA 
GACACAGATC 
CCATCATTTT 



41 
I 

TCGTTTCGGG 
TCTTCCCCGA 
ACGAAGCGGC 
TGCTGCGGGA 
TGGGCGAAGT 
CCGTGGTGCC 
TGCCGCGCGT 
TCCAAGAGAT 
AGCAAGCCGC 
TCACTGATGG 
CAGGAGTGGA 
TGGCTTCCAC 
CTTTAGCTCG 
TGGTCCACTG 
GCAAATGTGG 
AAGGTCTGCA 
CAGGAGGAAT 
GCACATCCCC 
AACTTGTCCA 
GCAACAACCA 
GAAGCAGCAT 
GAGTAAGAAC 
GGGAAATGTT 
GCAGTGATAA 
TGGAGCGCCA 
GTGGCAAGCA 
TTTTATCTGG 
TTCAGGCAGC 
AGGCTAAGAC 
AAGACAACTC 
TTTTCCCAAT 
CCAGCTGCAT 
GATCTCCACC 
AGTTCTCAGA 
TTTTCCCTCA 
GGACATGTGA 
TAAATGGGGA 
TGGAGGGCTA 
GCGTCTGGAA 
ACCACGGGTT 
TGATGAAGAA 
GTAGTGATGC 
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51 
I 

CTGGGCGACC 
GACCGCGCCC 
GGGGAGCAGA 
GCTCAGCGAG 
CAACTTCCGC 
CACGGCCACG 
CGATTACATC 
CCCTGCCATC 
GCAAATTCTT 
ATATTCCAAT 
GATCTTCACT 
CCCAAAGGAG 
CCGGGCATTG 
CTCATATCTT 
GACACACACA 
GTATGAATGC 
CAGCAGTTGC 
TGAAGACTGT 
CTGCCCTGCC 
CTTCAATGCA 
CATCTTATGT 
ATGTCCTCAT 
ATATAAGACA 
GCTTACTTGT 
CTGTTCCACC 
GCCAGCCAAA 
AGTCAAAGAA 
TGTGTGTAAA 
TCTGGAACAG 
TGGTGAAAAG 
TGGAGATGTT 
TTTCCATATC 
TCCCGTCCAG 
CAACTCAGGG 
AGGGGAGACT 
TATCCATATT 
TTTTATATGC 
TGATTTCACA 
ACCAACATAT 
CAAGTCCTTT 
GTTTTCTGAA 
AGAGGACATT 
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GACTGCAGAC TGGAGGAGAA CCTGACCXAA AAATATTGCC TAGAATATAA TTATGACTAT 2580 

GAAAATGGCT TTGCAATTGG ACCAGGTGGC TGGGGTGCAG CTAATAGGCT GGATTACTCT 2640 

TACGATGACT TCCTGGACAC TGTGCAAGAA ACAGCCACAA GCATCGGCAA TGCCAAGTCC 2700 

TCACGGATTA AAAGAAGTGC CCCATTATCT GACTATAAAA TTAAGTTAAT TTTTAACATC 2760 

5 ACAGCTAGTG TGCCATTACC CGATGAAAGA AATOATACCC TTGAATGGGA AAATCAGCAA 2820 

CGACTCCTTC AGACATTGGA AACTATCACA AATAAACTGA AAAGGACTCT CAACAAAGAC 2880 

CCCATGTATT CCTTTCAGCT TGCATCAGAA ATACTTATAG CCGACAGCAA TTCATTAGAA 2940 

ACAAAAAAGG CTTCCCCCTT CTGCAGACCA GGCTCAGTGC TGAGAGGGOG TATGTGTGTC 3000 

AATTGCCCTT TGGGAACCTA TTATAATCTG GAACATTTCA CCTGTGAAAG CTGCCGGATC 3060 

10 GGATCCTATC AAGATGAAGA AGGGCAACTT GAGTGCAAGC TTTGCCCCTC TGGGATGTAC 3120 

ACGGAATATA TCCATTCAAG AAACATCTCT GATTGTAAAG CTCAGTGTAA ACAAGGCACC 3180 

TACTCATACA GTGGACTTGA GACTTGTGAA TCGTGTCCAC TGGGCACTTA TCAGCCAAAA 3240 

TTTGGTTCCC GGAGCTGCCT CTCGTGTCCA GAAAACACCT CAACTGTGAA AAGAGGAGCC 3300 

GTGAACATTT CTGCATGTGG AGTTCCTTGT CCAGAAGGAA AATTCTCGCG TTCTGGGTTA 3360 

15 ATGCCCTGTC ACCCATGTCC TOGTGACTAT TACCAACCTA ATGCAGGGAA GGCCTTCTGC 3420 

CTGGCCTGTC CCTTTTATGG AACTACCCCA TTCGCTGGTT CCAGATCCAT CACAGAATGT 3480 

TCAACTTCAG TTCTGAATAT TACTATTTTC GGTGGATTTG GGCATCTGGA GTTGTTAAAT 3540 

TGTCCTTCTG AGGTTTTCCA TGAATGCTTC TTTAACCCTT GCCACAATAG TGGAACCTGC 3600 

CAGCAACTTG GGCGTGGTTA TGTTTGTCTC TGTCCACTTG GATATACAGG CTTAAAGTGT 3660 

20 GAAACAGACA TCGATGAGTG CAGCCCACTG CCTTGCCTCA ACAATGGAGT TTGTAAAGAC 3720 

CTAGTTGGGG AATTCATTTG TGAGTGCCCA TCAGGTTACA CAGGTCAGCG GTGTGAAGAA 3780 

AATATAAATG AGTGTAGCTC CAGTCCTTGT TTAAATAAAG GAATCTGTGT TGATGGTGTG 3840 

GCTGGCTATC GTTGCACATG TGTGAAAGGA TTTGTAGGCC TGCATTGTGA AACAGAAGTC 3900 

AATGAATGCC AGTCAAACCC ATGCTTAAAT AATGCAGTCT GTGAAGACCA GGTTGGGGGA 3960 

25 TTCTTGTGCA AATGCCCACC TGGATTTTTG GGTACCCGAT GTGGAAAGAA CGTCGATGAG 4020 

TGTCTCAGTC AGCCATGCAA AAATGGAGCT ACCTGTAAAG ACGGTGCCAA TAGCTTCAGA 4080 

TGCCTGTGTG CAGCTGGCTT CACAGGATCA CACTGTGAAT TGAACATCAA TGAATGTCAG 4140 

TCTAATCCAT GTAGAAATCA GGCCACCTGT GTGGATGAAT TAAATTCATA CAGTTGTAAA 4200 

TGTCAGCCAG GATTTTCAGG CAAAAGGTGT GAAACAGAAC AGTCTACAGG CTTTAACCTG 4260 

30 GATTTTGAAG TTTCTGGCAT CTATGGATAT GTCATGCTAG ATGGCATGCT CCCATCTCTC 4320 

CATGCTCTAA CCTGTACCTT CTGGATGAAA TCCTCTGACG ACATGAACTA TGGAACACCA 4380 

ATCTCCTATG CAGTTGATAA CGGCAGCGAC AATACCTTGC TOCTGACTGA TTATAACGGC 4440 

TGGGTTCTTT ATGTGAATGG CAGGGAAAAG ATAACAAACT GTCCCTCGGT GAATGATGGC 4500 

AGATGGCATC ATATTGCAAT CACTTGGACA AGTGCCAATG GCATCTGGAA AGTCTATATC 4560 

35 GATGGGAAAT TATCTGACGG TGGTGCTGGC CTCTCTGTTG GTTTGCCCAT ACCTGGTGGT 4620 

GGTGCGTTAG TTCTGGGGCA AGAGCAAGAC AAAAAAGGAG AGGGATTCAG CCCAGCTGAG 4680 

TCTTTTGTGG GCTCCATAAG CCAGCTCAAC CTCTGGGACT ATGTCCTGTC TCCACAGCAG 4740 

GTGAAGTCAC TGGCTACCTC CTGCCCAGAG GAACTCAGTA AAGGAAACGT GTTAGCATGG 4800 

CCTGATTTCT TGTCAGGAAT TGTGGGGAAA GTGAAGATCG ATTCTAAGAG CATATTTTGT 4860 

40 TCTGATTGCC CAOGCTTAGG AGGGTCAGTG CCTCATCTGA GAACTGCATC TGAAGATTTA 4920 

AAGCCAGGTT CCAAAGTCAA TCTGTTCTGT GATCCAGGCT TCCAGCTGGT CGGGAACCCT 4980 

GTGCAGTACT GTCTGAATCA AGGACAGTGG ACACAACCAC TTCCTCACTG TGAACGCATT 5040 

AGCTGTGGGG TGCCACCTCC TTTGGAGAAT GGCTTCCATT CAGCCGATGA CTTCTATGCT 5100 

GGCAGCACAG TAACCTACCA GTGCAACAAT GGCTACTATC TATTGGGTGA CTCAAGGATG 5160 

45 TTCTGTACAG ATAATGGGAG CTGGAACGGC GTTTCACCAT CCTGCCTTGA TGTCGATGAG 5220 

TGTGCAGTTG GATCAGATTG TAGTGAGCAT GCTTCTTGCC TGAACGTAGA TGGATCCTAC 5280 

ATATGTTCAT GTGTCCCACC GTACACAGGA GATGGGAAAA ACTGTGCAGA ACCTATAAAA 5340 

TGTAAGGCTC CAGGAAATCC GGAAAATGGC CACTCCTCAG GTGAGATTTA TACAGTAGGT 5400 

GCCGGAGTCA CATTTTCGTG TCAGGAAGGA TACCAGTTGA TGGGAGTAAC CAAAATCACA 5460 

50 TGTTTGGAGT CTGGAGAATG GAATCATCTA ATACCATATT GTAAAGCTGT TTCATGTGGT 5520 

AAACCGGCTA TTCCAGAAAA TGGTTGCATT GAGGAGTTAG CATTTACTTT TGGCAGCAAA 5580 

GTGACATATA GGTGTAATAA AGGATATACT CTGGCCGGTG ATAAAGAATC ATCCTGTCTT 5640 

GCTAACAGTT CTTGGAGTCA TTCCCCTCCT GTGTGTGAAC CAGTGAAGTG TTCTAGTCCG 5700 

GAAAATATAA ATAATGGAAA ATATATTTTG AGTGGGCTTA CCTACCTTTC TACTGCATCA 5760 

55 TATTCATGCG ATACAGGATA CAGCTTACAG GGCCCTTCCA TTATTGAATG CAOGGCTTCT 5820 

GGCATCTGGG ACAGAGCGCC ACCTGCCTGT CACCTCGTCT TCTGTGGAGA ACCACCTGCC 5880 

ATCAAAGATG CTGTCATTAC GGGGAATAAC TTCACTTTCA GGAACACCGT CACTTACACT 5940 

TGCAAAGAAG GCTATACTCT TGCTGGTCTT GACACCATTG AATGCCTGGC CGACGGCAAG 6000 

TGGAGTAGAA GTGACCAGCA GTGCCTGGCT GTCTCCTGTG ATGAGCCACC CATTGTGGAC 6060 

60 CACGCCTCTC CAGAGACTGC CCATCGGCTC TTTGGAGACA TTGCATTCTA CTACTGCTCT 6120 

GATGGTTACA GCCTAGCAGA CAATTCCCAG CTTCTCTGCA ATGCCCAGGG CAAGTGGGTA 6180 

CCCCCAGAAG GTCAAGACAT GCCCCGTTGT ATAGCTCATT TCTGTGAAAA ACCTCCATCG 6240 

GTTTCCTATA GCATCTTGGA ATCTGTGAGC AAAGCAAAAT TTGCAGCTGG CTCAGTTGTG 6300 

- AGCTTTAAAT GCATGGAAGG CTTTGTACTG AACACCTCAG CAAAGATTGA ATGTATGAGA 6360 

65 GGTGGGCAGT GGAACCCTTC CCCCATGTCC ATCCAGTGCA TCCCTGTGCQ GTGTGGAGAG 6420 

CCACCAAGCA TCATGAATGG CTATGCAAGT GGATCAAACT ACAGTTTTGG AGCCATGGTG 6480 

GCTTACAGCT GCAACAAGGG GTTCTACATC AAAGGGGAAA AGAAGAGCAC CTGCGAAGCC 6540 

ACAGGGCAGT GGAGTAGTCC TATACCGAGG TGCCACCCGG TATCTTGTGG TGAACCACCT 6600 

AAGGTTGAGA ATGGCTTTCT GGAGCATACA ACTGGCAGGA TCTTTGAGAG TGAAGTGAGG 6660 

70 TATCAGTGTA ACCOGGGCTA TAAGTCAGTC GGAAGTCCTG TATTTGTCTG CCAAGCCAAT 6720 

CGCCACTGGC ACAGTGAATC CCCTCTGATG TGTGTTCCTC TCGACTGTGG AAAACCTCCC 6780 

CCGATCCAGA ATGGCTTCAT GAAAGGAGAA AACTTTGAAG TAGGGTCCAA GGTTCAGTTT 6840 

TTCTGTAATG AGGGTTATGA GCTTGTTGGT GACAGTTCTT GGACATGTCA GAAATCTGGC 6900 

AAATGGAATA AGAAGTCAAA TCCAAAGTGC ATGCCTGCCA AGTGCCCAGA GCCGCCCCTC 6960 

75 TTGGAAAACC AGCTAGTATT AAAGGAGTTG ACCACCGAGG TAGGAGTTGT GACATTTTCC 7020 

TGTAAAGAAG GGCATGTCCT GCAAGGCCCC TCTGTCCTGA AATGCTTGCC ATCCCAGCAA 7080 

TGGAATGACT CTTTCCCTGT TTGTAAGATT GTTCTTTGTA CCCCACCTCC CCTAATTTCC 7140 

TTTGGTGTCC CCATTCCTTC TTCTGCTCTT CATTTTGGAA GTACTGTCAA GTATTCTTGT 7200 

GTAGGTGGGT TTTTCCTAAG AGGAAATTCT ACCACCCTCT GCCAACCTGA TGGCACCTGG 7260 

80 AGCTCTCCAC TGCCAGAATG TGTTCCAGTA GAATGTCCCC AACCTGAGGA AATCCCCAAT 7320 

GGAATCATTG ATGTGCAAGG CCTTGCCTAT CTCAGCACAG CTCTCTATAC CTGCAAGCCA 7380 

GGCTTTGAAT TGGTGGGAAA TACTACCACC CTTTGTGGAG AAAATGGTCA CTGGCTTGGA 7440 

GGAAAACCAA CATGTAAAGC CATTGAGTGC CTGAAACCCA AGGAGATTTT GAATGGCAAA 7500 

TTCTCTTACA CGGACCTACA CTATGGACAG ACCGTTACCT ACTCTTGCAA CCGAGGCTTT 7560 
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CGGCTCGAAO GTCCCAGTGC CTTGACCTGT TTAGAGACAG GTGATTGGGA TGTAGATGCC 
CCATCTTGCA ATQCCATCCA CTGTGATTCC CCACAACCCA TTGAAAATGG TTTTGTAGAA 
GGTGCAGATT ACAGCTATGG TGCCATAATC ATCTACAGTT GCTTCCCTGG GTTTCAGGTG 
GCTGGTCATG CCATGCAGAC CTGTGAAGAG TCAGGATGGT CAAGTTCCAT CCCAACATGT 
ATGCCAATAG ACTGTGGCCT CCCTCCTCAT ATAGATTTTG GAGACTGTAC TAAACTCAAA 
GATGACCAGG GATATTTTGA GCAAGAAGAC GACATGATGG AAGTTCCATA TGTGACTCCT 
CACCCTCCTT ATCATTTGGG AGCAGTGGCT AAAACCTGGG AAAATACAAA GGAGTCTCCT 
GCTACACATT CATCAAACTT TCTGTATGGT ACCATGGTTT CATACACCTG TAATCCAGGA 
TATGAACTTC TGGGGAACCC TGTGCTGATC TGCCAGGAAG ATGGAACTTG GAATGGCAGT 
GCACCATCCT GCATTTCAAT TGAATGTGAC TTGCCTACTG CTCCTGAAAA TGGCTTTTTG 
CGTTTTACAQ AGACTAGCAT GGGAAGTGCT GTGCAGTATA GCTGTAAACC TGGACACATT 
CTAGCAGGCT CTGACTTAAG GCTTTGTCTA GAGAATAGAA AGTGGAGTGG TGCCTCCCCA 
CGCTGTGAAG CCATTTCATG CAAAAAGCCA AATCCAGTCA TGAATGGATC CATCAAAGGA 
AGCAACTACA CATACCTGAG CACGTTGTAC TATGAGTGTG ACCCCGGATA TGTGCTGAAT 
GGCACTGAGA GGAGAACATG CCAGGATGAC AAAAACTGGG ATGAGGATGA GCCCATTTGC 
ATTCCTGTGG ACTGCAGTTC ACCCCCAGTC TCAGCCAATG GCCAGGTGAG AGGAGACGAG 
TACACATTCC AAAAAGAGAT TGAATACACT TGCAATGAAG GGTTCTTGCT TGAGGGAGCC 
AGGAGTCGGG TTTGTCTTGC CAATGGAAGT TGGAGTGGAG CCACTCCCGA CTGTGTGCCT 
GTCAGATGTG CCACCCCGCC ACAACTGGCC AATGGGGTGA CGGAAGGCCT GGACTATGGC 
TTCATGAAGG AAGTAACATT CCACTGTCAC GAGGGCTACA TCTTGCACGG TGCTCCAAAA 
CTCACCTGTC AGTCAGATGG CAACTGGGAT GCAGAGATTC CTCTCTGTAA ACCAGTCAAC 
TGTGGACCTC CTGAAGATCT TGCCCATGGT TTCCCTAATG GTTTTTCCTT TATTCATGGG 
GGCCATATAC AGTATCAGTG CTTTCCTGGT TATAAGCTCC ATGGAAATTC ATCAAGAAGG 
TGCCTCTCCA ATGGCTCCTG GAGTGGCAGC TCACCTTCCT GCCTGCCTTG CAGATGTTCC 
ACACCAGTAA TTGAATATGG AACTGTCAAT GGGACAGATT TTGACTGTGG AAAGGCAGCC 
CGGATTCAGT GCTTCAAAGG CTTCAAGCTC CTAGGACTTT CTGAAATCAC CTGTGAAGCC 
GATGGCCAGT GGAGCTCTGG GTTCCCCCAC TGTGAACACA CTTCTTGTGG TTCTCTTCCA 
ATGATACCAA ATGCGTTCAT CAGTGAGACC AGCTCTTGGA AGGAAAATGT GATAACTTAC 
AGCTGCAGGT CTGGATATGT CATACAAGGC AGTTCAGATC TGATTTGTAC AGAGAAAGGG 
GTATGGAGCC AGCCTTATCC AGTCTGTGAG CCCTTGTCCT GTGGGTCCCC ACCGTCTGTC 
GCCAATGCAG TGGCAACTGG AGAGGCACAC ACCTATGAAA GTGAAGTGAA ACTCAGATGT 
CTGGAAGGTT ATACGATGGA TACAGATACA GATACATTCA CCTGTCAGAA AGATGGTCGC 
TGGTTCCCTG AGAGAATCTC CTGCAGTCCT AAAAAATGTC CTCTCCCGGA AAACATAACA 
CATATACTTG TACATGGGGA CGATTTCAGT GTGAATAGGC AAGTTTCTGT GTCATGTGCA 
GAAGGGTATA CCTTTGAGGG AGTTAACATA TCAGTATGTC AGCTTGATGG AACCTGGGAG 
CCACCATTCT CCGATGAATC TTGCAGTCCA GTTTCTTGTG GGAAACCTGA AAGTCCAGAA 
CATGGATTTG TGGTTGGCAG TAAATACACC TTTGAAAGCA CAATTATTTA TCAGTGTGAG 
CCTGGCTATG AACTAGAGGG GAACAGGGAA CGTGTCTGCC AGGAGAACAG ACAGTGGAGT 
GGAGGGGTGG CAATATGCAA AGAGACCAGG TGTGAAACTC CACTTGAATT TCTCAATGGG 
AAAGCTGACA TTGAAAACAG GACGACTGGA CCCAACGTGG TATATTCCTG CAACAGAGGC 
TACAGTCTTG AAGGGCCATC TGAGGCACAC TGCACAGAAA ATGGAACCTG GAGCCACCCA 
GTCCCTCTCT GCAAACCAAA TCCATGCCCT GTTCCTTTTG TGATTCCCGA GAATGCTCTG 
CTGTCTGAAA AGGAGTTTTA TGTTGATCAG AATGTGTCCA TCAAATGTAG GGAAGGTTTT 
CTGCTGCAGG GCCACGGCAT CATTACCTGC AACCCCQACG AGACGTGGAC ACAGACAAGC 
GCCAAATGTG AAAAAATCTC ATGTGGTCCA CCAGCTCACG TAGAAAATGC AATTGCTCGA 
GGCGTACATT ATCAATATGG AGACATGATC ACCTACTCAT GTTACAGTGG ATACATGTTG 
GAGGGTTTCC TGAGGAGTGT TTGTTTAGAA AATGGAACAT GGACATCACC TCCTATTTGC 
AGAGCTGTCT GTCGATTTCC ATGTCAGAAT GGGGGCATCT GCCAACGCCC AAATGCTTGT 
TCCTGTCCAG AGGGCTGGAT GGGGCGCCTC TGTGAAGAAC CAATCTGCAT TCTTCCCTGT 
CTGAACGGAG GTCGCTGTGT GGCCCCTTAC CAGTGTGACT GCCCGCCTGG CTGGACGGGG 
TCTCGCTGTC ATACAGCTGT TTGCCAGTCT CCCTGCTTAA ATGGTGGAAA ATGTGTAAGA 
CCAAACCGAT GTCACTGTCT TTCTTCTTGO ACGGGACATA ACTGTTCCAG GTAA 

Seq ID NO: CI 3 5 DNA Sequence 

Nucleic Acid Accession #t FGBNESH predicted 

Coding sequence: 1..390 



1 
I 

ATGAGGTTCA 
TATGTGTCAG 
GAACCATGGC 
CAGTGCTGTT 
TGCACCTTCT 
TTTGTTGTGA 
AGTAAATGTG 



11 
I 

GTGTCTCAGG 

TGTGCCAGCC 
ACAATGACGC 
GGCCCTGCTT 
AGCTGAAGGT 
AAAGAGGCCG 



21 
I 

CATGAGGACC 
CCTCTTGTGT 
GGCACCCAGG 
CATCGTGTCC 
TGAGCTCTGC 
TCAGGGTGTG 
GATATGTTAG 



31 
I 

GACTACCCCA 
CCAAGGGAAG 
TGTGGAGACA 
CTGAGCGAGA 
TGTCTTGATT 
AATTCCCAGT 



41 

I 

GGAGTGTGCT 
TCATCGCTCC 
AGATCTACAA 
CCCGCCAATG 
CCTTTGGCCT 
GCCACTCATC 



51 
I 

GGCTCCTGCT 
CGCTGGCTCA 
CCCCTTGGAG 
TGGTCCCCCC 
CACAAACGAT 
TCCCATCTCC 



Seq ID NO: C136 DNA Sequence 
Nucleic Acid Accession BC035671.1 
Coding sequence : 126 . . 1745 



GGCAGCGACT 
GGCAGCTGCG 
GGGTCATGGT 
TCCTGGCTTT 
ATGGCTGTGG 
CCGGGACCTA 
GACTGATTCT 
TTCTCTTCAC 
AAGAACTCTX 
CTGGCCGGGG 



11 
I 

GCGCCCCGTC 
GCTCGGGATC 
GCCCGGCGCC 
GCTGCTCGCG 
ACACCTAGTG 
CCCCAATCAC 
GAGGTTGGGA 
CAGCTCTTCA 
GTTGAACACA 
TTTTTTGCTG 



21 
I 

CCGGCGCCGC 
CGTCGAGGGG 
CGCGGCGGCG 
GTCTCCGCCC 
ACTTATCAGG 
ACTGTTTGCG 
GATTTGGATA 
GATCAATATG 
AGTGAAGTAA 
ACCTATGCGA 



31 
I 

GCTCGTCCGC 
AGGCCGAGCT 
GCGCACTGGC 
CGCTCCGGCT 
ATAGTGGCAC 
AAAAGACAAT 
TCGAATCCCA 
GTCCATACTG 
CCGTCCGCTT 
GCAGCGACCA 



41 

1 

AGAGGAGGCG 
TGCCAAGCTG 
GCGGGCTGCC 
GCAGGCGGAG 
AATGACATCT 
TACAGTACCA 
GACCTGTGCT 
TGGAAGTATG 
TGAGAGTGGA 
TCCAGATTTA 



7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

B5B0 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 

9960 - 

10020 

10080 

10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

10620 

10674 



51 
I 

GCCCGGCCCG 
GCGCCCAGCG 
GGGCGGGGCC 
GAGCTGGGTG 
AAGAATTATC 
AAGGGGAAAA 
TCTGACTATC 
ACTGTTCCCA 
TCCCACATTT 
ATAACATGTT 



60 

120 

180 

240 

300 

360 

390 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 
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TGGAACGAGC TAGCCATTAT TTGAAGACAG AATACAGCAA ATTCTGCCCA GCTGGTTQTA 660 

GAGACGTAGC AGGAGACATT TCTGGGAATA TGGTAGATGG ATATAGAGAT ACCTCTTTAT 720 

TGTGCAAAGC TGCCATCCAT GCAGGAATAA TTGCTGATGA ACTAGGTGGC CAGATCAGTG 780 

TGCTTCAGCG CAAAGGGATC AGTCGATATG AAGGGATTCT GGCCAATGGT GTTCTTTCGA 840 

GGGATGGTTC CCTGTCAGAC AAGCGATTTC TGTTTACCTC CAATGGTTGC AGCAGATCCT 900 

TGAGTTTTGA ACCTGACGGG CAAATCAGAG CTTCTTCCTC ATGGCAGTCG GTCAATGAGA 960 

GTGGAGACCA AGTTCACTGG TCTCCTGGCC AAGCCCGACT TCAGGACCAA GGCCCATCAT 1020 

GGGCTTCGGG CGACAGTAGC AACAACCACA AACCAOGAGA GTGGCTGGAG ATCGATTTGG 1080 

GGGAGAAAAA GAAAATAACA GGAATTAGGA CCACAGGATC TACACAGTCG AACTTCAACT 1140 

TTTATGTTAA GAGTTTTGTG ATGAACTTCA AAAACAATAA TTCTAAGTGG AAGACCTATA 1200 

AAGGAATTGT GAATAATGAA GAAAAGGTGT TTCAGGGTAA CTCTAACTTT CGGGACCCAG 1260 

TGCAAAACAA TTTCATCCCT CCCATCGTGG CCAGATATGT GCGGGTTGTC CCCCAGACAT 1320 

GGCACCAGAG GATAGCCTTG AAGGTGGAGC TCATTGGTTG CCAGATTACA CAAGGTAATG 1380 

ATTCATTGGT GTGGCGCAAG ACAAGTCAAA GCACCAGTGT TTCAACTAAG AAAGAAGATG 1440 

AGACAATCAC AAGGCCCATC CCCTCGGAAG AAACATCCAC AGGAATAAAC ATTACAACGG 1500 

TGGCTATTCC ATTGGTGCTC CTTGTTGTCC TGGTGTTTGC TGGAATGGGG ATCTTTGCAG 1560 

CCTTTAGAAA GAAGAAGAAG AAAGGAAGTC CGTATGGATC AGCAGAGGCT CAGAAAACAG 1620 

ACTGTTGGAA GCAGATTAAA TATCCCTTTG CCAGACATCA GTCAGCTGAG TTTACCATCA 1680 

GCTATGATAA TGAGAAGGAG ATGACACAAA AGTTAGATCT CATCACAAGT GATATGGCAG 1740 

GTTAACTCCG TTGACTGCCA AAATAGCATC CCCAACGTGC AGCCCTCCGC ATCTATCAGC 1800 

AGGTTGCCCC GGATGGATCT CAGAGATGAG GATCGGAACA CCATGTTCTT TCCCACCCTA 1860 

ACAACAACAA AGGGCAGTAA ATTAAAGTAC TCTTTGTAAG GTACAGTTAC CGATTAATCT 1920 

AGAGATAAAA TATTTTCTTA AAAATATATT TCATTAAACA CCTATGCTGT CTCTATAAAA 1980 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 2010 

Seq ID NO: Cl 3 7 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..1761 

1 11 21 31 41 51 

I I I I I I 

ATGGGATTCG GTGCGGGGCA GCGACTGCGC CCCGTCCCGG CGCCGCGCTC GTCCGCAGAG 60 

GAGGCGGCCC GGCCCGGGCA GCTGCGGCTC GGGATCCGTC GAGGGGAGGC CGAGCTTGCC 120 

AAGCTGGCGC CCAGCGGGGT CATGGTGCCC GGCGCCCGCG GCGGCGGCGC ACTGGCGOGG 180 

GCTGCCGGGC GGGGCCTCCT GGCTTTGCTG CTCGCGGTCT CCGCCCCGCT CCGGCTGCAG 240 

GCGGAGGAGC TGGGTGATGG CTGTGGACAC CTAGTGACTT ATCAGGATAG TGGCACAATG 300 

ACATCTAAGA ATTATCCCGG GACCTACCCC AATCACACTG TTTGCGAAAA GACAATTACA 360 

GTACCAAAGG GGAAAAGACT GATTCTGAGG TTGGGAGATT TGGATATCGA ATCCCAGACC 420 

TGTGCTTCTG ACTATCTTCT CTTCACCAGC TCTTCAGATC AATATGGTCC ATACTGTGGA 480 

AGTATGACTG TTCCCAAAGA ACTCTTGTTG AACACAAGTG AAGTAACCGT CCGCTTTGAG 540 

AGTGGATCCC ACATTTCTGG CCGGGGTTTT TTGCTGACCT ATGCGAGCAG CGACCATCCA 600 

GATTTAATAA CATGTTTGGA ACGAGCTAGC CATTATTTGA AGACAGAATA CAGCAAATTC 660 

TGCCCAGCTG GTTGTAGAGA CGTAGCAGGA GACATTTCTG GGAATATGGT AGATGGATAT 720 

AGAGATACCT CTTTATTGTG CAAAGCTGCC ATCCATGCAG GAATAATTGC TGATGAACTA 780 

GGTGGCCAGA TCAGTGTGCT TCAGCGCAAA GGGATCAGTC GATATGAAGG GATTCTGGCC 840 

AATGGTGTTC TTTCGAGGGA TGGTTCCCTG TCAGACAAGC GATTTCTGTT TACCTCCAAT 900 

GGTTGCAGCA GATCCTTGAG TTTTGAACCT GACGGGCAAA TCAGAGCTTC TTCCTCATGG 960 

CAGTCGGTCA ATGAGAGTGG AGACCAAGTT CACTGGTCTC CTGGCCAAGC CCGACTTCAG 1020 

GACCAAGGCC CATCATGGGC TTCGGGCGAC AGTAGCAACA ACCACAAACC ACGAGAGTGG 1080 

CTGGAGATCG ATTTGGGGGA GAAAAAGAAA ATAACAGGAA TTAGGACCAC AGGATCTACA 1140 

CAGTCGAACT TCAACTTTTA TGTTAAGAGT TTTGTGATGA ACTTCAAAAA CAATAATTCT 1200 

AAGTGGAAGA CCTATAAAGG AATTGTGAAT AATGAAGAAA AGGTGTTTCA GGGTAACTCT 1260 

AACTTTCGGG ACCCAGTGCA AAACAATTTC ATCCCTCCCA TCGTGGCCAG ATATGTGCGG 1320 

GTTGTCCCCC AGACATGGCA CCAGAGGATA GCCTTGAAGG TGGAGCTCAT TGGTTGCCAG 1380 

ATTACACAAG GTAATGATTC ATTGGTGTGG CGCAAGACAA GTCAAAGCAC CAGTGTTTCA 1440 

ACTAAGAAAG AAGATGAGAC AATCACAAGG CCCATCCCCT CGGAAGAAAC ATCCACAGGA 1500 

ATAAACATTA CAACGGTGGC TATTCCATTG GTGCTCCTTG TTGTCCTGGT GTTTGCTGGA 1560 

ATGGGGATCT TTGCAGCCTT TAGAAAGAAG AAGAAGAAAG GAAGTCCGTA TGGATCAGCA 1620 

GAGGCTCAGA AAACAGACTG TTGGAAGCAG ATTAAATATC CCTTTGCCAG ACATCAGTCA 1680 

GCTGAGTTTA CCATCAGCTA TGATAATGAG AAGGAGATGA CACAAAAGTT AGATCTCATC 1740 

ACAAGTGATA TGGCAGGTTA A 1761 



Seq ID NO: C138 DNA Sequence 
Nucleic Acid Accession #i PGENESH predicted 
OD Coding sequence: 1..2310 



1 11 21 31 41 51 

I I I I I I 

ATGTTCCAGC GGCAGGAAAG ATTTCTTGAC TTATCTTCAG CTGAAGCAGT GGCAGCTTGG 60 

ATATTACATC AACATCCTGA CATTATTAAC AAAGGTGATG GCTGTGGACA CCTAGTGACT 120 

TATCAGGATA GTGGCACAAT GACATCTAAG AATTATCCCG GGACCTACCC CAATCACACT 180 

GTTTGCGAAA AGACAATTAC AGTACCAAAG GGGAAAAGAC TGATTCTGAG GTTGGGAGAT 240 

TTGGATATCG AATCCCAGAC CTGTGCTTCT GACTATCTTC TCTTCACCAG CTCTTCAGAT 300 

CAATATGGAA TGCAGAAGGA GGAGGAGACA GAAGTGCTTT GTCTTTCAGT GGCTGGCGCT 360 

CAGAGAGTGG ACATTCCTGT GCAGCTGTTG CCCAGCTTCC TGGAAGGGTG GAAGGGTCAT 420 

GCTGATGCAA GAGGTCCATA CTGTGGAAGT ATGACTGTTC CCAAAGAACT CTTGTTGAAC 480 

ACAAGTGAAG TAACCGTCCG CTTTGAGAGT GGATCCCACA TTTCTGGCCG GGGTTTTTTG 540 

CTGACCTATG CGAGCAGCGA CCATCCAGAT TTAATAACAT GTTTGGAACG AGCTAGCCAT 600 

TATTTGAAGA CAGAATACAG CAAATTCTGC CCAGCTGGTT GTAGAGACGT AGCAGGAGAC 660 

ATTTCTGGGA ATATGGTAGA TGGATATAGA GATACCTCTT TATTGTGCAA AGCTGCCATC 720 

CATGCAGGAA TAATTGCTGA TGAACTAGGT GGCCAGATCA GTGTGCTTCA GCGCAAAGGG 780 

ATCAGTCGAT ATGAAGGGAT TCTGGCCAAT GGTGTTCTTT CGAGGGATGG TTCCCTGTCA 840 

GACAAGCGAT TTCTGTTTAC CTCCAATGGT TGCAGCAGAT CCTTGAGTTT TGAACCTGAC 900 

GGGCAAATCA GAGCTTCTTC CTCATGGCAG TCGGTCAATG AGAGTGGAGA CCAAGTTCAC 960 
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TGGTCTCCTG 
AGCAACAACC 
ACAGGAATTA 
GTGATGAACT 
GAAGAAAAGG 
CCTCCCATCG 
TTGAAGGTGG 
AAGACAAGTC 
ATCCCCTCGG 
CAGATGATCT 
ACAGCGGAAA 
TTCTTAGCAC 
CCCTTTGCCA 
ACACAAAAGT 
GGCACCGGGA 
GAGGCAGGGG 
CACGAGTACG 
CGGCACGTGC 
CAGCCCGGCC 
GCCCAGGACG 
CGGCCCAAAG 
CCAACGCATC 
AACCAGACGG 



GCCAAGCCCG 
ACAAACCACG 
GGACCACAGG 
TCAAAAACAA 
TGTTTCAGGG 
TGGCCAGATA 
AGCTCATTGG 
AAAGCACCAO 
AAG AAA CATC 
CACAAAGGGA 
GCATGGTTAG 
ACACACCTGA 
GACATCAGTC 
TAGATCTCAT 
CAGTCACGAG 
TGAGCACCGA 
CGCTGCCCCT 
TGCGCGCCCA 
ACAAACACTC 
GAGACTATCA 
CTGTCAGCGC 
CCGGGACGAO 
CCATGACTGC 



ACTTCAGGAC 
AGAGTGGCTG 
ATCTACACAG 
TAATTCTAAG 
TAACTCTAAC 
TGTGCGGGTT 
TTGCCAGATT 
TGTTTCAACT 
CACAGATGCC 
GAATCTGGGA 
AGTAGTGTTT 
GGAGGACATT 
AGCTGAGTTT 
CACAAGTGAT 
GAAGGGCTCC 
TGCCGGCGGC 
GGCGCCCCCG 
CACGTTCTCT 
CCTCTCCTCG 
AAGGCCACAC 
CCTCGCCACC 
TGACAGCTAT 
CCTTTTGTGA 



CAAGGCCCAT 
GAGATCGATT 
TCGAACTTCA 
TGGAAGACCT 
TTTCGGGACC 
GTCCCCCAGA 
ACACAAGGTA 
AAGAAAGAAG 
ATGCCAGTGC 
CCTGATGAGG 
GCTGTTGTGG 
GATCACTACT 
ACCATCAGCT 
ATGGCAGATT 
ACCTTCCGGC 
CACTATGACT 
GAGCCCGAGT 
GCGCAGAGCG 
GGCGGCTTCT 
AGCGCACAGC 
GAAAGCGGGC 
TCTQCCCCCA 



CATGGGCTTC 
TGGGGGAGAA 
ACTTTTATGT 
ATAAAGGAAT 
CAGTGCAAAA 
CATGGCACCA 
ATGATTCATT 
ATGAGACAAT 
AGATTGTCGG 
GCAAAATACC 
TTAATGACCT 
GTTGGAAGCA 
ATGATAATGA 
ACCAGCAGCC 
CCATGGACAC 
GCCCGCAGCG 
ACGCCACGCC 
GCTACCGCGT 
CCCCCGTAGC 
CTGCGGACAG 
ACCCTGACTC 
GAGACTGCCT 



GGGCGACAGT 
AAAGAAAATA 
TAAGAGTTTT 
TGTGAATAAT 
CAATTTCATC 
GAGGATAGCC 
GGTGTGGCGC 
CACAAGGCCC 
AGACCATACC 
TTTTAAAGGC 
TGGCATGCTG 
GATTAAATAT 
GAAGGAGATG 
CCTCATGATT 
GGATGCCGAG 
GGCCGGCCGC 
CATCGTGGAG 
CCCAGGGCCC 
GGGTGTGGGC 
GGGCTACGAC 
TCAGAAGCCC 
CACACCCCTC 



Seq ID NO: CI 3 9 DMA Sequence 

Nucleic Acid Accession #: NMJJ04616.2 

Coding sequence: 180.. 893 



1 
I 

AGTGCCCCAG 
ATTTAAATGT 
GACAAGCCTG 
TGGCAGGTGT 
TATGTGGTAT 
CAATTTTTGG 
CTGTAGGTGC 
GTCGCTGCAT 
CGACAGGTAT 
TCTATGAAAA 
CCATAATTGT 
GGGGAAATAA 
GCCAAAGCTA 
TCTTGGCAAA 
TACTGGGTTT 
GATGCATCAA 
AAATATGTAA 
AGACCACAGA 
AATGATATGA 



11 
I 

GAGCTATGAC 
CCGTGGATAC 
TAACGAATAG 
GAGTGCCTGT 
CTTGATCCTA 
TTCTGAAGAT 
CATCATCATG 
GCTTCTGTTG 
CCTAGGAGCT 
CACAAAGCTT 
GTTTCAAGAA 
TTTTCAACAC 
TAATGGAAAA 
AAATTTGATT 



CCTATCGTCA 
GTGCTATATA 
TATCTTCTAG 
ATGTGTATTT 



21 
I 

AAGCAAAGGA 
AGAAATCTCT 
TTAAATTCAC 
ATAAAATATT 
GCATTAGCAA 
GTAGGCTCTA 
ATTCTGGGCT 
TTTTTCATAG 
GTTTTCAAAT 
TTGAGCGCCA 
GAGTTTAAAT 
TATCCTGAAT 
CAAGTTTACA 
ATAGTTATTG 
ATGGTCCTGT 
GTCAAACCCC 
AGTCAGGAGC 
ACATATTGAA 
TTACTCAAAA 



31 
I 

ACATACTTGC 
GCAGGCAAGT 
GGCATCTGGA 
CTATGTTTAC 
TATGGGTACG 
GCTCCTACGT 
TCCTGGGATG 
GCTTGCTTCT 
CTAAGTCTGA 
CAGGGGAAAG 
GCTGCGGTTT 
TATGTGCCTG 
AAGAGACCTG 
GAATATCATT 
ATTGCCAGAT 
TTTAAAATGT 
AGCTGTCTTT 
CACATTTAAG 
TAAAAGTAAC 



41 
I 

CTGGAGATAG 
TGCTCCAGAG 
TTCCTAATCC 
CTTCAACTTC 
AGTAAGCAAT 
TGCTGTGGAC 
CTGCGGTGCT 
GATCCTGCTC 
TCGCATTGTG 
TGAAAAACAA 
GGTCAATGGA 
TCTAGATAAG 
TATTTCTTTC 
TGGACTGGCA 
CGGGAACAAA 
TGCTTTGGCT 
TTAAAATGTC 
ATTTGAGGGA 
TGTTTACGTT 



51 
I 

CCTTTGCGAT 
CATATTGCAG 
TTTTCCGAAA 
TTGTTCTGGC 
GACTCTCAAG 
ATATTGATTG 
ATAAAAGAAA 
CTGCAGGTGG 
AATGAAACTC 
TTCCAGGAAG 
GCTGCTGATT 
CAGAGACCAT 
ATAAAAGACT 
GTTATTGAGA 
TGAATCTGTG 
TTGTAAATTT 
TCGGCTAGCT 
TATAAGGGAA 
AAAAAAAAAA 



Seq ID NO » CI 40 DNA Sequence 
Nucleic Acid Accession #» NM_004617.2 
Coding sequence: 232.. 840 



CTTCAGGTCA 
AGCTGAAGCA 
ATTGAATTGG 
AGGAGGCAGT 
GGGGGCTGTG 
GCTAACATCC 
CAAGAGATCT 
CTGGTGTTCT 
GGGAAGCGAT 
GGATACTCGT 
AATAGTACAT 
AACAAGTGCC 
CTGGTCGTAG 
GGGACCCTCT 
ACCTCCGAGA 
CTTCTTCTCT 
AGGCAGTTAT 
TAAAAAATAG 
TTAGAATTTA 
ATGTATAAAG 
TGGAAGTTTT 
TTCTGTTTTT 
TTTTTTGCTG 
GGGTTCTTTT 



11 
I 

GGGAGAATGT 
ACTCCAAGGA 
AGACAATTAC 
GAGGAGCTTT 
CCAGATGCCT 
TGTTATTTTT 
GGTTTTTCGG 
TGGGCCTGAA 
TTGCGATGTT 
TTATCATCTC 
GGGGCTACCC 
GAGAGCCTCT 
GAGGAATCCA 
GTGGGGACTG 
TGAGCTGCTC 
TCTTGGAATT 
TCCTTCTTTC 
TTTGGCCACT 
CCAACAGGTT 
TCACATGTAC 
ACATAAATCA 
AAGAGGAAAA 
CCTTGCTTGA 
ATGTATGTTA 



21 
I 

ATAAATGTCC 
CACAGTTCAC 
AAGGACTCTC 
TGATTGCTGA 
GGGGGGGACC 
TCCTGGAGGA 
AGGAATATTA 
GAACAATGAC 
CACCTCCACG 
AGCCATTTCA 
CTTCCACGAC 
CAATGTGGTT 
GATGGTTCTC 
CCAGTGTTGT 
AGACTCTACA 
ATTAATTCCT 
CAACCAGCTT 
TAACAAATTT 
CAAAGCATAC 
TGCCATACTA 
AAGGAAGAAA 
AGAATGATTG 
GTTGCTTGTG 
AATTAAAACC 



31 
I 

ATTGCCATCG 
AGAAATTTGG 
TGGCCAAAAA 
CCTGTGTCGT 
CTCATTCCCC 
AAAGTGATAG 
GGAAGCGGTG 
TGCTGTGGGT 
ATATTTGCTG 
ATCAACAAGG 
GGGGATTATC 
CCCTGGAATC 
TGCGCCATCC 
GGCTGCTGTG 
GCATGACGAC 
ATCTGCTTCC 
TGCTCGAGTT 
GATTTATAAA 
TTTTCATGAT 
CTTCTTTGTA 
GCACATTTAA 
ATGTATCCTA 
ACTGATCTTT 
TGAATTCAGA 



41 

I 

AGGTTCTGCT 
TTCTCAGCCC 
CCCTTGAAGA 
ACCACCCCAG 
TTGCTTTTTT 
ATGACAACGA 
TCTTGATGAT 
GCTGCGGCAA 
TGGTTGGATT 
GTCCTAAATG 
TCAATGATGA 
TGACCCTCTT 
AGGTGGTCAA 
GGGGAGATGG 
TACAATTTCT 
TAGCTGATAA 
AGAATTTTGT 
TCTTTCAAAT 
TTTTTTATTA 
TATAAAGATG 
AATGAGAAAC 
AGTATTGTTA 
TGAGGCTGTC 
GGTAAOGT 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2310 



51 
I 

ATTTTTGAGA 
CAAAATACTG 
GGCCCOGTGA 
AATGTGCACT 
TGGCTTCCTG 
CCACCTTTCC 
CTTCCCTGCG 
CGAGGGCTGT 
CTTGGGAGCT 
CCTCATGGCC 
GGCCTTATGG 
CTCCATCCTG 
TGGCCTCCTG 
ACCCGTTTAA 
TTTCATAAAA 
AGCTTAGAAA 
TATTTTCAAA 
TAGTTCCTTT 
CAAATGTAAA 
TTTATATCTT 
TAAGACCAAT 
TTTGTTGTCT 
ATCATGGCTA 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1159 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1428 



Seq ID NO: CI 41 DMA Sequence 

Nucleic Acid Accession fti NM_002381.2 



1288 



WO 03/042661 



PCT/US02/36810 



Coding sequence: 64.. 1524 

1 11 21 31 41 SI 

-I I I I I I 

J AAATCCGAGC CTCGCGTGGG CTCCTGGCCC CCGACGGACA CCACCAGGCC CACGGAGCCC 60 

ACCATGCCGC GCCCGGCCCC CGCGCGCCGC CTCCCGGGAC TCCTCCTGCT GCTCTGGCCG 120 

CTGCTGCTGC TGCCCTCCGC CGCCCCCGAC CCCGTGGCCC GCCCGGGCTT CCGGAGGCTG 180 

GAGACCCGAG GTCCCGGGGG CAGCCCTGGA CGCCGCCCCT CTCCTGCGGC TCCCGACGGC 240 

GCGCCCGCTT CCGGGACCAG CGAGCCTGGC CGCGCCCGCG GTGCAGGTGT TTGCAAGAGC 300 

10 AGACCCTTGG ACCTGGTGTT TATCATTGAT AGTTCTCGTA GCGTACGGCC CCTGGAATTC 360 

ACCAAAGTGA AAACTTTTGT CTCCCGGATA ATCGACACTC TGGACATTGG GCCAGCCGAC 420 

ACGCGGGTGG CAGTGGTGAA CTATGCTAGC ACTGTGAAGA TCGAGTTCCA ACTCCAGGCC 480 

TACACAGATA AGCAGTCCCT GAAGCAGGCT GTGGGTCGAA TCACACCCTT GTCAACAGGC 540 

ACCATGTCAG GCCTAGCCAT CCAGACAGCA ATGGACGAAG CCTTCACAGT GGAGGCAGGG 600 

15 GCTCGAGAGC CCTCTTCTAA CATCCCTAAG GTGGCCATCA TTGTTACAGA TGGGAGGCCC 660 

CAGGACCAGG TGAATGAAGT GGOGGCTCGG GCCCAAGCAT CTGGTATTGA GCTCTATGCT 720 

GTGGGCGTGG ACCGGGCAGA CATGGCGTCC CTCAAGATGA TGGCCAGTGA GCCCCTAGAG 780 

GAGCATGTTT TCTACGTGGA GACCTATGGG GTCATTGAGA AACTTTCCTC TAGATTCCAG 840 

GAAACCTTCT GTGCGCTGGA CCCCTGTGTG CTTGGAACAC ACCAGTGCCA GCACGTCTGC 900 

20 ATCAGTGATG GGGAAGGCAA GCACCACTGT GAGTGTAGCC AAGGATACAC CTTGAATGCC 960 

GACAAGAAAA CGTGTTCAGC TCTTGATAGG TGTGCTCTTA ACACCCACGG ATGTGAGCAC 1020 

ATCTGTGTGA ATGACAGAAG TGGCTCTTAT CATTGTGAGT GCTATGAAGG TTATACCTTG 1080 

AATGAAGACA GGAAAACTTG TTCAGCTCAA GATAAATGTG CTTTGGGTAC CCATGGGTGT 1140 

CAGCACATTT GTGTGAATGA CAGAACAGGG TCCCATCATT GTGAATGCTA TGAGGGCTAC 1200 

25 ACTCTGAATG CAGATAAAAA AACATGTTCA GTCCGTGACA AGTGTGCCCT AGGCTCTCAT 1260 

GGTTGCCAGC ACATTTGTGT GAGTGATGGG GCCGCATCCT ACCACTGTGA TTGCTATCCT 1320 

GGCTACACCT TAAATGAGGA CAAGAAAACA TGTTCAGCCA CTGAGGAAGC ACGAAGACTT 1380 

GTTTCCACTG AAGATGCTTG TGGATGTGAA GCTACACTGG CATTCCAGGA CAAGGTCAGC 1440 

TCGTATCTTC AAAGACTGAA CACTAAACTT GATGACATTT TGGAGAAGTT GAAAATAAAT 1500 

30 GAATATGGAC AAATACATCG TTAAATTGCT OCAATTTCTC ACCTGAAAAT GTGGACAGCT 1560 

TGGTGTACTT AATACTCATG CATTCTTTTG CACACCTGTT ATTGCCAATG TTCCTGCTAA 1620 

TAATTTGCCA TTATCTGTAT TAATGCTTGA ATATTACTGG ATAAATTGTA TGAAGATCTT 1680 

CTGCAGAATC AGCATGATTT TTCCAAGGAA ATACATATGC AGATACTTAT TAAGAGCAAA 1740 

CTTTAGTGTC TCTAAGTTAT GACTGTGAAA TGATTGGTAG GAAATAGAAT GAAAAGTTTA 1800 

35 GTGTTTCTTT ATCTACTAAT TGAGCCATTT AATTTTTAAA TGTTTATATT AGATAACCAT 1860 

ATTCACAATG GAAACTTTAG GTCTAGTTTC TTTTGATAGT ATTTATAATA TAAATCAATC 1920 

TTATTACTGA GAGTGCAAAT TGTACAAGGT ATTTACACAT ACAACTTCAT ATAACTGAGA 1980 

TGAATGTAAT TTTGAACTGT TTAACACTTT TTGTTTTTTG CTTATTTTGT TGGAGTATTA 2040 

TTGAAGATGT GATCAATAGA TTGTAATACA CATATCTAAA AATAGTTAAC ACAGATCAAG 2100 

40 TGAACATTAC ATTGCCATTT TTAATTCATT CTGGTCTTTG AAAGAAATGT ACTACTAAAG 2160 

AGCACTAGTT GTGAATTTAG GGTGTTAAAC TTTTTACCAA GTACAAAAAT CCCAAATTCA 2220 

CTTTATTATT TTGCTTCAGG ATCCAAGTGA CAAAGTTATA TATTTATAAA ATTGCTATAA 2280 

ATCGACAAAA TCTAATGTTG TCTTTTTAAT GTTAGTGATC CACCTGCCTC AGCCTCCCAA 2340 

AGTGCTGGGA TTACAGGCTT GAAAGTCTAA CTTTTTTTTA CTTATATATT TGATACATAT 2400 

AATTCTTTTG GCTTTGAAAC TTGCAACTTT GAGAACAAAA CAGTCCTTTA AATTTTGCAC 2460 

TGCTCAATTC TGTTTTTCGT TTGCATTGTC TTTAATATAA TAAAAGTTAT TACCTTTACA 2520 

TATTATCATG TCTATTTTTG ATGACTCATC AATTTTGTCT ATTAAAGATA TTTCTTTAAA 2580 

TTAAAAAAAA AAAAAAAAA 2599 



45 
50 
55 
60 
65 
70 



Seq ID NO: C142 DNA Sequence 
Nucleic Acid Accession #: NM_016639.l 
Coding sequence: 40.. 42 9 

1 11 21 31 41 51 

I I I I I I 

GCGGCGGGCG CAGACAGCGG CGGGCGCAGG ACGTGCACTA TGGCTCGGGG CTCGCTGCGC 60 

CGGTTGCTGC GGCTCCTCGT GCTGGGGCTC TGGCTGGCGT TGCTGCGCTC CGTGGCCGGG 120 

GAGCAAGCGC CAGGCACCGC CCCCTGCTCC CGCGGCAGCT CCTGGAGCGC GGACCTGGAC 180 

AAGTGCATGG ACTGCGCGTC TTGCAGGGOG CGACCGCACA GCGACTTCTG CCTGGGCTGC 240 

GCTGCAGCAC CTCCTGCCCC CTTCCGGCTG CTTTGGCCCA TCCTTGGGGG CGCTCTGAGC 300 

CTGACCTTCG TGCTGGGGCT GCTTTCTGGC TTTTTGGTCT GGAGACGATG CCGCAGGAGA 360 

GAGAAGTTCA CCACCCCCAT AGAGGAGACC GGCGGAGAGG GCTGCCCAGC TGTGGOGCTG 420 

ATCCAGTGAC AATGTGCCCC CTGCCAGCCG GGGCTCGCCC ACTCATCATT CATTCATCCA 480 

TTCTAGAGCC AGTCTCTGCC TCCCAGACGC GGCGGGAGCC AAGCTCCTCC AACCACAAGG 540 

GGGGTGGGGG GCGGTGAATC ACCTCTGAGG CCTGGGCCCA GGGTTCAGGG GAACCTTCCA 600 

AGGTGTCTGG TTGCCCTGCC TCTGGCTCCA GAACAGAAAG GGAGCCTCAC GCTGGCTCAC 660 

ACAAAACAGC TGACACTGAC TAAGGAACTG CAGCATTTGC ACAGGGGAGG GGGGTGCCCT 720 

CCTTCCTTAG GACCTGGGGG CCAGGCTGAC TTGGGGGGCA GACTTGACAC TAGGCCCCAC 780 

TCACTCAGAT GTCCTGAAAT TCCACCACGG GGGTCACCCT GGGGGGTTAG GGACCTATTT 840 

TTAACACTAG GGGCTGGCCC ACTAGGAGGG CTGGCCCTAA GATACAGACC CCCCCAACTC 900 

CCCAAAGCGG GGAGGAGATA TTTATTTTGG GGAGAGTTTG GAGGGGAGGG AGAATTTATT 960 

AATAAAAGAA TCTTTAACTT T AAAAAAAAA AAAAAAAA 998 

75 Seq ID NO: C143 DNA Sequence 

Nucleic Acid Accession th NM_001819 
Coding sequence: 113.. 2146 

1 11 21 31 41 51 

80 | | | | | | 

CCAGGAGGCA CGCTGGTTTT CCGGGGCCGC TCCATCGCGC CTTCCTCCTG CGCCTCGCTT 60 
CTCCGGTCCA GCCGCCATCT TCCTTTCCGC ACAGGGGCCG CCGAGCGGGG CCATGCAGCC 120 
AACGCTGCTT CTCAGCCTCC TGGGAGCCGT GGGGCTGGCG GCTGTCAATT CCATGCCAGT 180 
GGATAACAGG AACCACAATG AAGGAATGGT GACTCGCTGC ATCATTGAGG TCCTCTCAAA 240 

1289 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



TGCCTTGTCG 
GAGTAGAAAA 
AAQATTQTTA 
GGCAGGAGCC 
GGCAGAGGGA 
CTCCGACAGC 
GGATGAGGAG 
TGAAGAGAAA 
GCAGGCTTCA 
GCATTCTCAG 
AGGGAGCCAG 
TGAGGAAGGT 
CCACCACGGG 
AAAGGGACAC 
AAAGAGGGAC 
AGAAATAAAG 
GGGCAGAGGA 
GGAGGACAAG 
AGAAAGTGAG 
AGGGGAGCCA 
AGGACACCAC 
GTGGAAAGAG 
GCAGCAGCAG 
TAAACAATAT 
CCCATACTAC 
TGACAATTTT 
CCCAGAATAC 
GTATGAGAAG 
GGTGGCCCAA 
CTATGATTCT 
TGACCAGACA 
TTTGGAACTA 
GTGGGCACTG 
ACCATTTATA 
TTGGAATTGT 
GCATATTCTT 
TCTTTGGAAA 



AAGTCCAGCG 
GACGTCAAAG 
AGAGACCCAG 
CCAGGGGAGG 
GGCGGGCACA 
CAAGTCTCTG 
GAGGAGGAGG 
CACCTTGAAG 
GCTATAAAAA 
GAGAAGACAC 
GAGAATCACC 
GAGGAAGATG 
AGGAGCAGGC 
CCCCAGGAGG 
CACCATTCAA 
GGTTATCCAG 
AGTGAAGAAT 
AGAAACTACC 
GAAGAGAGGG 
CGTGCCTATT 
CGTGTCCAAG 
CTGGACAGAA 
GGAGACCTGC 
AGCTCCCATC 
GACCCTCTCC 
CTCGAGGGTG 
AACTATGACT 
AGAAACCTCG 
CTGGACCAGC 
GAGGAGCCGG 
GTCCTGACAG 
CAGAAGATAG 
TTAAGAAGCA 
TACCCAAGGG 
CTTTAATTTC 
TTCTGCAAAA 
AATGTTTGTC 



CTCCACCCAT 
ACAAAGAGAC 
CTGATGCCTC 
AGGACATCCA 
GCCGAGAGCG 
AAGAAGTGAA 
GAGAGAACTA 
AGCCAGGAGA 
AAGAGGAGTT 
ATAGCCGAGA 
CCCAGGAGTC 
CCACCTCTGA 
CCGACAGGTC 
AATCTGAGGA 
CCCACTACAG 
GCGTCCAGGC 
ACAGGGCTCC 
CCAGCTTAGA 
GCCTTGAGCC 
TCATGTCTGA 
AAAACCAGAT 
ATTATCTCAA 
AGGACACTAA 
ACACAGCTGA 
AGTGGAAGAG 
AGGAGGAAAA 
GGTGGGAGAA 
CCAGGGTCCC 
TCCTTCACTA 
TGAGCACCCA 
AGGACGAGAA 
CTGAGAAATT 
GCCATCACAT 
CAGAAAGTAG 
TGTCAGAATG 
TAGACATATT 
TCAGTTGGAA 



CACCCCTGAG 
AACTGAAAAT 
GGAAGCCCAC 
AGGCCCAACA 
AGCGGATGAG 
GACACGCCAT 
TCAAAAAGGG 
GACACAAAAC 
AGTGGCCAGA 
GAAGAGTAGC 
TAAAGGCCAA 
GGTGGACAAA 
CTCTCAAGGA 
GTCAAACGTC 
GGCTTCAGAG 
CCCTGAGGAC 
AAGACCTCAG 
GCTTGATAAG 
GGGAAAGGGA 
CACCAGAGAA 
GGACAAGGCA 
CTACGGTGAG 
AGAAAACAGG 



CAGCCATTTT 
TGAGCTGACC 
AAAGCCCTTC 
CAAGCTGGAC 
CAGGAAGAAG 
CCAGGAGGCA 
AAAAGAACTC 
CAGCCAAAGG 
GATCTGTTTT 
AACTTACTAT 
CTATTGAAAA 
AACATGCTTA 
ATAATAAAAG 



TGCCGCCAAG 
GAAAACACAA 
GAGTCCTCCA 
AAGGCAGACA 
CCCCAGTGGA 
TCTGAGAAGA 
GAG CGAGGGG 
GCTTTTCTCA 
TCGGAAACAC 
CAGGAGAGTG 
CCCCGAAGCC 
CGACGCACGA 
GGGAGTCTTC 
AGCATGGCCA 
GAAGAACCTG 
CTGGAGTGGG 
AGTGAGGAGA 
ATGGCACATG 
CGCCATCACA 
GAGAAAAGGT 
AGGAGGCATC 
GAAGGAGCCC 
GAGGAAGCTA 
AGATTAGGGG 
GAAAGAAGAG 
TTGAACGAGA 
TCTGAGGATG 
CTGAAAAGGC 
TCAGCTGAGT 
GAAAATGAAA 
GAAAACTTGG 
GGCTGACTGT 
TCACCACTTC 
TCATTAAATG 
TGTGAATTGC 
TGACAATGAC 
ATTCACCTGA 



TCCTGAAGAC 
AGTTTGAAGT 
GCAGGGGAGA 
CAGAGAAATG 
GCCTCTATCC 
GCCAGAGAGA 
AAGATAGCAG 
ATGAAAGAAA 
ATGCTGCCGG 
GAGAGGAGGC 
AGGAAGAATC 
GGCCCAGACA 
CCTCTGAGGA 
GTTTAGGGGA 
AATATGGAGA 
AGCGCTATAG 
GTTGGGATGA 
GATATGGTGA 
GAGGCAGGGG 
TCTTGGGTGA 
CACAAGGTGC 
CAGGGAAGTG 
GGTTTCAAGA 
AACTGTTCAA 
ACAACATGAA 
AGAATTTCTT 
TGAACTGGGG 
AATATGACAG 
TTCCAGACTT 
AGGACAGGGC 
CTGCAATGGA 
CATTGGAGCG 
ACTGAAAGAC 
TTTGACACAA 
ATGACTTGTA 
TGTGCTACTG 
GACC 



Seq ID NO: C144 DNA Sequence 

Nucleic Acid Accession ft: XM_093082.1 

Coding sequence: 93.. 198 B 



1 
I 

CTTCTTGTGG 
AATTTGTAAA 
AGCCCAACTG 
GTGGGGCTCT 
AGACAAAACT 
GCAAATTACA 
TTGGAATGGT 
ACTGTCAGGA 
TTGCTGCTGC 
CAGGAAGACT 
ACCCAGCATC 
ACTTAGAAGT 
CCCTACAAGT 
TGCCTAATTC 
ACGTGACTGT 
GAGAACAGCT 
CTCCTTGGGA 
AGCAGGCTGA 
TTAAAGATCG 
TGCAGCCAGC 
GCCAAAACCA 
GCGTTCAAGG 
GAACACCTTC 
AAGAAAACTT 
AAGGTTATTA 
TCACTTCTTC 
GTGCCGCCAT 
AAGAAAGAAA 
GAGAAAGCGA 
CCATTCATGA 
AGGAGCCTGC 
TCGAGCTGGA 
CAGAGTCAGA 
AGGCATAG 



11 
I 

TAGGGACCTC 
AAATACCCTC 
TGTATTACAG 
AGCAGCTGTT 
GGAAGCTTCA 
GAAGCTCAAC 
TGGAAATTGT 
CAGTTCTTTC 
AAAGGATTGT 
TGAGGCAGTG 
TCAGGGTGGG 
CAAGGCAGAA 
TGTCCTCCTT 
TCCTGGTCAG 
TGGATCTAAT 
TTCCATCCAG 
GGAGGGGAAG 
ACTCCAGATT 
AATTACAGGG 
AGACAGTGGA 
AGGCATCCTC 
AAGACCAGAA 
CCCTGTGTAC 
CAACCCAACC 
CCAGTGTACT 
ACATCCAGAA 
CATCATCTCT 
TTCTAAGACC 
AGCAATGCCA 
GACTGGCCCT 
CCCAGAGCCT 
GCTGGAGCCA 
GCCTGGGGTT 



21 
I 

TCCTCAGTAT 
GAAGATTCAG 
ACATTGAGGT 
CTTAGCACCA 
GCTTTGAAAT 
TTGCAGTTTT 
TCTGGTTTCT 
AAATGTGATC 
GGGAGTCCTA 
GAGGAGGTTT 
GGGCATTGTG 
CCAAGCCTGA 
TGCAAAATCT 
GTTAGTGTGG 
GTCACTCTCA 
TGGTCTTTCT 
TGGCCAGATG 
TACTTTTCTC 
TCCAACGATC 
AT TTACATCT 
AACGTCAGTG 
ACTGGCCACA 
TACTGGCATA 
ACCGGGATTT 
GCCATCAACA 
GTTGGAATCA 
GTTGTGTGCT 
ATCGCGGAAC 
AGAGAAGACG 
GATACCATCC 
GCCCCAGGAT 
GAAAOGCAGT 
GTAGTTGAGC 



31 
I 

TTGAAACTAA 
GAATGAAGCT 
GGTACCGGTG 
GTCAGTGGCT 
TGCTCTATGG 
CTTTATCTGT 
CGGGATCATT 
TTTGTAAGCT 
AGTCCTTCCT 
TGGGGTTGGG 
AAAACTATGG 
GAAAAGGTGG 
TCTCCCTCAA 
TGCAAOTGAC 
TCTGCATCTA 
TCCATAAGAA 
TTGAGGCTGT 
AAGGTGGACA 
CAGGTAATGC 
GOGATGTTAA 
TGTTAGTGAA 
CTATTTCCCT 
AACTTGAGGG 
TGGTCATTGG 
GACTTGGCAA 
TTGTTGGGGC 
TCGCAAGGAA 
TTGAGCCAAT 
CTACCCAACT 
AAGAACCAGA 
CAGAGCCTAT 
CGGAATTGGA 
CCTTAAGTGA 



41 
I 

CCAGCATCTG 
TCTGTGTGAA 
CCTTATCTCT 
CACTGAACTG 
AGGCTTAAAA 
AACCGCTGCA 
GGTGCAATCT 
GCTCTGGCCT 
ATCAGAAGGG 
GGTGCTTGTA 
GTCTTTTAGA 
TATGGATCTC 
ACTATTTCTC 
CATCCCAGAC 
CACCACCACT 
GGAGATGGAG 
GAAGGGCACT 
AGCTGTAGCC 
ATCTATCACT 
CAACCCCCCA 
ACCTTCTAAG 
TTCCTGTCTC 
AAGAGACATC 
AAATCTGACA 
TAGTTCCTGC 
CTTGATTGGT 
TAAGGCAAAA 
GACAAAGATA 
AGAAGTAACT 
CTATGAGCCA 
GGCAGTGCCT 
GCCAGAGCCA 
AGATGAAAAG 



51 
I 

ACAGATTTCG 
GGATTAAAAC 
TCTGCTTCTT 
GAATTTAGTG 
GATCCAAATT 
AAACTTCCAG 
CATTTTGGCT 
TCCACCAGAG 
CTGAACTGGG 
CAGCCCGGTG 
GACTTGGTGG 
CAGAGACCCA 
TTTATTGCAT 
GGTTTOGTGA 
GTGGCCTCCC 
CCAATTTCTT 
CTTGATGGAC 
ATCGGGCAAT 
ATCTCGCATA 
GAC TTTC TCG 
CCCCTTTGTA 
TCTGCGCTTG 
GTGCGAGTGA 
AATTTTGAAC 
GAAATCGATC 
AGCCTGGTAG 
GCAAAGGCAA 
AACCCAAGGG 
CTACCATCTT 
AAGCCTACTC 
GACCTTGACA 
GAGCCAGAGC 
GGAGTGGTTA 
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Seq ID NO: C145 DNA Sequence 

Nucleic Acid Accession ft: FGENESH predicted 

Coding sequence: 1. .1242 



11 
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31 



51 

1290 



WO 03/042661 



PCT7US02/36810 
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ATGGTGTTCG 
GTGGTGCAAG 
CTCATCTGCA 
TTCTTCCATA 
GATGTTGAGG 
TCTCAAGGTG 
GATCCAGGTA 
ATCTGCGATG 
AGTGTGTTAG 
CACACTATTT 
CATAAACTTG 
ATTTTGGTCA 
AACAGACTTG 
ATCATTGTTG 
TGCTTCGCAA 
GAACTTGAGC 
GACGCTACCC 
ATCCAAGAAC 
GGATCAGAGC 
CAGTCGGAAT 
GAGCCCTTAA 



CATTTTGGAA 
TGACCATCCC 
TCTACACCAC 
AGAAGGAGAT 
CTGTGAAGGG 
GACAAGCTGT 
ATGCATCTAT 
TTAACAACCC 
TGAAACCTTC 
CCCTTTCCTG 
AGGGAAGAGA 
TTGGAAATCT 
GCAATAGTTC 
GGGCCTTGAT 
GGAATAAGGC 
CAATGACAAA 
AACTAGAAGT 
CAGACTATGA 
CTATGGCAGT 
TGGAGCCAGA 
GTGAAGATGA 



GGTCTTTCTG 
AGACGGTTTC 
CACTGTGGCC 
GGAGCCAATT 
CACTCTTGAT 
AGCCATCGGG 
CACTATCTCG 
CCCAGACTTT 
TAAGCCCCTT 
TCTCTCTGCG 
CATCGTGCCA 
GACAAATTTT 
CTGCGAAATC 
TGGTAGCCTG 
AAAAGCAAAG 
GATAAACCCA 
AACTCTACCA 
GCCAAAGCCT 
GCCTGACCTT 
GCCAGAGCCA 
AAAGGGAGTG 



ATCCTAAGCT 
GTGAACGTGA 
TCCCGAGAAC 
TCTTCTCCTT 
GGACAGCAGG 
CAATTTAAAG 
CATATGCAGC 
CTCGGCCAAA 
TGTAGCGTTC 
CTTGGAACAC 
GTGAAAGAAA 
GAACAAGGTT 
GATCTCACTT 
GTAGGTGCCG 
GCAAAAGAAA 
AGGGGAGAAA 
TCTTCCATTC 
ACTCAGGAGC 
GACATCGAGC 
GAGCCAGAGT 
GTTAAGGCAT 



GCCTTGCAGG 
CTGTTGGATC 
AGCTTTCCAT 
GGGAGGAGGG 
CTGAACTCCA 
ATCGAATTAC 
CAGCAGACAG 
ACCAAGGCAT 
AAGGAAGACC 
CTTCCCCTGT 
ACTTCAACCC 
ATTACCAGTG 
CTTCACATCC 
CCATCATCAT 
GAAATTCTAA 
GCGAAGCAAT 
ATGAGACTGG 
CTGCCCCAGA 
TGGAGCTGGA 
CAGAGCCTGG 
AG 



TCAGGTTAGT 
TAATGTCACT 
CCAGTGGTCT 
GAAGTGGCCA 
GATTTACTTT 
AGGGTCCAAC 
TGGAATTTAC 
CCTCAACGTC 
AGAAACTGGC 
GTACTACTGG 
AACCACCGGG 
TACTGCCATC 
AGAAGTTGGA 
CTCTGTTGTG 
GACCATCGCG 
GOCAAGAGAA 
CCCTGATACC 
GCCTGCCCCA 
GCCAGAAACG 
GGTTGTAGTT 



Seq ID NO: CI 4 6 DNA Sequence 

Nucleic Acid Accession #i NM_003020.1 

Coding sequence : 2 9 . . 6 6 4 



1 
I 

CGCTCCTCGG 
TGGCCTACTG 
CCCTGACCGG 
GGGCATTGCC 
CCAGAGCATT 
CCCCAACATC 
GGGGTACCCA 
AAACACCCCT 
TCCGGAACAT 
GAAGGGAGGA 
GGATAATGTT 
GTAAAGAGAA 
TGCACGTGTA 
TACAAAGCAG 
AAATTAGAAT 
TAAAAATTAA 
TGCAGTTTAA 
TTTGATTTTG 
GCTGTACTCA 
GCAAGCATTG 




GTCTCAGAAG 
AGGCCCCGAG 
GAAGGTGGAG 
GTGGCAGAGT 
GACCCTCCAA 
GACACTGCAG 
GACTATCCAG 
GAGAGACGAA 
GTTGCAAAGA 
GATGCTAGAC 
AATGGAGTCC 
CTGTATGTAG 
AAGAGCTTTT 
AATGTGAATG 
AATGGTGTCT 
ATTATGTAGT 
AGAGGAGGAG 
GC 



21 
I 

GGTTGACAAT 
CATCTGGATG 
CAGATATCCA 
TGGAATATCC 
CTCATGAAGG 
TGACTGGAGA 
ATCCCTGTCC 
AGTTCAGTOG 
GCTTGGGCAA 
AGCGGAGGAG 
AGTCTGTCCC 
GAAAACCCAC 
CTGTGAATGA 
ATAGTGTATT 
TTGTTTCTTG 
TCAACAATAA 
GAGGTTGTAC 
TCATCCAGCC 
CTGACACATT 



31 

I 

GGTCTCCAGG 
GACTCCAGCA 
GAGGCTGCTT 
AGCTCACCAG 
ACTTCAGCAT 
CAACATTCCT 
TGTTGGAAAA 
AGAGTTCCAG 
GTGGAACAAG 
TGTCAATCCA 
CCATTTTTCA 
ATTACCTGTT 
CAGCATGTTT 
GTCTTCACAC 
GGTTTTTAAA 
AAAGCAAGAC 
TATTTTGGCC 
CTTGGGCATT 
TCACTTGGCT 



41 
I 

ATGGTCTCTA 
TTTGCTTACA 
CATGGTGTTA 
GCCATGAATC 
TTGGGTCCTT 
AAGGACTTTA 
ACAGATGATG 
TTGCACCAGC 
AAACTCCTTT 
TATCTACAAG 
GATGAGGATA 
AGGCCTCAGC 
CTTACATAGA 
CGATGATTCT 
ATGTGAATCT 
TATGAAAGGC 
AAGTCTGTAG 
GTTATACACC 
GOGTCTTAAT 



SI 
I 

CCATGCTATC 
GCCCCCGGAC 
TGGAGCAATT 
TTGTGGGCCC 
TTGGCAACAT 
GTGAGGATCA 
GATGTCTAGA 
ATCTCTTTGA 
ACGAGAAGAT 
GACAGAGACT 
AGGATCCAGA 
ATGGCTTATG 
TAATTATGGA 
GCTTTTTGCT 
GCAATGATCA 
TCAGATTTCT 
AAAGCTGTCA 
AGTAAAGAAG 
AAACATGAAT 



Seq ID NO: C147 DNA Sequence 

Nucleic Acid Accession #i NM_024021.2 

Coding sequence : 144 ..806 



AACATTCCTQ 
CTTTGGAACA 
ATTCGAGCAC 
GGGCTGGCCC 
AAGGATTGCA 
TGACTGCCCT 
ATGGAAGTAA 
TTATTTCAGG 
GTAGTCTAGG 
CATTTAGCTT 
ATAATTGTCA 
GTGTGCTGGA 
GTACCCCTGG 
CTCCCACACC 
AATCTATGCT 
TACTGATAGA 
AGAGAGATAA 
TTCACTGGCT 
GCAACAGCAC 
GCAAATAAGG 
GCAAAA CACA 
AAAGTGCTTT 
CTCCTGCTTT 
GCACGAGCTA 
TCAACTTATA 
AATTATATCT 
TTTTATTGCA 



11 
I 

CAAATGGTTT 
ACTTAAATAA 
CTTTTCTGCT 
TGGTGTGCCC 
AGAGAAGTTC 
GATGAGCCTT 
CCCTATTTCC 
ATCCTTGTCA 
AATGAATATC 
GGCGTTTTAT 
TGGGACTATG 
ATTCTGCATT 
TGGGGTTGTG 
ACTTAATGAG 
GACTGTGACA 
CTTGTTGATA 
TAAATTCAAA 
CTTTATCGAG 
TTGAAAGTTT 
TTTGGAAGCA 
TACTTCCCTC 
TTCCTTCAGG 
TCTTTCCCCA 
TCATTTGAGT 
TTTTCTATTT 
TAATCATATA 
GAGCAAAAAT 



21 
I 

CAATATATGC 
GTCAAATATA 
GCCATGACAA 
CAGCTGGGAA 
TTGAAGGGAG 
AGCATGGGAA 
GTGTATATCG 
ATTGCAGCAG 
ACCAGCTCTG 
TCATTCCATC 
TCCATCTTAA 
GCTGTGTCCC 
TTAATTCTGC 
GTTTGAGGCC 
CAAGAGCCTC 
TTATTATTAT 
ATTATGTTCT 
AGTACTAGAA 
TTCATTCATC 
GAAGAGCAAA 
CCATTTATTT 
AAGTGGAGAT 
TAGAAAGTAC 
AAAAGTATAC 
CATCTTCTTC 
TGGAAATGTG 
AAATCAAATT 



31 
I 

AGATGTCTCG 
CTTGGAGCTT 
CCATGCAAGG 
ACATGGCTGT 
AACCCAAAGT 
TAACAATGAT 
GGTACACAAT 
GAATTAGAAC 
TACTGGCTGC 
ACCCTTACTG 
TGGGTCTGGA 
TCTCTGCCTT 
CATCACATTC 
ACCAAAAGAT 
ACATGAGAAA 
ATGTAATCCA 
CATTTTTTTC 
GTTAAATTAA 
ATAAGAACTT 
AAAAAGATAT 
AACTTTTTTT 
GCATGGCCAT 
CTTGAAGTAG 
ATGGAGTAAA 
CTTTCCCTTC 
CAACATATGG 
AGAAGCAATA 



41 
I 

ATATAGGAAT 
TAAAAATTAA 
AATGGAACAG 
CATACATTCA 

GTGTATGGCA 
TTGGGGGTCA 
TACAAAAGGC 
ATCAGGGATC 
TAACTACTAT 
TGGCATGGTG 
TGGATGTAAA 
TCACATGGCA 
CAACAGACAA 
TTACCAGTAT 
ATTATGAACT 
CCTGGAACTC 
TAAATAATGC 
TATATAAAGG 
TGTTAAAATG 
TTCTCCTACC 
CTCCCCCTCC 
CACAGTCCGT 
AATCATATTA 
TCCCACCTTC 
TATTTGTTAA 
AAAAAAAAAA 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1242 



51 
I 

GAAATTACGT 
AAGGAGAGAG 
GCCATGCCAG 
CATCTGTGGA 
GTGCAGATTC 
TCTAATACTT 
GTAATGTTTA 
CTGGTCGGAG 
TTAATCAACA 
GGCAACTCAA 
CTCCTCTTAA 
GTGCTCTGTT 
GAAACAGCAT 
ATGCTCCAGA 
CCAACTTCGA 
GTGTGTGTAT 
AATAACTCAT 
ATTTAATGAG 
CATTACATTG 
AGGCCTCCAT 
TATGGGGACC 
CTTTTTCCTT 
CCTTGCATGT 
AGCATCAGAT 
TACTGGGCAT 
ATACGTTTGT 
AAAAAAAAA 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1152 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1619 



1291 



WO 03/042661 



PCT/US02/36810 



Seq ID MO: C148 DNA Sequence 
Nucleic Acid Accession fl: NM_002091.1 
^ Coding sequence: 56.. 502 

1 11 21 31 41 51 

I I I I 1 I 

AGTCTCTGCT CTTCCCAGCC TCTCCGGCGC GCTCCAAGGG CTTCCCGTCG GGACCATGCG 60 

CGGCAGTGAG CTCCCGCTGG TCCTGCTGGC GCTGGTCCTC TGCCTAGCGC CCCGGGGGCG 120 

10 AGCGGTCCCG CTGCCTGCGG GCGGAGGGAC CGTGCTGACC AAGATGTACC CGCGCGGCAA 1B0 

CCACTGGGCG GTGGGGCACT TAATGGGGAA AAAGAGCACA GGGGAGTCTT CTTCTGTTTC 240 

TGAGAGAGGG AGCCTGAAGC AGCAGCTGAG AGAGTACATC AGGTGGGAAG AAGCTGCAAG 300 

GAATTTGCTG GGTCTCATAG AAGCAAAGGA GAACAGAAAC CACCAGCCAC CTCAACCCAA 360 

GGCCTTGGGC AATCAGCAGC CTTCGTGGGA TTCAGAGGAT AGCAGCAACT TCAAAGATGT 420 

15 AGGTTCAAAA GGCAAAGTTG GTAGACTCTC TGCTCCAGGT TCTCAACGTG AAGGAAGGAA 480 

CCCCCAGCTG AACCAGCAAT GATAATGATG GCCTCTCTCA AAAGAGAAAA ACAAAACCCC 540 

TAAGAGACTG AGTTCTGCAA GCATCAGTTC TACGGATCAT CAACAAGATT TCCTTGTGCA 600 

AAATATTTGA CTATTCTGTA TCTTTCATCC TTGACTAAAT TCGTGATTTT CAAGCAGCAT 660 

CTTCTGGTTT AAACTTGTTT GCTGTGAACA ATTGTCGAAA AGAGTCTTCC AATTAATGCT 720 

20 TTTTTATATC TAGGCTACCT GTTGGTTAGA TTCAAGGCCC CGAGCTGTTA CCATTCACAA 780 

TAAAAGCTTA AACACAT 797 



25 
30 
35 
40 
45 
50 
55 
60 



Seq ID NO: C149 DNA Sequence 

Nucleic Acid Accession #: NM_0 122 61.1 

Coding sequence: 203.. 1045 

1 11 21 31 41 51 

I I I I I I 

GATTTGCTCT GCCAGCAGCT GTCGGTGCCG CGCTCGACAC CGAGTCCTAG CTAGGCGCTC 60 

ACAGAATACG CGCTCCCTCC CTCCCCCTTC TCTGTCCCCC GCCTCTCGCT CACCCCGGCC 120 

CACTCCAGCG GCGACTTTGA GGGATTCCCT CTCTGGCGGC CTCTGCAGCA GCACAGCCGG 180 

CCTCATTCGG GGCACTGCGA GTATGGATCT CCAAGGAAGA GGGGTCCCCA GCATCGACAG 240 

ACTTCGAGTT CTCCTGATGT TGTTCCATAC AATGGCTCAA ATCATGGCAG AACAAGAAGT 300 

GGAAAATCTC TCAGGCCTTT CCACTAACCC TGAAAAAGAT ATATTTGTGG TGCGGGAAAA 360 

TGGGACGACG TGTCTCATGG CAGAGTTTGC AGCCAAATTT ATTGTACCTT ATGATGTGTG 420 

GGCCAGCAAC TACGTAGATC TGATCACAGA ACAGGCCGAT ATCGCATTGA CCCGGGGAGC 480 

TGAGGTGAAG GGCCGCTGTG GCCACAGCCA GTCGGAGCTG CAAGTGTTCT GGGTGGATCG 540 

CGCATATGCA CTCAAAATGC TCTTTGTAAA GGAAAGCCAC AACATGTCCA AGGGACCTGA 600 

GG CGACTTGG AGGCTGAGCA AAGTGCAGTT TGTCTAOGAC TCCTCGGAGA AAACCCACTT 660 

CAAAGACGCA GTCAGTGCTG GGAAGCACAC AGCCAACTCG CAOCACCTCT CTGCCTTGGT 720 

CACCCCCGCT GGGAAGTCCT ATGAGTGTCA AGCTCAACAA ACCATTTCAC TGGCCTCTAG 780 

TGATCCGCAG AAGACGGTCA CCATGATCCT GTCTGCGGTC CACATCCAAC CTTTTGACAT 840 

TATCTCAGAT TTTGTCTTCA GTGAAGAGCA TAAATGCCCA GTGGATGAGC GGGAGCAACT 900 

GGAAGAAACC TTGCCCCTGA TTTTGGGGCT CATCTTGGGC CTCGTCATCA TGGTAACACT 960 

CGCGATTTAC CACGTCCACC ACAAAATGAC TGCCAACCAG GTGCAGATCC CTCGGGACAG 1020 

ATCCCAGTAT AAGCACATGG GCTAGAGGCC GTTAGGCAGG CACCCCCTAT TCCTGCTCCC 1080 

CCAACTGGAT CAGGTAGAAC AACAAAAGCA CTTTTCCATC TTGTACACGA GATACACCAA 1140 

CATAGCTACA ATCAAACAGG CCTGGGTATC TGAGQCTTGC TTGGCTTGTG TCCATGCTTA 1200 

AACCCACGGA AGGGGGAGAC TCTTTCGGAT TTGTAGGGTG AAATGGCAAT TATTCTCTCC 1260 

ATGCTGGGGA GGAGGGGAGG AGGGTCTCAG ACAGCTTTCG TGCTCATGGT GGCTTGGCTT 1320 

TGACTCTCCA AAGAGCAATA AATGCCACTT GGAGCTGTAT CTGGCCCCAA AGTTTAGGGA 1380 

TTGAAAACAT GCTTCTTTGA GGAGGAAACC CCTTTAGGTT CAGAAGAATA TGGGGTGCTT 1440 

TGCTCCCTTG GACACAGCTG GCTTATCCTA TACAGTTGTC AATGCACACA GAATACAACC 1500 

TCATGCTCCC TGCAGCAAGA CCCCTGAAAG TGATTCATGC TTCTGGCTGG CATTCTGCAT 1560 

GTTTAGTGAT TGTCTTGGGA ATGTTTCACT GCTACCOGCA TCCAGCGACT GCAGCACCAQ 1620 

AAAACGACTA ATGTAACTAT GCAGAGTTGT TTGGACTTCT TCCTGTGCCA GGTCCAAGTC 1680 

GGGGGACCTG AAGAATCAAT CTGTGTGAGT CTGTTTTTCA AAATGAAATA AAACACACTA 1740 

TTCTCTGGC 1749 

Seq ID NO i C150 DNA Sequence 

Nucleic Acid Accession ft: NM_003226.1 

Coding sequence! 2.. 226 



^_ 1 11 21 31 41 51 

65 | | | | | | 

GATGCTGGGG CTGGTCCTGG CCTTGCTGTC CTCCAGCTCT GCTGAGGAGT ACGTGGGCCT 60 

GTCTGCAAAC CAGTGTGCCG TGCCGGCCAA GGACAGGGTG GACTGCGGCT ACCCCCATGT 120 

CACCCCCAAG GAGTGCAACA ACCGGGGCTG CTGCTTTGAC TCCAGGATCC CTGGAGTGCC 180 

TTGGTGTTTC AAGCCCCTGA CTAGGAAGAC AGAATGCACC TTCTGAGGCA CCTCCAGCTG 240 

70 CCCCTGGGAT GCAGGCTGAG CACCCTTGCC CGGCTGTGAT TGCTGCCAGG CACTGTTCAT 300 

CTCAGTTTTT CTGTCCCTTT GCTCCCGGCA AGCTTTCTGC TGAAAGTTCA TATCTGGAGC 360 

CTGATGTCTT AACGAATAAA GGTCCCATGC TCCACCCG 398 

75 Seq ID NO: C151 DNA Sequence 

Nucleic Acid Accession #: NM_002993.l 
Coding sequence: 64.. 408 

rtA 1 11 21 31 41 51 

80 i i | | | | 

GGCACGAGCC AGTCTCCGCG CCTCCACCCA GCTCAGGAAC CCGCGAACCC TCTCTTGACC 60 

ACTATGAGCC TCCCGTCCAG CCGCGCGGCC CGTGTCCCGG GTCCTTCGGG CTCCTTGTGC 120 

GCGCTGCTCG CGCTGCTGCT CCTGCTGAOG CCGCCGGGGC CCCTCGCCAG CGCTGGTCCT 180 

GTCTCTGCTG TGCTGACAGA GCTGCGTTGC ACTTGTTTAC GCGTTACGCT GAGAGTAAAC 240 

1292 



WO 03/042661 



PCT/US02/36810 



10 



15 



20 



CCCAAAACGA 
GTGGTAGCCT 
AAGAAAQTCA 
ACCATGCATC 
CAGTAAGAAT 
GAAGAGTGTG 
CTAATATAGT 
CAATTGACCA 
TGAAGATAAC 
ATTTCGTATG 
ACTCACTCTT 
TCTTTACCCT 
TTATCTGTGC 
CTAATATATT 
CATGATTTAC 
ATTCTGGTCA 
TGATTGCTAA 
AATGATCTGT 
CATTTAGTCC 
TTTAAAGGTT 
AAATTGCACT 
TGGAGAAACA 



TTGGTAAACT 
CCCTGAAGAA 
TCCAGAAAAT 
ATAAAATTGC 
AAGAAGGAAG 
GGGGAAAGCC 
ATTTCCACTA 
TATTGTGAGC 
TATTGTATTT 
GAAATAATGT 
CTCATAAAAT 
AGGATGCTAT 
AGAATATATT 
CTCTTCCTAT 
TCATTAAACT 
CTAAATATAC 
TTTACATAGA 
GCTCTGCAAA 
TCAAAATATA 
TTGACCATTT 
TTTATTTTTT 
ATAAAAGATT 



GCAGGTGTTC 
CGGGAAGCAA 
TTTGGACAGT 
CCAGTCTTCA 



TACGCTTCTC 
TTTACTGTTA 
AAAGAATCAC 
CTATCATACA 
TTTATTAGTG 
AGGAAATATT 
TTAAGTTGTA 
TCCTTATTCA 
GGTTTTAGAT 
TTGATTTTGT 
ACTTTAGATA 
AATGTATTCT 
GTTTTGAAAA 
TACAGCATTG 
TGTTATGAGG 
CCTGTGTGTC 
TCTAAACCAA 



CCCGCAGGCC 
GTTTGTCTGG 
GGAAACAAGA 
GCGGAGCAGT 
TTTCCATTTT 
CCTGAAGTTT 
TTTTACCTGA 
TGGTTATTAG 
TTCCTTAAAG 
TGCTGTTGAG 
TTAGTTCTGT 
CTGTATTAGA 
GAATTTCTAA 
GTTTGATGTC 
ATGCTATTTT 
GATGAAGAAG 
CTTGGTTTTT 
TATATTTGAA 
CTAAGATTTT 
AATTATACAT 
ATGTTGGTTT 
AAAAAAAAAA 



CGCAGTGCTC 
ACCCGGAAGC 
AAAACTGAGT 
TTTCTGGAGA 
CTACATGGAT 
ACAGCTCAGC 
TAAGTTATTG 
TCTTTCAATG 
TCTTACCGAA 
GGAGGTATCC 
TTTCTTGGGG 
AGACTGGGTG 
AAATTTAAGT 
TTCTTAGTAT 
TTCACTATAG 
CCCAAAAACA 
TAAATAAAAG 
CAATTTGAAT 
CAGATATCTA 
GTATCACATT 
TTGGTACTTG 
AAAAAAA 



CAAGGTGGAA 
CCCTTTTCTA 
AACAAAAAAG 
TCCCTGGACC 
TCCCTACTTT 
TAATGAAGTA 
AACCCTTTGG 
AATATTGAAT 
AAGGCTGTGG 

AATATGTTAC 
TGTCATACCG 
TCTGTAAGGG 
GGCATAATGT 
GATGACTATA 
GATAAATTCC 
CAAAATTAAC 
ATAAATTCAT 
TTGTGGATCT 
CACTATATTA 
TATTGTCATT 



300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

10BO 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1547 



25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



Seq ID NO: C152 DNA Sequence 

Nucleic Acid Accession #: NM_005242.2 

Coding sequence: 148.. 1341 



I 

CGGCCCGCCC 
TTTCTCTCGG 
GCGCCCGGCG 
GCCGCCATCC 
TCCTCTAAAG 
GGAGTTACAG 
AAACTGACCA 
AGTAACGGCA 
ATTTACATGG 
ATTGCCTATC 
ATTGGCTTTT 
CAGAGGTATT 
ATTGGCATCT 
GTGAAGCAGA 
GAGCAGCTCT 
CTGTTCCCAG 
TCTGCCATGG 
GTCCTGGCCA 
TTTCTGATTA 
CTCTCTACCC 
AGGGATCATG 
GTATCCCTCA 
ACTGTTAAGA 
TGGAACCPGT 
ACATACCACC 



11 

I 

TGGGGAGGCG 
TGCGTCCAGT 
TCGGGGCTTC 
TGCTAGCAGC 
GAAGAAGCCT 
TTGAAACAGT 
CGGTCTTCCT 
TGGCCCTGTG 
CCAATCTGGC 
ACATACATGC 
TCTATGGCAA 
GGGTCATCGT 
CCCTGGCAAT 
CCATCTTCAT 
TGGTGGGAGA 
OCTTCCTCAC 
ATGAAAACTC 
TGTACCTGAT 
AGAGCCAGGG 
TTAACAGCTG 
CAAAGAACGC 
CCTCAAAGAA 
CCTCCTATTG 
TTAATGTTAT 
G 



21 

I 

CGCAGCAGAG 
GGAGCTCTGA 
CAGGAGGATG 

TATTGGTAAG 

TCCAATTGTC 
GGTCTTTCTT 
CTTGGCTGAC 
CAACAACTGG 
CATGTACTGT 
GAACCCCATG 
ATGGCTGCTG 
TCCTGCCCTG 
CATGTTCAAT 
AGCCTCTGCC 
AGAGAAGAAA 
CTGCTTCACT 
CCAGAGCCAT 
CATCGACCCC 
TCTCCTTTGC 
ACACTCCAGG 
AGTTTTCCAG 
GAGGACGTGT 



31 
I 

GCTCCGATTC 
GTTTCGAATC 
CGGAGCCCCA 
TGCAGTGGCA 
GTTGATGGCA 
GATGAGTTTT 
TACACAATTG 
TTCCGAACTA 
CTCCTCTCTG 
ATTTATGGGG 
TCCATTCTCT 
GGGCACTCCA 
ATTCTGCTGG 
AACATCACGA 
TACTTCCTCT 
TATGTGCTGA 
AGGAAGAGGG 
CCTAGTAACC 
GTCTATGCCC 
TTTGTCTATT 
CGAAGTGTCC 
AAATCCAGCT 
GTCCTCAGAT 
CTGTTATTTC 



41 
I 

GGGGCAGGTG 
GGTGGCGGCG 
GCGCGGCGTG 
CCATCCAAGG 
CATCCCACGT 
CTGCATCTGT 
TGTTTGTGGT 
AGAAGAAGCA 
TCATCTGGTT 
AAGCTCTTTG 
TCATGACCTG 
GGAAGAAGGC 
TCACCATCCC 
CCTGTCATGA 
CTCTGGCCAT 
TGATCAGAAT 
CCATCAAACT 
TTCTGCTTGT 
TGTACATTGT 
ACTTTGTTTC 
GCACTGTAAA 
CTTACTCTTC 
GGGAATTGCA 
CTAATCAAAA 



51 
I 

AGAGGCTGAC 
GATTCCCCGC 
GCTGCTGGGG 
AACCAATAGA 
CACTGGAAAA 
CCTCACTGGA 
GGGTTTGCCA 
CCCTGCTGTG 



TAATGTGCTT 
CCTCAGTGTG 
AAACATTGCC 
TTTGTATGTC 
TGTTTTGCCT 
TGGGGTCTTT 
GCTGCGATCT 
CATTGTCACT 
GGTGCATTAT 
AGCCCTCTGC 
ACATGATTTC 
GCAGATGCAA 
AAGTTCAACC 
CAGTAGGATG 
AGGTCTCACC 



Seq ID NO: C153 DNA Sequence 

Nucleic Acid Accession #: NM_003469.2 

Coding sequence i 92 . . 1945 



GAAACGGCCC 
CATATAAACA 
AGCAGCCCTG 
TCAGAGAAAC 
TCCCAGTCCT 
TAAGGAAGAA 
AAAAGAAAAT 
GATGAGAATA 
AGAAAATAAG 
TTATGAGACA 
TGAAGAGAAT 
TACTCCTCAA 
ACCAAACAAC 
AGATGATATC 
GAACCCAGTA 
GAATATAGGA 
CCAGGAAGAA 
AATTGCCTAT 
AAATGGGGAA 
GCTGATTGAA 
AACTGGGGAG 
CCTAGATGAC 



11 

! 

GAGAAGCTCG 
AAAAGAGGAA 
TCTCTTATCC 
CAGCTGCTTC 
GAAATGATCA 
AGCAGCCCAG 
GGCGATGAAA 
ATACTCGAAG 
CCCTATGCCT 
CAGCAGTGGC 
TCCAGGGATA 
AGCCTTGCTA 
CAGAAACGTG 
TACAAGGCTA 
GAGGAGAAAA 
AAAAATGAAC 
GATCTTCGGA 
TTGAAAAGGT 
AGGGCCACCA 
ATCTCAAGGA 
AAGCGGAATG 
ATCTCAGAGG 



21 
I 

CCCGGAGAAC 
ATCTTTCAAA 
CTTTAATTTT 
AGAAAGAACC 
GGGCTTTGGA 
ATTATAATCC 
GOCACTTGCC 
CTTTGAGACA 
TGAATTCAGA 
CAGAAAGAAA' 
ACCCCTTTAA 
CATTGGAATC 
AGAGGATGGA 
ATAACATTGC 
TAGAGAGTCA 
AAATCAACGA 
AAGAGAGTAA 
TAGTAAATGC 
GGCTTTTTGA 
ATTTACAGAT 
GATCAGTGGA 
CTGACTTAGA 



31 
I 

GGGGAGGAAT 
CATGGCTGAA 
CCTCATCTCT 
AGACCTCAGG 
GTACATAGAA 
CTACCAAGGT 
CGAGAGGGAT 
GGCTGAAAAT 
AAAGAACTTT 
GCTTAAGCAC 
ACGCACAAAT 
TGTCTTCCAA 
TGAGGAGCAA 
CTATGAAGAT 
AACCCAGGAA 
TGAGATGAAA 
AGACCAACTC 
TGCAGGAAGT 
GAAACCTCTT 
ACCCCCAGAA 
ACCGGAGCGG 
CCATCCAGAC 



41 

I 

ATGCTGTGGA 
GCAAAGACCC 
GGGGCTGAAG 
TTGGAAAATG 
AACCTCOGAC 
GTCTCTGTCC 
TCACTGAGTG 
GAGCCTCAGT 
CCAATGGACA 
ATGCAATTCC 
GAAATAGTGG 
GAGCTGGGGA 
AAACTTTATA 
GTGGTCGGGG 
GAGGTGAGAG 
CGCTCAGGGC 
TCAGATGATG 
GGGAGGTTAC 
GATTCTCAGT 
GACTTAATTG 
GAGCTTGACC 
CTGTTCCAAA 



51 
I 

GCTCCTCTGC 
ACTGGCTTGG 
CAGCTTCATT 
TCCAAAAGTT 
AACAAGCTCA 
CCCTTCAGCA 
AAGAAGACTG 
CTGCACCAAA 
TGAGTGATGA 
CTCCTATGTA 
AGGAACAATA 
AACTGACAGG 
CGGATGATGA 
GAGAAGACTG 
ACAGCAAAGA 
AGCTTGGCAT 
TCTCCAAAGT 
AGAATGGGCA 
CTATTTATCA 
AGATGCTCAA 
TTCCTGTTGA 
ATAGGATGCT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1451 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

760 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 



1293 



WO 03/042661 



PCT/US02/36810 



10 



15 



CTCCAAGAGT GGCTACCCTA AAACACCTGQ TCGTGCTGGG ACTGAGGCCC TACCAGACGG 13 BO 

GCTCAGTGTT GAGGATATTT TAAATCTTTT AGGGA7GGAG AGTGCAGCAA ATCAGAAAAC 1440 

GTCGTATTTT CCCAATCCAT ATAACCAGGA GAAAGTTCTG CCAAGGCTCC CTTATGGTGC 1500 

TGGAAGATCT AGATCGAACC AGCTTCCCAA AGCTGCCTGG ATTCCACATG TTGAAAACAG 1560 

J ACAGATGGCA TATGAAAACC TGAACGACAA GGATCAAGAA TTAGGTGAGT ACTTGGCCAG 1620 

GATGCTAGTT AAATACCCTG AGATCATTAA TTCAAACCAA GTGAAGCGAG TTCCTGGTCA 1680 

AGGCTCATCT GAAGATGACC TGCAGGAAGA GGAACAAATT GAGCAGGCCA TCAAAGAGCA 1740 

TTTGAATCAA GGCAGCTCTC AGGAGACTGA CAAGCTGGCC CCGGTGAGCA AAAGGTTCCC 1800 

TGTGGGGCCC CCGAAGAATG ATGATACCCC AAATAGGCAG TACTGGGATG AAGATCTGTT 1860 

AATGAAAGTG CTGGAATACC TCAATCAAGA AAAGGCAGAA AAGGGAAGGG AGCATATTGC 1920 

TAAGAGAGCA ATGGAAAATA TGTAAGCTGC TTTCATTAAT TACCCTACTT TCATTCCTCC 1980 

CACCCCAAGC AAATCCCAAC ATTTCTCTTC AGTGTGTTGA CTTCTATCCT GTTAACACTG 2040 

TAATATCTTT AAATGATGTA CAGGCAGATG AAACCAGGTC ACTGGGGAGT CTGCTTCATT 2100 

TCCTCTGAGC TGTTATCTTG TGTATGGATA TGTGTAAATG TTATGACTCC TTGATAAAAA 2160 

ATTTATTATG TCCATTATTC AAGAAAGATA TCTATGACTG TGTTTAATAG TATATCTAAT 2220 

GGCTGTGGCA TTGTTGATGC TCACATATGA TAAAAAAGTG TCCTATAATT CTATTGAAAG 2280 

TTTTTAATAT TTATTGAATT ATTTTGTTAC TGTCTGTAGC GTTTTGTGGA GTACTGGACC 2340 

AAAAAAATAA AGCATTATAA ATATA 2365 

20 Seq ID NO: C154 DNA Sequence 

Nucleic Acid Accession #* Nrtj)30955 
Coding sequence: 327. .5108 

~- 1 11 21 31 41 51 

25 | | | | | | 

GAATTCCGGG AGCGGGCGGG CTGCGAGGCC GCGGGGCATG CGGGAGGCGG AGGGGTGGGA 60 

CCGGGTGGCT GCGCCCATTC CACACCCGCC GAAAGCGGAC ACTGTCAGCT GAATCACTCC 120 

CCTTTTAGGA GGAGGGAGGG GGAAAAGGTG TCTAGCTAAT TTCTGCTTAA AAAAGCACAG 180 

GAGATCGCGG GTCAGCTTTG CAGTCGCTGC CTTCTCGCGC CTGACCATGC ACCCCTGCAT 240 

CTTCCTGCTG GGCACAGGCG AGCGCTTTAT TTCTGGAGCT GAGGGCTAAA ACTTTTTTCA 300 

CTTTTCTTCT CCTCAACATC TGAATCATGC CATGTGCCCA GAGGAGCTGG CTTGCAAACC 360 

TTTCCGTGGT GGCTCAGCTC CTTAACTTTG GGGCGCTTTG CTATGGGAGA CAGCCTCAGC 420 

CAGGCCCGGT TOGCTTCCCG GACAGGAGGC AAGAGCATTT TATCAAGGGC CTGCCAGAAT 480 

ACCACGTGGT GGGTCCAGTC CGAGTAGATG CCAGTGGGCA TTTTTTGTCA TATGGCTTGC 540 

ACTATCCCAT CACGAGCAGC AGGAGGAAGA GAGATTTGGA TGGCTCAGAG GACTGGGTGT 600 

ACTACAGAAT TTCTCACGAG GAGAAGGACC T G TTTTTTAA CTTGACGGTC AATCAAGGAT 660 

TTCTTTCCAA TAGCTACATC ATGGAGAAGA GATATGGGAA CCTCTCCCAT GTTAAGATGA 720 

TGGCTTCCTC TGCCCCCCTC TGCCATCTCA GTGGCACGGT TCTACAGCAG GGCACCAGAG 780 

TTGGGACGGC AGCCCTCAGT GCCTGCCATG GACTGACTGG ATTTTTCCAA CTACCACATG 840 

GAGACTTTTT CATTGAACCC GTGAAGAAGC ATCCACTGGT TGAGGGAGGG TACCACCCGC 900 

ACATCGTTTA CAGGAGGCAG AAAGTTCCAG AAACCAAGGA GCCAACCTGT GGATTAAAGG 960 

ACAGTGTTAA CATCTCCCAG AAGCAAGAGC TATGGCGGGA GAAGTGGGAG AGGCACAACT 1020 

TGCCAAGCAG AAGCCTCTCT CGGCGTTCCA TCAGCAAGGA GAGATGGGTG GAGACACTGG 1080 

TGGTGGCOGA CACAAAGATG ATTGAATACC ATGGGAGTGA GAATGTGGAG TCCTACATCC 1140 

TCACCATCAT GAACATGGTC ACTGGGTTGT TCCATAACCC AAGCATTGGC AATGCAATTC 1200 

ACATTGTTGT GGTTCGGCTC ATTCTACTCG AAGAAGAAGA GCAAGGACTG AAAATAGTTC 1260 

ACCATGCAGA AAAGACACTG TCTAGCTTCT GCAAGTGGCA GAAGAGTATC AATCCCAAGA 1320 

GTGACCTCAA TCCTGTTCAT CACGACGTGG CTGTCCTTCT CACCAGAAAG GACATCTGTG 1380 

CTGGTTTCAA TCGCCCCTGC GAGACCCTGG GCCTGTCTCA CCTTTCAGGA ATGTGTCAGC 1440 

CTCACCGCAG TTGTAACATC AATGAAGATT CGGGACTCCC TCTGGCTTTC ACAATTGCCC 1500 

ATGAGCTAGG ACACAGCTTC GGCATCCAGC ATGATGGGAA AGAAAATGAC TGTGAGCCTG 1560 

TGGGCAGACA TCCGTACATC ATGTCCCGCC AGCTCCAGTA CGATCCCACT CCGCTGACAT 1620 

GGTCCAAGTG CAGCGAGGAG TACATCACCC GCTTCTTGGA CCGAGGCTGG GGGTTCTGTC 1680 

TTGATGACAT ACCTAAAAAG AAAGGCTTGA AGTCCAAGGT CATTGCCCCC GGAGTGATCT 1740 

ATGATGTTCA CCACCAGTGC CAGCTACAAT ATGGACCCAA TGCTACCTTC TGCCAGGAAG 1800 

TAGAAAACGT CTGCCAGACA CTGTGGTGCT CCGTGAAGGG CTTTTGTCGC TCTAAGCTGG 1860 

ACGCTGCTGC AGATGGAACT CAATGTGGTG AGAAGAAGTG GTGTATGGCA GGCAAGTGCA 1920 

TCACAGTGGG GAAGAAACCA GAGAGCATTC CTGGAGGCTG GGGCCGCTGG TCACCCTGGT 1980 

CCCACTGTTC CAGGACCTGT GGGGCTGGAG TCCAGAGCGC AGAGAGGCTC TGCAACAACC 2040 

CCGAGCCAAA GTTTGGAGGG AAATATTGCA CTGGAGAAAG AAAACGCTAT CGCTTGTGCA 2100 

ACGTCCACCC CTGTCGCTCA GAGGCACCAA CATTTCGGCA GATGCAGTGC AGTGAATTTG 2160 

ACACTGTTCG CTACAAGAAT GAACTCTACC ACTGGTT T CC CATTTTTAAC CCAGCACATC 2220 

CTTGTGAGCT CTACTGCOGA CCCATAGATG GCCAGTTTTC TGAGAAAATG CTGGATGCTG 2280 

TCATTGATGG TACCCCTTGC TTTGAAGGCG GCAACAGCAG AAATGTCTGT ATTAATGGCA 2340 

TATGTAAGAT GGTTGGCTGT GACTATGAGA TCGATTCCAA TGCCACCGAG GATCGCTGCG 2400 

GTGTGTGCCT GGGAGATGGC TCTTCCTGCC AGACTGTGAG AAAGATGTTT AAGCAGAAGG 2460 

AAGGATCTGG TTATGTTGAC ATTGGGCTCA TTCCAAAAGG AGCAAGGGAC ATAAGAGTGA 2520 

TGGAAATTGA GGGAGCTGGA AACTTCCTGG CCATCAGGAG TGAAGATCCT GAAAAATATT 2580 

ACCTGAATGG AGGGTTTATT ATCCAGTGGA ACGGGAACTA TAAGCTGGCA GGGACTGTCT 2640 

TTCAGTATGA CAGGAAAGGA GACCTGGAAA AGCTGATGGC CACAGGTCCC ACCAATGAGT 2700 

CTGTGTGGAT CCAGCTTCTA TTCCAGGTGA CTAACCCTGG CATCAAGTAT GAGTACACAA 2760 

TCCAGAAAGA TGGCCTTGAC AATGATGTTG AGCAGATGTA CTTCTGGCAG TACGGCCACT 2820 

GGACAGAGTG CAGTGTGACC TGCGGGACAG" GTATCCGCCG CCAAACTGCC CATTGCATAA 2880 

AGAAGGGCCG CGGGATGGTG AAAGCTACAT TCTGTGACCC AGAAACACAG CCCAATGGGA 2940 

GACAGAAGAA GTGCCATGAA AAGGCTTGTC CACCCAGGTG GTGGGCAGGG GAGTGGGAAG 3000 

CATGCTCGGC GACATGCGGG CCCCACGGGG AGAAGAAGCG AACOGTGCTG TGCATCCAGA 3060 

CCATGGTCTC TGACGAGCAG GCTCTCCCGC CCACAGACTG CCAGCACCTG CTGAAGCCCA 3120 

AGACCCTCCT TTCCTGCAAC AGAGACATCC TGTGCCCCTC GGACTGGACA GTGGGCAACT 3180 

GGAGTGAGTG TTCTGTTTCC TGTGGTGGTG GAGTGCGGAT TCGCAGTGTC ACATGTGCCA 3240 

AGAACCATGA TGAACCTTGC GATGTGACAA GGAAACCCAA CAGCCGAGCT CTGTGTGGCC 3300 

TCCAGCAATG CCCTTCTAGC CGGAGAGTTC TGAAACCAAA CAAAGGCACT ATTTCCAATG 3360 

GAAAAAACCC ACCAACACTA AAGCCCGTCC CTCCACCTAC ATCCAGGCCC AGAATGCTGA 3420 

CCACACCCAC AGGGCCTGAG TCTATGAGCA CAAGCACTCC AGCAATCAGC AGCCCTAGTC 3480 

CTACCACAGC CTCCAAAGAA GGAGACCTGG GTGGGAAACA GTGGCAAGAT AGCTCAACCC 3540 
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AACCTQAGCT GAGCTCTCGC TATCTCATTT CCACTGGAAG CACTTCCCAG CCCATCCTCA 3600 

CTTCCCAATC CTTGAGCATT CAGCCAAGTG AGGAAAATGT TTCCAGTTCA GATACTGGTC 3660 

CTACCTCGGA GGGAGGCCTT GTAGCTACAA CAACAAGTGG TTCTGGCTTG TCATCTTCCC 3720 

GCAACCCTAT CACTTGGCCT GTGACTCCAT TTTACAATAC CTTGACCAAA GGTCCAGAAA 3780 

5 TGGAGATTCA CAGTGGCTCA GGGGAAGAAA GAGAACAGCC TGAGGACAAA GATGAAAGCA 3840 

ATCCTGTAAT ATGGACCAAG ATCAGAGTAC CTGGAAATGA CGCTCCAGTG GAAAGTACAG 3900 

AAATGCCACT TGCACCTCCA CTAACACCAG ATCTCAGCAG GGAGTCCTGG TGGCCACCCT 3960 

TCAGCACAGT AATGGAAGGA CTGCTCCCCA GCCAAAGGCC CACTACTTCC GAAACTGGGA 4020 

CACCCAGAGT TGAGGGGATG GTTACTGAAA AGCCAGCCAA CACTCTGCTC CCTCTGGGAG 4080 

10 GAGACCACCA GCCAGAACCC TCAGGAAAGA CGGCAAACCG TAACCACCTG AAACTTCCAA 4140 

ACAACATGAA CCAAACAAAA AGTTCTGAAC CAGTCCTGAC TGAGGAGGAT GCAACAAGTC 4200 

TGATTACTGA GGGCTTTTTG CTAAATGCCT CCAATTACAA GCAGCTCACA AACGGCCACG 4260 

GCTCTGCACA CTGGATCGTC GGAAACTGGA GCGAGTGCTC CACCACATGT GGCCTGGGGG 4320 

CCTACTGGAA AAGGGTGGAG TGCACCACCC AGATGGATTC TGACTGTGCG GCCATCCAGA 4380 

15 GACCTGACCC TGCAAAAAGA TGCCACCTCC GTCCCTGTGC TGGCTGGAAA GTGGGAAACT 4440 

GGAGCAAGTG CTCCAGAAAC TGCAGTGGGG GCTTCAAGAT ACGCGAGATT CAGTGCGTGG 4500 

ACAGCCGGGA CCACCGGAAC CTGAGGCCAT TTCACTGCCA GTTCCTGGCC GGCATTCCTC 4560 

CCCCATTGAG CATGAGCTGT AACCCGGAGC CCTGTGAGGC GTGGCAGGTG GAGCCTTGGA 4620 

GCCAGTGCTC CAGGTCCTGT GGAGGTGGAG TTCAGGAGAG AGGAGTGTTC TGTCCAGGAG 4680 

20 GCCTCTGTGA TTGGACAAAA AGACCCACAT CCACCATGTC TTGCAATGAG CACCTGTGCT 4740 

GTCACTGGGC CACTGGGAAC TGGGACCTGT GTTCCACTTC CTGTGGAGGT GGCTTTCAGA 4800 

AGAGGATTGT CCAATGTGTG CCCTCAGAGG GCAATAAAAC TGAAGACCAA GACCAATGTC 4860 

TATGTGATCA CAAACCCAGA CCTCCAGAAT TCAAAAAATG CAACCAGCAG GCCTGCAAGA 4920 

AAAGTGCCGA TTTACTTTGC ACTAAGGACA AACTGTCAGC CAGTTTCTGC CAGACACTGA 4980 

25 AAGCCATGAA GAAATGTTCT GTGCCCACCG TGAGGGCTGA GTGCTGCTTC TCGTGTCCCC 5040 

AGACACACAT CACACACACC CAAAGGCAAA GAAGGCAACG GTTGCTCCAA AAGTCAAAAG 5100 

AACTCTAAGC CCAAA SI 15 
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Seq ID NO i CI 5 5 DNA Sequence 

Nucleic Acid Accession #s NMJ)01062.1 

Coding sequence : 76 1380 

1 11 21 31 41 51 

I I I I I I 

GCTCTCATTA CCTTCTGCCC ATCACTTAAT AAATAGCCAG CCAATTCATC AACATTCTGG 60 

TACACTGTTG GAGAGATGAG ACAGTCACAC CAGCTGCCCC TAGTGGGGCT CTTACTGTTT 120 

TCTTTTATTC CAAGCCAACT ATGCGAGATT TGTGAGGTAA GTGAAGAAAA CTACATCCGC 180 

CTAAAACCTC TGTTGAATAC AATGATCCAG TCAAACTATA ACAGGGGAAC CAGCGCTGTC 240 

AATGTTGTGT TGTCCCTCAA ACTTGTTGGA ATCCAGATCC AAACCCTGAT GCAAAAGATG 300 

ATCCAACAAA TCAAATACAA TGTGAAAAGC AGATTGTCAG ATGTAAGCTC GGGAGAGCTT 3 60 

GCCTTGATTA TACTGGCTTT GGGAGTATGT CGTAACGCTG AGGAAAACTT AATATATGAT 420 

TACCACCTGA CTGACAAGCT AGAAAATAAA TTCCAAGCAG AAATTGAAAA TATGGAAGCA 480 

CACAATGGCA CTCCCCTGAC TAACTACTAC CAGCTCAGCC TGGACGTTTT GGCCTTGTGT 540 

CTGTTCAATG GGAACTACTC AACOGCCGAA GTTGTCAACC ACTTCACTCC TGAAAATAAA 600 

AACTATTATT TTGGTAGCCA GTTCTCAGTA GATACTGGTG CAATGGCTGT CCTGGCTCTG 660 

ACCTGTGTGA AGAAGAGTCT AATAAATGGG CAGATCAAAG CAGATGAAGG CAGTTTAAAG 720 

AACATCAGTA TTTATACAAA GTCACTGGTA GAAAAGATTC TGTCTGAGAA AAAAGAAAAT 780 

GGTCTCATTG GAAACACATT TAGCACAGGA GAAGCCATGC AGGCCCTCTT TGTATCATCA 840 

GACTATTATA ATGAAAATGA CTGGAATTGC CAACAAACTC TGAATACAGT GCTCACGGAA 900 

ATTTCTCAAG GAGCATTCAG TAATCCAAAC GCTGCAGCCC AGGTCTTACC TGCCCTGATG 960 

GGAAAGACCT TCTTGGATAT TAACAAAGAC TCTTCTTGCG TCTCTGCTTC AGGTAACTTC 1020 

AACATCTCCG CTGATGAGCC TATAACTGTG ACACCTCCTG ACTCACAATC ATATATCTCC 1080 

GTCAATTACT CTGTGAGAAT CAATGAAACA TATTTCACCA ATGTCACTGT GCTAAATGGT 1140 

TCTGTCTTCC TCAGTGTGAT GGAGAAAGCC CAGAAAATGA ATGATACTAT ATTTGGTTTC 1200 

ACAATGGAGG AGCGCTCATG GGGGCCCTAT ATCACCTGTA TTCAGGGCCT ATGTGCCAAC 1260 

AATAATGACA GAACCTACTG GGAACTTCTG AGTGGAGGCG AACCACTGAG CCAAGGAGCT 1320 

GGTAGTTACG TTGTCCGCAA TGGAGAAAAC TTGGAGGTTC GCTGGAGCAA ATACTAATAA 1380 

GCCCAAACTT TCCTCAGCTG CATAAAATCC ATTTGCAGTG GAGTTCCATG TTTATTGTCC 1440 

TTATGCCTTC TTCTTCATTT ATCCCAGTAC GAGCAGGAGA GTTAATAACC TCCCCTTCTC 1500 

TCTCTACATG TTCAATAAAA GTTGTTGAAA GATTAAC 1537 

Seq ID NO; C156 DNA Sequence 
Nucleic Acid Accession #: NM 004591 
Coding sequence : 5 9 . . 34 9 

1 11 21 31 41 51 

I I I I I I 

CACTCCCAAA GAACTGGGTA CTCAACACTG AGCAGATCTG TTCTTTGAGC TAAAAACCAT 60 

GTGCTGTACC AAGAGTTTGC TCCTGGCTGC TTTGATGTCA GTGCTGCTAC TCCACCTCTG 120 

CGGCGAATCA GAAGCAGCAA GCAACTTTGA CTGCTGTCTT GGATACACAG ACCGTATTCT 180 

TCATCCTAAA TTTATTGTGG GCTTCACACG GCAGCTGGCC AATGAAGGCT GTGACATCAA 240 

TGCTATCATC TTTCACACAA AGAAAAAGTT GTCTGTGTGC GCAAATCCAA AACAGACTTG 300 

GGTGAAATAT ATTGTGCGTC TCCTCAGTAA AAAAGTCAAG AACATGTAAA AACTGTGGCT 360 

TTTCTGGAAT GGAATTGGAC ATAGCCCAAG AACAGAAAGA ACCTTGCTGG GGTTGGAGGT 420 

TTCACTTGCA CATCATGGAG GGTTTAGTGC TTATCTAATT TGTGCCTCAC TGGACTTGTC 480 

CAATTAATGA AGTTGATTCA TATTGCATCA TAGTTTGCTT TGTTTAAGCA TCACATTAAA 540 

GTTAAACTGT ATTTTATGTT ATTTATAGCT GTAGGTTTTC TGTGTTTAGC TATTTAATAC 600 

TAATTTTCCA TAAGCTATTT TGGTTTAGTG CAAAGTATAA AATTATATTT GGGGGGGAAT 660 

AAGATTATAT GGACTTTCTT GCAAGCAACA AGCTATTTTT TAAAAAAACT ATTTAACATT 720 

CTTTTGTTTA TATTGTTTTG TCTCCTAAAT TGTTGTAATT GCATTATAAA ATAAGAAAAA 780 

CATTAATAAG ACAAATATT 799 



Seq ID NO: C157 DNA Sequence 

Nucleic Acid Accession #: NM 013271.1 
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Coding sequence t 27.. 80 9 

1 11 21 31 41 51 

< I I I I I I 

J TCCGGAGCCA GGCTCGCTGG GGCAGCATGG CGGGGTCGCC GCTGCTCTGQ GGGCCGCGGG 60 

CCGGGGGCGT CGGCCTTTTG GTGCTGCTGC TGCTCGGCCT GTTTCGGCCG CCCCCCGCGC 120 

TCTGCGCGCG GCCGGTAAAG GAACCCCGCG GCCTAAGCGC AGCGTCTCCG CCCTTGGCTG 180 

AGACTGGCGC TCCTCGCCGC TTCCGGCGGT CAGTGCCCCG AGGTGAGGCG GCGGGGGCGG 240 

TGCAGQAGCT GGCGCGGGCG CTGGCGCATC TGCTGGAGGC CGAACGTCAG GAGCGGGCGC 300 

10 GGGCCGAGGC GCAGGAGGCT GAGGATCAGC AGGCGCGCGT CCTGGCGCAG CTGCTGCGCG 360 

TCTGGGGCGC CCCCCGCAAC TCTGATCCGG CTCTGGGCCT GGACGACGAC CCCGACGCGC 420 

CTGCAGCGCA GCTCGCTCGC GCTCTGCTCC GCGCCCGCCT TGACCCTGCC GCCCTAGCAG 480 

CCCAGCTTGT CCCCGCGCCC GTCCCCGCCG CGGCGCTCCG ACCCCGGCCC CCGGTCTACG 540 

ACGACGGCCC CGCGGGCCCG GATGCTGAGG AGGCAGGCGA CGAGACACCC GACGTGGACC 600 

CCGAGCTGTT GAGGTACTTG CTGGGACGGA TTCTTGCGGG AAGCGCGGAC TCCGAGGGGG 660 

TGGCAGCCCC GCGCCGCCTC CGCCGTGCCG CCGACCACGA TGTGGGCTCT GAGCTGCCCC 720 

CTGAGGGCGT GCTGGGGGCG CTGCTGCGTG TGAAACGCCT AGAGACCCCG GCGCCCCAGG 780 

TGCCTGCACG CCGCCTCTTG CCACCCTGAG CACTGCCCGG ATCCCGTGCA CCCTGGGACC 840 

CAGAAGTGCC CCCGCCATCC CGCCACCAGG ACTTCTCCCC GCCAGCACGT CCAGAGCAAC 900 

20 TTACCCCGGC CAGCCAGCCC TCTCACCCGA GGATCCCTAC CCCCTGGCCC ACAATAACAT 960 

GATCTGAGC 969 

,Seq ID NO: C158 DNA Sequence 
Nucleic Acid Accession #: NM_002245.2 
25 Coding sequence: 183.. 1193 

1 11 21 31 41 51 

I I I I I I 

GGGCAGGAAG ACGGCGCTGC CCGGAGGAGC GGGGCGGGCG GGCGCGCGGG GGAGCGGGCG 60 

30 GCGGGCGGGA GCCAGGCCCG GGCGGGGGCG GGGGCGGCGG GGCCAGAAGA GGCGGCGGGC 120 

CGCGCTCCGG CCGGTCTGCG GCGTTGGCCT TGGCTTTGGC TTTGGCGGCG GCGGTGGAGA 180 

AGATGCTGCA GTCCCTGGCC GGCAGCTCGT GCGTGCGCCT GGTGGAGCGG CACCGCTCGG 240 

CCTGGTGCTT CGGCTTCCTG GTGCTGGGCT ACTTGCTCTA CCTGGTCTTC GGCGCAGTGG 300 

TCTTCTCCTC GGTGGAGCTG CCCTATGAGG ACCTGCTGCG CCAGGAGCTG CGCAAGCTGA 360 

35 AGCGACGCTT CTTGGAGGAG CACGAGTGCC TGTCTGAGCA GCAGCTGGAG CAGTTCCTGG 420 

GCCGGGTGCT GGAGGCCAGC AACTACGGCG TGTCGGTGCT CAGCAACGCC TCGGGCAACT 480 

GGAACTGGGA CTTCACCTCC GCGCTCTTCT TCGOCAGCAC CGTGCTCTCC ACCACAGGTT 540 

ATGGCCACAC CGTGCCCTTG TCAGATGGAG GTAAGGCCTT CTGCATCATC TACTCCGTCA 600 

TTGGCATTCC CTTCACCCTC CTGTTCCTGA CGGCTGTGGT CCAGCGCATC ACCGTGCACG 660 

40 TCACCCGCAG GCCGGTCCTC TACTTCCACA TCCGCTGGGG CTTCTCCAAG CAGGTGGTGG 720 

CCATCGTCCA TGCCGTGCTC CTTGGGTTTG TCACTGTGTC CTGCTTCTTC TTCATCCOGG 780 

CCGCTGTCTT CTCAGTCCTG GAGGATGACT GGAACTTCCT GGAATCCTTT TATTTTTGTT 840 

TTATTTCCCT GAGCACCATT GGCCTGGGGG ATTATGTGCC TGGGGAAGGC TACAATCAAA 900 

AATTCAGAGA GCTCTATAAG ATTGGGATCA CGTGTTACCT GCTACTTGGC CTTATTGCCA 960 

TGTTGGTAGT TCTGGAAACC TTCTGTGAAC TCCATGAGCT GAAAAAATTC AGAAAAATGT 1020 

TCTATGTGAA GAAGGACAAG GACGAGGATC AGGTGCACAT CATAGAGCAT GACCAACTGT 1080 

CCTTCTCCTC GATCACAGAC CAGGCAGCTG GCATGAAAGA GGACCAGAAG CAAAATGAGC 1140 

CTTTTGTGGC CACCCAGTCA TCTGCCTGCG TGGATGGCCC TGCAAACCAT TGAGCGTAGG 1200 

ATTTGTTGCA TTATGCTAGA GCACCAGGGT CAGGGTGCAA GGAAGAGGCT TAAGTATGTT 1260 

CATTTTTATC AGAATGCAAA AGCGAAAATT ATGTCACTTT AAGAAATAGC TACTGTTTGC 1320 

AATGTCTTAT TAAAAAACAA CAAAAAAAGA CACATGGAAC AAAGAAGCTG TGACCCCAGC 1380 

AGGATGTCTA ATATGTGAGG AAATGAGATG TCCACCTAAA ATTCATATGT GACAAAATTA 1440 

TCTCGACCTT ACATAGGAGG AGAATACTTG AAGCAGTATG CTGCTGTGGT TAGAAGCAGA 1500 

TTTTATACTT TTAACTGGAA ACTTTGGGGT TTGCATTTAG ATCATTTAGC TGATGGCTAA 1560 

ATAGCAAAAT TTATATTTAG AAGCAAAAAA AAAAAGCATA GAGATGTGTT TTATAAATAG 1620 

GTTTATGTGT ACTGGTTTGC ATGTACCCAC CCAAAATGAT TATTTTTGGA GAATCTAAGT 1680 

CAAACTCACT ATTTATAATG CATAGGTAAC CATTAACTAT GTACATATAA AGTATAAATA 1740 

TGTTTATATT CTGTACATAT GGTTTAGGTC ACCAGATCCT AGTGTAGTTC TGAAACTAAG 1800 

ACTATAGATA TTTTGTTTCT TTTGATTTCT CTTTATACTA AAGAATCCAG AGTTGCTACA 1860 

ATAAAATAAG GGGAATAATA AACTTGAGAG TGAATAACCA T 1901 
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Seq ID NO: C159 DNA Sequence 

Nucleic Acid Accession » : NM_005472 . 1 

Coding sequence: 93.. 404 

1 11 21 31 41 51 

I I I I I I 

AAAGGGACTC CTTGAAACTG ATTGAGAGCC CAGTGGATTT GCCAGCAGTT TGAGCTTCTA 60 

CCGAGTCTTC CCCCACCTCA ATCCCTGTTG CTATGGAGAC TACCAATGGA ACGGAGACCT 120 

GGTATGAGAG CCTGCATGCC GTGCTGAAGG CTCTAAATGC CACTCTTCAC AGCAATTTGC 180 

TCTGCCGGCC AGGGCCAGGG CTGGGGCCAG ACAACCAGAC TGAAGAGAGG CGGGCCAGCC 240 

TACCTGGCCG TGATGACAAC TCCTACATGT ACATTCTCTT TGTCATGTTT CTATTTGCTG 300 

TAACTGTGGG CAGCCTCATC CTGGGATACA CCCGCTCCCG CAAAGTGGAC AAGCGTAGTG 360 

ACCCCTATCA TGTGTATATC AAGAACCGTG TGTCTATGAT CTAACACGAG AGGGCTGGGA 420 

OGGTGGAAGA CCAAGACACC TGGGGATTGC GTCTGGGGCC TCCAGAACTC TGCTGTGGAC 480 

TGCATCAGGT CT 492 



Seq ID NO: CI 60 DNA Sequence 
Nucleic Acid Accession #: NM_005245.l 
oO Coding sequence: 187.. 13 959 

1 11 21 31 ' 41 51 

I I I I I I 

CTGGGCGGCC GGGCGCGGGG AGAGGGCGCG GGAGCGGCTC GTGCGGCAGG TACCATGCGG 60 
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ACGCGCGAGC CCGGCGAGGC CCCGGCAGGC CCGTCCCTGC TCGGGGGOGC GCTGAGACGG 120 

CGGGTGAGCT CCACGAGAGC GCCGTCGCCA CTTCGGGCCA ACTTTGCGAT TCCCGACAGT 180 

TAAGCAATGG GGAGACATTT GGCTTTGCTC CTGCTTCTGC TCCTTCTCTT CCAACATTTT 240 

GGAGACAGTG ATGGCAGCCA ACGACTTGAA CAGACTCCTC TGCAGTTTAC ACACCTCGAG 300 

TACAACGTCA CCGTGCAGGA GAACTCTGCA GCTAAGACTT ATGTGGGGCA TCCTGTCAAG 360 

ATGGGTGTTT ACATTACACA TCCAGCGTGG GAAGTAAGGT ACAAAATTGT TTCCGGAGAC 420 

AGTGAAAACC TGTTCAAAGC TGAAGAQTAC ATTCTCGGAG ACTTTTGCTT TCTAAGAATA 480 

AGGACCAAAG GAGGAAATAC AGCTATTCTT AATAGAGAAG TGAAGGATCA CTACACATTG S40 

ATAGTGAAAG CACTTGAAAA AAATACTAAT GTGGAGGCGC GAACAAAGGT CAGGGTGCAG 600 

GTGCTGGATA CAAATGACTT GAGACCGTTA TTCTCACCCA CCTCATACAG CGTTTCTTTA 660 

CCTGAAAACA CAGCTATAAG GACCAGTATC GCAAGAGTCA GCGCCACGGA TGCAGACATA 720 

GGAACCAACG GGGAATTTTA CTACAGTTTT AAAGATCGAA CAGATATGTT TGCTATTCAC 780 

CCAACCAGTG GTGTGATAGT GTTAACTGGT AGACTTGATT ACCTAGAGAC CAAGCTCTAT 840 

GAGATGGAAA TCCTCGCTGC GGACCGTGGC ATGAAGTTGT ATGGGAGCAG TGGCATCAGC 900 

AGCATGGCCA AGCTAACGGT GCACATCGAA CAGGCCAATG AATGTGCTCC GGTGATAACA 960 

GCAGTGACAT TGTCACCATC AGAACTGGAC AGGGACCCAG CATATGCAAT TGTGACAGTG 1020 

GATGACTGCG ATCAGGGTGC CAATGGTGAC ATAGCATCTT TAAGCATCGT GGCAGGTGAC 1080 

CTTCTCCAGC AGTTTAGAAC AGTGAGGTCC TTTCCAGGGA GTAAGGAGTA TAAAGTCAAA 1140 

GCCATCGGTG ACATTGATTG GGACAGTCAT CCTTTCGGCT ACAATCTCAC ACTACAGGCT 1200 

AAAGATAAAG GAACTCCGCC CCAGTTCTCT TCTGTTAAAG TCATTCACGT GACTTCTCCA 1260 

CAGTTCAAAG CCGGGCCAGT CAAGTTTGAA AAGGATGTTT ACAGAGCAGA AATAAGTGAA 1320 

TTTGCTCCTC CCAACACACC TGTGGTCATG GTAAAGGCCA TTCCTGCTTA TTCCCATTTG 1380 

AGGTATGTTT TTAAAAGGAC ACCTGGAAAA GCTAAATTCA GTTTAAATTA CAACACTGGT 1440 

CTCATTTCTA TTTTAGAACC AGTTAAAAGA CAGCAGGCAG CCCATTTTGA ACTTGAAGTA 1500 

ACAACAAGTG ACAGAAAAGC GTCCACCAAG GTCTTGGTGA AAGTCTTAGG TGCAAATAGC 1560 

AATCCCCCTG AATTTACCCA GACAGCGTAC AAAGCTGCTT TTGATGAGAA CGTGCCCATT 1620 

GGTACTACTA TCATGAGCCT GAGTGCCGTA GACCCTGATG AGGGTGAGAA TGGGTACGTG 1680 

ACATAGAGTA TCGCAAATTT AAATCATGTG CCGTTTGCGA TTGACCATTT CACTGGTGCC 1740 

GTGAGTACGT CAGAAAACCT GGACTACGAA CTGATGCCTC GGGTTTATAC TCTGAGGATT 1800 

CGTGCATCAG ACTGGGGCTT GCCGTACCGC CGGGAAGTCG AAGTCCTTGC TACAATTACT 1860 

CTCAATAACT TGAATGACAA CACACCTTTG TTTGAGAAAA TAAATTGTGA AGGGACAATT 1920 

CCCAGAGATC TAGGCGTGGG AGAGCAAATA ACCACTGTTT CTGCTATTGA TGCAGATGAA 1980 

CTTCAGTTGG TACAGTATCA GATTGAAGCT GGAAATGAAC TGGATTTGTT TAGTTTAAAC 2040 

CCCAACTCGG GGGTATTGTC ATTAAAGCGA TCGCTAATGG ATGGCTTAGG TGCAAAGGTG 2100 

TCTTTCCACA GTCTGAGAAT CACAGCTACA GATGGAGAAA ATTTTGCCAC ACCATTATAT 2160 

ATCAACATAA CAGTGGCTGC CAGTCACAAG CTGGTAAACT TGCAGTGTGA AGAGACTGGT 2220 

GTTGCCAAAA TGCTGGCAGA GAAGCTCCTG CAGGCAAATA AATTACACAA CCAGGGAGAG 2280 

GTGGAGGATA TTTTCTTCGA TTCTCACTCT GTCAATGCTC ACATACCGCA GTTTAGAAGC 2340 

ACTCTTCCGA CTGGTATTCA GGTAAAGGAA AACCAGCCTG TGGGTTCCAG TGTAATTTTC 2400 

ATGAACTCCA CTGACCTTGA CACTGGCTTC AATGGAAAAC TGGTCTATGC TGTTTCTGGA 2460 

GGAAATGAGG ATAGTTGCTT CATGATTGAT ATGGAAACAG GAATGCTGAA AATTTTATCT 2520 

CCTCTTGACC GTGAAACAAC AGACAAATAC ACCCTGAATA TTACCGTCTA TGACCTTGGG 2580 

ATACCCCAGA AGGCTGCGTG GCGTCTTCTA CATGTCGTGG TTGTCGATGC CAATGATAAT 2640 

CCACCCGAGT TTTTACAGGA GAGCTATTTT GTGGAAGTGA GTGAAGACAA GGAGGTACAT 2700 

AGTGAAATCA TCCAGGTTGA AGCCACAGAT AAAGACCTGG GGCCCAACGG ACACGTGACG 2760 

TACTCAATTC TTACAGACAC AGACACATTT TCAATTGACA GCGTGACGGG TGTTGTTAAC 2820 

ATCGCACGCC CTCTGGATCG AGAGCTGCAG CATGAGCACT CCTTAAAGAT TGAGGCCAGG 2B80 

GACCAAGCCA GAGAAGAGCC TCAGCTGTTC TCCACTGTCG TTGTGAAAGT ATCACTAGAA 2940 

GATGTTAATG ACAACCCACC TACATTTATT CCACCTAATT ATCGTGTGAA AGTCCGAGAG 3000 

GATCTTCCAG AAGGAACCGT CATCATGTGG TTAGAAGCCC ACGATCCTGA TTTAGGTCAG 3060 

TCTGGTCAGG TGAGATACAG CCTTCTGGAC CACGGAGAAG GAAACTTCGA TGTGGATAAA 3120 

CTCAGTGGAG CAGTTAGGAT CGTCCAGCAG TTGGACTTTG AGAAGAAGCA AGTGTATAAT 3180 

CTCACTGTGA GGGCCAAAGA CAAGGGAAAG CCAGTTTCTC TGTCTTCTAC TTGCTATGTT 3240 

GAAGTTGAGG TGGTTGATGT GAATGAGAAC CTGCACCCAC CCGTGTTTTC CAGCTTTGTG 3300 

GAAAAGGGGA CAGTGAAAGA AGATGCACCT GTTGGTTCAT TGGTAATGAC GGTGTCGGCT 3360 

CATGATGAGG ACGCCGGAAG AGATGGGGAG ATCCGATACT CCATTAGAGA TGGCTCTGGC 3420 

GTTGGTGTTT TCAAAATAGG TGAAGAGACA GGTGTCATAG AGACGTCAGA TCGACTGGAC 3480 

CGTGAATCGA CCTCCCATTA TTGGCTAACA GTCTTTGCAA CCGATCAGGG TGTCGTGCCT 3540 

CTTTCATCGT TCATAGAGAT CTACATAGAG GTTGAGGATG TCAATGACAA TGCACCACAG 3600 

ACATCAGAGC CTGTTTATTA CCCAGAAATC ATGGAAAATT CTCCTAAAGA TGTATCTGTG 3660 

GTCCAGATCG AGGCATTTGA TCCAGATTCG AGCTCTAATG ACAAGCTCAT GTACAAAATT 3720 

ACAAGTGGAA ATCCACAAGG ATTCTTTTCA ATACATCCTA AAACAGGTCT CATCACAACT 3780 

ACGTCAAGGA AGCTAGACCG AGAACAGCAA GATGAACACA TATTAGAGGT TACTGTGACA 3840 

GACAATGGTA GTCCCCCCAA ATCAACCATT GCAAGAGTCA TTGTGAAAAT CCTTGATGAA 390O 

AATGACAACA AACCTCAGTT TCTGCAAAAG TTCTACAAAA TCAGACTCCC TGAGCGGGAA 3960 

AAGCCAGACC GAGAAAGAAA TGCCAGACGG GAGCCGCTCT ATCGCGTCAT AGCCACCGAC 4020 

AAGGATGAGG GCCCCAATGC AGAAATCTCC TACAGCATCG AAGACGGGAA TGAGCATGGC 4080 

AAATTTTTCA TCGAACCGAA AACTGGAGTG GTTTOGTCCA AGAGGTTTTC AGCAGCTGGA 4140 

GAATATGATA TTCTTTCAAT TAAGGCAGTT GACAATGGTC GCCCTCAAAA GTCATCAACC 4200 

ACCAGACTCC ATATTGAATG GATCTCCAAG CCCAAACAGT CCCTGGAGCC CATTTCATTT 4260 

GAAGAATCAT TTTTTACCTT TACTGTGATG GAAAGTGACC CCGTTGCTCA CATGATTGGA 4320 

GTAATATCTG TGGAGCCTCC TGGCATACCC CTTTGGTTTG ACATCACTGG TGGCAACTAC 4380 

GACAGTCACT TCGATGTGGA CAAGGGAACT GGAACCATCA TTGTTGCCAA ACCTCTTGAT 4440 

GCAGAACAGA AGTCAAACTA CAACCTCACA GTCGAGGCTA CAGATGGAAC CACCACTATC 4500 

CTCACTCAGG TATTCATCAA AGTAATAGAC ACAAATGACC ATCGTCCTCA GTTTTCTACA 4560 

TCAAAGTATG AAGTTGTTAT TCCTGAAGAT ACAGCGCCAG AAACAGAAAT TTTGCAAATC 4620 

AGTGCTGTGG ATCAGGATGA GAAAAACAAA CTAATCTACA CTCTGCAGAG CAGTAGAGAT 4680 

CCACTGAGTC TCAAGAAATT TCGTCTTGAT CCTGCAACCG GCTCTCTCTA TACTTCTGAG 4740 

AAACTGGATC ATGAAGCTGT TTCACCAGCA CACCTCACGG TCATGGTACG AGATCAAGAT 4800 

GTGCCTGTAA AACGCAACTT TGCAAGGATT GTGGTCAATG TCAGCGACAC GAATGACCAC 4860 

GCCCCGTGGT TCACCGCTTC CTCCTACAAA GGGCGGGTTT ATGAATCGGC AGCCGTTGGC 4920 

TCAGTTGTGT TGCAGGTGAC GGCTCTGGAC AAGGACAAAG GGAAAAATGC TGAAGTGCTG 4980 

TACTCGATCG AGTCAGGAAA TATTGGAAAT ATTGGAAATT CTTTTATGAT TGATCCTGTC 5040 

TTGGGCTCTA TTAAAACTGC CAAAGAATTA GATCGAAGTA ACCAAGCGGA GTATGATTTA 5100 
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ATGGTAAAAG CTACAGATAA GGGCAGTCCA CCAATGAGTG AAATAACTTC TGTGCGTATC 5160 

TTTGTCACAA TTGCTGACAA CGCCTCTCCG AAGTTTACAT CAAAAGAATA TTCTGTTGAA 5220 

CTTAGTGAAA CTGTCAGCAT TGGGAGTTTC GTTGGGATGG TTACAGCCCA TAGTCAATCA 5280 

TCAGTGGTGT ATGAAATAAA AGATGGAAAT ACAGGTGATG CTTTTGATAT TAATCCACAT 5340 

TCTGGAACTA TCATCACTCA GAAAGCCCTG GACTTTGAAA CTTTGCCCAT TTACACATTG 5400 

ATAATACAAG GAACTAACAT GGCTGGTTTG TCCACTAATA CAACGGTTCT AGTTCACTTG 5460 

CAGGATGAGA ATGACAACGC GCCAGTTTTT ATGCAGGCAG AATATACAGG ACTCATTAGT 5520 

GAATCAGCCT CAATTAACAG CGTGGTCCTA ACAGACAGGA ATGTCCCACT GGTGATTCGA S580 

GCAGCTGATG CTGATAAAGA CTCAAATGCT TTGCTTGTAT ATCACATTGT TGAACCATCT 5640 

GTACACACAT ATTTTGCTAT TGATTCTAGC ACTGGTGCTA TTCATACAGT ACTAAGTCTG 5700 

GACTATGAAG AAACAAGTAT TTTTCACTTT ACCGTCCAAG TGCATGACAT GGGAACCCCA S760 

CGTTTATTTG CTGAGTATGC AGCGAATGTA ACAGTACATG TAATTGACAT TAATGACTGC 582 0 

CCCCCTGTGT TTGCCAAGCC ATTATATGAA GCATCTCTTT TGTTACCAAC ATACAAAGGA 5880 

GTAAAAGTCA TCACAGTAAA TGCTACAGAT GCTGATTCAA GTGCATTCTC ACAGTTGATT 5940 

TACTCCATCA CCGAAGGCAA CATCGGGGAG AAGTTTTCTA TGGACTACAA GACTGGTGCT 6000 

CTCACTGTCC AAAACACAAC TCAGTTAAGA AGCCGCTACG AGCTAACCGT TAGAGCTTCC 6060 

GATGGCAGAT TTGCCGGCCT TACCTCTGTC AAAATTAATG TGAAAGAAAG CAAAGAAAGT 6120 

CACCTAAAGT TTACCCAGGA TGTCTACTCT GCGGTAGTGA AAGAGAATTC CACCGAGGCC 6180 

GAAACATTAG CTGTCATTAC TGCTATTGGG AGTCCAATCA ATGAGCCTTT GTTTTATCAC 6240 

ATCCTCAACC CAGATCGCAG ATTTAAAATA AGCCGCACTT CAGGGGTTCT GTCAACCACT 6300 

GGCACGCCCT TCGATCGTGA GCAGCAGGAG GCGTTTGATG TGGTTGTAGA AGTGATAGAG 6360 

GAACATAAGC CTTCTGCAGT GGCCCACGTT GTCGTGAAGG TCATTGTAGA AGACCAAAAT 6420 

GATAATGOGC CGGTGTTTGT CAACCTTCCC TACTACGCCG TTGTTAAAGT GGACACTGAG 6480 

GTGGGCCATG TCATTCGCTA TGTCACTGCT GTAGACAGAG ACAGTGGCAG AAACGGGGAA 6540 

GTGCATTACT ACCTCAAGGA ACATCATGAA CACTTTCAAA TTGGACCCTT GGGTGAAATT 6600 

TCACTGAAAA AGCAATTTGA GCTTGACACC TTAAATAAAG AATATCTTGT TACAGTGGTT 6660 

GCAAAAGATG GAGGGAACCC GGCCTTTTCA GOGGAAGTTA TCGTTCCGAT CACTGTCATG 6720 

AATAAAGCCA TGCCTGTGTT TGAAAAACCT TTCTACAGTG CAGAGATTGC AGAGAGCATC 6780 

CAGGTGCACA GCCCTGTGGT CCACGTGCAG GCTAACAGCC CGGAAGGCCT GAAAGTGTTC 6840 

TACAGCATCA CAGACGGAGA CCCTTTCAGC CAGTTCACTA TTAACTTCAA TACTGGAGTT 6900 

ATCAATGTCA TAGCTCCTCT GGACTTTGAG GCCCACCCGG CATATAAGCT GAGCATACGC 6960 

GCAACTGACT CCTTGACGGG CGCTCATGCT GAAGTATTTG TGGACATCAT AGTAGACGAC 7020 

ATCAATGATA ACCCTCCTGT GTTTGCTCAG CAGTCTTATG CGGTGACCCT GTCTGAGGCA 7080 

TCTGTAATTG GAACGTCTGT TGTTCAAGTT AGAGCCACCG ATTCTGATTC AGAACCAAAT 7140 

AGAGGAATCT CATACCAGAT GTTTGGGAAT CACAGCAAGA GTCATGATCA TTTTCATGTA 7200 

GACAGCAGCA CTGGCCTCAT CTCACTACTC AGAACCCTGG ATTACGAGCA GTCCCGGCAG 7260 

CACACGATTT TTGTGAGGGC AGTTGATGGT GGTATGCCCA CGCTGAGCAG TGATGTGATT 7320 

GTCACGGTGG ACGTTACCGA CCTCAATGGT AATCCACCAC TCTTTGAACA ACAGATTTAT 7380 

GAAGCCAGAA TTAGCGAGCA CGCCCCTCAT GGGCATTTCG TGACCTGTGT AAAAGCCTAT 7440 

GATGCAGACA GTTCAGACAT AGACAAGTTG CAGTATTCCA TTCTGTCTGG CAATGATCAT 7500 

AAACATTTTG TCATTGACAG TGCAACAGGG ATTATCACCC TCTCAAACCT GCACCGGCAC 7560 

GCCCTGAAGC CATTTTACAG TCTTAACCTG TCAGTGTCTG ATGGAGTTTT TAGAAGTTCC 7620 

ACCCAGGTTC ATGTAACTGT AATTGGAGGC AATTTGCACA GTCCTGCTTT CCTTCAGAAC 7680 

GAATATGAAG TGGAACTAGC TGAAAACGCT CCCCTACATA CCCTGGTGAT GGAGGTGAAA 7740 

ACTACGGATG GGGATTCTGG TATTTATGGT CACGTTACTT ACCATATTGT AAATGACTTT 7800 

GCCAAAGACA GATTTTACAT AAATGAGAGA GGACAGATAT TTACTTTGGA AAAACTTGAT 7860 

CGAGAAACCC CGGCGGAGAA AGTGATCTCA GTCCGTTTAA TGGCTAAGGA TGCTGGAGGA 7920 

AAAGTTGCTT TCTGCACCGT GAATGTCATC CTTACAGATG ACAATGACAA TGCACCACAA 7980 

TTTOGAGCAA CCAAATACGA AGTGAATATC GGGTCCAGTG CTGCTAAAGG GACTTCAGTC 8040 

GTAAAGTCTG CAAGTGATGC OGATGAGGGC TCCAATGCCG ACATCACCTA TGCCATTGAA 8100 

GCAGACTCTG AAAGTGTAAA AGAGAATTTG GAAATTAACA AACTGTCOGG CGTAATCACT 8160 

ACAAAGGAGA GCCTCATTGG CTTGGAAAAT GAATTCTTCA CTTTCTTTGT TAGAGCTGTG 8220 

GATAATGGGT CTCCATCAAA AGAATCTGTT GTTCTTGTCT ATGTTAAAAT CCTTCCACCG 8280 

GAAATGCAGC TTCCAAAATT TTCAGAACCT TTCTATACCT TTACAGTGTC AGAGGACGTG 8340 

CCTGTTGGAA CAGAGATAGA TCTCATCCGA GCAGAACATA GTGGGACTGT TCTTTACAGC 8400 

CTGGTCAAAG GGAATACTCC AGAAAGCAAT AGGGATGAGT CCTTTGTGAT TGACAGACAG 8460 
AGCGGGAGAC TGAAGTTGGA GAAGAGTCTT GATCATGAGA CAACTAAGTG GTATCAGTTT 8520 
TCCATACTGG CCAGGTGCAC TCAAGATGAC CATGAGATGG TGGCTTCTGT AGATGTTAGT 8580 
ATCCAAGTGA AAGATGCAAA TGACAACAGC CCGGTCTTTG AATCTAGTCC ATATGAGGCA 8640 
TTCATTGTTG AAAACCTGCC AGGGGGAAGT AGAGTAATTC AGATCAGGGC ATCTGATGCT 8700 
GACTCAGGAA CCAACGGCCA AGTTATGTAT AGCCTGGATC AGTCACAAAG TGTGGAAGTC 8760 
ATTGAATCCT TTGCCATTAA CATGGAAACA GGCTGGATTA CAACTTTAAA GGAACTTGAC 8820 
CATGAAAAGA GAGACAATTA CCAGATTAAA GTGGTTGCAT CAGATCATGG TGAAAAGATC 8880 
CAGCTATCCT CCACAGCCAT TGTGGATGTT ACCGTCACCG ATGTCAACGA TAGTCCACCA 8940 
CGATTCACGG CCGAGATCTA TAAAGGGACT GTGAGTGAGG ATGACCCCCA AGGTGGGGTG 9000 
ATTGCCATCT TAAGTACCAC GGATGCTGAT TCTGAAGAGA TCAACAGACA AGTTACATAT 9060 
TTCATAACAG GAGGGGATCC TTTAGGACAG TTTGCCGTTG AAACTATACA GAATGAATGG 9120 
AAGGTATATG TGAAGAAACC TCTAGACAGG GAAAAAAGGG ACAATTACCT TCTTACTATC 9180 
ACGGCAACTG ATGGCACCTT CTCATCAAAA GCGATAGTTG AAGTGAAAGT TCTGGATGCA 9240 
AATGACAACA GTCCAGTTTG TGAAAAGACT TTATATTCAG ACACTATTCC TGAAGACGTC 9300 
CTTCCTGGAA AATTGATCAT GCAGATCTCT GCTACAGACG CAGACATCCG CTCTAAOGCT 9360 
GAAATTACTT ACACGTTATT GGGTTCAGGT GCAGAAAAAT TCAAACTAAA TCCAGACACA 9420 
GGTGAACTGA AAACGTCAAC CCCCCTTGAT CGTGAGGAGC AAGCTGTTTA TCATCTTCTC 9480 
GTCAGGGCCA CAGATGGAGG AGGAAGATTC TGCCAAGCCA GTATTGTCGT CACGCTAGAA 9540 
GATGTGAACG ATAACGCCCC CGAATTCTCT GCCGATCCTT ATGCCATCAC CGTGTTTGAA 9600 
AACACAGAGC CGGGAACGCT GCTGACAAGA GTGCAGGCCA CAGATGCCGA CGCAGGATTA 9660 
AATCGGAAGA TTTTATACTC ACTGATTGAC TCTGCTGATG GGCAGTTCTC CATTAACGAA 9720 
TTATCTGGAA TTATTCAGTT AGAAAAACCT TTGGACAGAG AACTCCAGGC AGTATACACC 9780 
CTCTCTTTGA AAGCTGTGGA TCAAGGCTTG CCAAGGAGGC TGACTGCCAC TGGCACTGTG 9840 
ATTGTATCAG TTCTTGACAT AAATGACAAC CCCCCTGTGT TTGAGTACCG TGAATATGGT 9900 
GCCACCGTGT CTGAGGACAT TCTTGTTGGA ACTGAAGTTC TTCAAGTGTA TGCAGCAAGT 9960 
CGGGATATTG AAGCAAATGC AGAAATCACC TACTCAATAA TAAGTGGAAA TGAACATGGG 10020 
AAATTCAGCA TAGATTCTAA AACAGGGGCC GTATTTATCA TTGAGAATCT GGATTATGAG 10080 
AGCTCTCATG AGTATTACCT AACAGTAGAG GCCACTGATG GAGGCACGCC TTCACTGAGC 10140 
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GACGTTGCCA CTGTGAACGT TAATGTAACA GATATCAACG ATAATACCCC TGTGTTCAGC 10200 

CAAGACACCT ACACGACAGT CATCAGTGAA GATGCCGTTC TTGAGCAGTC TGTCATCACG 10260 

GTTATGGCCG ATGATGCCGA TGGACCTTCC AACAGCCACA TCCACTACTC AATTATAGAT 10320 

GGCAACCAAG GAAGCTCGTT CACAATTGAC CCCGTCAGGG GAGAAGTCAA AGTGACCAAA 103 BO 

CTTCTCGACC GAGAAACGAT TTCAGGTTAC ACGCTCACGG TTCAAGCTTC TGATAATGGC 10440 

AGTCCACCCA GAGTCAACAC GACGACCGTG AACATCGATG TGTCCGATGT CAATGACAAC 1050 0 

GCGCCCGTCT TCTCCAGGGG AAACTACAGT GTCATTATCC AGGAAAATAA GCCAGTGGGC 10560 

TTCAGCGTGC TGCAGCTGGT AGTAACAGAT GAGGATTCTT CCCATAACGG TCCACCCTTC 10620 

TTCTTTACTA TTGTAACTGG AAATGATGAG AAGGCTTTTG AAGTTAACCC GCAAGGAGTC 10680 

CTCCTGACAT CATCTGCCAT CAAGAGGAAG GAGAAAGATC ATTACTTACT GCAGGTGAAG 10740 

GTGGCAGATA ATGGAAAGCC TCAGTTGTCA TCTTTGACAT ACATTGACAT TAGGGTAATT 10800 

GAG GAG AG CA TCTATCCGCC TGCGATTTTG CCCCTGGAGA TTTTCATCAC CTCTTCTGGA 10860 

GAAGAATACT CAGGTGGCGT CATTGGGAAG ATCCATGCCA CAGACCAGGA CGTGTATGAT 10920 

ACTCTAACCT ACAGTCTCGA CCCTCAGATG GACAACCTGT TCTCTGTTTC CAGCACAGGG 10980 

GGCAAGCTGA TAGCACACAA AAAGCTAGAC ATAGGGCAAT ACCTTCTCAA TGTCAGCGTA 11040 

ACAGATGGGA AGTTCACGAC GGTGGCCGAC ATCACAGTGC ATATCAGACA AGTCACACAG 11100 

GAGATGTTGA ACCACACCAT CGCGATCCGC TTTGCCAACC TCACTCCGGA AGAATTCGTT 11160 

GGTGACTACT GGCGCAACTT CCAGCGAGCT TTACGGAACA TCCTGGGTGT GAGGAGGAAC 11220 

GACATACAGA TTGTTAGTTT GCAGTCCTCT GAACCTCACC CACATCTGGA CGTCTTACTT 11280 

TTTGTAGAGA AACCAGGTAG TGCTCAGATC TCAACAAAAC AACTTCTGCA CAAGATTAAC 11340 

TCTTCCGTGA CTGACATTGA GGAAATCATT GGAGTTAGGA TACTGAATGT ATTCCAGAAA 11400 

CTCTGCGCGG GACTGGACTG CCCCTGGAAG TTCTGCGATG AAAAGGTGTC TGTGGATGAA 11460 

AGTGTGATGT CAACACACAG CACAGCCAGA CTGAGTTTTG TGACTCCCCG CCACCACAGG 11520 

GCAGCGGTGT GTCTCTGCAA AGAGGGAAGG TGCCCACCTG TCCACCATGG CTGTGAAGAT 11580 

GATCCGTGCC CTGAGGGATC CGAATGTGTG TCTGATCCCT GGGAGGAGAA ACACACCTGT 11640 

GTCTGTCCCA GCGGCAGGTT TGGTCAGTGC CCAGGGAGTT CATCTATGAC ACTGACTGGA 11700 

AACAGCTACG TGAAATACCG TCTGACGGAA AATGAAAACA AATTAGAGAT GAAACTGACC 11760 

ATGAGGCTCA GAACATATTC CACGCATGCG GTTGTCATGT ATGCTCGAGG AACTGACTAT 11820 

AGCATCTTGG AGATTCATCA TGGAAGGCTG CAGTACAAGT TTGACTGTGG AAGTGGCCCT 11880 

GGAATTGTCT CTGTTCAGAG CATTCAGGTC AATGATGGGC AGTGGCACGC AGTGGCCCTG 11940 

GAAGTGAATG GAAACTATGC TCGCTTGGTT CTAGACCAAG TTCATACTGC ATCGGGCACA 12000 

GCCCCAGGGA CTCTGAAAAC CCTGAACCTG GATAACTATG TQTTTTTTGO TGGCCACATC 12060 

CGTCAGCAGG GAACAAGGCA TGGAAGAAGT CCTCAAGTTG GTAATGGTTT CAGGGGTTGT 12120 

ATGGACTCCA TTTATTTGAA TGGGCAGGAG CTCCCTTTAA ACAGCAAACC CAGAAGCTAT 12180 

GCACACATCG AAGAGTCGGT GGATGTATCT CCAGGCTGCT TCCTGACGGC CAOGGAAGAC 12240 

TGCGCCAGCA ACCCTTGCCA GAATGGAGGC GTTTGCAATC CGTCACCTGC TGGAGGTTAT 12300 

TACTGCAAAT GCAGTGCCTT GTACATAGGG ACCCACTGTG AGATAAGCGT CAATCCGTGT 12360 

TCCTCCAACC CATGCCTCTA TGGGGGCACG TGTGTTGTCG ACAACGGAGG CTTTGTTTGC 12420 

CAGTGTAGAG GATTATATAC TGGTCAGAGG TGTCAGCTTA GTCCATACTG CAAAGATGAA 12480 

CCCTGTAAGA ATGGCGGAAC ATGCTTTGAC AGTTTGGATG GCGCCGTTTG TCAGTGTGAT 12540 

TCGGGTTTTA GGGGAGAAAG GTGTCAGAGT GATATCGACG AGTGCTCTGG AAACCCTTGC 12600 

CTGCACGGGG CCCTCTGTGA GAACACGCAC GGCTCCTATC ACTGCAACTG CAGCCACGAG 12660 

TACAGGGGAC GTCACTGCGA GGATGCTGCG CCCAACCAGT ATGTGTCCAC GCCGTGGAAC 12720 

ATTGGGTTGG CGGAAGGAAT TGGAATCGTT GTGTTTGTTG CAGGGATATT TTTACTGGTG 12780 

GTGGTGTTTG TTCTCTGCCG TAAGATGATT AGTCGGAAAA AGAAGCATCA GGCTGAACCT 12840 

AAAGACAAGC ACCTGGGACC CGCTACGGCT TTCTTGCAAA GACCGTATTT TGATTCCAAG 12900 

CTAAATAAGA ACATTTACTC AGACATACCA CCCCAGGTGC CTGTCCGGCC TATTTCCTAC 12960 

ACCCCGAGTA TTCCAAGTGA CTCAAGAAAC AATCTGGACC GAAATTCCTT CGAAGGATCT 13020 

GCTATCCCAG AGCATCCCGA ATTCAGCACT TTTAACCCCG AGTCTGTGCA CGGGCACCGA 13080 

AAAGCAGTGG CGGTCTGCAG CGTGGCGCCA AACCTGCCTC CCCCACCCCC TTCAAACTCC 13140 

CCTTCTGACA GCGACTCCAT CCAGAAGCCT AGCTGGGACT TTGACTATGA CACAAAAGTG 13200 

GTGGATCTTG ATCCCTGTCT TTCCAAGAAG CCTCTAGAGG AAAAGCCTTC CCAGCCATAC 13260 

AGTGCCCGGG AAAGCCTGTC TGAAGTGCAG TCCCTGAGCT CCTTCCAGTC CGAATCGTGC 13320 

GATGACAATG GGTATCACTG GGATACATCA GATTGGATGC CAAGCGTTCC TCTGCCGGAC 13380 

ATACAAGAGT TCCCCAACTA TGAGGTGATT GATGAGCAGA CACCCCTGTA CTCAGCAGAT 13440 

CCAAACGCCA TCGATACGGA CTATTACCCT GGAGGCTACG ACATCGAAAG TGATTTTCCT 13500 

CCACCCCCAG AAGACTTCCC CGCAGCTGAT GAGCTACCAC CGTTACCGCC CGAATTCAGC 13560 

AATCAGTTTG AATCCATCCA CCCTCCTAGA GACATGCCTG CCGCGGGTAG CTTGGGTTCT 13620 

TCATCAAGAA ACCGGCAGAG GTTCAACTTG AATCAGTATT TGCCCAATTT TTATCCCCTC 13680 

GATATGTCTG AACCTCAAAC AAAAGGCACT GGTGAGAATA GTACTTGTAG AGAACCCCAT 13740 

GCCCCTTACC CGCCAGGGTA TCAAAGACAC TTCGAGGCGC CCGCTGTCGA GAGCATGCCC 13800 

ATGTCTGTGT ACGCCTCCAC CGCCTCCTGC TCTGACGTGT CAGCCTGCTG CGAAGTGGAG 13860 

TCCGAGGTCA TGATGAGTGA CTATGAGAGC GGGGACGACG GCCACTTCGA AGAGGTGACG 13920 

ATCCCGCCCC TGGATTCCCA GCAGCACACG GAAGTCTGAC TCTCAACTCC CCCCAAAGTG 13980 

CCTGACTTTA GTGAACCTAG AGGTGATGTG AGTAATCCGC GCTGTTCTTT GCAGCAGTGC 14040 

TTCCAAGCTT TTTTTGGTGA GCCGAATGGG CATGGCTGCG CTGGATCCTG CGCCTCTGGA 14100 

CGTGCTAGCC ATTTCCAGTG TCCCAACTAC TGTCATCGTG AGGTTTTCAT CGGCTGTGCC 14160 

ATTTCCCAAC GTCTTTTGGG ATTTACATCT GTCTGTGTTA AAATAATCAA ACGAAAAATC 14220 

AGTCCTGTGT TGTCAGCATG ATTCATGTAT TTATATAGAT TTGATTATTT TAATTTTCCT 14280 

GTCTCTTTTT TTTGTAAATT TTATGTACAG ATTTGATTTT TCATAGTTTT AACTAGATTT 14340 

CCAAGATATT TTGTGCATTT GTTTCAACTG AATTTTGGTG GTGTCAGTGC CATTATCTAG 14400 

CACCCTGATT TTTTTTTTTT TACTATAACC AGGGTTTCAT TCTGTCTTTT TCCACTGAAG 14460 

TGTGACATTT TGTTAGTACA TTTCAGTGTA GTCATTCATT TCTAGCTGTA CATAGGATGA 14520 

AGGAGAGATC AGATACATGA ACATGTCTTA CATGGGTTGC TGTATTTAGA ATTATAAACA 14580 

TTTTTCATTA TTGGAAAGTG TAACGGGGAC CTTCTGCATA CCTGTTTAGA ACCAA AACCA 14640 

CCATGACACA GTTTTTATAG TGTCTGTATA TTTGTGATGC AATGGTCTTG TAAAGGTTTT 14700 

TAATGAAAAC TACCATTAGC CAGTCTTTCT TACTGACAAT AAATTATTAA TAAAAT 14756 

Seq ID NO: CI 61 DNA Sequence 

Nucleic Acid Accession 8: NM_014220.1 

Coding sequence: 102.. 710 
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GTGGTGTTTG 
CTGAAGACAA 
CACGATGCAT 
TGCTTTACTT 
TGTGGTTCTT 
TCATTGGGCT 
GATGTGCGAT 
GTGTCATTGT 
AGTGGAACTA 
CCGAGTGCAC 
TGGCTCTTGG 
GAGGCATATG 
CCAGGACAGA 
ATTTGTAAAA 
TAAAGACTGG 
GTTTATTTGT 
TTACAGACTG 
AAATAACATT 
AGTTGCTTTT 
CACTGCTTGT 
AACTGCCTTG 
TGTTCCAATC 
GAGATTCAAA 
ATCATGTGTT 
AATAAAGCAG 
AGGAAATACC 
AAAAAAAAAA 



CTTTCTCCAC 
AGAGAAGGGG 
CGGACATTCT 
TCCCAATGGG 
TTCTGGCATC 
GGAACAGGAT 
GCTTTCTTCT 
GGCAGCCCTT 
CACCTTTGCC 
TGAACCCAAG 
TGGAATTGAA 
TGGCTTTTGC 
GCCACAATCT 
CTTTGTATTA 
CATCTTCACA 
TTTTGTTTTT 
AGTGACAGTA 
CCAATCACTA 
TATAAGACCA 
ATGATGTTTC 
TGTTCTGTGA 
CAAATGAATG 
TTTTTCTAAC 
TAAAAAAAGA 
TTGATCAGCA 
CTCAAAACTA 
AAAAAAAAAA 



CAGAAGGGCA 
GAGAAAACCT 
CTGGTGGGGC 
GAAACAAAGT 
GTAGGAGGTG 
GACTGCTGTG 
GTATTGGCTG 
GGCTTAGCAG 
AGCACCGAGG 
CACATTGTGG 
TTCATCTTGT 
TGCTCTCACC 
TCCTCTATTT 
GTGTAACATA 
GGATGTCAGT 
TTTTAAGGAA 
CTCAGTATAT 
TTGTATATAT 
AGAAGGAGAA 
CCATTCATAC 
GAAACAAATA 
CATCACAACT 
ATATGGAAAG 
AAGGCTACGA 
TCATTGGAAC 
ACTTGTTTAC 
AAA 



CACTTTCATC 
AGCAGACCAC 
TCGCCCTCCT 
ATGCCTCCGA 
GCCTGCTGAT 
GCTGCTGTGG 
CTCTCATTGG 
AAGGACCACT 
GCCAGTACCT 
AATGGAATGT 
GTCTTATTCA 
AACAGCAATA 
CATTGTAATT 
CTCCCCACAG 
GTTTAAATTT 
TGAGGAAACA 
CTGAGATAAA 
GTGCATGTAT 
AATCCGACAA 
ACCTATAAAT 
TTTACTTAGA 
TACAATGCTG 
CCTTTTGTCC 
TGACTGGGCA 
ATGGGGACGA 
AACAAAATAA 



TAATTTGGGG 
CATGTGCTAT 
GTGCATCGCG 
AAACCACCTC 
GCTCCTGCCA 
CCATGAAAAC 
AATTGCAGGA 
ATGTCTTGAT 
TCTGGATACC 
ATCTCTGTTT 
AGTAATAAAT 
TGACTGCTAA 
TATATATTTC 
TCTACTTTTA 
AGTAAACTTC 
AACCACCCTC 
CTCTATAATG 
TTTTTAAATT 
CCTGGAAAGA 
CTCTAACAAG 
GTGGAAGGAC 
CTCATTGTTG 
TCCAAAGATG 
AGAAGAAAGA 
GTGACGGCAG 
AGTATTCACT 



TATCACTGAG 
GGGAAGTGTG 
GCTAATATTT 
AGCCGCTTCG 
GCATTTGTCT 
TGTGGCAAAC 
TCTGGCTACT 
TCCCTCGGCC 
TCCACATGGT 
TCTATCCTCT 
GGAGTGCTTG 
AAGAACCAAC 
ACTTGTATTC 
CAAAOGCCTG 
TTTTTTGTTT 
TGGGGGTAGT 
TTTTGGATAA 
AAAGATGTCT 
TTTTTGTTTT 
AGGCCCTTTG 
TGATTGAGAA 
TGAGTACTAT 
AGTACTAGGG 
TGGGAAACTG 
GAGGACCACG 
ACGAAAAAAA 



Seq ID NO: C162 UNA Sequence 

Nucleic Acid Accession #: NMJJ03759.1 

Coding sequence: 150.. 3257 
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1 11 21 31 41 51 

I I I I 1 I 

GTTCTTTGTG ACACATCACA CAGAATTGGA GTGCTGTCCT TCTGGAGAGT GGTGGAGAAC 60 

CAAGATACAG TTCAGAACCA AAGGAATAGA GAAGGGCTTT GATTTCTTTT TGGCTTTAGA 120 

TTGGGGATTT GGGAGGCTTA GCAGGAAAGA TGTCCACTGA AAATGTGGAA GGGAAGCCCA 180 

GTAACCTTGG GGAGAGAGGA AGAGCCCGGA GCTCCACTTT CCTCAGGGTT GTCCAGCCAA 240 

TGTTTAACCA CAGTATTTTC ACTTCTGCAG TCTCTCCTGC TGCAGAACGC ATCCGATTCA 300 

TCTTGGGAGA GGAGGATGAC AGCCCAGCTC CCCCTCAGCT CTTCACGGAA CTGGATGAGC 360 

TGCTGGCCGT GGATGGGCAG GAGATGGAGT GGAAGGAAAC AGCCAGGTGG ATCAAGTTTG 420 

AAGAAAAAGT GGAACAGGGT GGGGAAAGAT GGAGCAAGCC CCATGTGGCC ACATTGTCCC 480 

TTCATAGTTT ATTTGAGCTG AGGACATGTA TGGAGAAAGG ATCCATCATG CTTGATCGGG 540 

AGGCTTCTTC TCTCCCACAG TTGGTGGAGA TGATTGTTGA CCATCAGATT GAGACAGGCC 600 

TATTGAAACC TGAACTTAAG GATAAGGTGA CCTATACTTT GCTCCGGAAG CACCGGCATC 660 

AAACCAAGAA ATCCAACCTT CGGTCCCTGG CTGACATTGG GAAGACAGTC TCCAGTGCAA 720 

GTAGGATGTT TACCAACCCT GATAATGGTA GCCCAGCCAT GACCCATAGG AATCTGACTT 780 

CCTCCAGTCT GAATGACATT TCTGATAAAC CGGAGAAGGA CCAGCTGAAG AATAAGTTCA 840 

TGAAAAAATT GCCACGTGAT GCAGAAGCTT CCAACGTGCT TGTTGGGGAG GTTGACTTTT 900 

TGGATACTCC TTTCATTGCC TTTGTTAGGC TACAGCAGGC TGTCATGCTG GGTGCCCTGA 960 

CTGAAGTTCC TGTGCCCACA AGGTTCTTGT TCATTCTCTT AGGTCCTAAG GGGAAAGCCA 1020 

AGTCCTACCA CGAGATTGGC AGAGCCATTG CCACCCTGAT GTCTGATGAG GTGTTCCATG 1080 

ACATTGCTTA TAAAGCAAAA GACAGGCACG ACCTGATTGC TGGTATTGAT GAGTTCCTAG 1140 

ATGAAGTCAT CGTCCTTCCA CCTGGGGAAT GGGATCCAGC AATTAGGATA GAGOCTCCTA 1200 

AGAGTCTTCC ATCCTCTGAC AAAAGAAAGA ATATGTACTC AGGTGGAGAG AATGTTCAGA 1260 

TGAATGGGGA TACGCCCCAT GATGGAGGTC AOGGAGGAGG AGGACATGGG GATTGTGAAG 1320 

AATTGCAGCG AACTGGACGG TTCTGTGGTG GACTAATTAA AGACATAAAG AGGAAAGCGC 1380 

CATTTTTTGC CAGTGATTTT TATGATGCTT TAAATATTCA AGCTCTTTCG GCAATTCTCT 1440 

TCATTTATCT GGCAACTGTA ACTAATGCTA TCACTTTTGG AGGACTGCTT GGGGATGCCA 1500 

CTGACAACAT GCAGGGCGTG TTGGAGAGTT TCCTGGGCAC TGCTGTCTCT GGAGCCATCT 1560 

TTTGCCTTTT TGCTGGTCAA CCACTCACTA TTCTGAGCAG CACCGGACCT GTCCTAGTTT 1620 

TTGAGAGGCT TCTATTTAAT TTCAGCAAGG ACAATAATTT TGACTATTTG GAGTTTCGCC 1680 

TTTGGATTGG CCTGTGGTCC GCCTTCCTAT GTCTCATTTT GGTAGCCACT GATGCCAGCT 1740 

TCTTGGTTCA ATACTTCACA CGTTTCACGG AGGAGGGCTT TTCCTCTCTG ATTAGCTTCA 1800 

TCTTTATCTA TGATGCTTTC AAGAAGATGA TCAAGCTTGC AGATTACTAC CCCATCAACT 1860 

CCAACTTCAA AGTGGGCTAC AACACTCTCT TTTCCTGTAC CTGTGTGCCA CCTGACCCAG 1920 

CTAATATCTC AATATCTAAT GACACCACAC TGGCCCCAGA GTATTTGCCA ACTATGTCTT 1980 

CTACTGACAT GTACCATAAT ACTACCTTTG ACTGGGCATT TTTGTCGAAG AAGGAGTGTT 2040 

CAAAATACGG AGGAAACCTT GTCGGGAACA ACTGTAATTT TGTTCCTGAT ATCACACTCA 2100 

TGTCTTTTAT CCTCTTCTTG GGAACCTACA CCTCTTCCAT GGCTCTGAAA AAATTCAAAA 2160 

CTAGTCCTTA TTTTCCAACC ACAGCAAGAA AACTGATCAG TGATTTTGCC ATTATCTTGT 2220 

CCATTCTCAT CTTTTGTGTA ATAGATGCCC TAGTAGGCGT GGACACCCCA AAACTAATTG 2280 

TGCCAAGTGA GTTCAAGCCA ACAAGTCCAA ACCGAGGTTG GTTCGTTCCA CCGTTTGGAG 2340 
AAAACCCCTG GTGGGTGTGC CTTGCTGCTG CTATCCCGGC TTTGTTGGTC ACTATACTGA 2400 
TTTTCATGGA CCAACAAATT ACAGCTGTGA TTGTAAACAG GAAAGAACAT AAACTCAAGA 2460 
AAGGAGCAGG GTATCACTTG GATCTCTTTT GGGTGGCCAT CCTCATGGTT ATATGCTCCC 2520 
TCATGGCTCT TCCGTGGTAT GTAGCTGCTA CGGTCATCTC CATTGCTCAC ATCGACAGTT 2580 
TGAAGATGGA GACAGAGACT TCTGCACCTG GAGAACAACC AAAGTTTCTA GGAGTGAGGG 2640 
AACAAAGAGT CACTGGAACC CTTGTGTTTA TTCTGACTGG TCTGTCAGTC TTTATGGCTC 2700 
CCATCTTGAA GTTTATACCC ATGCCTGTAC TCTATGGTGT GTTCCTGTAT ATGGGAGTAG 2760 
CATCCCTTAA TGGTGTGCAG TTCATGGATC GTCTGAAGCT GCTTCTGATG CCTCTGAAGC 2820 
ATCAGCCTGA CTTCATCTAC CTGCGTCATG TTCCTCTGCG CAGAGTCCAC CTGTTCACTT 2880 
TCCTGCAGGT GTTGTGTCTG GCCCTGCTTT GGATCCTCAA GTCAACGGTG GCTGCTATCA 2940 
TTTTTCCAGT AATGATCTTG GCACTTGTAG CTGTCAGAAA AGGCATGGAC TACCTCTTCT 3000 
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CCCAGCATGA CCTCAGCTTC CTGGATGATG TCATTCCAGA AAAGGACAAG AAAAAGAAGG 3060 

AGGATGAGAA GAAAAAGAAA AAGAAGAAGG GAAGTCTGGA CAGTGACAAT GATGATTCTG 3120 

ACTGCCCATA CTCAGAAAAA GTTCCAAGTA TTAAAATTCC AATGGACATC ATGGAACAGC 3180 

AACCTTTCCT AAGCGATAGC AAACCTTCTG ACAGAGAAAG ATCACCAACA TTCCTTGAAC 3240 

5 GCCACACATC ATGCTGATAA AATTCCTTTC CTTCAGTCAC TCGGTATGCC AAGTCCTCCT 3300 

AGAACTCCAG TAAAAGTTGC CTCAAATTAG ACTAGAACTT GAACCTGAAG ACAATGATTA 3360 

TTTCTGGAGG AGCAAGGGAA CAGAAACTAC ATTGTAACCT GTTTGTCTTT CTTAAAACTG 3420 

ACATTTGTTG TTAATGTCAT TTGTTTTTGT TTGGCTGTTT GTTTATTTTT TAACTTTTAT 3480 

TTCGTCTCAG TTTTTGGTCA CAGGCCAAAT AATACAGCGC TCTCTCTGCT TCTCTCTTGC 3 540 

10 ATAGATACAA TCAAGACAAT AGTGCACOGT TCCTTAAAAA CAGCATCTGA GGAATCCCCC 3600 

TTTTGTTCTT AAACTTTCAG ATGTGTCCTT TGATAACCAA ATTCTGTCAC TCAAGACACA 3660 

GACACCCACA GACCCTGTCC TTTGCCTCTA TTAAGCAGAG GATGGAAGTA TTAAGGATTT 3720 

TGTAACACCT TTTATGAAAA TGTTGAAGGA ACTTAAAACT TTAGCTTTGG AGCTGTGCTT 3780 

ACTGGCTTGT CTTTGTCTGG TAGAACAAAC CTTGACCTCC AGACAGAGTC CCTTCTCACT 3840 

15 TATAGAGCTC TCCAGGACTG GAAAAAGTGC TGCTATTTTA ACTTGCTCTT GCTTGTAAAT 3900 

CCTAATCTTA GAGTTATCAA AAGAAGAAAA AACTGAAGGT ACTTTACTCC CTATAGAGAA 3960 

ACCATTGCCA TCATTGTAGC AAGTGCTGGA ATGTCCCTTT TTTCCTATGC AACTTTTTTA 4020 

TAACCCTTTA ATGAACTTAT CTGTGGAGTA CATTGAAGAA TATTTTTCTT CCTAGATTTT 4080 

GTTGTTTAAA TTATGGGGCC TAACCTGCCA CTTATTTTTT GTCAATTTTT AAAACTTTTT 4140 

20 TTTAATTACT GTAAAGAAAA TGAATTTTTT CCTGCAGCAG GAAACATAGT TTTCAGTAGT 4200 

TCTACCTCTT ATTTGTAGCT GCCAGGCTTT CTGTAAAAAT TGTATTGTAT ATAATGTGAT 4260 

TTTTACACAT ACATACACAC ACAAATACAC AATCTCTAGG GTAAGCCAGA AGGCAAGATC 4320 

AGATTAAAAA CACCATGTTT CTAAGCATCC ATTTTTCCCT TTCTTTAAAA GAAACTTAAC 4380 

TGTTCTATGA AGGAGATTGA GGGAGAAGAG ACAAACTCCT ATGTCATGAG AATAACCGAT 4440 

25 GTTCTGATAA TAGTAGCATC TAGGTACAGA TGCTGGTTGT ATTACCACGT CAATGTCCTA 4500 

TGCAGTATTG TTAGACATTT TCTCATTTTG AAATATTTGT GTGTTTGTGT ATGTGCTCTG 4560 

TGCCATGGCT GGTGTATATA TGTGCAATGT TAGAAGGCAA AAGAGTGATG GTAGGCAGAG 4620 

GGCAAAGTCA TTGAATCTCT TATGCCAGTT TTCATAAAAC CCAAACCACA TATGAAAAAA 4680 

TCCATTAAGG GTCCAAGAAG TCTGTCCATA TGAAAATGAG GGTAAATATA GTTTATTTCC 4740 

30 CAGGTATCAG TCATTATAAT TGATATAATA GCTCTAACAT GCAATATAAA ATTCATAGGA 4800 

GTATTAATAG CCCATTTACA CATCTATAAA ATGTAATGGG ATTGCAGAGC TGCAGAGTAC 4860 

AGTGTAACAG TACTCTCATG CAATTTTTTT CAGGATGCAA AGGCAATTAT TCTTTGTAAG 4920 

CGGGACATTT AGATATATTT GTGTACATAT TATATGTATG TATATTTCAA AGTACCACAC 4980 

TGAAAATTAG ACATTTATTA ACCAAATTTA ACGTGGTATT TAAAGGTAAT ATTTTTAATA 504O 

35 TGATACATTA CATATTGTGA ATGTATACTA AAAAAACATT TTAAATGTTA AAATTATAAT 5100 

TTCAGATTCA TATAACCACA ACTGTGATAT ATCCTAACTA TAACCAGTTG TTGAGGGGTA 5160 

TACTAGAAGC AGAATGAAAC CACATTTTTT GGTTTGATAA TATGCACTTA TTGACTCCCA 5220 

CTCATTGTTA TGTTAATTAA GTTATTATTC TGTCTCCTTG TAATTTTGAT TACAAAAATT 5280 

TTATTATCCT GAGTTAGCTG TTACTTTTAC AGTACCTGAT ACTCCTAAAA CTTTTAACTT 5340 

40 ATACAAATTA GTCAATAATG ACCCCAATTT TTTCATTAAA ATAATAGTGG TGAATTATAT 5400 

GTTATTGTGT TAAAACCTCA CTTGCCAAAT TCTGGCTTCA CATTTGTATT TAGGGCTATC 5460 

CTTAAAATGA TGAGTCTATA TTATCTAGCT TTCTATTACC CTAATATAAA CTGGTATAAG 5520 

AAGACTTTCC TTTTTTCTTT ATGCATGGAA GCATCAATAA ATTGTTTAAA AACCATGTAT 5580 

AGTAAATTCA GCTTAACCCG TGATCTTCTT AAGTTAAAGG TACTTTTGTT TTATAAAAGC 5640 

45 TCTAGATAAA ACTTTCTTTT CTGATCATGA ATCAAGTATC TGTGGTTTCA TGCCCCTCTC 5700 

TATACCTTTC AAAGAACTCC TGAAGCAACT TAACTCATCA TTTCAGCCTC TGAGTAGAGG 5760 

TAAAACCTAT GTGTACTTCT GTTTATGATC CATATTGATA TTTATGACAT GAACACAGAA 5820 

TAGTACCTTA CATTTGCTAA ACAGACAGTT AATATCAAAT CCTTTCAATA TTCTGGGAAC 5880 

CCAGGGAAGT TTTTAAAAAT GTCATTACTT TCAAAGGAAC AGAAGTAGTT AACCAAACTA 5940 

50 ACAAGCAAAA CCTGAGGTTT ACCTAGTGAC ACCAAATTAT CGGTATTTTA ACTGAATTTA 6000 

CCCATTGACT AAGAATGAAC CGGATTTGGT GGTGGTTTTG TTTCTATGCA AACTGGACAC 60 60 

AAATTACAAC AGTAAATTTT TTTATAAGTG CTTCTCCCTT CTCCATGATG TGACTTCCGG 6120 

AGATAAAGGA TTCAAAAGAT AAAGACAAAG TACGCTCAGA GTTGTTAACC AGAAAGTCCT 6180 

GGCTGTGGTT GCAGAAACAC TGTTGGAAGA AAAGAGATGA CTAAGTCAAG TGTCTGCCTT 6240 

55 ATCAAAAGAG CAAAAATGCC TCTGGTTTTG TGTTTGGGAG AAAAATATCT TGGACGCACT 6300 

GTTTTCCTTG ATAAAAGTCA TCTTCTCTAC TGTGTGAAAT GAATACTTGG AATTCTAATT 6360 

GTTTTGTGTG CCAGGGGCAG TAATGTCCCT GCCTCTTCTC CCAATCAAGG TTGAGGAGTG 6420 

GGGCTGGGGA GAGGACTTAA CTGACTTAAG AAGTAGGAAA ACAAAAACCT CTCTCCTCAG 6480 

r CCTTCCACCT CCAAGAGAGG AGGAAAAACA GTTGTCTGCT GTCTGTAATT CAGTTTGCGT 6540 

60 GTATTTTATG CTCATGCACC AACCCATACA GAGTAAATCT TTTATCAACT ATATACTGGT 6600 

GTTTAATAGA GAATGATTGT CTTCCGAGTT TTTTGGTTCC TTTTTTAACT GTGTTAAAGT 6660 

ACTTGAAATG TATTGACTGC TGACTATATT TTAAAAACAA AATGAAATAA TTTGAGTTGT 6720 

ATTACAGAGG TTGACATTGT TCAGGGATGG GACAAAGCCT TCTTCAATCC TTTTCATACT 6780 

^ _ ACTTAATGAT TTTGGTGCAG GAACCTGAGA TTTTCTGATT TATATTTCAT GATATTTCAC 6840 

65 ATTTGCTCTT CACAGCATGA GCATGAAGCC CAGTGGCACC AAATGGCTGG GTACAATCAA 6900 

GTGATATTTT GTAGCACCTC ACTATCTGAA AGGCCATGAG TTTTCAGATG ATTTCATTGA 6960 

GCTTCATTGC AGCCTGAAAT TTTAAAAAAG TTGTGTAATA CGCCAACCAG TCAAGTTGTG 7020 

TTTTGGCCAG AGATTTAGAT ATGTCCAATT TCCTGGCTCA TTTCATTGTG CTCTATGGGT 7080 

ACGTATAAAA AGCAAGAATT CTGTTTCCTA GGCAAACATT GCAACTCAGG GCTAAAGTCA 7140 

70 TCCAGTGAAA CTTTTAGAGC CAGAAGTAAC TTTGTCCCAG TCCTACAATG TGAAAAGAGT 7200 

GAATAGTTGC CTCTTTTTAG CCATTTTCAT GGCTGGTACA TATTCGTACG CATTACTTTT 7260 

CAGAATCAAT ACGCACTTTC AGATATTCTT ATTTTTATTC TCTTAAGTCT TTATTAACTT 7320 

TGGAGAGAGA AATGATGCAT CTTTTTATTT TAAATGAAGT AGATCAACAT GGTGGAACAA 7380 

AATGATAAAG AACAGAAAAC ATTTCAATAT ATTACTAATA ACTTTTTCCA ATATAAATCC 7440 

#5 TAAAATTCCT ATAACATAGT ATTTTACAGT TTTATGAAGC TTTCTATTGT GACTTTTATG 7500 

GAATTAAGAG ATGAAGAAGA TGAGATATTT TAGCATTTAT ATTTTTCAAA ATTATATGTA 7560 
TACTTAAAAA TAAAGTAACT TTATGC 7586 

Seq ID NO: C163 UNA Sequence 
80 Nucleic Acid Accession #: NM_000958 
Coding sequence: 389.. 1855 ~" 
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CGGCACAGCC 
GCTGGGACTC 
CTGACCCATC 
CGGTGTCCAA 
TCCAGACTGA 
TGACCCTGGG 
CCAAGGCTGC 
CTTGAGCCCC 
GGTGGTGGGC 
GACGACCTTC 
GGTGAGCCCG 
GTGCGAGTAC 
CGCCATGAGT 
GGACAAGCGA 
CGCGCTGCCC 
CATCGACTGG 
CAGCTCCTTC 
CCGCATGCAC 



CAGCGACTTT 
CATCTTACTC 
AGTATTCGTC 
TTTGCAGGCC 
CCTGAGAAAG 
TGGCGGGTCC 
TGCCATGTCA 
ATCTCAGACC 
CAGGAATTTG 
GAGGACTTTG 
CTTACTGGTG 
CCTGCAAGTC 
AAGGAAAGAA 
CATGTCACAT 



TCACACCTGA ACGCTGTCCT 
GTCTTTGAAG GAAAAAAAAT 
CCGCTGCACC TCTTGTTTCC 
AAATCGACAG CCACTGAGAC 
GCAGGACAAG GTGAAAGCAG 
GGCTGGTGAG GCTGCCACCG 
GCACCGCCAG CCACTATCAT 
GACCGGCTGA ACAGCCCAGT 
AACCTGGTGG CCATCGTGGT 
TACACGCTGG TATGTGGGCT 
GTGACCATCG CCACGTACAT 
AGCACCTTCA TTCTGCTCTT 
GTCGAGCGCT ACCTGGCCAT 
TTGGCGGGCC TCACGCTCTT 
AACATGGGTC TOGGTAGCTC 
ACCACCAACG TGACGGCGCA 
CTCATTCTCG CCACCGTCCT 
CGCCAGTTCA TGCGCCGCAC 
CCCGGGGCCA 
GGAGCTTCCG 
ATTGCCACCT CCCTGGTGGT 
AACCAGTTAT ATCAGCCAAG 
ATCCGAATTG CTTCTGTGAA 
ACAGTGCTCA GTAAAGCAAT 
CGCAGGGAGC GCTCCGGACA 
GGCCACTCTC GCTCCTTCAT 
CTCCTGCCAG ACCTCTCACT 
CTTCCAGGTG TGCCTGGCAT 
CGAATATCAG AGACCTCAGA 
GATGAGGCTG GTGGGAGCGG 
ACATTTCCCA GTGAAACACT 
ATACAGTACT GTTTCTGGAC 
TTAGCTGTGC TCAGAAGGGC 



CCCGCAGACG AGACCGGCGG 
AGCGAGTAAG AAATCCAGCA 
CAAGTTTTTG AAAGCTGGCA 
CGGCTTTGAG AAGCCGAAGA 
GTTGGAGGCG GGTCCAGGAC 
CTGCTGCCGC TACAGACCCA 
GTCCACTCCC GGGGTCAATT 
GACCATCCCG GCGGTGATGT 
GCTGTGCAAG TCGCGCAAGG 
GGCTGTCACC GACCTGTTGG 
GAAGGGCCAA TGGCCCGGGG 
CTTCAGCCTG TCCGGCCTCA 
CAACCATGCC TATTTCTACA 
TGCAGTCTAT GCGTCCAACG 
GCGGCTGCAG TACCCAGACA 
CGCCGCCTAC TCCTACATGT 
CTGCAACGTG CTTGTGTGCG 
CTCGCTGGGC ACCGAGCAGC 
CCCCGCTGCC TCCCCAGCCT 
CCGCATCGCG GGCGCCGAGA 
GCTCATCTGC TCCATCCCGC 
TTTGGAGCGA GAAGTCAGTA 
CCCCATCCTA GACCCCTGGA 
AGAGAAGATC AAATGCCTCT 
GCACTGCTCA GACAGTCAAA 
CTCCGGGGAG CTGAAGGAGA 
GCCAGACCTC AGTGAAAATG 
GGGCCTGGCC CAGGAAGACA 
CTCTTCACAG GGTCAGGACT 
CAGGGCTGGG CCTGCCCCTA 
GAACTTATCA GAAAAATGTA 
CCTTATAAAA TCCTGTGCAA 
TATCATCA 



GCACTGCAAA 
CCATTCTTCA 
ACTCTGACCT 
TTTGGCAGTT 
ATCTGAGGGC 
GCCTTGCACT 
CGTCCGCCTC 
TCATCTTCGG 
AGCAGAAGGA 
GCACTTTGTT 
GCCAGCCGCT 
GCATCATCTG 
GCCACTACGT 
TGCTCTTTTG 
CCTGGTGCTT 
ACGCGGGCTT 
GCGCGCTGCT 
ACCACGCGGC 
TGCCGCGCCT 
TCCAGATGGT 
TCGTGGTGCG 
AAAATCCAGA 
TATATATCCT 
TCTGCCGCAT 
GGACATCTTC 
TCAGCAGTAC 
GCCTTGGAGG 
CCACCTCACT 
CAGAGAGTGT 
AGGGGAGCTC 
TATAATAGGC 
TAGACACATA 



Seq ID NO: C164 DNA Sequence 

Nucleic Acid Accession ft: NM_Q02659.1 

Coding sequence: 427.. 1434 



1 
I 

CAGTATCCCT 

TATTTACCGT 

CCACGTTAGG 

ATCACAACTC 

GGAAGGAAGT 

GGGGCTGACT 

TCCTTCCTGA 

CGCGACATGG 

GCCTCTTGGG 

TGCGCCCTGG 

GAGCTGGAGC 

TATCGGACTG 

AACCAGGGCA 

TCCTGTGGCT 

AGCCCTGAAG 

CGTCCAAAGG 

AATGGTTTCC 

AACGAGGGCC 

TGCAAGGGGA 

GGCCCCATGA 

ATGGTAAGAG 

AGCATGAACC 

GATGTCCAGT 

ATCACCCTGC 

AATCCCCCTC 

GCCCTACAGA 

TCCCAGCTAT 

GAAGGCCGTG 

GTTGTTATTA 

TGG 



11 
1 

CCTGACAAAA 
CAAAGTTTTT 
AAGAGAGAGA 
CATGAGTCAG 

CGCTCTTTCG 
GGCCAGAAGG 
GTCACCCGCC 
GCCTGCGGTG 
GACAGGACCT 
TGGTGGAGAA 
GCTTGAAGAT 
ACTCTGGCCG 
CATCAGACAT 
AACAGTGCCT 
ATGACCGCCA 
ACAACAACGA 
CAATCCTGGA 
ACAGCACCCA 
ATCAATGTCT 
GCTGTGCAAC 
ACATTGATGT 
ACCGCAGTGG 
TAATGACTGC 
TCTGCCCTGG 
CTTGCTGTGT 
GAAAACAGCT 
GGCAATGGGA 
ATTAATATTC 



21 
I 

CTAACAAAAA 
ATCCTCATTT 
ACTGGGATTT 
GGCCGAGCCA 
GGAGGTTCGT 
CAAAACGTCT 
AGAGAAGACG 
GCTGCTGCCG 
CATGCAGTGT 
CTGCAGGACC 
AAGCPGTACC 
CACCAGCCTT 
GGCTGTCACC 
GAGCTGTGAG 
GGATGTGGTG 
CCTCCGTGGC 
CACCTTCCAC 
GCTTGAAAAT 
TGGATGCTCC 
GGTAGCCACC 
CGCCTCAATG 
CTCCTGCTGT 
GGCTGCTCCT 
CAGACTGTGG 
CTGGATCCGG 
GACCTCAGGC 
ATCTCACAAA 
GAGCTCTTGT 
ATATTATTTA 



31 
I 

TCCTGTTAGC 
TACAGCAGTG 
GCACCCAGGC 
GCCCCTTCAC 
ACGGGAGGAG 
GGGAGGAGTC 
TGCAGGGACC 
CTGCTGCTGC 
AAGACCAACG 
ACGATCGTGC 
CACTCAGAGA 
ACCGAGGTTG 
TATTCCCGAA 
AGGGGCCGGC 
ACCCACTGGA 
TGTGGCTACC 
TTCCTGAAAT 
CTGCCGCAGA 
TCTGAAGAGA 
GGCACTCACG 
TGCCAACATG 
ACTAAAAGTG 
CAGCCTGGCC 
GGAGGCACTC 
GGGACCCCTT 
CAGTGTGCCG 
GTTGTGTGAA 
TATTATTAAT 
TTTTATACTT 



41 
I 

CAAATAATCA 
GAGAGCGATT 
AATCTGGGGA 
CACCAGCCGG 
GGGGAGGCGC 
CCTGGGGCCA 
CCGCGCACAG 
TGCTCCACAC 
GGGATTGCCG 
GCTTGTGGGA 
AGACCAACAG 
TGTGTGGGTT 
GCCGTTACCT 
ACCAGAGCCT 
TCCAGGAAGG 
TTCCCGGCTG 
GCTGCAACAC 
ATGGCCGCCA 
CTTTCCTCAT 
AACCGAAAAA 
CCCACCTGGG 
GCTGTAACCA 
CTGCCCATCT 
TCCTCTGGAC 
TGCCCTTCCC 
ACCTCTCTGG 
GCAGAAGAGA 
ATTGTTGCCG 
ACATAAAGAT 




CAGAGCTGTG 
CCGCGCCCCG 
CCACGCATCT 
CAAAACTGCC 
GAGCTGCCCT 
CTGCGTCCCA 
TGTGGAAGAG 
AGAAGGAGAA 
GACCCTGAGC 
AGACTTGTGC 
CGAATGCATT 
GCAGTGCCGC 
TGAAGAAGGG 

CACCAAATGC 
GTGTTACAGC 
TGACTGCCGA 
CCAAAGCTAT 
TGACGCCTTC 
CCCAGACCTG 
CAGCCTCACC 
CTAAACCTGA 
TCGGCTCCCA 
GCCTCAGTTT 
AAAGCTGGAG 
CTGTTGTGTT 
TTTGTACCAG 



Seq ID NO: CI 6 5 DNA Sequence 
Nucleic Acid Accession ft: AK027843. 
Coding sequence: 193.. 1731 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1958 



TTGCTTGAGT 
CTAAATAGCA 
CTGTTACCAG 
TCGTATTTCC 
CCTCCAAACT 
TTTACTTTCT 
GTGAGTTATG 



11 
I 

CATCTTCTGA 
CATCACATGT 
GGACAAATGC 
AGATGGATTT 
TACTTGAGAA 
TCAACAAAAC 
TGATGGCGTG 



21 
1 

AGCTTTAAAA 
GAATATTACA 
AATTTCAAAT 
TGAGAGTGGA 
TTTAAGTCCA 
TGGACTTTTC 
CAGTATTGGA 



31 
I 

ACAATTGATG 
ACTCGGAACT 
TTTAGCATTG 
CAAGTGGATC 
GAAGATTCTG 
CAGGATGTAG 
AACATTACTA 



41 

I 

AATTGGCCTT 
TGGCTCTCAG 
GTCTTCCAAG 
CACTGGCATC 
TATTAGTTAG 
GACCCCAAAG 
TCCAGAATCT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1743 



51 
I 

CAAGATAGAC 
CGTATCATCC 
CAATAATGAA 
TGTAATTTTG 
AAGAGCACAG 
AAAAACTTTA 
GAAGGATCCT 



60 

120 

180 

240 

300 

360 

420 
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GTTCAAATAA AAATCAAACA TACAAGAACT CAGGAAGTGC ATCATCCCAT CTGTGCCTTC 4 BO 

TGGGATCTGA ACAAAAACAA AAGTTTTGGA GGATGGAACA CGTCAGGATG TGTTGCACAC 540 

AGAGATTCAG ATGCAAGTGA GACAGTCTGC CTGTGTAACC ACTTCACACA CTTTGGAGTT 600 

CTGATGGACC TTCCAAGAAG TGCCTCACAG TTAGATGCAA GAAACACTAA AGTCCTCACT 660 

5 TTCATCAGCT ATATTGGGTG TGGAATATCT GCTATTTTTT CAGCAGCAAC TCTCCTGACA 720 

TATGTTGCTT TTGAGAAATT GCGAAGGGAT TATCCCTCCA AAATCTTGAT GAACCTGAGC 780 

ACAGCCCTGC TGTTCCTGAA TCTCCTCTTC CTCCTAGATG GCTGGATCAC CTCCTTCAAT 840 

GTGGATGGAC TTTGCATTGC TGTTGCAGTC CTGTTGCATT TCTTCCTTCT GGCAACCTTT 900 

ACCTGGATGG GGCTAGAAGC AATTCACATG TACATTGCTC TAGTTAAAGT ATTTAACACT 960 

10 TACATTCGCC GATACATTCT AAAATTCTGC ATCATTGGCT GGGGTTTGCC TGCCTTAGTG 1020 

GTGTCAGTTG TTCTAGCGAG CAGAAACAAC AATGAAGTCT ATGGAAAAGA AAGTTATGGG 1080 

AAAGAAAAAG GTGATGAATT CTGTTGGATT CAAGATCCAG TCATATTTTA TGTGACCTGT 1140 

GCTGGGTATT TTGGAGTCAT GTTTTTTCTG AACATTGCCA TGTTCATTGT GGTAATGGTG 1200 

CAGATCTGTG GGAGGAATGG CAAGAGAAGC AACCGGACCC TGAGAGAAGA AGTGTTAAGG 1260 

15 AACCTGCGCA GTGTGGTTAG CTTGACCTTT CTGTTGGGCA TGACATGGGG TTTTGCATTC 1320 

TTTGCCTGGG GACCCTTAAA TATCCCCTTC ATGTACCTCT TCTCCATCTT CAATTCATTA 1380 

CAAGGCTTAT TTATATTCAT CTTCCACTGT GCTATGAAGG AGAATGTTCA GAAACAGTGG 1440 

CGGCGGCATC TCTGCTGTGG TAGATTTCGG TTAGCAGATA ACTCAGATTG GAGTAAGACA 1500 

GCTACCAATA TCATCAAGAA AAGTTCTGAT AATCTAGGAA AATCTTTGTC TTCAAGCTCC 1560 

20 ATTGGTTCCA ACTCAACCTA TCTTACATCC AAATCTAAAT CCAGCTCTAC CACCTATTTC 1620 

AAAAGGAATA GCCACACAGA TAATGTCTCC TATGAGCATT CCTTCAACAA AAGTGGATCA 1680 

CTCAGACAGT GCTTCCATGG ACAAGTCCTT GTCAAAACTG GCCCATGCTG ATGGAGATCA 1740 

AACATCAATC ATCCCTGTCC ATCAGGTCAT TGATAAGGTC AAGGGTTATT GCAATGCTCA 1800 

TTCAGACAAC TTCTATAAAA ATATTATCAT GTCAGACACC TTCAGCCACA GCACAAAGTT 1860 

25 TTAATGTCTT TAAGAAAAAG AAATCAATCT GCAGAAATGT GAAGATTTGC AAGCAGTGTA 1920 

AACTGCAACT AGTGATGTAA ATGTGCTATT ACCTAGGTAA CTGCATATAT ATAAGGAATG 1980 

TATTTTGTTA AGAAGGCTTT TGTGAAATTC AGAATTTTTC TTTTTAATAT ATTTCTTCCA 2040 

TGGAAGAGTT GTCATCACTA AAACTTCAGT ACTGAGAGTA ACATGACTCA GTAGCCACAG 2100 

AAGCTATGAT TTGTAAAATA TATAATTGAA TCAGAGTAAT CATAATGCAG GGGAGACATT 2160 

30 CAAATTAGAG ACAAGGGAGA AGCAATGCTG AGGAAGACCC TAGATAGAGC TCATTTTACT 2220 

CCACCTAATC GTTATATCTG GATATACCCA TTTTCTGCAT CTTCTTTCTC AACAATAAAC 2280 

TGTCCTTGCT TTGGAGACTT TAAGACATTT CCTAAAGCAC AAATAAAAGC CTCGTATTTC 2340 

CCCATTGAGA GTTTTGTTCC AAGGAATATG AAGTGAGACA TATGGGTGAG TCATAATAAT 2400 

CAAAATAATT TATGAAGAGC TGGGTCTGCA ATAGCTAGTC TAAAAACTAC TTGTGTGTCA 2460 

35 GTCCTCTGGT TATAGTATAT AAGAGCCTGA GGAGGTCTGG CAAGATAGAT GGTGTATTAT 2520 

TTATGGATCA GGCTGCTGCA TACAAACCTT GCATACTATT ATGCAGCTTA CCTAACTCTC 2580 

AGACTATTCT GAGTAATGCT TGCTTGCTAA TGAATGTATA GGAGACCACA TTGTAATTGT 2640 

TCTTAGATGA TGGAGTCCAT GCAGTTTCTT AGAAATCGGT CTCAGTGCAT GCTGTGCTTT 2700 

TTCACATTTG CTCTGGGTTA TCTGGGAAGT ATCAGGTTCT GGGAGGCAAC AGCATTAAGT 2760 

40 GATAAGAAAA GGAGACATTC TGGCAAAGCC AATCTGCTTA AAGGCAAAGT CCAGAACCTG 2820 

GAACCTAGAG GCCTTTCTCT CTGCACGAAA AACAGGTAGT TTGCAGTCTG AGATATGGGA 2880 

GAGCTTTTAG GCTACACAGC AACCCAAGGG ACCTCTCACC TTTTGCTGAG CTTCAATCAG 2940 

GAAGCTATTT GCCTGGCTCC AGCAGATGAT GAGATAATGA GGTAGTGGGT TTTTTATTAC 3000 

TGTTCCATTT TGCAACATCC TGCAACACCA TCCTGGGAGA CAAGAGCATT ACCCAGCTTG 3060 

45 GCTTTCACGG GGGAGGGTTG TATTCAGT 3088 
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Seq ID NO: C166 DNA Sequence 

Nucleic Acid Accession ft: NM__000574.1 

Coding sequence! 6 6.. 12 11 



1 
I 

CCGCTGGGCG 
GCGCCATGAC 
CCCGGCTGCT 
CCCCAGATGT 
CTGTAATAAC 
TGATCTGCCT 
AGGTGCCAAC 
TTCCAGTCGG 
TATCACCAAA 
AAAAGAAATC 
GCATATTATT 
CGACTTCTAG 
AGTGCAGAGA 
AACGTGACCA 
TGATTGGAGA 
CACCACCTGA 
CTACCACAGT 
AAACCACCAC 
ATTTTCATGA 
TATCTGGGCA 
TGCTGACTTA 
AGTTTCTTAG 
ATGCTTTCAT 
GTCCTGGAAT 
TGAGAGTGAT 
GGGATCACGA 
CCACTTATAA 
CCCAATTCAG 
TCTTTGGCTG 
ACATGCTGGT 
AATCTTCCTT 
CTTAATGTCT 



11 
I 

TAGCTGCGAC 
CGTCGCGCGG 
GCTGCTGGTG 
ACCTAATGCC 
GTACAAATGT 
TAAGGGCAGT 
AAGGCTAAAT 
TACTGTTGTG 
ACTAACTTGC 
ATGCCCTAAT 
TGGTGCAACC 
TTTTTGTCTT 
AATTTATTGT 
TTATGGATAT 
GCACTCTATT 
ATGCAGAGGA 
AAATGTTCCA 
ACCAAATGCT 
AACAACCCCA 
CACGTGTTTC 
GCCAAAGAAG 
ACTTATCTGC 
TGTCTTTAAG 
CACATTCTTA 
TCCTTTCCTA 
GGAAAAGAGA 
AGGAAATAAA 
TCTCTTCTAA 
TAAGGCATTT 
GAACCAGGGG 
TGTTGSACAA 
TTAAAAGTAT 



21 
I 

TCGGCGGAGT 
CCGAGCGTGC 
CTGTTGTGCC 
CAGCCAGCTT 
GAAGAAAGCT 
CAATGGTCAG 
TCTGCATCCC 
GAATATGAGT 
CTTCAGAATT 
CCGGGAGAAA 
ATCTCCTTCT 
ATTTCAGGCA 
CCAGCACCAC 
AGACAGTCTG 
TATTGTACTG 
AAATCTCTAA 
ACTACAGAAG 
CAAGCAACAC 
AATAAAGGAA 
ACGTTGACAG 
AGTTAAGAAG 
ATATTGGATA 
ATGTGTTAGG 
GCACACCTAC 
AAAGTGTAAG 
AGGAAAGTGA 
AAATGAAAAA 
GCAAAATTGC 
TCATCTTTCC 
TGTTGATGGT 
ATAGAGTTTG 
CCAGAGATAC 



31 
I 

CCCGGCGGCG 
CCGOGGCGCT 
TGCCGGCCGT 
TGGAAGGCCG 
TTGTGAAAAT 
ATATTGAAGA 
TCAAACAGCC 
GCCGTCCAGG 
TAAAATGGTC 
TACGAAATGG 
CATGTAACAC 
GCTCTGTCCA 
CACAAATTGA 
TAACGTATGC 
TGAATAATGA 
CTTCCAAGGT 
TCTCACCAAC 
GGAGTACACC 
GTGGAACCAC 
GTTTGCTTGG 
AAAATACACA 
AAATAAATGC 
AATGTCAACA 
ACCTCTTGAA 
AAAGCATAGA 
TTTTTTTCCA 
CATTATTTGG 
TAAAGAGAGA 
TTCGGGTTGG 
GATAAGGGAG 
GAAAAAGCCT 
TACAATATTA 



41 
i 

OGTCCTTGTT 
GCCCCTCCTC 
GTGGGGTGAC 
TACAAGTTTT 
TCCTGGCGAG 
GTTCTGCAAT 
TTATATCACT 
TTACAGAAGA 
CACAGCAGTC 
TCAGATTGAT 
AGGGTACAAA 
GTGGAGTGAC 
CAATGGAATA 
ATGTAATAAA 
TGAAGGAGAG 
CCCACCAACA 
TTCTCAGAAA 
TGTTTCCAGG 
TTCAGGTACT 
GACGCTAGTA 
CAAGTATACA 
AATTGTGCTC 
GAGCAAGGAG 
AATAGAACAA 
GATTTGTTCG 
CAAGATCTGT 
ATATCAAAAG 
TGAACCACAT 
CAAAATATTT 
GAATATAGAA 
GTGAAAGGTG 
ACATAAGAAA 




CCCGAGGATA 
AAGGACTCAG 
CGTAGCTGCG 
CAGAATTATT 
GAACCTTCTC 
GAATTTTGTA 
GTACCAGGTG 
TTATTTGGCT 
CCGTTGCCAG 
ATTCAAGGGG 
GGATTCACCA 
TGGAGTGGCC 
GTTCAGAAAC 
ACCACCACAA 
ACAACCAAGC 
ACCCGTCTTC 
ACCATGGGCT 
GACTGTTCCT 
TTCATTTAGG 
AAAAAAGGCA 
CTTGCAGAAT 
TATTTAGAAT 
AATGTTATTT 
CAAATAAAAA 
TATAAAGTAA 
TAAAGGTAAA 
TGAAAGACTG 
TCTTCTTTGA 
AGATTATATA 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 



1303 



WO 03/042661 PCT7US02/36810 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 



70 
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TTATTTCTGA ATCGAGATGT CCATAGTCAA ATTTGTAAAT CTTATTCTTT TGTAATATTT 1980 

ATTTATATTT ATTTATGACA GTGAACATTC TGATTTTACA TGTAAAACAA GAAAAGTTGA 2040 

AGAAGATATG TGAAGAAAAA TGTATTTTTC CTAAATAGAA ATAAATGATC CCATTTTTTG 2100 

GT 2102 

Seq ID NO: C167 DNA Sequence 

Nucleic Acid Accession tf: Eos sequence 

Coding sequence : 1 . . 2651 

1 11 21 31 41 51 

I I I 1 I I 

ATGGACACCT CCCGGCTCGG TGTGCTCCTG TCCTTGCCTG TGCTGCTGCA GCTGGCGACC 60 

GGGGGCAGCT CTCCCAGGTC TGGTGTGTTG CTGAGGGGCT GCCCCACACA CTGTCATTGC 120 

GAGCCCGACG GCAGGATGTT GCTCAGGGTG GACTGCTCCG ACCTGGGGCT CTCGGAGCTG 180 

CCTTCCAACC TCAGCGTCTT CACCTCCTAC CTAGACCTCA GTATGAACAA CATCAGTCAG 240 

CTGCTCCCGA ATCCCCTGCC CAGTCTCCGC TTCCTGGAGG AGTTACGTCT TGCGGGAAAC 300 

GCTCTGACAT ACATTCCCAA GGGAGCATTC ACTGGCCTTT ACAGTCTTAA AGTTCTTATG 360 

CTGCAGAATA ATCAGCTAAG ACACGTACCC ACAGAAGCTC TGCAGAATTT GCGAAGCCTT 420 

CAATCCCTGC GTCTGGATGC TAACCACATC AGCTATGTGC CCCCAAGCTG TTTCAGTGGC 480 

CTGCATTCCC TGAGGCACCT GTGGCTGGAT GACAATGCGT TAACAGAAAT CCCCGTCCAG 540 

GCTTTTAGAA GTTTATCGGC ATTGCAAGCC ATGACCTTGG CCCTGAACAA AATACACCAC 600 

ATACCAGACT ATGCCTTTGG AAACCTCTCC AGCTTGGTAG TTCTACATCT CCATAACAAT 660 

AGAATCCACT CCCTGGGAAA GAAATGCTTT GATGGGCTCC ACAGCCTAGA GACTTTAGAT 720 

TTAAATTACA ATAACCTTGA TGAATTCCCC ACTGCAATTA GGACACTCTC CAACCTTAAA 780 

GAACTACATT TCTATGACAA TCCCATCCAA TTTGTTGGGA GATCTGCTTT TCAACATTTA 840 

CCTGAACTAA GAACACTGAC TCTGAATGGT GCCTCACAAA TAACTGAATT TCCTGATTTA 900 

ACTGGAACTG CAAACCTGGA GAGTCTGACT TTAACTGGAG CACAGATCTC ATCTCTTCCT 960 

CAAACCGTCT GCAATCAGTT ACCTAATCTC CAAGTGCTAG ATCTGTCTTA CAACCTATTA 1020 

GAAGATTTAC CCAGTTTTTC AGTCTGCCAA AAGCTTCAGA AAATTGACCT AAGACATAAT 1080 

GAAATCTACG AAATTAAAGT TGACACTTTC CAGCAGTTGC TTAGCCTCCG ATCGCTGAAT 1140 

TTGGCTTGGA ACAAAATTGC TATTATTCAC CCCAATGCAT TTTCCACTTT GCCATCCCTA 1200 

ATAAAGCTGG ACCTATCGTC CAACCTCCTG TCX3TCTTTTC CTATAACTGG GTTACATGGT 1260 

TTAACTCACT TAAAATTAAC AGGAAATCAT GCCTTACAGA GCTTGATATC ATCTGAAAAC 1320 

TTTCCAGAAC TCAAGGTTAT AGAAATGCCT TATGCTTACC AGTGCTGTGC ATTTGGAGTG 1380 

TGTGAGAATG CCTATAAGAT TTCTAATCAA TGGAATAAAG GTGACAACAG CAGTATGGAC 1440 

GACCTTCATA AGAAAGATGC TGGAATGTTT CAGGCTCAAG ATGAACGTGA CCTTGAAGAT 1500 

TTCCTGCTTG ACTTTGAGGA AGACCTGAAA GCCCTTCATT CAGTGCAGTG TTCACCTTCC 1560 

CCAGGCCCCT TCAAACCCTG TGAACACCTG CTTGATGGCT GGCTGATCAG AATTGGAGTG 1620 

TGGACCATAG CAGTTCTGGC ACTTACTTGT AATGCTTTGG TGACTTCAAC AGTTTTCAGA 1680 

TCCCCTCTGT ACATTTCCCC CATTAAACTG TTAATTGGGG TCATCGCAGC AGTGAACATG 1740 

CTCACGGGAG TCTCCAGTGC CGTGCTGGCT GGTGTGGATG CGTTCACTTT TGGCAGCTTT 1800 

GCACGACATG GTGCCTGGTG GGAGAATGGG GTTGGTTGCC ATGTCATTGG TTTTTTGTCC 1860 

ATTTTTGCTT CAGAATCATC TGTTTTCCTG CTTACTCTGG CAGCCCTGGA GCGTGGGTTC 1920 

TCTGTGAAAT ATTCTGCAAA ATTTGAAACG AAAGCTCCAT TTTCTAGCCT GAAAGTAATC 1980 

ATTTTGCTCT GTGCCCTGCT GGCCTTGACC ATGGCCGCAG TTCCCCTGCT GGGTGGCAGC 2040 

AAGTATGGCG CCTCCCCTCT CTGCCTGCCT TTGCCTTTTG GGGAGCCCAG CACCATGGGC 2100 

TACATGGTCG CTCTCATCTT GCTCAATTCC CTTTGCTTCC TCATGATGAC CATTGCCTAC 2160 

ACCAAGCTCT ACTGCAATTT GGACAAGGGA GACCTGGAGA ATATTTGGGA CTGCTCTATG 2220 

GTAAAACACA TTGCOCTGTT GCTCTTCACC AACTGCATCC TAAACTGCCC TGTGGCTTTC 2280 

TTGTCCTTCT CCTCTTTAAT AAACCTTACA TTTATCAGTC CTGAAGTAAT TAAGTTTATC 2340 

CTTCTGGTGG TAGTCOCACT TCCTGCATGT CTCAATCCCC TTCTCTACAT CTTGTTCAAT 2400 

CCTCACTTTA AGGAGGATCT GGTGAGCCTG AGAAAGCAAA CCTACGTCTG GACAAGATCA 2460 

AAACACCCAA GCTTGATGTC AATTAACTCT GATGATGTCG AAAAACAGTC CTGTGACTCA 2520 

ACTCAAGCCT TGGTAACCTT TACCAGCTCC AGCATCACTT ATGACCTGCC TCCCAGTTCC 2580 

GTGCCATCAC CAGCTTATCC AGTGACTGAG AGCTGCCATC TTTCCTCTGT GGCATTTGTC 2640 

CCATGTCTTA A 2651 

Seq ID NO: C168 DNA Sequence 

Nucleic Acid Accession #: NM_003667.2 

Coding sequence: 4 9.. 27 72 



1 11 21 31 41 51 

I I I I I I 

TGCTGCTCTC CGCCCGCGTC CGGCTCGTGG CCCCCTACTT CGGGCACCAT GGACACCTCC 60 

CGGCTCGGTG TGCTCCTGTC CTTGCCTGTG CTGCTGCAGC TGGOGACCGG GGGCAGCTCT 120 

CCCAGGTCTG GTGTGTTGCT GAGGGGCTGC CCCACACACT GTCATTGOGA GCCCGACGGC 180 

AGGATGTTGC TCAGGGTGGA CTGCTCCGAC CTGGGGCTCT CGGAGCTGCC TTCCAACCTC 240 

AGCGTCTTCA CCTCCTACCT AGACCTCAGT ATGAACAACA TCAGTCAGCT GCTCCCGAAT 300 

CCCCTGCCCA GTCTCCGCTT CCTGGAGGAG TTACGTCTTG CGGGAAACGC TCTGACATAC 360 

ATTCCCAAGG GAGCATTCAC TGGCCTTTAC AGTCTTAAAG TTCTTATGCT GCAGAATAAT 420 

CAGCTAAGAC ACGTACCCAC AGAAGCTCTG CAGAATTTGC GAAGCCTTCA ATCCCTGCGT 480 

CTGGATGCTA ACCACATCAG CTATGTGCCC CCAAGCTGTT TCAGTGGCCT GCATTCCCTG 540 

AGGCACCTGT GGCTGGATGA CAATGCGTTA ACAGAAATCC CCGTCCAGGC TTTTAGAAGT 600 

TTATCGGCAT TGCAAGCCAT GACCTTGGCC CTGAACAAAA TACACCACAT ACCAGACTAT 660 

GCCTTTGGAA ACCTCTCCAG CTTGGTAGTT CTACATCTCC ATAACAATAG AATCCACTCC 720 

CTGGGAAAGA AATGCTTTGA TGGGCTCCAC AGCCTAGAGA CTTTAGATTT AAATTACAAT 780 

AACCTTGATG AATTCCCCAC TGCAATTAGG ACACTCTCCA ACCTTAAAGA ACTAGGATTT 840 

CATAGCAACA ATATCAGGTC GATACCTGAG AAAGCATTTG TAGGCAACCC TTCTCTTATT 900 

ACAATACATT TCTATGACAA TCCCATCCAA TTTGTTGGGA GATCTGCTTT TCAACATTTA 960 

OU CCTGAACTAA GAACACTGAC TCTGAATGGT GCCTCACAAA TAACTGAATT TCCTGATTTA 1020 

ACTGGAACTG CAAACCTGGA GAGTCTGACT TTAACTGGAG CACAGATCTC ATCTCTTCCT 1080 

CAAACCGTCT GCAATCAGTT ACCTAATCTC CAAGTGCTAG ATCTGTCTTA CAACCTATTA 1140 

GAAGATTTAC CCAGTTTTTC AGTCTGCCAA AAGCTTCAGA AAATTGACCT AAGACATAAT 1200 

GAAATCTACG. AAATTAAAGT TGACACTTTC CAGCAGTTGC TTAGCCTCCG ATCGCTGAAT 1260 
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TTGGCTTGGA 
ATAAAGCTGG 
TTAACTCACT 
TTTCCAQAAC 
TGTGAGAATG 
GACCTTCATA 
TTCCTGCTTG 
CCAGGCCCCT 
TGQACCATAG 
TCCCCTCTGT 
CTCACGGGAG 
GCACGACATG 
ATTTTTGCTT 
TCTGTGAAAT 
ATTTTGCTCT 
AAGTATGGCG 
TACATGGTCG 
ACCAAGCTCT 
GTAAAACACA 
TTGTCCTTCT 
CTTCTGGTGG 
CCTCACTTTA 
AAACACCCAA 
ACTCAAGCCT 
GTGCCATCAC 
CCATGTCTCT 
GAGATTGAGT 



ACAAAATTGC 
ACCTATCGTC 
TAAAATTAAC 
TCAAGGTTAT 
CCTATAAGAT 
AGAAAGATGC 
ACTTTGAGGA 
TCAAACCCTG 
CAGTTCTGGC 
ACATTTCCCC 
TCTCCAGTGC 
GTGCCTGGTG 
CAGAATCATC 
ATTCTGCAAA 
GTGCCCTGCT 



CTCTCATCTT 
ACTGCAATTT 
TTGCCCTGTT 
CCTCTTTAAT 
TAGTCCCACT 
AGGAGGATCT 
GCTTGATGTC 
TGGTAACCTT 
CAGCTTATCC 
AATTAATATG 
ATATCAGAGC 



TATTATTCAC 
CAACCTCCTG 
AGGAAATCAT 
AGAAATGCCT 
TTCTAATCAA 
TGGAATGTTT 
AGACCTGAAA 
TGAACACCTG 
ACTTACTTGT 
CATTAAACTG 
CGTGCTGGCT 
GGAGAATGGG 
TGTTTTCCTG 
ATTTGAAACG 
GGCCTTGACC 
CTGCCTGCCT 
GCTCAATTCC 
GGACAAGGGA 
GCTCTTCACC 
AAACCTTACA 
TCCTGCATGT 
GGTGAGCCTG 
AATTAACTCT 
TACCAGCTCC 
AGTGACTGAG 
TGAAGGAAAA 
AGTAATTAAT 



CCCAATGCAT 
TCGTCTTTTC 
GCCTTACAGA 
TATGCTTACC 
TGGAATAAAG 
CAGGCTCAAG 
GCCCTTCATT 
CTTGATGGCT 
AATGCTTTGG 
TTAATTGGGG 
GGTGTGGATG 



CTTACTCTGG 
AAAGCTCCAT 
ATGGCCGCAG 
TTGCCTTTTG 
CTTTGCTTCC 
GACCTGGAGA 
AACTGCATCC 
TTTATCAGTC 
CTCAATCCCC 
AGAAAGCAAA 
GATGATGTOG 
AGCATCACTT 
AGCTGCCATC 
TGTTTTCAAA 
AAGAAGAGCT 



TTTCCACTTT 
CTATAACTGG 
GCTTGATATC 
AGTGCTGTGC 
GTGACAACAG 
ATGAACGTGA 
CAGTGCAGTG 
GGCTGATCAG 
TGACTTCAAC 
TCATCGCAGC 
CGTTCACTTT 
ATGTCATTGG 
CAGCCCTGGA 
TTTCTAGCCT 
TTCCCCTGCT 
GGGAGCCCAG 
TCATGATGAC 
ATATTTGGGA 
TAAACTGCCC 
CTGAAGTAAT 
TTCTCTACAT 
CCTACGTCTG 
AAAAACAGTC 
ATGACCTGCC 
TTTCCTCTGT 
GGTTGAGAAC 
GAGGTGAAAC 



GCCATCCCTA 
GTTACATGGT 
ATCTGAAAAC 
ATTTGGAGTG 
CAGTATGGAC 
CCTTGAAGAT 
TTCACCTTCC 
AATTGGAGTG 
AGTTTT CAGA 
AGTGAACATG 
TGGCAGCTTT 
TTTTTTGTCC 
GCGTGGGTTC 
GAAAGTAATC 
GGGTGGCAGC 
CACCATGGGC 
CATTGCCTAC 
CTGCTCTATG 
TGTGGCTTTC 
TAAGTTTATC 
CTTGTTCAAT 
GACAAGATCA 
CTGTGACTCA 
TCCCAGTTCC 
GGCATTTGTC 
CTGAAAATGT 
TCGGTTTAAA 



Seq ID NO: CI 6 9 DNA Sequence 
Nucleic Acid Accession #: NMJ>03506. 
Coding sequence: 259.-2379 

31 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 



1 11 21 31 41 51 

I I I I I I 

GCAGCTCCAG TCCCGGACGC AACCCCGGAG CCGTCTCAGG TCCCTGGGGG GAACGGTGGG 60 

TTAGACGGGG ACGGGAAGGG ACAGCGGCCT TCGACCGCCC CCCGAGTAAT TGACCCAGGA 120 

CTCATTTTCA GGAAAGCCTG AAAATGAGTA AAATAGTGAA ATGAGGAATT TGAACATTTT 180 

ATCTTTGGAT GGGGATCTTC TGAGGATGCA AAGAGTGATT CATCCAAGCC ATGTGGTAAA 240 

ATCAGGAATT TGAAGAAAAT GGAGATGTTT ACATTTTTGT TGACGTGTAT TTTTCTACCC 300 

CTCCTAAGAG GGCACAGTCT CTTCACCTGT GAACCAATTA CTGTTCCCAG ATGTATGAAA 360 

ATGGCCTACA ACATGACGTT TTTCCCTAAT CTGATGGGTC ATTATGACCA GAGTATTGCC 420 

GCGGTGGAAA TGGAGCATTT TCTTCCTCTC GCAAATCTGG AATGTTCACC AAACATTGAA 480 

ACTTTCCTCT GCAAAGCATT TGTACCAACC TGCATAGAAC AAATTCATGT GGTTCCACCT 540 

TGTCGTAAAC TTTGTGAGAA AGTATATTCT GATTGCAAAA AATTAATTGA CACTTTTGGG 600 

ATCCGATGGC CTGAGGAGCT TGAATGTGAC AGATTACAAT ACTGTGATGA GACTGTTCCT 660 

GTAACTTTTG ATCCACACAC AGAATTTCTT GGTCCTCAGA AGAAAACAGA ACAAGTCCAA 720 

AGAGACATTG GATTTTGGTG TCCAAGGCAT CTTAAGACTT CTGGGGGACA AGGATATAAG 780 

TTTCTGGGAA TTGACCAGTG TGCGCCTCCA TGCCCCAACA TGTATTTTAA AAGTGATGAG 840 

CTAGAGTTTG CAAAAAGTTT TATTGGAACA GTTTCAATAT TTTGTCTTTG TGCAACTCTG 900 

TTCACATTCC TTACTTTTTT AATTGATGTT AGAAGATTCA GATACCCAGA GAGACCAATT 960 

ATATATTACT CTGTCTGTTA CAGCATTGTA TCTCTTATGT ACTTCATTGG ATTTTTGCTG 1020 

GGCGATAGCA CAGCCTGCAA TAAGGCAGAT GAGAAGCTAG AACTTGGTGA CACTGTTGTC 1080 

CTAGGCTCTC AAAATAAGGC TTGCACCGTT TTGTTCATGC TTTTGTATTT TTTCACAATG 1140 

GCTGGCACTG TGTGGTGGGT GATTCTTACC ATTACTTGGT TCTTAGCTGC AGGAAGAAAA 1200 

TGGAGTTGTG AAGCCATCGA GCAAAAAGCA GTGTGGTTTC ATGCTGTTGC ATGGGGAACA 1260 

CCAGGTTTCC TGACTGTTAT GCTTCTTGCT CTGAACAAAG TTGAAGGAGA CAACATTAGT 1320 

GGAGTTTGCT TTGTTGGCCT TTATGACCTG GATGCTTCTC GCTACTTTGT ACTCTTGCCA 13 B0 

CTGTGCCTTT GTGTGTTTGT TGGGCTCTCT CTTCTTTTAG CTGGCATTAT TTCCTTAAAT 1440 

CATGTTCGAC AAGTCATACA ACATGATGGC CGGAACCAAG AAAAACTAAA GAAATTTATG 1500 

ATTCGAATTG GAGTCTTCAG OGGCTTGTAT CTTGTGCCAT TAGTGACACT TCTCGGATGT 1560 

TACGTCTATG AGCAAGTGAA CAGGATTACC TGGGAGATAA CTTGGGTCTC TGATCATTGT 1620 

CGTCAGTACC ATATCCCATG TCCTTATCAG GCAAAAGCAA AAGCTCGACC AGAATTGGCT 1680 

TTATTTATGA TAAAATACCT GATGACATTA ATTGTTGGCA TCTCTGCTGT CTTCTGGGTT 1740 

GGAAGCAAAA AGACATGCAC AGAATGGGCT GGGTTTTTTA AACGAAATCG CAAGAGAGAT 1800 

CCAATCAGTG AAAGTCGAAG AGTACTACAG GAATCATGTG AGTTTTTCTT AAAGCACAAT 1860 

TCTAAAGTTA AACACAAAAA GAAGCACTAT AAACCAAGTT CACACAAGCT GAAGGTCATT 1920 

TCCAAATCCA TGGGAACCAG CACAGGAGCT ACAGCAAATC ATGGCACTTC TGCAGTAGCA 1980 

ATTACTAGCC ATGATTACCT AGGACAAGAA ACTTTGACAG AAATCCAAAC CTCACCAGAA 2040 

ACATCAATGA GAGAGGTGAA AGCGGACGGA GCTAGCACCC CCAGGTTAAG AGAACAGGAC 2100 

TGTGGTGAAC CTGCCTCGCC AGCAGCATCC ATCTCCAGAC TCTCTGGGGA ACAGGTCGAC 2160 

GGGAAGGGCC AGGCAGGCAG TGTATCTGAA AGTGCGCGGA GTGAAGGAAG GATTAGTCCA 2220 

AAGAGTGATA TTACTGACAC TGGCCTGGCA CAGAGCAACA ATTTGCAGGT CCCCAGTTCT 2280 

TCAGAACCAA GCAGCCTCAA AGGTTCCACA TCTCTGCTTG TTCACCCAGT TTCAGGAGTG 2340 

AGAAAAGAGC AGGGAGGTGG TTGTCATTCA GATACTTGAA GAACATTTTC TCTCGTTACT 2400 

CAGAAGCAAA TTTGTGTTAC ACTGGAAGTG ACCTATGCAC TGTTTTGTAA GAATCACTGT 2460 

TACGTTCTTC TTTTGCACTT AAAGTTGCAT TGCCTACTGT TATACTGGAA AAAATAGAGT 2520 

TCAAGAATAA TATGACTCAT TTCACACAAA GGTTAATGAC AACAATATAC CTGAAAACAG 2SB0 

AAATGTGCAG GTTAATAATA TTTTTTTAAT AGTGTGGGAG GACAGAGTTA GAGGAATCTT 2640 

CCTTTTCTAT TTATGAAGAT TCTACTCTTG GTAAGAGTAT TTTAAGATGT ACTATGCTAT 2700 

TTTACCTTTT TGATATAAAA TCAAGATATT TCTTTGCTGA AGTATTTAAA TCTTATCCTT 2760 

GTATCTTTTT ATACATATTT GAAAATAAGC TTATATGTAT TTGAACTTTT TTGAAATCCT 2820 

ATTCAAGTAT TTTTATCATG CTATTGTGAT ATTTTAGCAC TTTGGTAGCT TTTACACTGA 2880 

ATTTCTAAGA AAATTGTAAA ATAGTCTTCT TTTATACTGT AAAAAAAGAT ATACCAAAAA 2940 

GTCTTATAAT AGGAATTTAA CTTTAAAAAC CCACTTATTG ATACCTTACC ATCTAAAATG 3000 
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TGTGATTTTT ATAGTCTOGT TTTAGGAATT TCACAGATCT AAATTATGTA ACTGAAATAA 
GGTGCTTACT CAAAGAGTGT CCACTATTQA TTGTATTATG CTGCTCACTG ATCCTTCTGC 
ATATTTAAAA TAAAATGTCC TAAAGGGTTA GTAGACAAAA TGTTAGTCTT TTGTATATTA 
GGCCAAGTGC AATTGACTTC CCTTTTTTAA TGTTTCATGA CCACCCATTG ATTGTATTAT 
AACCACTTAC AGTTGCTTAT ATTTTTTGTT TTAACTTTTG TTTCTTAACA TTTAGAATAT 
TACATTTTGT ATTATACAGT ACCTTTCTCA GACATTTTGT AG 

Seq ID NO: C170 DNA Sequence 
Nucleic Acid Accession #: NMJ>00582 
Coding sequence: 6 8-990 



3060 
3120 
3180 
3240 
3300 
3342 



GCAGAGCACA 
AACGCCGACC 
GGCATCACCT 
CTTTACAACA 
CAGAATCTCC 
GATGATATGG 
AACGACTCTG 
TCTGATGAAT 
TTCACTCCAG 
GGACTGAGGT 
GAOGAGGACA 
CCCGTTGCCC 
GAAACGAGTC 
TATAAGCGGA 
CTTTCCAAAG 
GTTGTAGACC 
TTAGATAGTG 
ATTTAGTCAA 
CTCAGTTTAT 
ATTAGTTTAG 
CTATGTTCAT 
TCATGAATAG 
ATAACATTTT 
TATCTTTTTG 
AATTGCTTAT 
GCCTAAAAAA 



11 

I 

GCATCGTCGG 
AAGGAAAACT 
GTGCCATACC 
AATACCCAGA 
TAGCCCCACA 
ATGATGAAGA 
ATGATGTAGA 
CTGATGAACT 
TTGTCCCCAC 
CAAAATCTAA 
TCACCTCACA 
AGGACCTGAA 
AGCTGGATGA 
AAGCCAATGA 
TCAGCCGTGA 
CCAAAAGTAA 
CATCTTCTGA 
AAGAAAAAAT 
TGGTTGAATG 
TTTGTGGCTT 
TCTATAGAAG 
AAATTTATGT 
ATGTCACTAT 
TGGTGTGAAT 
TTGTTTTCCC 
AAAAAAAAAA 



21 
I 

GACCAGACTC 
CACTACCATG 
AGTTAAACAG 
TGCTGTGGCC 
GACCCTTCCA 
TGATGATGAC 
TGACACTGAT 
GGTCACTGAT 
AGTAGACACA 
GAAGTTTCGC 
CATGGAAAGC 
CGCGCCTTCT 
CCAGAGTGCT 
TGAGAGCAAT 
ATTCCACAGC 
GGAAGAAGAT 
GGTCAATTAA 
GCTTTATAGC 
TGTATCTATT 
CATGGAAACT 
AAATGCAAAC 
AGAAGCAAAC 
AATCTTTTGT 
AAATCTTTTA 
ACGGTTGTCC 
AAAA 



31 
I 

GTCTCAGGCC 
AGAATTGCAG 
GCTGATTCTG 
ACATGGCTAA 
AGTAAGTCCA 
CATGTGGACA 
GATTCTCACC 
TTTCCCACGG 
TATGATGGCC 
AGACCTGACA 
GAGGAGTTGA 
GATTGGGACA 
GAAACCCACA 
GAGCATTCCG 
CATGAATTTC 
AAACACCTGA 



AAAATGAAAG 
TGAGTCTGGA 
CCCTGTAAAC 
TATCACTGTA 
AAAATACTTT 
TTTTTAAGTT 
TCTTGAATGT 
AGCAATTAAT 



41 
I 

AGTTGCAGCC 
TGATTTGCTT 
GAAGTTCTGA 
ACCCTGACCC 
ACGAAAGCCA 
GCCAGGACTC 
AGTCTGATGA 
ACCTGCCAGC 
GAGGTGATAG 
TCCAGTACCC 
ATGGTGCATA 
GCCGTGGGAA 
GCCACAAGCA 
ATGTGATTGA 
ACAGCCATGA 
AATTTCGTAT 
AATACAATTT 
AGAACATGAA 
AATAACTAAT 
TAAAAGCTTC 
TTTTAATATT 
TACCCACTTA 
AGTGTATATT 
AATAAGAATT 
AAAACATAAC 



51 
I 

TTCTCAGCCA 
TTGCCTCCTA 
GGAAAAGCAG 
ATCTCAGAAG 
TGACCACATG 
CATTGACTOG 
GTCTCACCAT 
AACCGAAGTT 
TGTGGTTTAT 
TGATGCTACA 
CAAGGCCATC 
GGACAGTTAT 
GTCCAGATTA 
TAGTCAGGAA 
AGATATGCTG 
TTCTCATGAA 
CTCACTTTGC 
ATGCTTCTTT 
GTGTTTGATA 
AGGGTTATGT 
TGTTATTCTC 
AAAAGAGAAT 
TTGTTGTGAT 
TGGTGGTGTC 
CTTTTTTACT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1524 



Seq ID NO: CI 71 DNA Sequence 
Nucleic Acid Accession #: NM_002B21 
Coding sequence: 150.. 3362 

1 11 21 31 41 51 

I I I I I 1 

AACTCCCGCC TCGGGAOGCC TCGGGGTCGG GCTCCGGCTG CGGCTGCTGC TGCGGCGCCC 60 

GCGCTCCGGT GCGTCCGCCT CCTGTGCCOG CCGCGGAGCA GTCTGCGGCC CGCCGTGCGC 120 

CCTCAGCTCC TTTTCCTGAG CCOGCCGOGA TGGGAGCTGC GCGGGGATCC CCGGCCAGAC 180 

CCCGCCGGTT GCCTCTGCTC AGCGTCCTGC TGCTGCCGCT GCTGGGCGGT ACCCAGACAG 240 

CCATTGTCTT CATCAAGCAG CCGTCCTCCC AGGATGCACT GCAGGGGCGC CGGGCGCTGC 300 

TTCGCTGTGA GGTTGAGGCT CCGGGCCCGG TACATGTGTA CTGGCTGCTC GATGGGGCCC 360 

CTGTCCAGGA CACGGAGCGG CGTTTCGCCC AGGGCAGCAG CCTGAGCTTT GCAGCTGTGG 420 

ACCGGCTGCA GGACTCTGGC ACCTTCCAGT GTGTGGCTCG GGATGATGTC ACTGGAGAAG 480 

AAGCCCGCAG TGCCAACGCC TCCTTCAACA TCAAATGGAT TGAGGCAGGT CCTGTGGTCC 540 

TGAAGCATCC AGCCTCGGAA GCTGAGATCC AGCCACAGAC CCAGGTCACA CTTCGTTGCC 600 

ACATTGATGG GCACCCTCGO CCCACCTACC AATGGTTCCG AGATGGGACC CCCCTTTCTG 660 

ATGGTCAGAG CAACCACACA GTCAGCAGCA AGGAGCGGAA CCTGAOGCTC CGGCCAGCTG 720 

GTCCTGAGCA TAGTGGGCTG TATTCCTGCT GCGCCCACAG TGCTTTTGGC CAGGCTTGCA 780 

GCAGCCAGAA CTTCACCTTG AGCATTGCTG ATGAAAGCTT TGCCAGGGTG GTGCTGGCAC 840 

CCCAGGACGT GGTAGTAGCG AGGTATGAGG AGGCCATGTT CCATTGCCAG TTCTCAGCCC 900 

AGCCACCCCC GAGCCTGCAG TGGCTCTTTG AGGATGAGAC TCCCATCACT AACCGCAGTC 960 

GCCCCCCACA CCTCCGCAGA GCCACAGTGT TTGCCAACGG GTCTCTGCTG CTGACCCAGG 1020 

TCCGGCCACG CAATGCAGGG ATCTACCGCT GCATTGGCCA GGGGCAGAGG GGCCCACCCA 1080 

TCATCCTGGA AGCCACACTT CACCTAGCAG AGATTGAAGA CATGCCGCTA TTTGAGCCAC 1140 

GGGTGTTTAC AGCTGGCAGC GAGGAGCGTG TGACCTGCCT TCCCCCCAAG GGTCTGCCAG 1200 

. AGCCCAGCGT GTGGTGGGAG CACGCGGGAG TCCGGCTGCC CACCCATGGC AGGGTCTACC 1260 

AGAAGGGCCA CGAGCTGGTG TTGGCCAATA TTGCTGAAAG TGATGCTGGT GTCTACACCT 1320 

GCCACGCGGC CAACCTGGCT GGTCAGCGGA GACAGGATGT CAACATCACT GTGGCCACTG 1380 

TGCCCTCCTG GCTGAAGAAG CCCCAAGACA GCCAGCTGGA GGAGGGCAAA CCCGGCTACT 1440 

TGGATTGCCT GACCCAGGCC ACACCAAAAC CTACAGTTGT CTGGTACAGA AACCAGATGC 1500 

TCATCTCAGA GGACTCACGG TTCGAGGTCT TCAAGAATGG GACCTTGCGC ATCAACAGCG 1560 

TGGAGGTGTA TGATGGGACA TGGTACCGTT GTATGAGCAG CACCCCAGCC GGCAGCATOG 1620 

AGGCGCAAGC CCGTGTCCAA GTGCTGGAAA AGCTCAAGTT CACACCACCA CCCCAGCCAC 1680 

AGCAGTGCAT GGAGTTTGAC AAGGAGGCCA CGGTGCCCTG TTCAGCCACA GGCCGAGAGA 1740 

AGCCCACTAT TAAGTGGGAA CGGGCAGATG GGAGCAGCCT CCCAGAGTGG GTGACAGACA 1800 

ACGCTGGGAC CCTGCATTTT GCCCGGGTGA CTCGAGATGA CGCTGGCAAC TACACT TGCA 1860 

TTGCCTCCAA OGGGCOGCAG GGCCAGATTC GTGCCCATGT CCAGCTCACT GTGGCAGTTT 1920 

TTATCACCTT CAAAGTGGAA CCAGAGCGTA CGACTGTGTA CCAGGGCCAC ACAGCCCTAC 1980 

TGCAGTGCGA GGCCCAGGGG GACCCCAAGC CGCTGATTCA GTGGAAAGGC AAGGACCGCA 2040 

TCCTGGACCC CACCAAGCTG GGACCCAGGA TGCACATCTT CCAGAATGGC TCCCTGGTGA 2100 

TCCATGACGT GGCCCCTGAG GACTCAGGCC GCTACACCTG CATTGCAGGC AACAGCTGCA 2160 

ACATCAAGCA CACGGAGGCC CCCCTCTATG TCGTGGACAA GCCTGTGCCG GAGGAGTCGG 2220 

AGGGCCCTGG CAGCCCTCCC CCCTACAAGA TGATCCAGAC CATTGGGTTG TCGGTGGGTG 2280 

CCGCTGTGGC CTACATCATT GCCGTGCTGG GCCTCATGTT CTACTGCAAG AAGCGCTGCA 2340 

1306 



WO 03/042661 



PCT/US02/36810 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 

80 



AAGCCAAGCG 
GAGGGCCTTT 
GCTTGGGCTC 
TCCCACGGTC 
TCCTGGCAAA 
GCCTGCAGAC 
GGAAGCTGAA 
ACTACATGGT 
AGAGCAAGGA 
GCACCCAGGT 
TGGCTGCGOG 
TCAGCAAGGA 
GCTGGATGTC 
CCTTOGGTGT 
CAGATGATGA 
GCTGCCCTTC 
GGCCCTCCTT 
GAGGAGGGAG 
CAGCATGATG 
TTGCTGAGGT 
GGCTGACTTG 
CTCTTCCTCT 
TTCTCCCCTT 
AGGCTTGGGA 
AGGGTTAATG 
ACACAGCAAG 
CCCCACCCTT 
CTTTTGACAC 
TGCAGCGTGG 
GCCATCCTTA 
TGTTTTTGTT 



GCTGCAGAAG 
GCAGAACGGG 
CGGCCCCGCG 
TAGCCTGCAG 
GGCTCAGGGC 
GAAGGATGAG 
CCACGCCAAC 
GCTGGAATAT 
TGAAAAATTG 
AGCCCTGGGC 
TAACTGCCTG 
TGTGTACAAC 
CCCCGAGGCC 
GCTGATGTGG 
AGTACTGGCA 
CAAACTCTAT 
CAGTGAGATT 
CCCGCTCAGG 
GGCAAGATCC 
CTGAGCAGGG 
GACCCAAACT 
ATCAGGGACA 
GACCGGGTCC 
TGAGCTGGGT 
AGTCTCTTGC 
TGAGTCCTCC 
CTCTCCTTTC 
TATATAAACC 
GGTGGGTGGG 
CCCCACACTT 
TTTACACTCG 



CAGCCCGAGG 
CAGCCCTCAG 
GCCACCAACA 
CCCATCACCA 
TTGGAGGAGG 
CAGCAGCAGC 
GTGGTGCGGC 
GTGGATCTGG 
AAGTCACAGC 
ATGGAGGACC 
GTCAGTGCCC 
AGTGAGTACT 
ATCCTGGAGG 
GAAGTGTTTA 
GATTTGCAGG 
CGGCTGATGC 
GCCAGCGCCC 
ATGGCCTGGG 
CTGTCCTCCT 
CCTGGCCTTT 
GGGCGACTAG 
GTGTGGGTGC 
AACTCTGCCA 
TTGTGGGGAG 
CCACTGGTCC 
CCACTCTGGG 
CTCATCCTAA 
GCCCTTTTTG 
CATGGGAGGT 
TTATTGTTGT 
CTGCTCTCAA 



GCGAGGAGCC 
CAGAGATCCA 
AACGCCACAG 
CGCTGGGGAA 
GAGTGGCAGA 
TGGACTTCCG 
TCCTGGGGCT 
GAGACCTCAA 
CCCTCAGCAC 
TGTCCAACAA 
AGAGACAAGT 
ACCACTTCCG 
GTGACTTCTC 
CACATGGAGA 
CTGGGAAGGC 
AGCGCTGCTG 
TGGGAGACAG 
CAGGGGAGGA 
GGGCCCTGAG 
CCTCCTCTTC 
GGCTTTGAGC 
CACAGGTAAC 
CTCATCTGCC 
TTCCTTAATA 
ACTTGGGGGT 
CTTGTGCACA 
GTGCCTGGCA 
TATGCACCAC 
AGGGGTGGGC 
CGTTTTTTGT 
TAAATAAGCC 



AGAGATGGAA 
AGAAGAAGTG 
CACAAGTGAT 
GAGTGAGTTT 
GACCCTGGTA 
GAGGGAGTTG 
GTGCCGGGAG 
GCAGTTCCTG 
CAAGCAGAAG 
CCGCTTTGTG 
GAAGGTGTCT 
CCAGGCCTGG 
TACCAAGTCT 
GATGCCCCAT 
TAGACTTCCT 
GGCCCTCAGC 
CACCGTGGAC 
CATCTCTAGA 
GTGCCCTAGT 
CTCACCCTCA 
TGGGCAGTTT 
CCCAATTTCT 
AACTTTGCCT 
TTCTCAAGTT 
CTAGACCAGG 
CTGACCCAGA 
GATGAAGGAG 
GGGCGGCTTT 
CCTGGAGATG 
TTGTTTTGTT 
TTTTTTA 



TGCCTCAACG 
GCCTTGACCA 
AAGATGCACT 
GGGGAGGTGT 
CTTGTGAAGA 
GAGATGTTTG 
GCTGAGCCCC 
AGGATTTCCA 
GTGGCCCTAT 
CATAAGGACT 
GCCCTGGGCC 
GTGCCGCTGC 
GATGTCTGGG 
GGTGGGCAGG 
CAGCCCGAGG 
CCCAAGGACC 
AGCAAGCCGT 
GGGAAGCTCA 
GCAACAGGCA 
TCCTTTGGGA 
CCCCTGCCAC 
GGCCTTCAAC 
GGGGAGGGCT 
CTGGGCACAC 
ATTATAGAGG 
CCCACGTCTT 
TTTTCAGGAG 
TATATGTAAT 
AGGAGGGTGG 
TTTTTGTTTT 



Seq ID NO s C172 DNA Sequence 

Nucleic Acid Accession #: NM_002309.2 

Coding sequence : 65 . . 673 



1 
I 

ATGAACCTCT 
CATAATGAAG 
TGGGGCGGGG 
ATGTCACAAC 
CAATGCCCTC 
CAAGCTATGT 
GGCCAAGCTG 
CACCCGGGAC 
CACCGCCGAC 
CCACGTGGGC 
GAAGAAGAAG 
CCAGGCCTTC 
GTCCAGATGT 
GGGGGCTGCT 
TGATGAAGCT 
CCTCTGGGAG 
GGCCCCTTGC 
CACAGGGTGA 



GGACTCCTTG 
AGGCTGTCTT 
TTCCAGAAGG 
CCTGGGAAGG 
CTCTTCCCOG 
GCAGGGGTTT 
TGAGGGACCC 
AGTAGCAGCT 
. GGGGACTGGA 
TCTCAGGGAA 
TTATGATATC 
TCCCTGGTCC 
GCTTAGCAGG 
AGCTCAGAGA 
GCTCCACCCC 
CTGGGCCAAT 
CACTGTTCTT 
GCCCCCTGGA 
CCCATGCTGC 
GACACAGCAG 
CTGGGCCAGA 
GAGGAGGAGA 
GCTCCACCCC 
TCCTTTGTGC 
AAGGAGGCAG 
TGAGCGGGCT 
TTCACTCACT 



11 
I 

GAAAACTGCC 
GTCTTGGCGG 
AGCCCCCTCC 
AACCTCATGA 
TTTATTCTCT 
GGCCCCAACG 
GTGGAGCTGT 
CAGAAGATCC 
ATCCTGCGAG 
CATGTGGACG 
CTGGGCTGTC 
TAGCAGGAGG 

GGCAGACCCC 
GAGCAGAGTG 
ATTGTGGACT 
TCACTTTGTG 
GACAGCAGGG 
CTGCCCCTTA 
TCCCCTAGAA 
CTTTTGAGGA 
AGGTCACTTG 
AAGACACTGG 
TGATCTTCCC 
GTGGAGGTGG 
AGGCTCTTCC 
CACATCTGAG 
GGCCGTCTCT 
CCCTGAGCTC 
CTATTTATAT 
CTACTCAACA 
GCCTGGTCTG 
AGGGTCGGAT 
CATCCCCTAC 
TTGTGGAGAG 
GGGCGCCCCA 
GCAGTGGGCA 
CTCTGAAATA 
GGAAGCTCCT 
CCCCAGGAAG 
GGGAGGCTGC 
CATCCCACTC 
TGGGGCCAGG 
CAGAGTTGGG 
GCCGGCCTCC 
TTGCACCTTT 



21 
I 

GGCATCTGAG 
CAGGAGTTGT 
CCATCACCCC 
ACCAGATCAG 
ATTACACAGC 
TGACGGACTT 
ACCGCATAGT 
TCAACCCCAG 
GCCTCCTTAG 
TGACCTACGG 
AACTCCTGGG 
TCTTGAAGTG 
CCAAGGGTGG 
GAGGGTGCCT 
GAAACTTCCA 
GGGGAGCGTG 
CAGTGAACAA 
CCCAGGGGAG 
GGTTGGGACT 
GTTTGATGAG 
TGATCAGAGA 
GCATTCAGGC 
GAGGAGGAGA 
TGCAGCCTGG 
GTAGGGCAGG 
TCTGGGTAAA 
GCCCTGGGAG 
GGTCCCCAGG 
CAAGCGTGCT 
TAACTTATTG 
AAATATGATG 
GAAGTCGACA 
CTGAGAGAAT 
TGTGACTTGC 
GCCGGGTGCC 
GCATTGTGTT 
AGACAGTCCT 
GCGCCCTGGA 
CCTGTGGCCC 
GTGGGGTAGA 
AGACGCCTGG 
CCACCCATGT 
CGAGTGGAGA 
GCAGCTGCTC 
AAGTTCCTCC 
CCCTGTCGCT 



31 41 

I I 
GTTTCCTCCA AGGCCCTCTG 
GCCCCTGCTG TTGGTTCTGC 
TGTCAACGCC ACCTGTGCCA 
GAGCCAACTG GCACAGCTCA 
CCAGGGGGAG CCGTTCCCCA 
CCCGCCCTTC CACGCCAACG 
CGTGTACCTT GGCACCTCCC 
TGCCCTCAGC CTCCACAGCA 
CAACGTGCTG TGCCGCCTGT 
CCCTGACACC TCGGGTAAGG 
GAAGTATAAG CAGATCATCG 
TGCTGTGAAC CGAGGGATCT 
CTGGGGCCCA GGGCATCGCT 
GGCCAGTCCA CTCCACTCTG 
TAGGGAGGGA GCTAGAAGAA 
GGCTGGACTT CTGCCTCTAC 
ACTACACAAG TCATCTACAA 
TGGACCAGCC CCCAGCAAAT 
TAGGTGGGCC AGAGGGGCTA 
TGGAAGATAG AGAGGGGCCT 
ACTTGGGCAT AGGAACAATC 
TCTTGGGGAG GCAGAGAAGC 
GGCCTGGAAA GCTTTGGTAG 
GATGGCCAGG GTCTGATGGC 
GGCAGGTTGC TAAGTCAGGT 
GGTCTGTAAG AAGGGGCTGG 
GTCTTGTGAG GTCACACAGA 
GCAAGGGAAC AGCAGAACTT 
GTGCGTCTGA CCTGGCATGA 
GTGCTTTCAG TGGCCAAGTT 
ATGGCTCCCG ACACAAGCGC 
ATGTTACAAG TGGAATAAGC 
GGGGAGGCCT GAGTGGGAGT 
TTTAGCGTGT CAGGGTCCAG 
TTTCTGTCTT GCTTCCAGGG 
GTGAGGCGCA CTGTTCCTGG 
TGTGGCCCAC CCTGTCCTTG 
ACAACCCTGC CCCTGCACCC 
GGACACCCAT AGACGGTGCG 
CTGGGGGGAT CAGCTGCCCA 
GACTCAGACC AGGAAGCTGT 
CTGGGCTCCC AGGCAGGGAA 
AAOGCCCTCC AGTCTGAGAG 
AGAGCAGTGT TCTGGCTTCT 
GACAAGATGA TGGTACTAAT 
CTCTAAGCAC TTTACCTGGA 



2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4187 



51 
! 

AAGTGCAGCC 
ACTGGAAACA 
TACGCCACCC 
ATGGCAGTGC 
ACAACCTGGA 
GCACGGAGAA 
TGGGCAACAT 
AGCTCAACGC 
GCAGCAAGTA 
ATGTCTTCCA 
CCGTGTTGGC 
CAGGAGTTGG 
AAACCCAAAT 
GGCTGGGCTG 
GGTGCCCCTT 
TTGTCCCTTT 
GAGCCCTGAC 
TATCACCATC 
GGATCCCAAA 
CTGGGATGGA 
TGGCAGAAGT 
CACCTTCAGG 
GTTCTTCGTT 
TGGACCTGCA 
GCAGAGGTTC 
GGTAGCTCAG 
GGTACTTGAG 
AGGGTCAGGG 
TTTCTATTTA 
AATTCCCCTT 
CAGGGCCAGG 
TTACGGGTGA 
GGGGGGCCTT 
GCTGCAGGGG 
GGCTGGTTCA 
CAGATATTGT 
TTTCTGTGTC 
AGCATGCTCC 
GGGGGCCTGG 
TTGCTCCCAA 

CCCGATCTCT 
CAGGGGAGGG 
TCTCAAACCC 
TATGGTACTT 
TGGCGCGTGG 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

640 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 
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GCAGTGTGCA 
CCATCTGTCC 
CCCTTGTCCA 
CTGGCTGCAT 
GCTGATTCCA 
GGAGCTCTGT 
GGACTTGGCC 
CGTCCACCCT 
CCGGGAGCTG 
GGGAGGAAGA 
TTTTTTTAAA 
CTTGTCCCTO 
CTCTCCAGAG 
CCCTGCAGAT 
CGGCATCCAG 
GAGGTTTCCC 
TGATGAATGT 
CATATTTAAA 
GAGAAATGAA 



GGCAGGTCCT 
ATCCCAACAG 
GGATGCAGGG 
TCCCCCAGGA 
CCCGGGGGGC 
GCCAGGCCCA 
CCATGGGACC 
GCACAGCATC 
GGTTTCTCTT 
GAGAGTTTAC 
AAGCACTGCT 
ACTTGGGGCA 
TAGACATAGT 
GGTACAGATG 
CCTTCAGTCC 
TCCTTCCTTT 
ACCCTGTGGG 
ATATTTATTT 
TAAAGAATCT 



GAGGCCTGGG 
CAAGACGAGG 
ACTGCCTTCT 
TGGGCTTCGA 
CCGGCTGACT 
CCTTGCGGCC 
CATCCTCAGT 
ACTGAATCAC 
CCCTTTTTAT 
CAAGAGAGAT 
AGTTTACTTG 
CTTCCACCCT 



TTCCTGCCTT 
CGCCCACGTG 
CCACTGAAAA 
GATGTTTCAT 
TTTATACCAA 
ACTCTTCG 



ATGTGGCTGT 
CCTTCCTGCT 
GAAAGACAAA 
CGCCCATCAC 
CTGGCTCTGA 
GCTCCCTCCA 
AGAGCCTTTG 
CTGCTGGTGT 
GTCTCCGGGC 
TCTCTCCTCC 
GACCCAGCCA 
GGAGCTCTGG 
AGAGTCATCT 
CTAGCTCCGT 
GCACATGGCC 
ACTGACAGAT 
ATGAATCACT 



AGGGTGCGGC 
TGAGATGTGG 
TCATCCGGCT 
CTTGTCTGGA 
CTCATCTCCC 
GTCGCTCTCC 
GATCCCGTCC 
CGTGAAACAG 
GGACCACACC 
CCTTATTTAT 
CCATCGTCCC 
GTCCAGCTCT 
CACCCGGGGA 
CTAGTTCCCC 
GGGCCCACCG 
TTGGGTGACA 
TATTTTTATT 
TTTTTTTTTA 



CCGGAGTTGT 
GCCACACTCA 
TAGCTTGGGG 
AACCAGAGTT 
TGTGGACTTG 
CACCCAGCCT 
GGCAGCTTGG 
CTCTGCCAGG 
TGGGCCTGGC 
TATTTAAACA 
CATCGTCCTC 
GCCTTGCCGG 
GGTAGCATTT 
ACCTCAATCC 
TGCGGCCTTA 
AATTCCTCTT 
TATTCAATGT 
AGAAAAAAAA 



2820 
2B80 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3868 



Seq ID NO: C173 DNA Sequence 
Nucleic Acid Accession ft: XM_097508 
Coding sequence: 44.. 2788 

1 11 21 31 41 51 

I 1 I I I I 

TGAAAGGCGG TTGTGGTGCA AAGGAAAACC CACAGGCCAA GGAATGGGAA GACCAAGGTT 60 

GACACTTGTT TGTCACGTGT CAATAATCAT CTCTGCCCGG GACCTCAGCA TGAACAACCT 120 

CACAGAGCTT CAGCCTGGCC TCTTCCACCA CCTGCGCTTC TTGGAGGAGC TGCGTCTCTC 180 

TGGGAACCAT CTCTCACACA TCCCAGGACA AGCATTCTCT GGTCTCTACA GCCTGAAAAT 240 

CCTGATGCTG CAGAACAATC AGCTGGGAGG AATCCCCGCA GAGGCGCTGT GGGAGCTGCC 300 

GAGCCTGCAG TCGCTGCGCC TAGATGCCAA CCTCATCTCC CTGGTCCCGG AGAGGAGCTT 360 

TGAGGGGCTG TCCTCCCTCC GCCACCTCTG GCTGGACGAC AATGCACTCA CGGAGATCCC 420 

TGTCAGGGCC CTCAACAACC TCCCTGCCCT GCAGGCCATG ACCCTGGCCC TCAACCGCAT 480 

CAGCCACATC CCCGACTACG CGTTCCAGAA TCTCACCAGC CTTGTGGTGC TGCATTTGCA 540 

TAACAACCGC ATCCAGCATC TGGGGACCCA CAGCTTCGAG GGGCTGCACA ATCTGGAGAC 600 

ACTAGACCTG AATTATAACA AGCTGCAGGA GTTCCCTGTG GCCATCCGGA CCCTGGGGAG 660 

ACTGCAGGAA CTGGGGTTCC ATAACAACAA CATCAAGGCC ATCCCAGAAA AGGCCTTCAT 720 

GGGGAACCCT CTGCTACAGA CGATACACTT TTATGATAAC CCAATCCAGT TTGTGGGAAG 780 

ATCGGCATTC CAGTACCTGC CTAAACTCCA CACACTATCT CTGAATGGTG CCATGGACAT 840 

CCAGGAGTTT CCAGATCTCA AAGGCACCAC CAGCCTGGAG ATCCTGACCC TGACCCGCGC 900 

AGGCATCCGG CTGCTCCCAT CGGGGATGTG CCAACAGCTG CCCAGGCTCC GAGTCCTGGA 960 

ACTGTCTCAC AATCAAATTG AGGAGCTGCC CAGCCTGCAC AGGTGTCAGA AATTGGAGGA 1020 

AATCGGCCTC CAACACAACC GCATCTGGGA AATTGGAGCT GACACCTTCA GCCAGCTGAG 1080 

CTCCCTGCAA GCCCTGGATC TTAGCTGGAA CGCCATCCGG TCCATCCACC CCGAGGCCTT 1140 

CTCCACCCTG CACTCCCTGG TCAAGCTGGA CCTGACAGAC AACCAGCTGA CCACACTGCC 1200 

CCTGGCTGGA CTTGGGGGCT TGATGCATCT GAAGCTCAAA GGGAACCTTG CTCTCTCCCA 1260 

GGCCTTCTCC AAGGACAGTT TCCCAAAACT GAGGATCCTG GAGGTGCCTT ATGCCTACCA 1320 

GTGCTGTCCC TATGGGATGT GTGCCAGCTT CTTCAAGGCC TCTGGGCAGT GGGAGGCTGA 1380 

AGACCTTCAC CTTGATGATG AGGAGTCTTC AAAAAGGCCC CTGGGCCTCC TTGCCAGACA 1440 

AGCAGAGAAC CACTATGACC AGGACCTGGA TGAGCTCCAG CTGGAGATGG AGGACTCAAA 1500 

GCCACACCCC AGTGTCCAGT GTAGCCCTAC TCCAGGCCCC TTCAAGCCCT GTGAGTACCT 1560 

CTTTGAAAGC TGGGGCATCC GCCTGGCCGT GTGGGCCATC GTGTTGCTCT CCGTGCTCTG 1620 

CAATGGACTG GTGCTGCTGA CCGTGTTCGC TGGCGGGCCT GTCCCCCTGC CCCGGGTCAA 1680 

GTTTGTGGTA GGTGCGATTG CAGGOGCCAA CACCTTGACT GGCATTTCCT GTGGCCTTCT 1740 

AGCCTCAGTC GATGCCCTGA CCTTTGGTCA GTTCTCTGAG TAOGGAGCCC GCTGGGAGAC 1800 

GGGGCTAGGC TGCCGGGCCA CTGGCTTCCT GGCAGTACTT GGGTCGGAGG CATCGGTGCT 1860 

GCTGCTCACT CTGGCCGCAG TGCAGTGCAG CGTCTCCGTC TCCTGTGTCC GGGCCTATGG 1920 

GAAGTCCCCC TCCCTGGGCA GCGTTCGAGC AGGGGTCCTA GGCTGCCTGG CACTGGCAGG 1980 

GCTGGCCGCC GCGCTGCCCC TGGCCTCAGT GGGAGAATAC GGGGCCTCCC CACTCTGCCT 2040 

GCCCTACGCG CCACCTGAGG GTCAGCCAGC AGCCCTGGGC TTCACCGTGG CCCTGGTGAT 2100 

GATGAACTCC TTCTGTTTCC TGGTCGTGGC CGGTGCCTAC ATCAAACTGT ACTGTGACCT 2160 

GCGGCGGGGC GACTTTGAGG CCGTGTGGGA CTGCGOCATG GTGAGGCACG TGGCCTGGCT 2220 

CATCTTCGCA GACGGGCTCC TCTACTGTCC CGTGGCCTTC CTCAGCTTTG CCTCCATGCT 2280 

GGGCCTCTTC CCTGTCACGC CCGAGGCCGT CAAGTCTGTC CTGCTGGTGG TGCTGCCCCT 2340 

GCCTGCCTGC CTCAACCCAC TGCTGTACCT GCTCTTCAAC CCCCACTTCC GGGATGACCT 2400 

TCGGCGGCTT CGGCCCCGCG CAGGGGACTC AGGGCCCCTA GCCTATGCTG CGGCCGGGGA 2460 

GCTGGAGAAG AGCTCCTGTG ATTCTACCCA GGCCCTGGTA GCCTTCTCTG ATGTGGATCT 2520 

CATTCTGGAA GCTTCTGAAG CTGGGCGGCC CCCTGGGCTG GAGACCTATG GCTTCCCCTC 2580 

AGTGACCCTC ATCTCCTGTC AGCAGCCAGG GGCCCCCAGG CTGGAGGGCA GCCATTGTGT 2640 

AGAGCCAGAG GGGAACCACT TTGGGAACCC CCAACCCTCC ATGGATGGAG AACTGCTGCT 2700 

GAGGGCAGAG GGATCTACGC CAGCAGGTGG AGGCTTGTCA GGGGGTGGCG GCTTTCAGCC 2760 

CTCTGGCTTG GCCTTTGCTT CACACGTGTA AATATCCCTC CCCATTCTTC TCTTCCCCTC 2820 

TCTTCCCTTT CCTCTCTCCC CCTCGGTGAA TGATGGCTGC TTCTAAAACA AATACAACCA 2880 

AAACTCAGCA GTGTGATCTA TAGCAGGATG GCCCAGTCCC TGGCTCCACT GATCACCTCT 2940 

CTCCTGTGAC CATCACCAAC GGGTGCCTCT TGGCCTGGCT TTCCCTTGGC CTTCCTCAGC 3000 

TTCACCTTGA TACTGGGCCT CTTCCTTGTC ATGTCTGAAG CTGTGGACCA GAGACCTGGA 3060 

CTTTTGTCTG CTTAAGGGAA ATGAGGGAAG TAAAGACAGT GAAGGGGTGG AGGGTTGATC 3120 

AGGGCACAGT GGACAGGGAG ACCTCACAGA GAAAGGCCTG GAAGGTGATT TCCCGTGTGA 3180 

CTCATGGATA GGATACAAAA TGTGTTCCAT GTACCATTAA TCTTGACATA TGCCATGCAT 3240 

AAAGACTTCC TATTAAAATA AGCTTTGGAA GAG 3273 



Seq ID NO: C174 DNA Sequence 
Nucleic Acid Accession #: NM_130849 
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1 11 21 31 

I I I I 

AGTCTGGCCC TGGACAACCC CAGCAAAGCC GCCCTCAGCC 
TGGCCACAGC AACACCCACT GAGCACGCTG GGAGCTGAGT 
GGAGCTGGGG CTGCTTCTGG CTGTGCTGGT GGTGACGGCG 
TCTGCTGAGC CTGCTCACCT CTGGCCAGGG CGCTCTGQAT 
GTTAAATACG CTGGCGGACC GTGTGCACTG CACCAACGGG 
TGTGGAGGAC GCCCTGGGCC TGGGCGAGCC TGAGGGGTCA 
CCTGGAGGCC AGGTACGTCG CCCGCCTCAG TGCCGCCGCC 
CGAGGGCACC TGTGAGGACA CTCGGGCTGG CCTCTGGGCC 
GGCCCTGCTC GAGAGCCCCA AGGCCCTGAC CCCGGGCCTG 
GCAGGCCCGG GCTGCCGGCC AGACCCCCAA GACGGCCTGC 
GGAGGAGGCG GTGGGGGCGG GGGCTCCGGG CAGTGCTGGC 
GGACCATGTC AGGAGCGGGT CTTGCTTCCA CGCCTTGCCG 
CTTTGTGTTC CAGCAGCACA GCAGCGAGGT CCCTATGACG 
GATGCAGCGC CTGGGGGTGG GCAGGGAGGC CCACAGTGAC 
AGCCAGCAGC CGGGACCCTG TGCCCCTCAT CAGCTCCAGC 
CACGGTATGC CTGAGTGCCA GGGACGTGAT GGCTGCATAT 
GGTGACCCCG GAGGCCTGGG CCCAACTGAG CCCTGCCCTG 
AGCCTGCACC TCCCAGTCCA GGCCCCCCGT CCAGGACCAG 
TCTGTACGGC TCCCTGGCCA CGCTGCTCAT CTGCCTCTGC 
GCTGACCTGC ACTGGCTGCA GGGGGGTCGC CCACTACATC 
GGCAGTGGGT GCACTCACTG GGGACGCTGT CCTGCATCTG 
GCATACACAC AGCGAAGAGG GCCTCAGCCC ACAGCCCACC 
GGCCGGGCTC TACGCCTTCT TCCTGTTTGA GAACCTCTTC 
CCCGGAGGAC CTGGAGGACG GGCCCTGCGG CCACAGCAGC 
CCACGGTGTG TCCCTGCAGC TGGCACCCAG CGAGCTCCGG 
GGGCTCCCGC GCAGACCTGG TGGCGGAGGA GAGCCCGGAG 
GAGACTGAGC CCAGAGTTGA GGCTACTGCC CTATATGATC 
CAACTTCGCC GACGGGCTGG CCGTGGGCGC CGCCTTCGCG 
GGCCACCTCG CTGGCCGTGT TCTGCCACGA GTTGCCACAC 
CTTGCTGCAC GCGGGGCTGT CCGTGCGCCA AGCACTGCTG 
CACGGCCTTC GCTGGTCTCT ACGTGGCACT CGCGGTTGGA 
CTGGATCCTG GCAGTGGCCA CCGGCCTGTT CCTCTACGTA 
GGCGATGTTG AAAGTACGGG ACCCGCGGCC CTGGCTCCTC 
CCTGCTGGGC GGCTGGACCG TCCTGCTGCT GCTGTCCCTG 
CTGATACCCT GCCCTAGTCC CCCACCTTTG ACTTAAGATC 
GCCCAGAAAC CAGAAGCCCC TATAGAGGCC CCAGTCCCAA 
TCCTTGGAAA AAAAAAAAAA AAAAAAAAAA AA 

Seq ID NO: CI 7 5 DMA Sequence 
Nucleic Acid Accession #x NM_01897l 
Coding sequence : 1 . . 112 8 



41 
I 

AGCCCAGAAG 
ATGGCGTCCC 
ACGGCGTCCC 
CAAGAGGCTC 
CCGTGTGGAA 
GGGCTGCCCC 
GTCCTGTACC 
TCTCATGCAG 
AGCTGGCTGC 
GTAGATATCC 
GGCGTCCTGG 
AGCCCTCAGT 
CTGGCCGAGC 
CACAGTCATC 
AACAGCTCCA 
GGACTGTCGG 
CTCCAACAGC 
CTCAGCCAGT 

CTGCAGACCT 
ACGCCCAAGG 
TGGCGCCTCC 
AATCTCCTGC 
CATAGCCACG 
CAGCCCAAGC 
CTGCTGAACC 
ACTCTGGGCG 
TCCTCCTGGA 
GAGCTGGGGG 
CTGAACCTGG 
GTCAGCGAGG 
GCACTCTGCG 
TTCCTGCTGC 
TACGAGGATG 
CCACACCTCA 
CTCCAGTAAA 



51 
I 

CACTGGGCCT 
TGGTCTCGCT 
CGCCTGCTGG 
TGGGCGGCCT 
AGTGCCTGTC 
CGGGCCCGGT 
TCAGCAACCC 
ACCACCTCCT 
TGCAGAGGAT 
CTCAGCTGCT 
CTGCCCTGCT 
ACTTCGTGGA 
TGTCAGCCTT 
GGCACAGGGG 
GTGTGTGGGA 
AACAGGCTGG 
AGCTGAGTGG 
CAGAGAGGTA 
GCCTCCTGCT 
TCCTGAGCCT 
TGCTGGGGCT 
TGGCTATGCT 
TGCCCAGGGA 
GGGGCCACAG 
CCCCCCACGA 
CTGAGCCCAG 
ACGCCGTGCA 
AGACCGGGCT 

CCTCCGCGCT 
AGAGCGAGGC 
ACATGCTCCC 
ACAACGTGGG 
ACATCACCTT 
CAAACCTACA 
GACACTCTTG 



1 
1 

ATGGCGAACG 
AAGCTGGCCA 
CTGCTGATOG 
TGCCTGGCCG 
CGTGCGGCGG 
CTGGCCGCGC 
TACCTGGCCA 
GCCATGCTGG 
GACGGCGGTG 
CCCGGCGCGC 
TACCTCCGCC 
CCCGCCGTCA 
AACTGGACGQ 
GCAGGGCCGG 
AGGCTGTGCA 
GTCGTGGCCA 
ACGGCCTCCG 
TTCAACAGGG 
ACCACCCAGG 



11 
1 

CGAGCGAGCC 
CGCTCAGCCT 
TGCGGGAGCG 
ACGGGCTGCG 
CCGCGGCGGG 
TCTTCTGCTT 
TCGCGCACCA 
TGTGCGCCGC 
GCGACGACGA 
TGGGCTTCCT 
TGCTCTTCTT 
GCCACGACTG 
CGGGCTTCGG 
GCCGCGGCGC 
AGATGTTCTA 
GCTACCTGCG 
TGTGGCTGAC 
AGCTGAGGGA 
CGACCCATCC 



21 
I 

GGGTGGCAGC 
GCTGCTGTGC 
CAGCCTGCAC 
CGCGCTCGCC 
GGCGCCGCCG 
CCACGCCGCC 
CCGCTTCTAT 
CTGGGCGCTG 
GGACGCGCCG 
GCTGCTGCTG 
CATCCACGAC 
GACCTTCCAC 
COGCGGGCCC 
GCGCCGCCTC 
CGCCGTCACG 
GGTCCTGGTG 
CTTCGCGCAG 
CTGCTTCAGG 
CTGCGACCTG 



31 
I 

GGCGGCGGCG 
GTGAGCCTAG 
CGCGCCCCGT 
TGCCTCCCGG 
GGCGCGCTGG 
TTCCTGCTGC 
GCAGAGCGCC 
GCGCTGGCCG 
TGCGCCCTGG 
GCCGTGGTGG 
CGCCGCAAGA 
GGCCCGGGCG 
ACGCCGCCCG 
CTCGTGCTGG 
CTGCTCTTCC 
CGGCCCGGCG 
GCCGGCATCA 
GCCCAGTTCC 
AAAGGCATTG 



41 
I 

AGGCGGCCGC 
CGGGCAACGT 
ACTACCTGCT 
CCGTCATGCT 
GCTGCAAGCT 
TGGGCGTGGG 
TGGCCGGCTG 
CGGCCTTCCC 
AGCAGCGGCC 
TGGGCGCCAC 
TGCGGCCCGC 
OCACCGGCCA 
OGCTTGTGGG 
AAGAATTCAA 
TGCTCCTCTG 
CCGTCCCCCA 
ACCCCGTCGT 
CCTGCTGCCA 
GTTTATGA 



51 
I 

CCTGGGCCTC 
GCTGTTCGCG 
GCTCGACCTG 
GGCGGCGCGG 
GCTCGCCTTC 
CGTCACCCGC 
GCCGTGCGCC 
GCCAGTGCTG 
CGACGGCGCC 
GCACCTCGTC 
GCGCCTGGTG 
GGCGGCCGCC 
CATCCGGCCC 
GACGGAGAAG 
GGGGCCCTAC 
GGCCTACCTG 
GTGCTTCCTC 
GAGCCCCCGG 



Seq IO NO: CI 7 6 DNA Sequence 
Nucleic Acid Accession #: NM_005631 
Coding sequence: 290.. 2653 



1 
I 

GGCACGAGGG 
CCGTTCCGGG 
GCOGAGGGGC 
ACAGGTCGCC 
AGGAGCAGGC 
CCGCCCAGCG 
GGACCCGGGC 
GGGCGGGAGC 
CGGCCGGGCT 
CTACGGGGCC 



11 
I 

GGCTGAAGAC 
CCTCCGCAGC 
CGGGGCGCGC 
TGAGCCGCCT 
GGGGGCGCCG 
CGGGGGCCGG 
CGGGGGGCGG 
GCGAGGAGGA 
GCCCCCTGCG 
ACCTCCACAC 



21 
I 

AACTTGGATT 
CCAACATGGG 
GGAGCGTCCG 
CCGCGGCCGC 
GGGCTTTTGC 
AGCTCCCGCT 
CCTCGAGCGG 
GCGCGGCGGT 
AGCCGCTGCG 
TGCTGGCCGG 



31 
I 

GCGAGGCTAG 
CCCCGGGTTC 
GGGGGGCCCG 
CGAGGTCGTG 
TGAGTTGGCG 
CCTGGGGCTG 
GAACGCGACC 
GACTGGCCCT 
CTACAACGTG 
AGACTCGGAC 



41 
I 

GGCTTGGGGA 
CAAAGTTTGC 
GGCCCGGATT 
CGTGTGGCCG 
GGGTTGGCCA 
CTGCTGCTGC 
GGGCCTGGGC 
CCGCCGCCGC 
TGCCTGGGCT 
TCCCAGGAGG 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

I960 

2040 

2100 

2160 

2192 



SI 
I 

GTCGTGCATC 
GAAGTTGGGC 
CTCTGGGCGC 
GGGGGCTCGG 
TGGCCGCTGC 
TGCTGCTGGG 
CTCGGAGCGC 
TGAGCCACTG 
CGGTGCTGCC 
AAGCGCACGG 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1128 
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CAAGCTCGTG CTCTGGTCGG GCCTCCGGAA TGCCCCCCGC TGCTGGGCAG TGATCCAGCC 
CCTGCTGTGT GCCGTATACA TGCCCAAGTG TGAGAATGAC CGGGTGGAGC TGCCCAGCCG 
TACCCTCTGC CAGGCCACCC GAGGCCCCTG TGCCATCGTG GAGAGGGAGC GGGGCTGGCC 
TGACTTCCTG CGCTGCACTC CTGACCGCTT CCCTGAAGGC TGCACGAATG AGGTGCAGAA 
CATCAAGTTC AACAGTTCAG GCCAGTGCGA AGTGCCCTTG GTTCGGACAG ACAACCCCAA 
GAGCTGGTAC GAGGACGTGG AGGGCTGCGG CATCCAGTGC CAGAACCCGC TCTTCACAGA 
GGCTGAGCAC CAGGACATGC ACAGCTACAT CGCGGCCTTC GGGGCCGTCA CGGGCCTCTG 
CACGCTCTTC ACCCTGGCCA CATTCGTGGC TGACTGGCGG AACTCGAATC GCTACCCTGC 
TGTTATTCTC TTCTACGTCA ATGCGTGCTT CTTTGTGGGC AGCATTGGCT GGCTGGCCCA 
GTTCATGGAT GGTGCCCGCC GAGAGATCGT CTGCCGTGCA GATGGCACCA TGAGGCTTGG 
GGAGCCCACC TCCAATGAGA CTCTGTCCTG CGTCATCATC TTTGTCATCG TGTACTACGC 
CCTGATGGCT GGTGTGGTTT GGTTTGTGGT CCTCACCTAT GCCTGGCACA CTTCCTTCAA 
AGCCCTGGGC ACCACCTACC AGCCTCTCTC GGGCAAGACC TCCTACTTCC ACCTGCTCAC 
CTGGTCACTC CCCTTTGTCC TCACTGTGGC AATCCTTGCT GTGGCGCAGG TGGATGGGGA 
CTCTGTGAGT GGCATTTGTT TTGTGGGCTA CAAGAACTAC CGATACCGTG CGGGCTTCGT 
GCTGGCCCCA ATCGGCCTGG TGCTCATCGT GGGAGGCTAC TTCCTCATCC GAGGAGTCAT 
GACTCTGTTC TCCATCAAGA GCAACCACCC CGGGCTGCTG AGTGAGAAGG CTGCCAGCAA 
GATCAACGAG ACCATGCTGC GCCTGGGCAT TTTTGGCTTC CTGGCCTTTG GCTTTGTGCT 
CATTACCTTC AGCTGCCACT TCTACGACTT CTTCAACCAG GCTGAGTGGG AGCGCAGCTT 
CCGGGACTAT GTGCTATGTC AGGCCAATGT GACCATCGGG CTGCCCACCA AGCAGCCCAT 
CCCTGACTGT GAGATCAAGA ATCGCCCGAG CCTTCTGGTG GAGAAGATCA ACCTGTTTGC 
CATGTTTGGA ACTGGCATCG CCATGAGCAC CTGGGTCTGG ACCAAGGCCA CGCTGCTCAT 
CTGGAGGCGT ACCTGGTGCA GGTTGACTGG GCAGAGTGAC GATGAGCCAA AGCGGATCAA 
GAAGAGCAAG ATGATTGCCA AGGCCTTCTC TAAGCGGCAC GAGCTCCTGC AGAACCCAGG 
CCAGGAGCTG TCCTTCAGCA TGCACACTGT GTCCCACGAC GGGCCCGTGG CGGGCTTGGC 
CTTTGACCTC AATGAGCCCT CAGCTGATGT CTCCTCTGCC TGGGCCCAGC ATGTCACCAA 
GATGGTGGCT CGGAGAGGAG CCATACTGCC CCAGGATATT TCTGTCACCC CTGTGGCAAC 
TCCAGTGCCC CCAGAGGAAC AAGCCAACCT GTGGCTGGTT GAGGCAGAGA TCTCCCCAGA 
GCTGCAGAAG CGCCTGGGCC GGAAGAAGAA GAGGAGGAAG AGGAAGAAGG AGGTGTGCCC 
GCTGGCGCCG CCCCCTGAGC TTCACCCCCC TGCCCCTGCC CCCAGTACCA TTCCTCGACT 
GCCTCAGCTG CCCCGGCAGA AATGCCTGGT GGCTGCAGGT GCCTGGGGAG CTGGGGACTC 
TTGCCGACAG GGAGCGTGGA CCCTGGTCTC CAACCCATTC TGCCCAGAGC CCAGTCCCCC 
TCAGGATCCA TTTCTGCCCA GTGCACCGGC CCCCGTGGCA TGGGCTCATG GCCGCCGACA 
GGGCCTGGGG CCTATTCACT CCCGCACCAA CCTGATGGAC ACAGAACTCA TGGATGCAGA 
CTCGGACTTC TGAGCCTGCA GAGCAGGACC TGGGACAGGA AAGAGAGGAA CCAATACCTT 
CAAGGCTCTT CTTCCTCACC GAGCATGCTT CCCTAGGATC CCGTCTTCCA GAGAACCTGT 
GGGCTGACTG CCCTCCGAAG AGAGTTCTGG ATGTCTGGCT CAAAGCAGCA GGACTGTGGG 
AAAGAGCCTA ACATCTCCAT GGGGAGGCCT CACCCCAGGG ACAGGGCCCT GGAGCTCAGG 
GTCCTTGTTT CTGCCCTGCC AGCTGCAGCC TGGTTGGCAG CATCTGCTCC ATCGGGGCAG 
GGGGTATGCA GAGCTTGTGG TGGGGCAGGA ACGGTGGAGG CAGAGGTGAC AGTTCCCAGA 
GTGGGCTTTG GTGGCCAGGG AGGCAGCCTA GCCTATGTCT GGCAGATGAG GGCTGGCTGC 
CGTTTTCTGG GCTGATGGGT GCCCTTTCCT GGCAGTCTCA GTCCAAAAGT GTTGACTGTG 
TCATTAGTCC TTTGTCTAAG TAGGGCCAGG GCACCGTATT CCTCTCCCAG GTGTTTGTGG 
GGCTGGAAGG ACCTGCTCCC ACAGGGGCCA TGTCCTCTCT TAATAGGTGG CACTACCCCA 
AACCCATCTT TTGTTCTCCT ATATCCTCCT TCTCCTGTTC CATTTCAGTT CAGTTTCAGC 
GGTGCCAACC TCTTTGCGTT TCCTTTTTGT TGATGAGGAC CCAGAGCTGC TGCACACACT 
CACCTCTAAC CCCCTCCCCT CGCTGCTGGG CCCCATCTCC ACAGGAGAGA CTGGTTCGGC 
TCTAGGGCCT CAGTCTGGAG TGGGATAGGA GCAGTGAGTG ACAAAGCCTC TGAAAGATGC 
ATCATCTCTT CCTCACACCC ATTTAGTGGG GGATGGGTCC TCTAGACTTG AGGGGCTACC 
CTGGGAAGCT GCCGTAGCTT CAGCCAGGCA AGAAAGCTTC CTTCAACCTG CATAGCCGGT 
GGGTGAGGAG ATTCCCACCT TCCATAGCCT CCAAACATGT TCCCAAGGCC CCACTTTCAA 
GAATCAGACA GCAGGAAGCC ATAGATGCTG GCTGGGTTCC AGGTTATGGG GAGAAGAAAT 
ACAGTCAATA AAAGGTTTTT GTATAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 
A 

seq ZD NO: CI 7 7 dna Sequence 
Nucleic Acid Accession #: AK094595 
Coding sequence : 1 . . 2 8 5 3 



ATGGGGGCCC 
CCGAGGCTGA 
TCAGCGCCAG 
AAGAACAAGC 
TGCAACGGCG 
ACCCTGGGGG 
CAGGTGGAGG 
GCGGGCACCA 
GATCAGGAGC 
CCGCCGGAGG 
CCCACCCAGG 
CGCCTGTCGG 
AGCACCACTG 
TCACCCTGCT 
CCCGCTCCAC 
ACCACCATCT 
ACTGAGTGTG 
CGTGACTGCA 
AATAAGAAAA 
CTGTATGCGG 
GTGGTGGTGT 
GCCCTQACTG 
CTCCTACACC 




ACACCAACTT 
ACACTGCCAA 
CCACCGTCAT 
CCAACCGCTG 
TCAACGGAGG 
GCCCAGTCGA 
CCCACTGGCG 
GCGGGACGCT 
CTCTAAGCGA 
GGCTCGTGGT 
ACCGCCGCAA 
GTGGTTTCCA 
CCTCTGTGCC 



21 
I 

TCGGGGCGCG 
TAGGAAGCGA 
GCCCTACTTC 
CCGCTGCCGC 
CCAGAACGAC 
CCTGCGGGTG 
GCTGGAGGAT 
CCGAGCCTAC 
GGAGGTGCCC 
GGCCGAGGTG 
CCTGCTCACC 
CTATACCTGC 
CGTCTACGTG 
TGGCCGAGGC 
GGCCTTCTGC 
TGGGGCGTGG 
TAGCCGCGAG 
GCTCGACTCT 
CCCCAACAGC 
GGCCATCTTC 
CTGCCGTGAC 
CCCCGTCAAC 
TCCTGACCTG 



31 
I 

CTGCTGCTGG 
TCGGGTGAGG 
CTGCAGGAGC 
GCCTTCCCCG 
CACGTCACAC 
CGCGAGGTGC 
TACTGGTGCC 
GTCOGCATCG 
CTGGACCATG 
GAATGGCTCA 
ATCGACCACA 
GTGGCCAAGA 
AATGGCGGCT 
TGGCAGAAGC 
GAGGGCCAGG 
ACGGAGTGGA 
TGCATGGCGC 
AAGAACTGCA 
CACCTGCTGG 
GTGGTCGTGG 
TTCGACACAG 
TTTAAGACGG 
ACAGCCAGCG 



41 
I 

CACTGCTGCT 
TGCTCCCTGA 
CACAGGACGC 
CCACACAGAT 
AGGAAGGCCT 
AGATCGAGGT 
AGTGCGTGGC 
CCTACCTGCG 
AGGTTCTCCT 
AGAATGAGGA 
ACCTCATCAT 
ACATCGTGGC 
GGTCCAGCTG 
GCACCCGGAC 
CATTCCAGAA 
GCAAGTGGTC 
CCCCACCCCA 
CAGATGGGCT 
AGGCCTCAGG 
CAATCCTCAT 
ACATCACTGA 
CAAGGCCCAG 
CCGGCATCTA 



660 

720 

780 

840 

900 

960 
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10B0 

1140 

1200 

1260 
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1380 

1440 
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1920 
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2160 
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2280 

2340 

2400 

2460 
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2580 
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2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3781 



51 
I 

CTGCTGGGAC 
CTCCTTCCCG 
CTACATTGTG 
CTACTTCAAG 
GGATGAGGCC 
GTCGCGGCAG 
CTGGAGCTCC 
CAAGAACTTC 
GCAGTGCCGC 
TGTCATCGAC 
CCGCCAGGCC 
CAAACGCCGG 
GGCAGAGTGG 
CTGCACCAAC 
GACCGCCTGC 
AGCCTGCAGC 
GAACGGAGGC 
GTGCATGCAA 
GGATGCGGCG 
GGCGGTGGGG 
CTCATCTGCT 
CAACCCGCAG 
CCGCGGACCC 
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GTGTATGCCC 
CCCTTACCCA 
CTGGCAGATG 
GCCCGGGACA 
GGCCTGCCCC 
CTCAGCATCC 
AAGTTCTACG 
GGGACCCAGA 
CGCCCCGTCA 
CAGCTCAAGA 
ACCCTGAACA 
CAGCTGGGCA 
CAGCTGGCCG 
TGCCTGGAGG 
GGATACTTGG 
CTCTCCCTCC 
GAGATCCCCT 
CTGGAGAGGC 
GAAGGGGAGG 
CTGGACACTC 
TTCAAGATCC 
CGGGGCAATG 
TTTGCCACCA 
GACGATGGGG 
CTGGTGGCTG 
ACTGGCAGGA 
CTGCTTCCTC 
ACCATGACCA 
CCACCCCCGC 
AGCCTCTGGA 
CCCTCAGCCC 
CGTTATTGAT 
GAAGAGTCAA 
GAAAGAAAGA 
ACAAGGGAAA 
TTGGATAAGT 



TGCAGGACTC 
GCCTTAAGGT 
GGGCTGACCT 
CCCACTTCCT 
GAGACCCAGG 
CCGGCACAGG 
AGATGTATCT 
CAGTATTGAG 
TCCTCACCAT 
CCCAGGCCCA 
CACCCTGCTA 
CCTACX3TGTT 
TCTTCGCCCC 
ACACGCCTGT 
TGGAGGAGCC 
ATGACCTCCC 
TCTATCACAT 
ACAGCTTGGC 
GCCAGATATT 
TCTGCTCTGC 
CACTGTCCAT 
ACTGGCGGAT 
AAGCGAGCCC 
ACCTCAACAG 
TGGCCACCGA 
GGCAGGTGCA 
CCAGTTCACA 
GCCTTAGAAA 
TCTCTCTCTC 
GGCAGTTGGT 
GGCAACTTCT 
TTCTCCTTTC 
GTACAATTCA 
AAGCTTCAGA 
AGAAAAACCC 
CCTTTTTAAA 



CACCGACAAA 
CAAGGTCTAC 
GCTGGGGGTC 
GCACCTGCGC 
GAGCAGCGTC 
GGTCAGCTTG 
ACTCATCAAC 
CCCCTCGGTG 
GCCCCACTGT 
CCAGGGCCAC 
CTGCCAGCTG 
CACGGGCGAG 



AGCACTGAAG 
GAAACCGCTA 
CCATGCCCAT 
TTGGAGTGGC 
CTCCACAGAG 
CCAGCTGCAT 
CCCTGGCAGC 
CCGCCAGAAG 
GTTAGCACAG 
CACGGGTGTG 
CCTGGCGAGT 
CGGGGACTGC 
GGGAGGCCTG 
GCCAGAGTTG 
ATCCATGTAC 
TTGGCCTGAG 
TGGGGGCGGG 
GGGTTCCATG 
TCCCTAAGCC 
GACAAACTGC 
CCGCTAGTAA 
AGTTTCTTAG 
A 



ATCCCCATGA 
AGCTCCAGCA 
TTGCCGCCTG 
AGCGCCAGCC 
AGCGGCACCT 
CTGGTGCCCA 
AAGGCAGAAA 
ACCTGTGGAC 
GCCGAAGTCA 
TGGGAGGAGG 
GAGCCCAGGG 
TCCTATTCCC 
ACCTCCCTGG 
GAGGTGCTGG 
ATGTTCAAGG 
TGGAGGAGCA 
AGCCAGAAGG 
CTCACCTGCA 
ACCACTCTGG 
ACTGTCACCA 
ATATGCAACA 
AAGCTCTCTA 
ATCCTGGACC 
GCCTTGGAGG 
TGAGCCTCCT 
GGGCAGCCTC 
CCTCTCCTCC 
TCTGTTGTTA 
ATCTCTGTGC 
CAGGCAGGAG 
GGTTTTAGTT 
CCCTTCTGCT 
TTTCTCCTGT 
GGCTCAAAGA 
GAAACGCAAA 



CCAACTCTCC 
CCACGGGCTC 
GCACATACCC 
TCGGTTCCCA 
TTGGCTGCCT 
ATGGAGCCAT 
GTACCCTCCC 
CCACAGGCCT 
GTGCCCGTGA 
TGGTGACCCT 
CCTGTCACAT 
GCTCAGCAGT 
AGTACAGCCT 
AGCTGGAGCG 
ACAGTTACCA 
AGCTGCTGGC 
CCCTCCACTG 
AGATCTGCGT 
CAGAGACACC 
CCCAGCTGGG 
GCCTAGATGC 
TGGACCGGTA 
TCTGGGAAGC 
AGATGGGCAA 
GGGACAGCGG 
CTGATGGGGA 

GAGGGCCCAG 
AGGAACCAAG 
GCCCTCCCTC 
CCGTTCTCGT 
TCCACGCCCT 
CCAAAAGCAA 
AGAAGAAAAA 
CGATTTATTA 



TCTGCTGGAC 
TGGGCCAGGC 
TAGCGATTTC 
GCAGCTCTTG 
GGGTGGGAGG 
TCCCCAGGGC 
GCTTTCAGAA 
CCTGCTGTGC 
CTGGATCTTT 
GGATGAGGAG 
CCTGCTGGAC 
CAAGCGGCTC 
CCGGGTCTAC 
GACTCTGGGC 
CAACCTGCGC 
CAAATACCAG 
CACTTTCACC 
GCGGCAAGTG 
TGCTGGCTCC 
ACCTTATGCC 
CCCCAACTCA 
CCTGAATTAC 
TCTGCAGCAG 
GAGTGAGATG 
GCTGGCAGGG 
TGTTTGGCCT 
CAACCCCCAG 
AGTTCCTTCT 
ATGGGGCTGA 
CACCCCCCCA 
TTTCTTCCTC 
TTTCCTCTTT 
AAAGGCAAAG 
CACCAAAACC 
TCCAGATTAT 



Seq ID NO: C178 DNA Sequence 
Nucleic Acid Accession ft: NM_004625 
Coding sequence : 310 . . 1359 



1 
I 

GAGGGAGGGG 
GCACCGGTCG 
CCTCCCGCTC 
CGCACCGACA 
TTGCCCCGCC 
ATCGGGACTA 
ATGGTCTACC 
TGTAACAAGA 
GCCATCATCG 
CGCAATGGCC 
AAAGTGGGGA 
GCCATCACAG 
CAAGGCCAGT 
TACGGCATCG 
ACTCTCATGA 
CTGGAATGTA 
CTGCCACAGT 
GTGGAGCCTG 
CTGTCGTACC 
TGCGAGGAGG 
GCTCCCCAGG 
TACGCCCGCG 
ACGTGCAGCG 
CCCGCTGCAA 
AGGATGCTGA 

CTCCACCCTC 
CAGGAACTCT 
ATAATATTAA 



11 

I 

CGGGGGCTGG 
CTCGCGCGCA 
CGTGGCTCCC 
GACGGCCCCG 
GGGGCCGGCC 
TGAACCGGAA 
TOCGGATCGG 
TCCCAGGCCT 
TCATAGGAGA 
GCTGGAACTG 
GCCGGGAGGC 
CTGCCTGTAC 
ACCACGGGGA 
GCTTCGCCAA 
ACTTGCACAA 
AGTGCCACGG 
TTCGGGAGCT 
TGCGTGCCAG 
GCAAGCCCAT 
ACCCGGTGAC 
CCAGCGGCTG 
TGTGGCAGTG 
AGCGCACGGA 
GTCAGATTGC 
GCTTGTCTTT 
AAAAAAAATC 
CCCCAGACAC 
GGACCCCTGG 
TTTATTTAAT 



Seq ID NO: C179 DNA Sequence OBR3 
Nucleic Acid Accession ft: NM_0037B6 
Coding sequence: 71.. 4654 

11 
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1 11 21 31 41 51 

I 1 1 I 1.1 

CTCCGGCGCC CGCTCTGCCC GCCGCTGGGT CCGACCGCGC TCGCCTTCCT TGCAGCCGCG 
CCTCGGCCCC ATGGACGCCC TGTGCGGTTC CGGGGAGCTC GGCTCCAAGT TCTGGGACTC 
CAACCTGTCT GTGCACACAG AAAACCCGGA CCTCACTCCC TGCTTCCAGA ACTCCCTGCT 
GGCCTGGGTG CCCTGCATCT ACCTGTGGGT CGCCCTGCCC TGCTACTTGC TCTACCTGCG 
GCACCATTGT CGTGGCTACA TCATCCTCTC CCACCTGTCC AAGCTCAAGA TGGTCCTGGG 

1311 



60 
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180 

240 

300 
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TGTCCTGCTG TGGTGOGTCT CCTGGGCGGA CCTTTTTTAC TCCTTCCATG GCCTGQTCCA 360 

TGGCCGGGCC CCTGCCCCTG TTTTCTTTGT CACCCCCTTG GTGGTGGGGG TCACCATGCT 420 

GCTGGCCACC CTGCTGATAC AGTATGAGCG GCTGCAGGGC GTACAGTCTT CGGGGGTCCT 4B0 

CATTATCTTC TGGTTCCTGT GTGTGGTCTG CGCCATCGTC CCATTCCGCT CCAAGATCCT 540 

TTTAGCCAAG GCAGAGGGTG AGATCTCAGA CCCCTTCCGC TTCACCACCT TCTACATCCA 600 

CTTTGCCCTG GTACTCTCTG CCCTCATCTT GGCCTGCTTC AGGGAGAAAC CTCCATTTTT 660 

CTCCGCAAAG AATGTCGACC CTAACCCCTA CCCTGAGACC AGCGCTGGCT TTCTCTCCCG 720 

CCTGTTTTTC TGGTGGTTCA CAAAGATGGC CATCTATGGC TACCGGCATC CCCTGGAGGA 780 

GAAGGACCTC TGGTCCCTAA AGGAAGAGGA CAGATCCCAG ATGGTGGTGC AGCAGCTGCT 840 

GGAGGCATGG AGGAAGCAGG AAAAGCAGAC GGCACGACAC AAGGCTTCAG CAGCACCTGG 900 

GAAAAATGCC TCCGGCGAGG ACGAGGTGCT GCTGGGTGCC CGGCCCAGGC CCCGGAAGCC 960 

CTCCTTCCTG AAGGCCCTGC TGGCCACCTT CGGCTCCAGC TTCCTCATCA GTGCCTGCTT 1020 

CAAGCTTATC CAGGACCTGC TCTCCTTCAT CAATCCACAG CTGCTCAGCA TCCTGATCAG 1080 

GTTTATCTCC AACCCCATGG CCCCCTCCTG GTGGGGCTTC CTGGTGGCTG GGCTGATGTT 1140 

CCTGTGCTCC ATGATGCAGT CGCTGATCTT ACAACACTAT TACCACTACA TCTTTGTGAC 1200 

TGGGGTGAAG TTTCGTACTG GGATCATGGG TGTCATCTAC AGGAAGGCTC TGGTTATCAC 1260 

CAACTCAGTC AAACGTGCGT CCACTGTGGG GGAAATTGTC AACCTCATGT CAGTGGATGC 1320 

CCAGCGCTTC ATGGACCTTG CCCCCTTCCT CAATCTGCTG TGGTCAGCAC CCCTGCAGAT 1380 

CATCCTGGCG ATCTACTTCC TCTGGCAGAA CCTAGGTCCC TCTGTCCTGG CTGGAGTCGC 1440 

TTTCATGGTC TTGCTGATTC CACTCAACGG AGCTGTGGCC GTGAAGATGC GCGCCTTCCA 1500 

GGTAAAGCAA ATGAAATTGA AGGACTCGCG CATCAAGCTG ATGAGTGAGA TCCTGAACGG 1560 

CATCAAGGTG CTGAAGCTGT ACGCCTGGGA GCCCAGCTTC CTGAAGCAGG TGGAGGGCAT 1620 

CAGGCAGGGT GAGCTCCAGC TGCTGCGCAC GGCGGCCTAC CTCCACACCA CAACCACCTT 1680 

CACCTGGATG TGCAGCCCCT TCCTGGTGAC CCTGATCACC CTCTGGGTGT ACGTGTACGT 1740 

GGACCCAAAC AATGTGCTGG ACGCCGAGAA GGCCTTTGTG TCTGTGTCCT TGTTTAATAT 1800 

CTTAAGACTT CCCCTCAACA TGCTGCCCCA GTTAATCAGC AACCTGACTC AGGCCAGTGT IB 60 

GTCTCTGAAA CGGATCCAGC AATTCCTGAG CCAAGAGGAA CTTGACCCCC AGAGTGTGGA 1920 

AAGAAAGACC ATCTCCCCAG GCTATGCCAT CACCATACAC AGTGGCACCT TCACCTGGGC 1980 

CCAGGACCTG CCCCCCACTC TGCACAGCCT AGACATCCAG GTCCCGAAAG GGGCACTGGT 2040 

GGCCGTGGTG GGGCCTGTGG GCTGTGGGAA GTCCTCCCTG GTGTCTGCCC TGCTGGGAGA 2100 

GATGGAGAAG CTAGAAGGCA AAGTGCACAT GAAGGGCTCC GTGGCCTATG TGCCCCAGCA 2160 

GGCATGGATC CAGAACTGCA CTCTTCAGGA AAACGTGCTT TTCGGCAAAG CCCTGAACCC 2220 

CAAGCGCTAC CAGCAGACTC TGGAGGCCTG TGCCTTGCTA GCTGACCTGG AGATGCTGCC 2280 

TGGTGGGGAT CAGACAGAGA TTGGAGAGAA GGGCATTAAC CTGTCTGGGG GCCAGCGGCA 2340 

GCGGGTCAGT CTGGCTCGAG CTGTTTACAG TGATGCCGAT ATTTTCTTGC TGGATGACCC 2400 

ACTGTCCGCG GTGGACTCTC ATGTGGCCAA GCACATCTTT GACCACGTCA TCGGGCCAGA 2460 

AGGCGTGCTG GCAGGCAAGA CGCGAGTGCT GGTGACGCAC GGCATTAGCT TCCTGCCCCA 2520 

GACAGACTTC ATCATTGTGC TAGCTGATGG ACAGGTGTCT GAGATGGGCC CGTACCCAGC 2580 

CCTGCTGCAG CGCAACGGCT CCTTTGCCAA CTTTCTCTGC AACTATGCCC CCGATGAGGA 2640 

CCAAGGGCAC CTGGAGGACA GCTGGACCGC GTTGGAAGGT GCAGAGGATA AGGAGGCACT 2700 

GCTGATTGAA GACACACTCA GCAACCACAC GGATCTGACA GACAATGATC CAGTCACCTA 2760 

TGTGGTCCAG AAGCAGTTTA TGAGACAGCT GAGTGCCCTG TCCTCAGATG GGGAGGGACA 2820 

GGGTCGGCCT GTACCCCGGA GGCACCTGGG TCCATCAGAG AAGGTGCAGG TGACAGAGGC 2880 

GAAGGCAGAT GGGGCACTGA CCCAGGAGGA GAAAGCAGCC ATTGGCACTG TGGAGCTCAG 2940 

TGTGTTCTGG GATTATGCCA AGGCCGTGGG GCTCTGTACC ACGCTGGCCA TCTGTCTCCT 3000 

GTATGTGGGT CAAAGTGCGG CTGCCATTGG AGCCAATGTG TGGCTCAGTG CCTGGACAAA 3060 

TGATGCCATG GCAGACAGTA GACAGAACAA CACTTCCCTG AGGCTGGGCG TCTATGCTGC 3120 

TTTAGGAATT CTGCAAGGGT TCTTGGTGAT GCTGGCAGCC ATGGCCATGG CAGCGGGTGG 3180 

CATCCAGGCT GCCCGTGTGT TGCACCAGGC ACTGCTGCAC AACAAGATAC GCTCGCCACA 3240 

GTCCTTCTTT GACACCACAC CATCAGGCCG CATCCTGAAC TGCTTCTCCA AGGACATCTA 3300 

TGTCGTTGAT GAGGTTCTGG CCCCTGTCAT CCTCATGCTG CTCAATTCCT TCTTCAAOGC 3360 

CATCTCCACT CTTGTGGTCA TCATGGCCAG CACGCCGCTC TTCACTGTGG TCATCCTGCC 3420 

CCTGGCTGTG CTCTACACCT TAGTGCAGCG CTTCTATGCA GCCACATCAC GGCAACTGAA 3480 

GCGGCTGGAA TCAGTCAGCC GCTCACCTAT CTACTCCCAC TTTTCGGAGA CAGTGACTGG 3540 

JJ TGCCAGTGTC ATCCGGGCCT ACAACCGCAG CCGGGATTTT GAGATCATCA GTGATACTAA 3600 

GGTGGATGCC AACCAGAGAA GCTGCTACCC CTACATCATC TCCAACCGGT GGCTGAGCAT 3660 

CGGAGTGGAG TTCGTGGGGA ACTGOGTGGT GCTCTTTGCT GCACTATTTG CCGTCATOGG 3720 

GAGGAGCAGC CTGAACCCGG GGCTGGTGGG CCTTTCTGTG TCCTACTCCT TGCAGGTGAC 3780 

ATTTGCTCTG AACTGGATGA TACGAATGAT GTCAGATTTG GAATCTAACA TCGTGGCTGT 3840 

GGAGAGGGTC AAGGAGTACT CCAAGACAGA GACAGAGGCG CCCTGGGTGG TGGAAGGCAG 3900 

CCGCCCTCCC GAAGGTTGGC CCCCACGTGG GGAGGTGGAG TTCCGGAATT ATTCTGTGOG 3960 

CTACCGGOCG GGCCTAGACC TGGTGCTGAG AGACCTGAGT CTGCATGTGC ACGGTGGCGA 4020 

GAAGGTGGGG ATCGTGGGCC GCACTGGGGC TGGCAAGTCT TCCATGACCC TTTGCCTGTT 4080 

CCGCATCCTG GAGGCGGCAA AGGGTGAAAT CCGCATTGAT GGCCTCAATG TGGCAGACAT 4140 

CGGCCTCCAT GACCTGCGCT CTCAGCTGAC CATCATCCCG CAGGACCCCA TCCTGTTCTC 4200 

GGGGACCCTG CGCATGAACC TGGACCCCTT CGGCAGCTAC TCAGAGGAGG ACATTTGGTG 4260 

GGCTTTGGAG CTGTCCCACC TGCACACGTT TGTGAGCTCC CAGCCGGCAG GCCTGGACTT 4320 

CCAGTGCTCA GAGGGCGGGG AGAATCTCAG CGTGGGCCAG AGGCAGCTCG TGTGCCTGGC 4380 

CCGAGCCCTG CTCCGCAAGA GCCGCATCCT GGTTTTAGAC GAGGCCACAG CTGCCATCGA 4440 

CCTGGAGACT GACAACCTCA TCCAGGCTAC CATCCGCACC CAGTTTGATA CCTGCACTGT 4500 

CCTGACCATC GCACAOCGGC TTAACACTAT CATGGACTAC ACCAGGGTCC TGGTCCTGGA 4560 
CAAAGGAGTA GTAGCTGAAT TTGATTCTCC AGCCAACCTC ATTGCAGCTA GAGGCATCTT 4620 
CTACGGGATG GCCAGAGATG CTGGACTTGC CTAAAATATA TTCCTGAGAT TTCCTCCTGG 4680 
CCTTTCCTGG TTTTCATCAG GAAGGAAATG ACACCAAATA TGTCCGCAGA ATGGACTTGA 4740 
TAGCAAACAC TGGGGGCACC TTAAGATTTT GCACCTGTAA AGTGCCTTAC AGGGTAACTG 4800 
TGCTGAATGC TTTAGATGAG GAAATGATCC CCAAGTGGTG AATGACACGC CTAAGGTCAC 4860 
AGCTAGTTTG AGCCAGTTAG ACTAGTCCCC CGGTCTCCCG ATTCCCAACT GAGTGTTATT 4920 
TGCACACTGC ACTGTTTTCA AATAACGATT TTATGAAATG ACCTCTGTCC TCCCTCTGAT 4980 
TTTTCATATT TTCCTAAAGT TTCGTTTCTG TTTTTTAATA AAAAGCTTTT TCCTCCTGGA 5040 
ACAGAAGACA GCTGCTGGGT CAGGCCACCC CTAGGAACTC AGTCCTGTAC TCTGGGGTGC 5100 
TGCCTGAATC CATTAAAAAT GGGAGTACTG ATGAAATAAA ACTACATGGT CAACAGTAAA 5160 
AAAAAAAAAA AAAAAA 5176 

Seq ID NO: CI 80 DNA Sequence 
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Nucleic Acid Accession ft: NM_004626 
Coding sequence: 124.. 1188 
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1 
I 

TAACCCGCCG 
CAGGCGQAGG 
ACGATGAGGG 
GGCGTGTGCT 
AACCAGACGC 
CGCAGCAACC 
TGTCGCCGGG 
TATTTGCTTG 
GCCGCCATCA 
TGCGGCCCCO 
AACCTCAGCT 
AAAACAGGAT 
GCTCTGCGGG 
ATCCGCACCT 
CGATACCTGT 
CCCAAGGACC 
CCTGACTTCT 
AACAAGACAT 
CCCTACACAG 
ACCTGCCGCA 
CGCCCCACGC 
GGAGACACTC 
TGCTTTGCCT 
CAGGACATCA 
GCCCACAGGC 
TTTCAGGGCC 
TGCGGGACCC 
TAGCTTGGGT 
GCTACAGAAA 
TTATATATTT 
AGCTTTTTAA 
CCCGGCCTOG 
CCAGGCA 



11 
I 

CCTCCGCTCT 
ACTTOGGCGC 
CGCGGCCGCA 
ATGGCATCAA 
AACACTGCAA 
TGGAGCTCAT 
CCTTTGCCGA 
ACCTGGAGAG 
GCCACGCCAT 
TCCCAGGTGA 
ACGGGCTCCT 
CCCAAGCCAA 
CCTCTCTGGA 
GCTGGAAGGG 
CGGCCACCAA 
TGGATATCOG 
GCATGAAGAA 
CCAACGGAAG 
ACCGCGTGGT 
GGTGTGAGCG 
AGGAGCGAGG 
CATGGAGCTC 
TCACTTGGAA 
AAGGAAACCG 
TGGTGTAAGG 
ACAGAGACCA 
CCTGGCCCAC 
TGTAAATGTT 
TATTTATAAA 
TATATATATA 
GCAGCTGTAT 
TCAAGTGAAC 



21 
I 

CCCCGGCTGC 
GGCTCCTCCT 
GGTCTGCGAG 
GTGGCTGGCG 
GCAGCTGGAG 
GCACACGGTG 
CATGOGCTGG 
AGGGACCCGG 
CGCCCGGGCC 
GCCACCCGGG 
CATGGGGGCC 
TAAACTGATG 
AATGAAGTGT 
GCTGCAGGAG 
GGTAGTGCAC 
GCCTGTGAAG 
TGAGAAGGTG 
CGACAGCTGC 
CGAGCGGTGC 
TACCGTGGAG 
ACTCTGCTCA 
TGCTTGTGAA 
GCCACCAGGA 
ACAAGATTAA 
AGCGGGGCTT 
GCCTCCGGGA 
CCTGGGGTCT 
TTCTTTTGTT 
ACATAGCTTT 
AATATATATG 
GAAATAAATG 
TCGGCAGACC 



31 
I 

AGGCGGCGTG 
GGGTGTGACC 
GCGCTGCTCT 
CTGTCCAAGA 
GGTCTGGTGT 
GTGCACGCCG 
AACTGCTCCT 
GAGTCGGCCT 
TGCACCTCCG 
CCCGGGAACC 
AAGTTTTCCG 
CGTCTACACA 
AAGTGCCATG 
CTGCAGGATG 
CGACCCATGG 
GACTCGGAAC 
GGCTCCCACG 
GACCTTATGT 
CACTGTAAGT 
CGCTATGTCT 
AGGACCCTCA 
TTCCAGATGC 
ACAGAAGGTC 
AAATAACTTG 
GGGATCGGTG 
AGGGGTCTGC 
GAGCCTGCTG 
TTTTGCTTTT 
TTCTTTGGGG 
TATATATATA 
CTGAGTGAGC 



41 
I 

CAGGACCAGC 
CCGGGCGCGC 
TCGCCCTGGC 
CACCATCGGC 
CTGCACAGGT 
CCCGCGAGGT 
CCATTGAGCT 
TCGTGTATGC 
GCGACCTGCC 
GCTGGGGAGG 
ATGCTCCTAT 
ACAGTGAAGT 
GGGTGTCTGG 
TGGCTGCTGA 
GCACCCGCAA 
TCGTCTATCT 
GGACACAAGA 
GCTGCGGGCG 
ACCACTGGTG 
GCAAGTGAGG 
GCAACTGGGG 
CAGGCATGGG 
TGGCCACCCT 
GCAGCCTGAG 
AGACTGATAC 
CCGCCTTCTT 
GGCCCACCAC 
TCTTCCTTTG 
TGGCACTTCT 
ATGATCTCTA 
CCCAGCCCGC 
CAGAGGGAGC 



51 
I 

GGCGGCCGTG 
CCGCCGCGCG 
GCTCCAGACC 
CCTGGCACTG 
GCAGCTGTGC 
CATGAAGGCC 
CGCCCCCAAC 
GCTGTCGGCC 
CGGCTGCTCC 
ATGTGCGGAC 
GAAGGTGAAA 
GGGGAGACAG 
CTCCTGCTCC 
CCTCAAGACC 
GCACCTGGTG 
GCAGAGCTCA 
CAGGCAGTGC 
TGGCTACAAC 
CTGCTACGTC 
CCCTGCCCTC 
CCAGGGGCCT 
AGGCGGCTTG 
GGAAGGAGGG 
GCTCTGGAGT 
AGACTTGACC 
CAGAATGTTC 
ATGGAATCAC 
GGATGTGGAA 
CAATTCCTCT 
TTTTAAAACT 
• CCCTGCAGTT 
TCTCCAGTTT 



Seq ID NO: C181 DNA Sequence 
Nucleic Acid Accession ft: NM_031866 
Coding sequence: 6.. 2090 



1 
I 

ACAGCATGGA 
TGCAGCGCTC 
CCGTGCCGCT 
ACGACACGCA 
AGTGCTCGCC 
ACTACAAGAA 
CGCCGCTCAT 
CCGAGCAAGG 
CCGCGCCCAG 
GCAGCGGCCA 
GTGGCGGCGG 
GGCCCCCTGG 
TGAGCGTGTC 
ACTGCGCGCT 
TCTGGATCGG 
TCCTTATCGA 
GCTACCTCTT 
CGTGCAGCGG 
GCGCGGGCGC 
TGGGCGCGGT 
TCTTCTTGCT 
TCACATGGTT 
AGTACTTCCA 
TCAGCTCGGT 
ACAACCTGCG 
TCCTGCTGGC 
GCCCCACCAA 
TCTACACCGT 
GGCGCTGGGA 
GCAGGCCCGA 
CCTCGGGCGT 
GCTGCTGCTG 
GGGGTGGCGG 
GCGGCGGCGG 
CGGCGAGCTC 
GGGGGCGCCC 
TTGATGGGCT 
TCTTAATGGT 
TGAAGCCTCC 



11 21 

I I 
GTGGGGTTAC CTGTTGGAAG 
TAGCGGCGCT GCGGCCGCCT 
GTGTAAGGGC ATCGGCTACA 
AGACGAGGCG GGCCTGGAGG 
CGATCTCAAG TTCPTCCTGT 
GCCGCTGCCG CCCTGCCGCT 
GCGCCAGTAC GGCTTCGCCT 
CAACCCTGAC ACGCTGTGCA 
CCOGCOGCGC CGCCTGCCGC 
CGGCCGCCCG CCGGGGGCCA 
GGACGCGGCG GCGCCCCCAG 
CGGCGGCGCG GCTCCCTGCG 
CAGOGAGCGC CACCCGCTCT 
GCCCTGCCAC AACCCCTTTT 
CCTGTGGTCG GTGCTCTGCT 
CATGGAGCGC TTCAAGTACC 
CGTGTCGGTG GGCTACCTAG 
TGGCGCGCCG GGCGCGGGGG 
GGCGGGCGCG GGCGCGGGCG 
GGAGCAGCAC GTGCGCTACG 
GGTCTACTTC TTCGGCATGG 
CCTGGCGGCC GGTATGAAGT 
CCTGGCCGCG TGGCTTGTGC 
GGACGGCGAC CCGGTGGCGG 
CGGCTTCGTG CTGGCGCCGC 
CGGCTTCGTG TCCCTGTTCC 
GACGCACAAG CTGGAGAAGC 
GCCCGCCGCG GTGGTGGTCG 
GGCCACGCAC AACTGCCCGT 
CTACGCCGTC TTCATGCTCA 
GTGGGTCTGG TCCGGCAAGA 
GGCCAGCAAG GGCGCCGCGG 
CGGGCCGGGG GGCGGCGGCG 
GGGCTCCCTC TACAGCGACG 
CGTGTCTTAT CCAAAGCAGA 
AGGAGGGGTG GGGAGGGGGG 
GAGGTTCCCA CCCCTTCACA 
ATCCATTAGC TGGGACTTAA 
CAGACCCAGC CCCTTTTCCT 



41 



GCCTGCGGGA 
AGTACTTCAT 
CGCTGGAGTC 
TGGGCGGGGG 
GCGGGGGACC 
TCAGCACTGG 
TGCCATTGTC 
CGAGGAGACC 
GTGTTGATTG 
ATGACTCACT 
CCATTGATGT 



GCTGGCCGCC 
GCTGGCATGC 
CATGCCCAAT 
CTGGCCGCTG 
CACGCCCATC 
GCGCGCCAAG 
CATGCGCTGC 
CCGCACCGAC 
CGGCGAGCAG 
CCGCGGAGGC 
CGGCGGTGGC 
CCAGTGCCGC 
CAAGACAGGC 
CGAGCGCGCC 
CTTCGCCACC 
CATTATCTTC 
GGCGGGCCAC 
CGCGGGCGGC 
GCGCGGCGAG 
CCCCGCGCTG 
CTGGTGGGTG 
AGCCATCGCC 
GTCCATCGCG 
CGTGGGCAAC 
CCTCTTCATC 
GGTCATCAAG 
CCTGGGCCTG 
CTACGAGCAG 
CCTGCAGCCC 
GTGCCTAGTG 
CTGGCGCTCC 
CGCGGGCGCC 
CGGCGGCGGC 
CCTGACGTGG 
CCAGGTCTGA 
CAAGTGCAGC 
CTATTAGCAT 
TAGAACAAAG 
GCGGGGAGCT 
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51 
I 

TTGGCGCTGC 
CAAGAGATCA 
CAGTTCAACC 
GTGGAGATCC 
TGCCTAGAGG 
GCCGGCTGCG 
GACCGGCTGC 
CTAACCACCG 
CCGCCTTCGG 
GGCAGGGGCG 
GGGAAGGCGC 
GCGCCTATGG 
CAGATCGCTA 
TTCACCGTCT 
GTCTCCACCT 
CTCTCGGCCT 
GAGAAGGTGG 
GCGGCGGCGG 
TACGAGGAGC 
TGCACCGTGG 
ATCTTGTCGC 
GGCTACTCGC 
GTGCTGGCGC 
CAGAGCCTGG 
GGCACCATGT 
CAACAGGACG 
TTCACCGTGC 
CACAACCGCC 
GACCAGGCAC 
GTGGGCATCA 
CTGTGCACCC 
ACGGCCGCGG 
GGGGGGCCGG 
CGGTCGGGCA 
GCGGAGGGGA 
GAAGGGACAC 
GATAATGAAC 
TACCTGGCAT 
CCTCCCGCCA 
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CGCGTTAATT TCTGTTGGCT GAGGAGGGTG GACTCTGCGG CGTTTCCAGA ACCCGAGATT 2400 

TGGAGCCCTC CCTGGCTGCA CTTGGCTGGG TTTGCAGTCA GATACACAGA TTTCACCTGG 2460 

GAGAACCTCT TTTTCTCCCT CGACTCTTCC TACGTAAACT CCCACCCCTG ACTTACCCTG 2520 

GAGGAGGGGT GACCGCCACC TGATGGGATT GCACGGTTTG GGTATTCTTA ATGACCAGGC 25B0 

AAATGCCTTA AGTAAACAAA CAAGAAATGT CTTAATTATA CACCCCACGT AAATACGGGT 2640 

TTCTTACATT AGAGGATGTA TTTATATAAT TATTTGTTAA ATTGTAAAAA AAAAAAGTGT 2700 

AAAATATGTA TATATCCAAA GATATAGTGT GTACATTTTT TTGTAAAAAG TTTAGAGGCT 2760 

TACCCCTGTA AGAACAGATA TAAGTATTCT ATTTTGTCAA TAAAATGACT TTTGATAAAT 2820 

GATTTAACCA TTGCCCTCTC CCCCGCCTCT TCTGAGCTGT CACCTTTAAA GTGCTTGCTA 2880 

AGGACGCATG GGGAAAATGG ACATTTTCTG GCTTGTCATT CTGTACACTG ACCTTAGGCA 2940 

TGGAGAAAAT TACTTGTTAA ACTCTAGTTC TTAAGTTGTT AGCCAAGTAA ATATCATTGT 3000 

TGAACTGAAA TCAAAATTGA GTTTTTGCAC CTTCCCCAAA GACGGTGTTT TTCATGGGAG 3060 

CTCTTTTCTG ATCCATGGAT AACAACTCTC ACTTTAGTGG ATGTAAATGG AACTTCTGCA 3120 

AGGCAGTAAT TCCCCTTAGG CCTTGTTATT TATCCTGCAT GGTATCACTA AAGGTTTCAA 3180 

AACCCTGAAA AAAAA 3195 

Seq ID NO: C182 DNA Sequence 
Nucleic Acid Accession #? XM_OS0625 
Coding sequence: 222.. 1109 



1 11 21 31 41 51 

I I I I I I 

CCGGGTCGGA GCCCCCCGGA GCTGCGCGCG GGCTTGCAGC GCCTCGCCCG CGCTGTCCTC 60 

CCGGTGTCCC GCTTCTCCGC GCCCCAGCCG CCGGCTGCCA GCTTTTCGGG GCCCCGAGTC 120 

Lj GCACCCAGCG AAGAGAGCGG GCCCGGGACA AGCTCGAACT CCGGCCGCCT CGCCCTTCCC 180 

CGGCTCCGCT CCCTCTGCCC CCTCGGGGTC GCGCGCCCAC GATGCTGCAG GGCCCTGGCT 240 

CGCTGCTGCT GCTCTTCCTC GCCTCGCACT GCTGCCTGGG CTCGGCGCGC GGGCTCTTCC 300 

TCTTTGGCCA GCCCGACTTC TCCTACAAGC GCAGCAATTG CAAGCCCATC CCTGCCAACC 3 GO 

^ TGCAGCTGTG CCACGGCATC GAATACCAGA ACATGCGGCT GCCCAACCTG CTGGGCCACG 420 

J\) AGACCATGAA GGAGGTGCTG GAGCAGGCCG GCGCTTGGAT CCCGCTGGTC ATGAAGCAGT 480 

GCCACCCGGA CACCAAGAAG TTCCTGTGCT CGCTCTTCGC CCCCGTCTGC CTCGATGACC 540 

TAGACGAGAC CATCCAGCCA TGCCACTCGC TCTGCGTGCA GGTGAAGGAC CGCTGCGCCC 600 

CGGTCATGTC CGCCTTCGGC TTCCCCTGGC CCGACATGCT TGAGTGCGAC CGTTTCCCCC 660 

~r AGGACAACGA CCTTTGCATC CCCCTCGCTA GCAGCGACCA CCTCCTGCCA GCCACCGAGG 720 

D J AAGCTCCAAA GGTATGTGAA GCCTGCAAAA ATAAAAATGA TGATGACAAC GACATAATGG 780 

AAACGCTTTG TAAAAATGAT TTTGCACTGA AAATAAAAGT GAAGGAGATA ACCTACATCA 840 

ACCGAGATAC CAAAATCATC CTGGAGACCA AGAGCAAGAC CATTTACAAG CTGAACGGTG 900 

TGTCCGAAAG GGACCTGAAG AAATCGGTGC TGTGGCTCAA AGACAGCTTG CAGTGCACCT 960 

Af . GTGAGGAGAT GAACGACATC AACGCGCCCT ATCTGGTCAT GGGACAGAAA CAGGGTGGGG 1020 

4U AGCTGGTGAT CACCTCGGTG AAGCGGTGGC AGAAGGGGCA GAGAGAGTTC AAGCGCATCT 1080 

CCCGCAGCAT CCGCAAGCTG CAGTGCTAGT CCCGGCATCC TGATGGCTCC GACAGGCCTG 1140 

CTCCAGAGCA CGGCTGACCA TTTCTGCTCC GGGATCTCAG CTCCCGTTCC CCAAGCACAC 1200 

TCCTAGCTGC TCCAGTCTCA GCCTGGGCAG CTTCCCCCTG CCTTTTGCAC GTTTGCATCC 1260 

CCAGCATTTC CTGAGTTATA AGGCCACAGG AGTGGATAGC TGTTTTCACC TAAAGGAAAA 1320 

4-> GCCCACCCGA ATCTTGTAGA AATATTCAAA CTAATAAAAT CATGAATATT TTTATGAAGT 1380 

TT 1382 

Seq ID NO: C183 DNA Sequence 
Nucleic Acid Accession *: NM_001306.1 
J\J Coding sequence: 19 9.. 8 61 

1 11 21 31 41 51 

I I I I I I 

AATTCGGCAC GAGGGCAGGT GCAGGCGCAC GOGGCGAGAG CGTATGGAGC CGAGCCGTTA 60 

33 GCGCGCGCCG TCGGTGAGTC AGTCCGTCCG TCCGTCCGTC CGTCGGGGCG CCGCAGCTCC 120 

CGCCAGGCCC AGCGGCCCCG GCCCCTCGTC TCCCCGCACC CGGAGCCACC CGGTGGAGCG 180 

GGCCTTGCCG CGGCAGCCAT GTCCATGGGC CTGGAGATCA CGGGCACCGC GCTGGCCGTG 240 

CTGGGCTGGC TGGGCACCAT CGTGTGCTGC GCGTTGCCCA TGTGGCGCGT GTCGGCCTTC 300 

^ n ATCGGCAGCA ACATCATCAC GTCGCAGAAC ATCTGGGAGG GCCTGTGGAT GAACTGCGTG 360 

OU GTGCAGAGCA CCGGCCAGAT GCAGTGCAAG GTGTAOGACT CGCTGCTGGC ACTGCCACAG 420 

GACCTTCAGG CGGCCCGCGC CCTCATCGTG GTGGCCATCC TGCTGGCCGC CTTCGGGCTG 480 

CTAGTGGCGC TGGTGGGCGC CCAGTGCACC AACTGCGTGC AGGACGACAC GGCCAAGGCC 540 

AAGATCACCA TCGTGGCAGG CGTGCTGTTC CTTCTCGCCG CCCTGCTCAC CCTCGTGCCG 600 

srr GTGTCCTGGT CGGCCAACAC CATTATCCGG GACTTCTACA ACCCCGTGGT GCCCGAGGCG 660 

UJ CAGAAGCGCG AGATGGGCGC GGGCCTGTAC GTGGGCTGGG CGGCCGCGGC GCTGCAGCTG 720 

CTGGGGGGCG CGCTGCTCTG CTGCTCGTGT CCCCCACGCG AGAAGAAGTA CACGGCCACC 780 

AAGGTCGTCT ACTCGGCGCC GCGCTCCACC GGCCCGGGAG CCAGCCTGGG CACAGGCTAC 840 

GACCGCAAGG ACTACGTCTA AGGGACAGAC GCAGGGAGAC CCCACCACCA CCACCACCAC 900 

„ CAACACCACC ACCACCACCG CGAGCTGGAG CGOGCACCAG GCCATCCAGC GTGCAGCCTT 960 

/U GCCTCGGAGG CCAGCCCACC CCCAGAAGCC AGGAAGCCCC CGCGCTGGAC TGGGGCAGCT 1020 » 

TCCCCAGCAG CCACGGCTTT GCGGGCCGGG CAGTCGACTT CGGGGCCCAG GGACCAACCT 1080 

GCATGGACTG TGAAACCTCA CCCTTCTGGA GCACGGGGCC TGGGTGACCG CCAATACTTG 1140 

ACCACCCOGT CGAGCCCCAT CGGGCCGCTG CCCCCATGTC GCGCTGGGCA GGGACCGGCA 1200 

GCCCTGGAAG GGGCACTTGA TATTTTTCAA TAAAAGCCTC TCGTTTTAGC 1250 



Seq ID NO: Cl 84 DNA Sequence 

Nucleic Acid Accession 8: NM_012449.l 

Coding sequence: 66-. 1085 

1 11 21 31 41 51 

L 1 1 1 1 i 

CCGAGACTCA CGGTCAAGCT AAGGCGAAGA GTGGGTGGCT GAAGCCATAC TATTTTATAG 60 
AATTAATGGA AAGCAGAAAA GACATCACAA ACCAAGAAGA ACTTTGGAAA ATGAAGCCTA 120 
GGAGAAATTT AGAAGAAGAC GATTATTTGC ATAAGGACAC GGGAGAGACC AGCATGCTAA 180 
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AAAGACCTGT GCTTTTGCAT TTGCACCAAA CAGCCCATGC TGATGAATTT GACTGCCCTT 240 

CAGAACTTCA GCACACACAG GAACTCTTTC CACAGTGGCA CTTGCCAATT AAAATAGCTG 300 

CTATTATAGC ATCTCTGACT TTTCTTTACA CTCTTCTGAG GGAAGTAATT CACCCTTTAG 360 

CAACTTCCCA TCAACAATAT TTTTATAAAA TTCCAATCCT GGTCATCAAC AAAGTCTTGC 420 

CAATGGTTTC CATCACTCTC TTGGCATTGG TTTACCTGCC AGGTGTGATA GCAGCAATTG 480 

TCCAACTTCA TAATGGAACC AAGTATAAGA AGTTTCCACA TTGGTTGGAT AAGTGGATGT 540 

TAACAAGAAA GCAGTTTGGG CTTCTCAGTT TCTTTTTTGC TGTACTGCAT GCAATTTATA 600 

GTCTGTCTTA CCCAATGAGG CGATCCTACA GATACAAGTT GCTAAACTGG GCATATCAAC 660 

AGGTCCAACA AAATAAAGAA GATGCCTGGA TTGAGCATGA TGTTTGGAGA ATGGAGATTT 720 

ATGTGTCTCT GGGAATTGTG GGATTGGCAA TACTGGCTCT GTTGGCTGTG ACATCTATTC 780 

CATCTGTGAG TGACTCTTTG ACATGGAGAG AATTTCACTA TATTCAGAGC AAGCTAGGAA 840 

TTGTTTCCCT TCTACTGGGC ACAATACACG CATTGATTTT TGCCTGGAAT AAGTGGATAG 900 

ATATAAAACA ATTTGTATGG TATACACCTC CAACTTTTAT GATAGCTGTT TTCCTTCCAA 960 

TTGTTGTCCT GATATTTAAA AGCATACTAT TCCTGCCATG CTTGAGGAAG AAGATACTGA 1020 

AGATTAGACA TGGTTGGGAA GACGTCACCA AAATTAACAA AACTGAGATA TGTTCCCAGT 1080 

TGTAGAATTA CTGTTTACAC ACATTTTTGT TCAATATTGA TATATTTTAT CACCAACATT 1140 

TCAAGTTTGT ATTTGTTAAT AAAATGATTA TTCAAGGAAA AAAAAAAAAA AAAAA 1195 

Seq ID NO: C185 DNA Sequence 

Nucleic Acid Accession fl: NM_001775.1 

Coding sequence : 70 . . 972 

1 11 21 31 41 51 

I I I I I I 

CTAAAGCTCT CTTGCTGCCT AGCCTCCTGC CGGCCTCATC TTCGCCCAGC CAACCCCGCC 60 

TGGAGCCCTA TGGCCAACTG CGAGTTCAGC CCGGTGTCCG GGGACAAACC CTGCTGCCGG 120 

CTCTCTAGGA GAGCCCAACT CTGTCTTGGC GTCAGTATCC TGGTCCTGAT CCTCGTCGTG 180 

GTGCTCGCGG TGGTCGTCCC GAGGTGGCGC CAGACGTGGA GCGGTCCGGG CACCACCAAG 240 

CGCTTTCCCG AGACCGTCCT GGCGCGATGC GTCAAGTACA CTGAAATTCA TCCTGAGATG 300 

AGACATGTAG ACTGCCAAAG TGTATGGGAT GCTTTCAAGG GTGCATTTAT TTCAAAACAT 360 

CCTTGCAACA TTACTGAAGA AGACTATCAG CCACTAATGA AGTTGGGAAC TCAGACCGTA 420 

CCTTGCAACA AGATTCTTCT TTGGAGCAGA ATAAAAGATC TGGCCCATCA GTTCACACAG 480 

GTCCAGCGGG ACATGTTCAC CCTGGAGGAC ACGCTGCTAG GCTACCTTGC TGATGACCTC 540 

ACATGGTGTG GTGAATTCAA CACTTCCAAA ATAAACTATC AATCTTGCCC AGACTGGAGA 60 0 

AAGGACTGCA GCAACAACCC TGTTTCAGTA TTCTGGAAAA CGGTTTCCCG CAGGTTTGCA 660 

GAAGCTGCCT GTGATGTGGT CCATGTGATG CTCAATGGAT CCGGCAGTAA AATCTTTGAC 720 

AAAAACAGCA CTTTTGGGAG TGTGGAAGTC CATAATTTGC AACCAGAGAA GGTTCAGACA 780 

CTAGAGGCCT GGGTGATACA TGGTGGAAGA GAAGATTCCA GAGACTTATG CCAGGATCCC 840 

ACCATAAAAG AGCTGGAATC GATTATAAGC AAAAGGAATA TTCAATTTTC CTGCAAGAAT 900 

ATCTACAGAC CTGACAAGTT TCTTCAGTGT GTGAAAAATC CTGAGGATTC ATCTTGCACA 960 

TCTGAGATCT GAGCCAGTCG CTGTGGTTGT TTTAGCTCCT TGACTCCTTG TGGTTTATGT 1020 

CATCATACAT GACTCAGCAT ACCTGCTGGT GCAGAGCTGA AGATTTTGGA GGGTCCTCCA 1080 

CAATAAGGTC AATGCCAGAG ACGGAAGCCT TTTTCCCCAA AGTCTTAAAA TAACTTATAT 1140 

CATCAGCATA CCTTTATTGT GATCTATCAA TAGTCAAGAA AAATTATTGT ATAAGATTAG 1200 

AATGAAAATT GTATGTTAAG TTACTTCCTT TAG 1233 

Seq ID NO: Cl 8 6 DNA Sequence 

Nucleic Acid Accession #: XM_120513.2 

Coding sequence: 1..2208 

1 11 21 31 41 51 

I I I I I I 

ATGGTGTCAT GCACGTTCTC GGGGCCCCTA CGGGAAACAA ATGAAAAOGT GAAAAAGTTC 60 

TACGCCTTGC GAGCTTTTAT GTTCCGCATG AGCTCAGAGG CCGCGATGCT CGGGGAAAGC 120 

AGGACCCCAA AGCCCCGTAA ACACCGCGCG ACCACCCGGG CCAAGATCTT CAAGAGGTTC 180 

TTTTCAGAAG GATCGGAGAG CAATTCCCGA TTGGTAGAAG AACTTGCTGT AATACACACG 240 

TACTCTGACG ACCCCGCCCC AACGACTAGC CCCTCCTCTG TGCAACCCCG AGAGTTTGGG 300 

GTCATGCAGG GGGCGCCACG AGCTCGTTTC GGAAGCCGGA CCCCGCCCGC AGCCGCAGAA 360 

GCCTCGAGTC CACATCTGGG CATTGGCGAG GCAGCCTGTC AATCAGGAGC TCGGGCGGCA 420 

GCCCCCCGCG CGGGGGCTCG GCGATGCCAG CCTCAGCGAC AGGCGGCGGC GGCGGCGGCC 480 

ACGGCACAGA CACACACCCT CCCACACGCG CGCACCAGGG CAGACCCGGC GGGCAGGCGG 540 

CGGAGGCACC CTCGGAGCCC GGCGCCCGGC GGGGAGGGGA CGTGCTCCGA GGGACCGGCC 600 

CCGAGGCGCC GGATGGAGGA AGAGATGCAG CCGGCAGAGG AGGGGCCCAG CGTCCCCAAA 660 

ATCTACAAGC AGCGCAGCCC CTACAGCGTC CTCAAGACGT TCCCCAGCAA GAGACCGGCG 720 

CTGGCCAAGC GCTACGAGCG ACCCACCCTG GTGGAGCTGC CGCACGGCCA CTTGAGGACT 780 

CCGGCGCAGC CGCCGCCCGC GTCCCCCGCC GCCTCCTCGT CGTCTTCGTT CGCCGCTGTC 840 

GTCAGACTCG GGGCTCCTCC GCGTCCGCCA CGCCGTGGAT TTCGGGCGCG GGGAACCATC 900 

CCGCCCCTCC TTCCTGCCCC GGGAGTCGCA GGCACTCTGC TCCCACCGCC CACGTCGTCG 960 

TCCCCGCCAT CCCCTCGTCC CCGTCCGTGG CACGCTGCCG CGCCGCGGGG AGGGACCTCA 1020 

CATACACATA TGTGGAGGTC CCAGTCCACA CTTCCAGGAT CTGACACCAT GGTCTCTGTC 1080 

TTTGGATTGA TGGCTCAGAG AAGATGGCAG CATAGATCTT TAAAGCAGTT TGAGTGGGGA 1140 

ATTCTTGGAT CTTGGGGTAC TTGGCCATGT GGACAGGATT GGCTGGAGAA GGAGGGTCAG 1200 

GTGGCGGTCC TGCTGCCAAG GTCTGAGGGT AATACTGCTC CTAAGAAGAG TCGAATGATC 1260 

TTGGATGCCT TTGCCCAGCA GTGCAGTCGA GTTCTTAGCC TCTTAAATTG TGGAGGAAAA 1320 

CTCCTGGACT CCAACCATTC TCAGTCCATG ATTTCTTGCG TAAAGCAGGA AGGCTCAAGT 13 B0 

TACAACGAAA GACAGGAGCA CTGTCACATT GGGAAAGGGG TCCACAGTCA GACCTCAGAC 1440 

AATGTAGACA TAGAGATGCA GTATATGCAA AGGAAACAAC AAACTTCTGC CTTTTTGAGG 1500 

GTTTTCACTG ACTCTCTACA AAATTACCTG CTCTCGGGAA GCTTTCCAAC TCCAAACCCC 1560 

TCGTCAGCCA GTGAATATGG CCATCTGGCC GACGTGGATC CTCTGTCAAC CTCTCCTGTG 1620 

CATACATTAG AAAATATTTC ACTTGATTCC ACAGCTTCCC TGTGTAAATC TAGGCATCTA 1680 

TCCAGAGAGC CCCCAGTCAA GAGTGATTTT CCAAATCCTT TGCAGCAGGC CTTGGCTGGG 1740 

GGTGCTTCAA GACCATTTTC AGGGGCACAG CAAAGCATCG CTTACAGGGT GAACTCTGAA 1800 

CTTGAGGATG GCATCCGCAG CCCCGTCCCT TTGAGTTGTG AGGCCTTGGA AATGGATTTG 1860 

ACCTCCTTGG GAAGCAAGCA GCTGTTGAAC AACTATCCTG TCTACATAAC GAGCAAACAG 1920 
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TGGGATGA6G CTGTAAATTC TTCAAAGAAA GATGGGAGAC GGCTCCTTCG ATACCTCATC 1980 

AQATTTGTTT TCACAACCGA TGAGCTTAAG TACTCATGCG GCCTTGGGAA AAGGAAAAGG 2040 

TCAGTGCAGT CAGGAGAGAC AGGTCCCGAA AGACGCCCTC TGGATCCAGT TAAAGTAACA 2100 

TGCCTCCGAG GTACTGCATC CTTCCGCTCA GTGTCACCAT CTGTGATCTC ATTICA CCGC 2160 

5 ATTGGCTGTG GCTCTCCCCG TACAAGTGTT CAGCCTTCTG TATTTTGA 2208 
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Seq ID NO: CI 87 DNA Sequence 
Nucleic Acid Accession #: AB037745.1 
Coding sequence: 2G..1744 

1 11 21 31 41 51 

I I I I I I 

ATGGTGGAAC ACGCTGCCCA CAAACATGGA AACGACCGTT CTCAGTGGGA TCAACTTCGA 60 

GTACAAGGGC ATGACAGGCT GGGAGGTGGC TGGTGATCAC ATTTACACAG CTGCTGGAGC 120 

CTCAGACAAT GACTTCATGA TTCTCACTCT GGTTGTGCCA GGATTTAGAC CTCCGCAGTC 180 

GGTGATGGCA GACACAGAGA ATAAAGAGGT GGCCAGAATC ACATTTGTCT TTGAGACCCT 240 

CTGTTCTGTG AACTGTGAGC TCTACTTCAT GGTGGGTGTG AATTCTAGGA CCAACACTCC 300 

TGTGGAGACG TGGAAAGGTT CCAAAGGCAA ACAGTCCTAT ACCTACATCA TTGAGGAGAA 3 SO 

CACTACCACG AGCTTCACCT GGGCCTTCCA GAGGACCACT TTTCATGAGG CAAGCAGGAA 420 

GTACACCAAT GACGTTGCCA AGATCTACTC CATCAATGTC ACCAATGTTA TGAATGGCGT 480 

GGCCTCCTAC TGCCGTCCCT GTGCCCTAGA AGCCTCTGAT GTGGGCTCCT CCTGCACCTC 540 

TTGTCCTGCT GGTTACTATA TTGACCGAGA TTCAGGAACC TGCCACTCCT GCCCCCCTAA 600 

CACAATTCTG AAAGCCCACC AGCCTTATGG TGTCCAGGCC TGTGTGCCCT GTGGTCCAGG 6G0 

GACCAAGAAC AACAAGATCC ACTCTCTGTG CTACAATGAT TGCACCTTCT CACGCAACAC 720 

TCCAACCAGG ACTTTCAACT ACAACTTCTC CGCTTTGGCA AACACOGTCA CTCTTGCTGG 780 

AGGGCCAAGC TTCACTTCCA AAGGGTTGAA ATACTTCCAT CACTTTACCC TCAGTCTCTG 840 

TGGAAACCAG GGTAGGAAAA TGTCTGTGTG CACCGACAAT GTCACTGACC TCCGGATTCC 900 

TGAGGGTGAG TCAGGGTTCT CCAAATCTAT CACAGCCTAC GTCTGCCAGG CAGTCATCAT 960 

CCCCCCAGAG GTGACAGGCT ACAAGGCCGG GGTTTCCTCA CAGCCTQTCA GCCTTGCTGA 1020 

TCGACTTATT GGGGTGACAA CAGATATGAC TCTGGATGGA ATCACCTCCC CAGCTGAACT 1080 

TTTCCACCTG GAGTCCTTGG GAATACCGGA CGTGATCTTC TTTTATAGGT CCAATGATGT 1140 

GACCCAGTCC TGCAGTTCTG GGAGATCAAC CACCATCCGC GTCAGGTGCA GTCCACAGAA 1200 

AACTGTCCCT GGAAGTTTGC TGCTGCCAGG AACGTGCTCA GATGGGACCT GTGATGGCTG 1260 

CAACTTCCAC TTCCTGTGGG AGAGCGCGGC TGCTTGCCCG CTCTGCTCAG TGGCTGACTA 1320 

CCATGCTATC GTCAGCAGCT GTGTGGCTGG GATCCAGAAG ACTACTTACG TGTGGCGAGA 1380 

ACCCAAGCTA TGCTCTGGTG GCATTTCTCT GCCTGAGCAG AGAGTCACCA TCTGCAAAAC 1440 

CATAGATTTC TGGCTGAAAG TGGGCATCTC TGCAGGCACC TGTACTGCCA TCCTGCTCAC 1500 

CGTCTTGACC TGCTACTTTT GGAAAAAGAA TCAAAAACTA GAGTACAAGT ACTCCAAGCT 1560 

GGTGATGAAT GCTACTCTCA AGGACTGTGA CCTGCCAGCA GCTGACAGCT GCGCCATCAT 1620 

GGAAGGCGAG GATGTAGAGG ACGACCTCAT CTTTACCAGC AAGAAGTCAC TCTTTGGGAA 1680 

GATCAAATCA TTTACCTCCA AGCAGCCAGC TCCTGTCACC ATCTCTCTTT CAGAGGACTC 1740 

CTGATGGATT TGACTCAGTG CCGCTGAAGA CATCCTCAGG AGGCCCAGAC ATGGACCTGT 1800 

GAGAGGCACT GCCTGCCTCA CCTGCCTCCT CACCTTGCAT AGCACCTTTG CAAGCCTGCG 1860 

GCGATTTGGG TGCCAGCATC CTGCAACACC CACTGCTGGA AATCTCTTCA TTGTGGCCTT 1920 

ATCAGATGTT TGAATTTCAG ATCTTTTTTT ATAGAGTACC CAAACCCTCC TTTCTGCTTG 1980 

CCTCAAACCT GCCAAATATA CCCACACTTT GTTTGTAAAT TATGCCCTTG CTTGTATCTT ' 2040 

GTTTCCCAAA ATGGCCCATC CGCCAGAGCC ATAGCTTCGT CTGCTCATAA TTCTTATAGC 2100 

TTTGGAATGA AAATATTTCT ATCTTCTTAA GTATAGAAAC TATTTCCTCT GTCCTCTAAC 2160 

TTAAGGGCAG AAACAGCTGG GAGTTTTCCT CGCATGCCCT CAGCTCATGA TCTCTTCAGG 2220 

AGAGAGGCTG GGTGAGGAGG GTGTCGGGGT TCCCTGGTGG ATAATCTTCA TAGCAGCCTG 2280 

GATCCATTTC CCCTGGATAA CCAGCTCAAA GGGAGTGAAA ATGGTAGTCT GAGGGCAAGG 2340 

GGAGCAAGGC CTGGGTAAGA AAAGCCTTGA AAAGCATAAA AAGAGGCCGG GCGCGGTGGC 2400 

TCACGCCTGT AATCCCAGCA CTTTGGGAGG CCGAGGCGGG CAGATCATGA GGTCGGGAGA 2460 

TTGAGACCAT CCTGGCTAAC ACGGTGAAGC CCCGTCTCTA CTGGAAATAC AAAAAATTAG 2520 

CCGGGCGTGG TGGCGGGTGC CTGTGGTCCC AGCTACTCGG GAGGCTGAGG CGGGAGAATA 2S80 

GCGTGGGCCT GGAAGGCGGA GCTTGCAGTG AGCOGAGATC GCGCCACTGC ACTCCATCCA 2640 

GCCTGGGTGA CAGAGTGAGA CTCTGCCTCA AAAAAAAAAA AAAAAAAGAA AAGCACAAAG 2700 

AGAGGCAACA AGGAATGTTT TTGTTTTTGA GACAGGCTCT CACTCTGTCA CCTAGGCTGG 2760 

AGTGCAGTGG CGTAATCACT GTTCAGTGCA GCCTCAAGCT CTTGGGCTCA GGCTATCCTC 2820 

CCATCTCAGC CTCTCAAGTA GCTGGGACTA CGAGTGTGCA CCACCAGGCT CACTAATTTT 2880 

TGTGTTTTTT GTAGACACGG GGTTTCACCG TGTTGCCCAG GCTGGTCTCC AACTCCTGGG 2940 

CTCAAGTGAT CTGTCCGCCT CGGCCTCCCA AACTGCTGGG ATTACAGGCA TAAGCCACTG 3000 

CACTCAGCCT TTTATTTGTT TTTTAAACCA CGTAGCTCAT TGCCTTCTCT TAAGTAAATG 3060 

ATAGATATTC TCACTGAAGC CAAAGGAATA AGTTCATCAA GAAAATGCCC AAAGCCCTGG 3120 

TGGATACATC CTCCCTATCT TTTTTTTAAA CCTTCCACTA TCACTCTATG ACACTGAAAA 3180 

GAACCAGGTA AGCCCCAAAC CCAGATGTTC CAGCCTTATC CTCTATTGGG TTTACCCACA 3240 

GACATAGCAA ACCCTGTCAG TGAGGAAAAT TCCCCATCCT TGAGTGCCCC CGTCCTAGAA 3300 

GTTTGGGCCA TATTATGGAA CAGGGGTCTC TTATTTGAAA AGAGCACAAG GAGGCCAAGA 3360 

TTTTAATGGG GCACTTTAGG GGATACAGCC CACAATGGCA TGGGCCTGAG GTGGCCGTGA 3420 

TGTCTGCTTC TAAGCTTAAC GCATCTGCTC AGGCACAGAA TAAACGTCTA GGCTGGCCAA 3480 

AAAAGGAACT GAATCCCAGG CCCATACGCC AGCACCAGAA TCAAACCAGT CTTCAAGGAA 3540 

GGAAGGCTAG GAGAGTTTAA CAAGATTTTC ACTGGGCCCA GCATGGTGGC TCACACCTGT 3600 

AATCCCAAGG CAGAATGGTG GCTTGAGCTC AGGAGTTCAA GACCAGCCTG GGCAACACAG 3660 

TGAGACCCTG TCTCTAAAAA ATTTAAAAAT AAACAAGGTG TTCACCAAGC TGGGATACTT 3720 

CTCACTATTA AGCCCCTATC TTTCTCTTTT TTTCATTCTC AATTGCTTTG TGTGATAAAA 3780 

AACTAAAGAG ACTTCTGGTC CAATTTCTGG CAACATCCCT TCTGAAAGGT GAGTAGAGTG 3840 

GGTGTCTTCT ATGCCCATTT TCCCCAATTT TACACAAACT ATTATCAATG AACTTTTAAG 3900 

TACCTAGAAT GGGTAAAACC AGAGCAAGAC TTTAAATTAC CTTCTTCTTT CTTCTACTGG 3960 

CAGTTCTGCC TCCATCACTA TCAGGCTAGG GTGACCTTCC CTTGGTCAAG CCCCAATTGC 4020 

CCATGATTTG TGCCTGTGCC CTTTCTCCAG TGACCATTTG GTGACCAGAT GGTAGATATA 4080 

GAAAGGGGAT GGCATTTGCA AGTGACTAGT CTGCCACAAA ATGCTCATCT GATTAGCCAC 4140 

TGCTGCCCTG GCAATGGCTT TGTAAGAGTC AATGAGAACT AGAGCCAGGC TGTGGTCCCT 4200 

GGCCATCAAC -AGTGTTGGTG ACGGCAGGGA GTCCCTTTGG TTTAATAAAT CCAGTTTTTC 4260 

TTTGGGTATC CAAATTCTCC CCTCCTTTTG TAGGAGTCAG GCTCTCAGAA CCTGTGTCCA 4320 
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TGTTGGAACT TCCCCCAGTG TGGATGCAGA TACGCAGCTC CTGAGCTCCA GCCTAAAGTC 4380 

TTCTGTAGCC TCAGCAATAC TTGGGCACCT GCTGTCTCAC TGAATAGCTT TCTTTTGTGA 4440 

CAAAGGCCAC AGACAGCCCT TAGACTATTC CGGAAACAGT AGGAAAAATT ACATATGTCT 4500 

TTGACTTCTT TATTCTGACT CCACTGATTT TAGCCATAAT ACTTTAAGGA GCTACTTTTT 4560 

ACTACCCCTT ACCGTGCTGA CTTCTGCAGG TCTGCCCTGT GACCTGTCAG GAACTCCTGA 4620 

GTTACGCTAC TGGGGTCACC TGTTGCTCCC CTAGCAAGTT AGGCATGTCA TATATTTTTA 4680 

ACAGCTTTAT TGAGATATAA TTCACATATT ATACAATTCA CCTTTAAAAC ATACGATTCA 4740 

ATGGTTTTCA GCAAACTCAC AGAGTTGTCC GCCCACTTGA GAGCAAACAC ATGTTCAATT 4800 

TTCTTTTCCT TTTTTTTTTT GAGACAGAGT CAGCTTTGTC GCCCAGGCTG GAGTGCAGTG 4860 

CCATGATCTT GGCTCACTGC AGCCTCCCCA TCCTGGGTTC AAGTGATCCT TCTGC TTCAG 4920 

CCTCCCCAGT AGCTGGGATT ACAGGCATGC GCCACCACGC CTAGCTAATT TTTGTGTTTT 4980 

TAGTAGAGAT GGGGTTTCAC CGTGTTGGCC AGGCTGGTCT CAAACTCCTG GACTCAAGTG 5040 

ATCCACCCAC CTCGGCCTCC CAAAGTGCTG GGATTGCAGG TGTGAGCCAC CGTGCCTGGC 5100 

CTACGTGTTC AATTTTCTAT GAACAAAGGC TTTAGTCCTT GACCCAGGGC TAAAGTGGTC 5160 

TGTCCAAGCT GTTGTTGGTA GAGGGAGTAT GATAAAATGT TTAAATCTCA TTTGGTTACC 5220 

TTGAGTCCTG GAACACGCAQ TAACTGTCAT GCTATAGTCA TCATCTGTAT TTGGCTGGGA 5280 

ATACAAATGA AGATTGTGGT GTATTCAAGC AGTAGGGTTT TTGCTTTTGT TTTTGTTTTA S340 

GTGCCAACAA AACTTTTTTT TGTCTGACTA CATTAAAGAT AAGACTGACT ATATTTATAC 5400 

AACAGAAACT TTGTAATAGA TTTTTTCAGC TTTGTGAAAT CGAATTTTTT TTCATCAGGG S460 

CTGGTTGGAT TTCCTTTTTA CCCTGTAATC CAAGCGTTAA TAGTTTGTTA GAAGATGGGT 5520 

TATTGCATGT CACTTTTTTT TTTTTGTAAA ATAAAAACAT ACCTTAC 5567 

Seq ID NO: CI 8 8 DNA Sequence 

Nucleic Acid Accession ft: NM_014324.1 

Coding sequence: 89.. 12 37 

1 11 21 31 41 51 

I I.I I I I 

GGCGCCGGGA TTGGGAGGGC TTCTTGCAGG CTGCTGGGCT GGGGCTAAGG GCTGCTCAGT 60 

TTCCTTCAGC GGGGCACTGG GAAGCGCCAT GGCACTGCAG GGCATCTCGG TCGTGGAGCT 120 

GTCCGGCCTG GCCCCGGGCC GTNTCTGTGC TATGGTCCTG GCTGACTTCG GGGCGCGTGT 180 

GGTACGCGTG GACCGGCCCG GCTCCCGCTA CGACGTGAGC CGCTTGGGCC GGGGCAAGCG 240 

CTCGCTAGTG CTGGACCTGA AGCAGCCGCG GGAGCCGCGT GCTGCGGCGT CTGTGCAAGC 300 

GGTCGGATGT GCTGCTGGAG CCCTTCCGCC GCGGTGTCAT GGAGAAACTC CAGCTGGGCC 360 

CAGAGATTCT GCAGCGGGAA AATCCAAGGC TTATTTATGC CAGGCTGAGT GGATTTGGCC 420 

AGTTCAGGAA AGCTTCTGCC GGTTAGCTGG CCACGATATC AACTATTTGG CTTTGTCAGG 480 

TGTTCTCTCA AAAATTGGCA GAAGTGGTGA GAATCCGTAT GCCCCGCTGA ATCTCGTGGC 540 

TGACTTTGCT GGTGGTGGCC TTATGTGTGC ACTGGGCATT ATAATGGCTC TTTTTGACCG 600 

CACACGCACT GACAAGGGTC AGGTCATTGA TGCAAATATG GTGGAAGGAA CAGCATATTT 660 

AAGTTCTTTT CTGTGGAAAA CTCAGAAATC GAGTCTGTGG GAAGCACCTC GAGGACAGAA 720 

CATGTTGGAT GGTGGAGCAC CTTTCTATAC GACTTACAGG ACAGCAGATG GGGAATTCAT 780 

GGCTGTTGGA GCAATAGAAC CCCAGTTCTA CGAGCTGCTG ATCAAAGGAC TTGGACTAAA 840 

GTCTGATGAA CTTCCCAATC AGATGAGCAC GGATGATTGG CCAGAAATGA AGAAGAAGTT 900 

TGCAGATGTA TTTGCAAAGA AGACGAAGGC AGAGTGGTGT CAAATCTTTG ACGGCACAGA 960 

TGCCTGTGTG ACTCCGGTTC TGACTTTTGA GGAGGTTGTT CATCATGATC ACAACAAGGA 1020 

ACGGGGCTCG TTTATCACCA GTGAGGAGCA GGACGTGAGC CCCCGCCTTG CACCTCTGCT 1080 

GTTAAACACC CCAGCCATCC CTTCTTCCAA AGGGGATCCT TTCATAGGAG AACACACTGA 1140 

GGAGATACTT GAAGAATTTG GATTCAGCCG AGAAGAGATT TATCAGCTTA ACTCAGATAA 1200 

AATCATTGAA AGTAATAAGG TAAAAGCTAG TCTCTAACTT CCAGGCCCAC GGCTCAAGTG 1260 

AATTTGAATA CTGCATTTAC AGTGTAGAGT AACACATAAC ATTGTATGCA TGGAAACATG 1320 

GAGGAACAGT ATTACAGTGT CCTACCACTC TAATCAAGAA AAGAATTACA GACTCTGATT 1380 

CTACAGTGAT GATTGAATTC TAAAAATGGT TATCATTAGG GCTTTTGATT TATAAAACTT 1440 

TGGGTACTTA TACTAAATTA TGGTAGTTAT TCTGCCTTCC AGTTTGCTTG ATATATTTGT 1500 

TGATATTAAG ATTCTTGACT TATATTTTGA ATGGGTTCTA GTGAAAAAGG AATGATATAT 1560 

TCTTGAAGAC ATCGATATAC ATTTATTTAC ACTCTTGATT CTACAATGTA GAAAATGAGG 1620 

AAATGCCACA AATTGTATGG TGATAAAAGT CACGTGAAAC AGAGTGATTG GTTGCATCCA 1680 

GGCCTTTTGT CTTGGTGTTC ATGATCTCCC TCTAAGCACA TTCCAAACTT TAGCAACAGT 1740 

TATCACACTT TGTAATTTGC AAAGAAAAGT TTCACCTGTA TTGAATCAGA ATGCCTTCAA 1800 

CTGAAAAAAA CATATCCAAA ATAATGAGGA AATGTGTTGG CTCACTACGT AGAGTCCAGA 1860 

GGGACAGTCA GTTTTAGGGT TGCCTGTATC CAGTAACTCG GGGCCTGTTT CCCCGTGGGT 1920 

CTCTGGGCTG TCAGCTTTCC TTTCTCCATG TGTTTGATTT CTCCTCAGGC TGGTAGCAAG 1980 

TTCTGGATCT TATACCCAAC ACACAGCAAC ATCCAGAAAT AAAGATCTCA GGACCCCCCA 2040 

AAAAAAAAAA AAAAAAAAAA AAAAAAAA 2068 

Seq ID NO: C189 DNA Sequence 

Nucleic Acid Accession ft: XM_091332.1 

Coding sequence: 1..1401 

1 11 21 31 41 51 

I I I I I I 

ATGCAAAGGT GGACACTGTG GGCTGCAGCC TTCCTGACCC TCCACTCTGC ACAGGCCTTT 60 

CCACAAACAG ACATCAGTAT CAGTCCAGCC CTGCCAGAGC TGCCCCTGCC TTCCCTGTGC 120 

CCCCTGTTCT GGATGGAGTT CAAAGGCCAC TGCTATCGAT TCTTCCCTCT CAATAAGACC 180 

TGGGCTGAGG CCGACCTCTA CTGTTCTGAG TTCTCTGTGG GCAGGAAGTC CGCCAAGCTG 240 

GCCTCCATCC ACAGCTGGGA GGAGAATGT C TTTGTATATG ACCTCGTGAA CAGCTGTGTT 300 

CCCGGCATCC CAGCTGACGT CTGGACAGGC CTTCATGATC ACAGACAGGA AGGGCAGTTT 360 

GAATGGACTG ATGGCTCATC CTATGACTAC AGCTACTGGG ATGGCAGCCA GCCAGATGAT 420 

GGCGTCCACG CGGACCCAGA AGAAGAGGAC TGCGTGCAGA TATGGTACAG GCCTACCAGT 480 

GAGCAGCTAC AGGCCCCAGA GCCCCAGTTA CCCTTATCAA TCTCAGAGGC CACAGATGTC 540 

TATCTCCCTG AGGATTTCCC AGCTGAGCCC AAGCTCATGG ACCAGTCCTG GGTGTCCAGG 600 

AAGAGCCTGA AACCATCCAA GAGTCATCTT ATGGAGCCAC CCACTCCAGT GGCCAAGCAC 660 

CAAAAGGCAA AGACCCGACA TAGGAGCCTG CGCGGCGTCT GGTGGCCATC AGGTAAGGCT 720 

GGGTCATGGA AAGAAAGAAT GAATGCAGAC TACGGGCGAA GAAAGCGATC GGCCCCGAGG 780 

CAGGAAGGCC GGCTCCGGTG CAGGGAGCGC CGCCTGCGGG CTGCTTCGGG CCAGGGTCGA 840 
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CCCGAGGGCC AGCGCAAGCA GCGGCAACAG GAGOGCCAGG AGAGAGGCTG GGAAGAACTG 90O 

GGAGGGGTGT CCCCAATGCG GGGCGCCCAA GCGTGGCAGC ACGGGCTGGG AGCGGGGAGC 960 

CAGCGGGGTG CGGCGCCGGA GTGCGGGGAG AACCACCAGG CGCCGGAATT GGGGAGCACG 1020 

TGGAGGGGGC AGCGGCTCCA GCCCCAGACC GCCGCGCTCT GTCACTTTGC ATTAAGAAAG 1080 

CTTCCGGGGA ATGCACACGG CCTGGCCGCC GCCTTCGTGC AGCCCGCCCT GCAGGTGCAG 1140 

GAAGAAAAGA ATAATCGCAC CCGTTTCTCA GGTGCTTACT TCACCATGTC CGATCCGACG 1200 

TGTGACCAAG ATAGCAAGGA GCAGTCTTTA AGGOGACACG GCAGAGAGGC AGAAAAAGAT 1260 

GGGCCTTACC GGTTAGTTAA GAAAAAAAGA GGACCTGTTG CCTGTCCCTC TAGCTTTGAA 1320 

CTACAAAGTG GAGGGGAAGT TTGTCTGGAT TTTCCTGTAG AACTGAGGGC AGGGACCTGG 1380 

ATTGCTCGAG AACCTCCATA A I 401 



Seq ID NO: CI 90 DNA Sequence 

Nucleic Acid Accession #: XM_0S4B69.2 

Coding sequence: 2 6.. 2 902 



1 11 21 31 41 51 

1 I I I I I 

TAGACGCGGA GCCCAAGGAG GTAAAATGCA CACTTGCTGC CCCCCAGTAA CTTTGGAACA 60 

GGACCTTCAC AGAAAAATGC ATAGCTGGAT GCTGCAGACT CTAGCGTTTG CTGTAACATC 120 

TCTCGTCCTT TOGTGTGCAG AAACCATCGA TTATTATGGG GAAATCTGTG ACAATGCATG 1B0 

TCCTTGTGAG GAAAAGGACG GCATTTTAAC TGTGAGCTGT GAAAACCGGG GGATCATCAG 240 

TCTCTCTGAA ATTAGCCCTC CCCGTTTCCC AATCTACCAC CTCTTGTTGT CCGGAAACCT 300 

TTTGAACCGT CTCTATCCCA ATGAGTTTGT CAATTACACT GGGGCTTCAA TTT TGCA TCT 360 

AGGTAGCAAT GTTATCCAGG ACATTGAGAC CGGGGCTTTC CATGGGCTAC GGGGTTTGAG 420 

GAGATTGCAT CTAAACAATA ATAAACTGGA ACTTCTGCGA GATGATACCT TCCTTGGCTT 480 

GGAGAACCTG GAGTACCTAC AGGTCGATTA CAACTACATC AGCGTCATTG AACCCAATGC 540 

TTTTGGGAAA CTGCATTTGT TGCAGGTGCT TATCCTCAAT GACAATCTTT TGTCCAGTTT 600 

ACCCAACAAT CTTTTCCGTT TTGTGCCCTT AACGCACTTG GACCTCCGGG GGAACCGGCT 660 

GAAACTTCTG CCCTACGTGG GGCTCTTGCA GCACATGGAT AAAGTTGTGG AGCTACAGCT 720 

GGAGGAAAAC CCTTGGAATT GTTCTTGTGA GCTGATCTCT CTAAAGGATT GGTTGGACAG 780 

CATCTCCTAT TCAGCCCTGG TGGGGGATGT AGTTTGTGAG ACCCCCTTCC GCTTACACGG 840 

AAGGGACTTG GACGAGGTAT CCAAGCAGGA ACTTTGCCCA AGGAGACTTA TTTCTGACTA 900 

CGAGATGAGG CCGCAGACGC CTTTGAGCAC CAOGGGGTAT TTACACACCA CCCCGGCGTC 960 

AGTGAATTCT GTGGCCACTT CTTCCTCTGC TGTTTACAAA CCCCCTTTGA AGCCCCCTAA 1020 

GGGGACTCGC CAACCCAACA AGCCCAGGGT GCGCCCCACC TCTCGGCAGC CCTCTA AGGA 1080 

CTTGGGCTAC AGCAACTATG GCCCCAGCAT CGCCTATCAG ACCAAATCCC CGGTGCCTTT 1140 

GGAGTGTCCC ACCGCGTGCT CTTGCAACCT GCAGATCTCT GATCTGGGCC TCAACGTAAA 1200 

CTGCCAGGAG CGAAAGATCG AGAGCATCGC TGAACTGCAG CCCAAGCCCT ACAATCCCAA 1260 

GAAAATGTAT CTGACAGAGA ACTACATCGC TGTCGTGCGC AGGACAGACT TCCTGGAGGC 1320 

CACGGGGCTG GACCTCCTGC ACCTGGGGAA TAACCGCATC TCGATGATCC AGGACCGCGC 1380 

TTTCGGGGAT CTCACCAACC TGAGGCGCCT CTACCTGAAT GGCAACAGGA TCGAGAGGCT 1440 

GAGCCCGGAG TTATTCTATG GCCTGCAGAG CCTGCAGTAT CTCTTCCTCC AGTACAATCT 1500 

CATCCGGGAG ATTCAGTCTG GAACTTTTGA CCCGGTCCCA AACCTCCAGC TGCTATTCTT 1560 

GAATAACAAC CTCCTGCAGG CCATGCCCTC AGGCGTCTTC TCTGGCTTGA CCCTCCTCAG 1620 

GCTAAACCTG AGGAGTAACC ACTTCACCTC CTTGCCAGTG AGTGGAGTTT TGGACCAGCT 16 B0 

GAAGTCACTC ATCCAAATOG ACCTGCATGA CAATCCTTGG GATTGTACCT GTGACATTGT 1740 

GGGCATGAAG CTGTGGGTGG AGCAGCTCAA AGTGGGOGTC CTAGTGGACG AGGTGATCTG 1800 

TAAGGCGCCC AAAAAATTCG CTGAGACCGA CATGCGCTCC ATTAAGTCGG AGCTGCTGTG 1860 

CCCTGACTAT TCAGATGTAG TAGTTTCCAC GCCCACACCC TCCTCTATCC AGGTCCCTGC 1920 

GAGGACCAGC GCCGTGACTC CTGCGGTCCG GTTGAATAGC ACCGGGGCCC CCGCGAGCTT 1980 

GGGCGCAGGC GGAGGGGCGT CGTCGGTGCC CTTGTCTGTG TTAATTCTCA GCCTCCTGCT 2040 

GGTTTTCATC ATGTCCGTCT TCGTGGCCGC CGGGCTCTTC GTGCTGGTCA TGAAGCGCAG 2100 

GAAGAAGAAC CAGAGCGACC ACACCAGCAC CAACAACTCC GACGTGAGCT CCTTTAACAT 2160 

GCAGTACAGC GTGTAOGGCG GCGGCGGCGG CACGGGCGGC CACCCACACG CGCACGTGCA 2220 
TCACCGCGGG CCCGCGCTGC CCAAGGTGAA GACGCCCGCG GGCCACGTGT ATGAATACAT 2280 
CCCCCACCCA CTGGGCCACA TGTGCAAAAA CCCCATCTAC CGCTCCCGAG AGGGCAACTC 2340 
CGTAGAGGAT TACAAAGACC TGCACGAGCT CAAGGTCACC TACAGCAGCA ACCACCACCT 2400 
GCAGCAGCAG CAGCAGCCGC CGCCGCCACC GCAGCAGCCA CAGCAGCAGC CCCOGOGGCA 2460 
GCTGCAGCTG CAGCCCGGGG AGGAGGAGAG GCGGGAAAGC CACCACTTGC GGAGCCCCGC 2520 
CTACAGCGTC AGCACCATCG AGCCCCGGGA GGACCTGCTG TCGCCGGTGC AGGACGCCGA 2580 
CCGCTTTTAC AGGGGCATTT TAGAACCAGA CAAACACTGC TCCACCACCC CCGCCGGCAA 2640 
TAGCCTCCCG GAATATCCCA AATTCCOGTG CAGCCCCGCT GCTTACACTT TCTCCCCCAA 2700 
CTATGACCTG AGACGCCCCC ATCAGTATTT GCACCCGGGG GCAGGGGACA GCAGGCTACG 2760 
GGAACCGGTG CTCTACAGCC CCCCGAGTGC TGTCTTTGTA GAACCCAACC GGAACGAATA 2820 
TCTGGAGTTA AAAGCAAAAC TAAACGTTGA GCCGGACTAC CTCG AAGTGC TGGAAAAACA 2880 
GACCACGTTT AGCCAGTTCT AAAAGCAAAG AAACTCTCTT GGAGCTTTTG CATTTAAAAC 2940 
AAACAAGCAA GCAGACACAC ACAGTGAACA CATTTGATTA ATTGTGTTGT TTCAACGTTT 3000 
AGGGTGAAGT GCCTTGGCAC GGGATTTCTC AGCTTCGGTG GAAGATACGA AAAGGGTGTG 3060 
CAATTTCCTT TAAAATTTAC ACGTGGGAAA CATTTGTGTA AACTGGGCAC ATCACTTTCT 3120 
CTTCTTGCGT GTGGGGCAGG TGTGGAGAAG GGCTTTAAGG AGGCCAATTT GCTGCGCGGG 3180 
TGACCTGTGA AAGGTCACAG TCATTTTTGT AGTGGTTGGA AGTGCTAAGA ATGGTGGATG 3240 
ATGGCAGAGC AT AG ATT CT A CTCTTCCTCT TTTGCTTCCT CCCCCTCCCC CGCCCCTGCC 3300 
CCACCTCTCT TTCTCCCCTT TTAAGCCATG GGTGGGTCTA ACTGGCTTTT GTGGAGAAAT 3360 
TAGCACACCC CAACTTTAAT AGGAAATTTG TTCTCTTTTT CCGC COCTCT CCTTCTCTCC 3420 
TCCCCTCCCC TCCCTTCTCA TTCCTTTTCT TTGTTTTTAA AGGATGTGTT TGTATGCATT 3480 
CTGGACATTT GAATTAAAAA AAAAGTATTG TGATCCTGTA AAGGATCACC ATAGATGTGG 3540 
ACAAATCATT AAAATTACAG AGCTATATGA TCCATAATTG ATTAGTCAAA ATAACTTATT 3600 
GATGAAATAT ACAAATATTT TATTGTAGCA CCTATTTTTA TATGCACATT TAGCATTCCT 3660 
CTTTCCTTCA CTATTTAGCC TATGATTTTG CAGAGGTGTC ACACTGTATT AGGATCTGCA 3720 
TTTCTAAAAC TGACGTGGTA TCAGGAAGGC ATTTTCAATC ATTCAAAATG TGGAGAATTT 3780 
AATGGCTAAA TCTTTAAAAG CCAATGCAAC CCACCCAATT GAATCTGCAT TTTCTTTTAA 3840 
GAAAACAGAG CTGATTGTAT CCCAATGTAT TTTAAAAAAT AGGGCAATTG ATTGGGCCAT 3900 
TCCGAGAGAA TTGTTTGCAA GTTTTGGGTT TTATTAGAAA ATATTTGAAA GTATTTTTAT 3960 
TAATGAACCA AAATGACATG TTCATTTGAC TACTATTGTA GCCGATTTTC GATTGTTTAA 4020 
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CCAAACCCAG TTGCATTTGT ACAGATCCAC GTGTACTGGC ACCTCAGAAG ACCAAATCAT 4060 

GGACTGTACA AGTCTCTATA CAATGTCTTT ATCCCTGTGG GCAGCAAGCA ATGATGATAA 4140 

TGACAAACAG GATATCTGTA AGATGGGGCT ACTGTTGTTA CAGTCTCATA TGTATCCCAG 4200 

CACATGTAAT TTTTTAAATA GTTTCTGAAT AAACACTTGA TAACTATGTC 4250 

Seq ID NO: CI 91 DNA Sequence 

Nucleic Acid Accession fl: NM_000793.2 

Coding sequence: 401.. 1222 

1 11 21 31 41 51 

I I I I I I 

GCCTGCAGAG AGAGGCACTT TGCACCACAG ACAGATAGCA AGAAGGGAAA GACAGAGAGT 60 

GAGAAAAAAG AGGAGTCAGT CGCTCCTGGG GAAGGGAGAG AGTGAGACTG GGAGAAAGAG 120 

AAGCACAGAA AGTGTGTGTA AAACGGAGTA AAGAAAGAAA AAAAAAAAAC TACCCTTAAA 180 

GCACATTTAA AAAAAAAAAA CTCTGGCAAT TCAAGAAAGA AACAGGCTAC GTTTAAAGAG 240 

CATAGAGACA ATGAAAGGCT AAAGAAAATT TTAAAATCTC TGCCACAGTC TCATAGGTGC 300 

TTGGAAATGA AAGTAGAACT GCCTGTCTTT AACGGACTCT GACAGAGAGG GTGAAGGGGA 360 

ACCAGAGCGC ACAAGGGAAC TGACTCAGGA GGCAGAGAAG ATGGGCATCC TCAGCGTAGA 420 

CTTGCTGATC ACACTGCAAA TTCTGCCAGT TTTTTTCTCC AACTGCCTCT TCCTGGCTCT 480 

CTATGACTCG GTCATTCTGC TCAAGCAOGT GGTGCTGCTG TTGAGCCGCT CCAAGTCCAC 540 

TCGCGGAGAG TGGCGGCGCA TGCTGACCTC AGAGGGACTG CGCTGCGTCT GGAAGAGCTT 600 

CCTCCTCGAT GCCTACAAAC AGGTGAAATT GGGTGAGGAT GCCCCCAATT CCAGTGTGGT 660 

GCATGTCTCC AGTACAGAAG GAGGTGACAA CAGTGGCAAT GGTACCCAGG AGAAGATAGC 720 

TGAGGGAGCC ACATGCCACC TTCTTGACTT TGCCAGCCCT GAGOGCCCAC TAGTGGTCAA 780 

CTTTGGCTCA GCCACTTGAC CTCCTTTCAC GAGCCAGCTG CCAGCCTTCC GCAAACTGGT 840 

GGAAGAGTTC TCCTCAGTGG CTGACTTCCT GCTGGTCTAC ATTGATGAGG CTCATCCATC 900 

AGATGGCTGG GCGATACCGG GGGACTCCTC TTTGTCTTTT GAGGTGAAGA AGCACCAGAA 960 

CCAGGAAGAT CGATGTGCAG CAGCCCAGCA GCTTCTGGAG CGTTTCTCCT TGCCGCCCCA 1020 

GTGCCGAGTT GTGGCTGACC GCATGGACAA TAACGCCAAC ATAGCTTACG GGGTAGCCTT 1080 

TGAACGTGTG TGCATTGTGC AGAGACAGAA AATTGCTTAT CTGGGAGGAA AGGGCCCCTT 1140 

CTCCTACAAC CTTCAAGAAG TCCGGCATTG GCTGGAGAAG AATTTCAGCA AGAGATGAAA 1200 

GAAAACTAGA TTAGCTGGTT AAAGGTATGA TTATAAGAGA GCTTATTGTT TTAAAAAGTT 1260 

ATATAAAGGC AAGGAAATTA AGAACTGAAT CCATATTTCA ACAGAGCCCT ATTGGCTTAC 1320 

TGAAAGACAG GAGTTTATCT ATCGGAAGAA CATGAATCTC TAACAGCTCC ATACTTCTTT 1380 

CACTACTCAA ATGGCATTGG GCTGAGTAAG TAACCATATC ACCTCTCTTC TTAGTAAAAA 1440 

GCCCTATGTG AAAAGATCCC AAGATGGAGA GGAAGAAACG CTAATTCAGC ATGTGTTCAT 1500 

TCTGCATTGA GAAGGAACTG ATACATCTGA TGCATGCTTT GAGACCAGAA GAAAAGACTT 1560 

ACCTGAATAA TTACTACATT AGGGAAGCTA CTGTCTACGT TAAGATAAAG GGTATTGCCT 1620 

TGGCTCTATT TGGCATGGAT GGAGCCCAGT TGGAAAATTC CCAAATATTA CAACAAGTCC 1680 

TTGAACCCAG GCCATGTGGT TAGACGTTGG TGTTAAGGTT AGACCTTATG TTAGAGTCAT 1740 

TTCTGATGTT CCAGCTTCTA GCCATGTAGT GCTCTCAGTC TTCATACCCC AGAAATTATT 1800 

GGTATATTTG TAGATACCGA GAATGATCCC TCAGTCTGAG AGGTTAGAAT GATCATCTGT 1860 

AATCTGAGGG TTAATTTCTA GGCAGGTGGA GAGAGTGGTA AAAAAGAAAT GAAATTGACA 1920 

AGCTAGGAAA GAGGAGGCAG AAAGATTTGG AAAATTCACA GAGTTTCACC CTTAAGCTGT 1980 

AGAGAGTGGG TCACATTTGT TAGCCACGGA AACATAGAAA CATACACAAG GCCAGAAAAA 2040 

GAAGAAGGAG CTCAACTAAA AGTGGCATAG AGAATACACA TATAAAAACA ATATATTTGT 2100 

CATATGCTCC TAGAGAGGAG AAAGGGGTGA TTGAAAGAAA AAAAAATACT TAAATATTTG 2160 

TAATTGTGAG GGGTTTCTTT TGGAAATAAT TACTTTTGAA CCATGTATGT GGTATGTATA 2220 

TTTTCAGTGG GTTAATTATA CCCCATGATA CCTATTAAAG GAAAACCAGT GGGTCTGGTG 2280 

GTGCTGGTCT TTTCCTCCCC ATTCCTACAA TTTCTATGTG GCCCAAGTCA TTCCTAATCT 2340 

TGGTCTCTAT AGCAGTGTTC TCTCTGAATG CTGAGCTGAA GAAATTATAC GTACATACAC 2400 

ACATACATAC ATACATACAA ATATATGTAT ATATATTCTC AGCTGCTGCG GGAGGTAGGT 2460 
ACCATGGCCA TTCAGCACAG CCTTGATTTC CTCCCAAAGT AGGTGAGCTA TAGTGAAGAA 2520 
TAGGTGCAAA CAAACAAGCT TACTTCCATT GCAAAATAGA AGAAGAGGAA GTTAGAGATA 2580 
ATTCTGATCA ATCATTTTGG AGGCTTTGTT ATAAGGCAAC CCCCGGTATA TCA TGGAA TT 2640 
TCCATTGACA TTTGAATTTG <3ACTTGGATC TTCCCTTGGT CCCATTAGCT GAGGTTTAGT 2700 
AATCTAAAGT CCCTATAGTA TATGATTATA ATGCTATTTT AAAAAATATA TATATAAAAT 2760 
ATTTTTTTCT TTTTAAAATA GACACTATAG TTTTACCCAT AAGTAATATT TAAAGATTAT 2820 
AGCTCCCAAA AGAATGGACC AACCACTTTC GTATCATAAT TTCTTTTTGG TA AATATG AG 2880 
ACTATTATGA AATCATAGTA TATGATTGTA TTTAAAGGTA CAATCAAAGG ATCTTTTGTC 2940 
CATTCCATTA ATAACTGAAT AAAAAATAAA TAAAATGGAT AGAAAAAAAC TAAAGTTGAA 3000 
AATACATTCT TAAACTAGTT GTCTGAAATG AGAAAAGAGT GAGAACTAGG TGTGCAAGAA 3060 
CCAAAOGTAT TTTATTTTAT TTTTTAAATG GGAGCAACAT ATCAGTCGTG TCACCAGCTG 3120 
GTATATTGTG TAAATATTAA AGCTCCATTG GGACTGATTT TTCATGGCAA CATCAGCTTT 3180 
CTAATGTTCT AAATTCTATA AAAACCACCC ACAAAGAAAC AAAGCAAATT TCATTATCTA 3240 
ATGAGTTGCT GGAAAATCAT ATTGAGAATA ATTATTTCAG ATTCCTCAGT TGTTAACTTC 3300 
TACATTCAAG GGCTTATCTC TGCCCCCATT GATTTTTAAC CTCAAAATGG TGTGAGATTT 3360 
ACTGTGGAAC CCTAAAGCAG TAAAATAAAA AACCTGGTTG CAGCACATTC ACACTGTTGT 3420 
CCTTAAAATT CCC CTTTTTT CTCTATGTAC GATAAAGTAA CAGTATGTCA GATAAGCCGG 3480 
TGGGGGGATG AGATTAGGCT GAGGCAGTGC TAGTCAACTG GGGGAAAAGG ATGATGGAAA 3540 
AATCACCCAG TTGTGCTATA TTTTTAAAGA AGGAGGTCGT TTATGTGTGC AGACAATTCT 3600 
CCCTGAGGTT AGCCCAATGG AGAAATGAAG CAGAGGAAGG AAACATAGAA AGACATGGGC 3660 
TATCAGGGAG GAAGATGTTC AATAGAACAT GCAAGAATTT CTGGAAGAAA GGCTGTGGAA 3720 
GGGCCAATGG AGAAAATGAA TGGACAAAGC TCAGGAATCC CTACGCTATG TAGAATGTTC 3780 
TTGGTGTTAT CAGGGTTAAG CCCTGTAATT ATGTAACCTA TTTATCGCAA CATGAATTTT 3840 
TATGATTTCT TGTGATGTAT TCTTTTATGA AATTAACAAG AACTCATTAT TTTGAGGTAG 3900 
AGGAAAATCA ATGCTTTATC TGATATGCTG AGAAATTATT AGATTGCCAA TACTCATGTG 3960 
CGTTTCATGT GTTTTATAAG GTTTGTTCCT TTGAAGAATT GTAGTTCTTA GTCCCACAGG 4020 
GAAATGTGTA TCTATTTATA TATCATAGTA TAAATCTATG ATATATTTAT ATCATATATA 4080 
AAAGTCTGAG TTCTCTTTCT TAGTCCCTAA TCATGTTTCT CCCATAGGCT GTGTTTACAT 4140 
GGAGCTATCG GTTTAGCCTT TTAAGCTTCA TTAGCTTGTC TATTATTGAA ATAGTTTCCA 4200 
AGAAATTTTA GATATTATCA TAACATCTGG GTCTACTCAA ACACTTATTG TTTGAAAGAC 4260 
TTATGTCTTG GACCTATCAA AAACTGACTT TATTTATTGC TTAGTGAAAA TACTAGTGGG 4320 
ATCAACAATG ATTTTCTTGA ATGGGCATGA ATGGAGATGC CCGCACAGTA ATGTAGAAAT 4380 
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GTTTCATACA 
ACTTTGTATA 
CTCTGGTCCT 
CTTCCCAGTG 
CATCACAGTG 
GGGCTCTGAA 
ATAAAATTCT 
AGGGTGGGAT 
AAGAATTGAC 
GGACATTTGT 
TATGACCATT 
AAAACCTTTA 
TGGAGACTGC 
TAAATGGTAT 
TCACTCTAAA 
TACCAATCTC 
AGAGAGGATC 
GGGTTGGTCT 
GAAGGCTGCT 
ATGCCCTATG 
CAAGGTGAGT 
TGCAAGTCGG 
CAATATTTAC 
AAACCCTCCA 
AACCCACAGA 
CACTGGTCAG 
AGTACTTTAT 
TTAGTTTTGC 
GAAGAGGGAG 
AAGAGTCATC 
TTTGTCTCCC 
GTGAGAATGC 
GCTTTGGGAG 
TTATACAATT 
AAAAAAAAA 



GCTATTAAAA 
GCTAAGTGAC 
GTGTCTTCAC 
ATCTGTTCAG 
GGAAGAATAG 
ATAAAAATTA 
AGGAGGGCAG 
TACAAGGGTG 
TGATTTACAQ 
TCCACCCGAC 
GAAAAAGGAA 
CTAGCATTTA 
AAGTAAGGCT 
GGCCAAAAGT 
AATAGAATCC 
TGCTTTGCAA 
TAGGATGGGA 
ACCAATCTGG 
GCAAGTCATT 
TGTATAGTAC 
CTGAACCAAT 
GGCTGGCTAC 
TGAGACTTCG 
CTTCTCCCCC 
AGGGGATGGG 
CGTGGTTTTT 
AACCAAAGCA 
CATCACATTG 
ACAGAGATGT 
GACCTCAGTT 
TGGCAAGGAG 
ACGCGTGCAT 
ATTATCAGTA 
TGTTCTTGTA 



TGTAACTGAC 
AGTCACTTAA 
CTCATTTATA 
TTAAGTTCTT 
CCTATTGTCT 
TGAAATATGG 
GTTAGGAGAC 
TTCCTCAGGC 
GACTTCTCTT 
CTCTGACTGA 
AAATGTAGAC 
GAGCTTTTCA 
TTTAATTTTA 
CAGAGTTAAA 
AAACCCACTC 
TGGGCACAAT 
GAGCTAGAAA 
GAAGATTTGA 
GAGTGACTTT 
CAGAAGCAAG 
AGAAAGCAAA 
CCGTCTTAGG 
AAGACCCAGC 
TCCCCTCAAA 
AAATAAAGAA 
ATGTGTATTA 
ATTAAATGAT 
TCACCCAGAC 
GCCAGAGTTG 
AGTGGTTGGA 
AATATGCGGG 
ATGCTACACA 
GAAAGAGTGT 
TTTTAATAAA 



CTCCTTAGAG 
CTTACATGAC 
GCACGTCTCC 
CTCCCGTTAA 
TTCATTTTGC 
TGAGGTCACA 
AGTTATGTAT 
ATGCCCCTAT 
TATGTCAATC 
TGGTTTGGAA 
TCTGACTTCC 
GAACATCCCC 
GGAGGTTTTT 
ATATATATAO 
TTCATATATG 
CTTGGTCATG 
GTTGCTAACT 
AAACAAACTT 
AGGATGAGCA 
GTCTCAGACT 
CATGTGCAGA 
CAGCAACAGC 
AGATGTTTAA 
AAGCCAACAG 
AATTCTCTCA 
GGATTGGGGG 
ATTGGGGTAG 
CTCACCTAGC 
ACCCAGTGTG 
TGTAGTGACA 
ACATGATGCT 
TATGTGCTTC 
TATCATATTG 
CTTTGAATAA 



GCAGATTAGT 
TTTCTTTTTT 
TTGATTTTTG 
CCAGGAAGTG 
CTGAGTGTAT 
TGTTGGTGCT 
GGCCTTTCGG 
GGGCCCTATG 
TTAAGAGGAT 
AATAACTTTA 
GTCCCACTGA 
ACTGTCATGT 
TTTTTTTTTT 
TTAGATTCCA 
CTTCCAGAAT 
TCCTGAGGCT 
GGGAAGAACA 
CTCGCAACTG 
AAACATTGGG 
TAACAGACCC 
TATCCAAACA 
AGAGCTCCAG 
TGAAGTCACT 
GTAAACACAT 
AGACTTCTCC 
ATGTGAAGAA 
GGAATGTTGG 
CCCAAGTAAT 
CGGATGATAA 
TTAGTTTGCC 
AAGAGCCCTG 
TCAGTTGCAG 
GTGCTGAGTG 
AAGAATAAAA 



AACTGTTCCT 
CACATTGGGT 
GTAGTATCAA 
CTTATTCTCT 
TTTACTATTT 
GCCTTGCTGC 
GAAAATTCAA 
TGGAAGCAAG 
GGATGAATCT 
ATTAGGATCA 
AGGATTAATG 
GTCTCAGCAG 
TTTTTTCCCC 
ACTTCCTCCT 
GGGGCTTAAG 
CTCTAAGAAA 
AGGCCCTGAG 
AAGGAAGGCT 
CCACTTCCTA 
AGCTCTGTTC 
AGACTGCTCA 
GGAGCTTATT 
ATTTTGGCTC 
AAATGAAAGA 
AGGCCCATGT 
ATAAGTATCC 
CCAGTTTTGT 



CTACTGACGA 
TCTCCCCATC 
GGTAAATGTG 
AAAATGAACT 
CTATGTGTGC 
AAAAAAAAAA 



Seq ID NO: CI 92 DNA Sequence 

Nucleic Acid Accession fl: NM_006549.2 

Coding sequence: 824.. 2590 



1 
I 

GAGCCTGGGG 
CAACACAGAG 
AAAAAAAAAT 
AGGCCAAGGC 
AACCCCATCT 
CCAGCTACTC 
GATCCGAGAT 
AAGAAAAAAA 
TGTGAGTGTC 
ACCCAGTTAA 
CTOGATTCCC 
CGTAGGACGG 
TTATCCAGCT 
ATGGGAGTGC 
TAGCCAGCCC 
CAGCAGCGAA 
CCTGGGCATG 
CGGCTTGGCG 
CGGGTCCCAG 
GGGTGGGCTG 
GCCCTACTCA 
GGAGTCTCAC 
CCTGAAGGAT 
TGACAATACC 
CTTTCCACGT 
CAGGGGCCCC 
CAATGTGGTG 
GTTCGAACTG 
AGACCAGGCC 
GAAGATCATC 
CAAGATCGCT 
CACCGTGGGC 
CTCTGGGAAG 
CCAGTGCCCA 
CCTGGAATTT 
GCTGGACAAG 
CACGAGGCAT 
GACTGAAGAG 
GGTGAAGACC 
GGAACGCTCA 
GTCCCTGTCT 
CCCCCGTGGG 



11 

I 

AGGTCGAGGG 
AGACCCTGTC 
GGGAGTGGGC 
GGGTGGATCA 
CTACGAAAAA 
GGGAGACTGA 
CGCGTCACTG 
AAAAAAAAAA 
CCTACGGCCT 
CTAGAGGGCT 
TTTTGGCTGT 
TTCAGGGAGC 
GGGGGATCCA 
CCCAGTGTGC 
AGCAGCAACC 
AGCCAGAAGC 
GAGTCCTTCA 
CGGGACCGGC 

GCAGCCGGTG 
CCCGTCAGCT 
CACGTCTCCA 
GAAATTGGAA 
TACTATGCAA 
CGCCCTCCAC 
ATTGAGCAGG 
AAGCTGGTGG 
GTCAACCAAG 
CGTTTCTACT 
CACCGTGACA 
GACTTTGGTG 
ACGCCOGCCT 
GCCTTGGATG 
TTCATGGACG 
CCAGACCAGC 
AACCCCGAGT 
GGGGCGGAGC 
GAGGTCGAGA 
ATGATACGTA 
CTGTCAGCGC 
GAGCTCAAGG 
GGAGGAGGAA 



21 
I 

TGCAGOGAGC 
TCAAAACAAA 
CGGGCGCGGT 
CGAGGTCAGG 
TACAAAAATT 
GGCAGAGAAC 
CACTCCAGCG 
AAAAAGGGAG 
AGAAATACCA 
TTGAACCTTT 
TTGGCAGGGT 
CCTCCAGGTC 
ACGCACACTT 
TGGATGAAGC 
GGGCCGCCCC 
CCTGTGAGGC 
TTGTGGTCAC 
CCCTGGAGGC 
ACCTCTCCGG 
GCAGCCTGGA 
CCCCGCAGTC 
TCACGGGTAT 
AGGGCTCCTA 
TGAAGGTGCT 
CCCGAGGCAC 
TGTACCAGGA 
AGGTCCTGGA 
GGCCCGTGAT 
TCCAGGATCT 
TCAAACCTTC 
TGAGCAATGA 
TCATGGCACC 
TTTGGGCCAT 
AGCGGATCAT 
CCGACATAGC 
CGAGGATCGT 
CGTTGCCGTC 
ACTCAGTCAA 
AACGCTCCTT 
CTGGAAACTT 
AAGCAAGGCA 
GTGCTCTTGT 



31 
I 

CGTGATCGTG 
CAAACAAACA 
GACTCACACC 
AATTCAAGAT 
AGCCAAGTAT 
TGCTTGAACC 
TGGGCGACAG 
TCGGGGTGGA 
GAGAAGCACA 
TATTAACTTG 
GAGTGAGACA 
TTCGTTTCTC 
AAGGCTCCAG 
TGGCGCATGC 
CCAGGATGAG 
CCTGCGGGGC 
CGAGTGTGAG 
CGATGGCCAA 
TCGCAAGCTG 
CATGAACGGA 
CTCGCCTCGG 
GCAGGACTGT 
TGGTGTCGTC 
GTCCAAAAAG 
CCGGCCAGCT 
AATTGCCATC 
TGACCCCAAT 
GGAAGTGCCC 
GATCAAAGGC 
CAACCTCCTG 
ATTCAAGGGC 
CGAGTCGCTC 
GGGTGTGACA 
GTGTTTACAC 
TGAGGACTTG 
GGTGCCGGAA 
GGAGGATGAG 
ACACATTCCC 
TGGGAACCCA 
GCTCACCAAA 
GCGAAGACAA 
GAGAGGCAGT 



41 
I 

CTACTGCACT 
AACAAACAAA 
TGTAATCCCA 
TAGCCTGGAC 
GGTGGCCGGC 
TGGGAGGCAG 
AGCGAGACTC 
GCTCTCATTG 
TCGGAACGGG 
GAGGTTGACT 
TCCCCTGGGT 
CTCTTCCCCG 
CAAAGTGGCT 
ACCATGTCAT 
CTGGGGGGCA 
CTCTCATCCT 
CCGGGCTGTG 
GAGGTCCCCC 
TCTCTGCAAG 
CGCTGCATCT 
CTGCCCCGGC 
GTGCAGCTGA 
AAGTTGGCCT 
AAGCTGATCC 
CCTGGAGGCT 
CTCAAGAAGC 
GAGGACCATC 
ACCCTCAAAC 
ATCGAGTACT 
GTCGGAGAAG 
AGTGACGCGC 
TCTGAGACCC 
CTATACTGCT 
AGTAAGATCA 
AAGGACCTGA 
ATCAAGCTGC 
AACTGCACGC 
AGCTTGGCAA 
TTCGAGGGCA 
AAACCAACCA 
CCTCCAGGGC 
CCCTGCGTGG 




GGCCGACAGT 
ATCAGTATAC 
ACAATGAAAA 
GGCAGGCCGG 
GCATCCAGCC 
TGGACCACCC 
TGTACATGGT 
CACTCTCTGA 
TACACTACCA 
ATGGGCACAT 
TCCTCTCCAA 
GCAAGATCTT 
TTGTCTTTGG 
AGAGTCAGGC 
TCACCCGTAT 
ACCCCTGGGT 
TGGTCGAAGT 
CCGTGATCCT 
GCCGGCGGGA 
GGGAATGTGA 
ACCGACCCGC 
AAAGTTGCTG 



4440 
4500 
4560 
'4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6429 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
'720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
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GGCCCCCGCC CCCGGCTCCC CCGCACGCAT GCATCCACTQ CGGCCGGAGQ AGGCCATGGA 2580 

GCCCGAGTAG CTGCCTGGAT CGCTCGACCT GGCATGCGCG CCGCGTCGCC TCTGGGGGGC 2640 

TGCTGCACCG CGTTTCCATA GCAGCATGTC CTACGGAAAC CCAGCACGTG TGTAGAGCCT 2700 

CGATCGTCAT CTCTGGTTAT TTGTTTTTTC CTTTGTTGTT TTAAAGGGGA CAAAAAAAAA 2760 

AAAAAAAGGA CTTGACTCCA TGACGTOGAC CGTGGCCGCT GGCTGGCTGG ACAGGCGGGT 2820 

GTGAGGAGTT GCAGACCCAA ACCCACGTGC ATTTTGGGAA ATTGCTTTTT AAAAGATTTT 28B0 

TATGCCAAAA ATCCTTCATT GTGATTTTCA GAACCACGTC AGATATACCA AGTGACTGTG 2940 

TGTGGGGTTT GACAACTGTG GAAAGGCGAG CAGAAAACTC CGGGGGTCTG AGGCCATGGA 3000 

GGTGGTTGCT GCATTTGAGA GGGAGTAGGG GGCTAGATGT GGCTCCTAGT GCAAACCGGA 3060 

AACCATGGCA CCTTCCAGAG CCGTGGTCTC AAGGAGTCAG AGCAGGGCTG GCCCTCAGTA 3120 

GCTGCAGGGA GCTTTGATGC AACTTATTTG TAAGAAGGAT TTTTAAATTT TTTATGGGTA 3180 

GAATTGTAGT CAGGAAAACA GAAAGGGCTT GAAATTTAAT AAGTGCTGCT GGAAGGGGAT 3240 

TTTCCAAGCC TGGAAGGGTA TTCAGCAGCT GTGGTGGGGA AACATTTCTC CTGAAAGACT 3300 

GAACGTGTTT CTTCATGACA GCTGCTCAAA GCAGGTTTCT GAGATAGCTG ACCGAGCTCT 3360 

GGTAAATCTC TTTGTCAAAT TACGAAAACT TCAGGGTGAA ATCCTATGCT TTCATGTACA 3420 

TTACATGGCT TAAGATTAAA CAAAAACATT TTTCAAGTCT CTAACTAGAG TGAACTCTAG 3480 

AGCACAGTAG TTCAGAAACT ATTTAGAGCT TCCAGGATAT ATTTCACAGC TTCAGGCATG 3540 

TGATCAGTTA GAGCCGATGA AACCTATGCC CGCCTGTATA TATATTAGCA GCTTAGCTAG 3600 

TTCATAACCT GTATATTCTA AAGACTGCTA AGGTTTTGTT TTCATTTTAA A TCCT AGCTG 3660 

ATTGTTGTGG TCAATGAAAT ACCCAGTTTC TGAAGGGCCA GGTGGGAAAT GCTTTCACTG 3720 

GACCAACACA CAAATGATCA TCCTGAGGAT CTGAGCTTCC CTAGACTCCA CACAATAACC 3780 

TTGGGGCACC CTTTTAGAGA AGACTGTTGA AACCCACAGC ACTCGTTGGG GTATGAGGAA 3840 

ACCAGGGCTT GGCACAGGAA GTTCCCCTTT GTAGCTAAAA GTCCAGAAAG AAAGGGTTCA 3900 

TCTTTTTGAC TTCCAACTGA TATTGGGAAG TTTGGTTGAG GTTCAAGTGT GACTCCTTCC 3960 

AGAGCCACAG GTAGGGGAGT GTGAAGTTGA GGGGGAGGAA AGCTGGAAGG ACTCTGCCTT 4020 

GGGAGATTCC CAGCTCTGCT TTCCAGCGCT TGGTGGAATC TGGGCTGGGG AAAGACGGCA 4080 

CCGGGAAACT CTGCTTCCCC ATTGTTTCCA TCTGATCAGC TGTGGTGTGA GGACTTCTCA 4140 

GACAAAGGCA AGGCCTCGTG CCCCTGCCCA GCCCATTCAT GGAGCCCTGG GCCTTCTTGG 4200 

CTTCCATAGA TCCTAAGCTC TTGACTGTAG TTTAGCCAGA CTTGTTTTGC TATCTTATAA 4260 

GCAGTTCAGA ATTAGGGAAT GCTGGTTTTG AAGAGCAAAG GACAGGTAGT CTAGAGAGGG 4320 

TCGTCTGGCC TGCTTGCTGG GTCTTTGTAA CCCAGCACTT CCTCTTGOCC TCCTGGCTTT 43 BO 

ATGTTTATGG GAGAGGACTC AATAGCTCCA CCCCTTCTGG CACCAGATGG GGCTTGGTTA 4440 

GTTTGCAATA AGCACCTTGC AGAGGTTAAA GCCAGCGGGT CCCTAGTCTT AGGCCCAGCC 4500 

TGCTTGTGTG GGCTCTGGCC TGGCCTGGTG GCTGGCCCAG GGGGCAGCAG TGCTTAGAGC 4560 

TTCTGCAGGG CTTCTCTTGT TTACACAGCT GCATCAGACA ATGCCATTTC TCCCCACCAC 4620 

GGAACCTTCC ATCTAAGATT TCTTCCAGGG AATGCCAGCA ATCAGGCAGC ACCCAGCTGT 4680 

GGGGGCAGTG GGGTGGGGGA GACCCACATT GATGACTTTT TTTTTTTCTT TTAATGAAGA 4740 

AACACCAAAG AAAGCTGTGG AAAGGACCTG CCCCACATGA AAAGGATAAG CCAAGATGGC 4800 

TGTAAACACA GAGCATTTGA GCTGCCACTC TTGGAGCACA TTGATTTTTC AAAAGCCAGC 4860 

TCTGTCAGGA AAGGAGGTGC TGTTATGACA GCTCTTCCAG TGGGCAAAGA GGACGCCCAT 4920 

AATTTCTTCC ATTGCTAGCT CATCTGTGGG ACCAATTTGG TGTAAGCAAC CTGTGGCCTG 4980 

CACTTGTGGC CTCGAAGGAA GCACAAACCC TCCATCCACT TCCCATTTCC TCTGCCCTTT 5040 

TCCACCTCCC CCTTCCATCC CACCAGCTGC CAGTGGCTCC CAGAAAGCCT TATTGAGCCC 5100 

CTTGTTGACA CTTGGGGCTG CGGAGGCCTT TCCCTACTGG TCTGGCCTTT CCTGAGAGGC 5160 

AGGTCTTCCG TCCTCAGAGC CTTTCTGGAA CAAGGAGAAT GCCTGTGCAG GTGGACACAC 5220 

AGGCCTGGCC TGTCGCTCTC ACTTGTCTTC CAGCGGGGAG CTTCACGTTG CCGAGTGGAA 5280 

GAACCATGAC CTCCACTTGC TTCCAAGGTG CTAGGGAAGT TTCAGGGTAC GCTGGTTCCC 5340 

CTCTCCAGCT GGAGGCCGAG TTTCTGGGGA CTGCAGATTT TTCTACTCTG TGATCGATTC 5400 

AATGCCCGAT GCTTCTGTTT CATTCCCGAC CCTTTCTACT ATGCATTTTC CTTTTATCAG 5460 

GTGTATAAAG TTAAATACTG TGTATTTATC ACTAAAAAGT ACATGAACTT AAGAGACAAC 5520 

TAAGCCTTTC GTGTTTTTCC ACAGGTGTTT AAGCTTCTCT GTACAGTTGA AATAAACAGA 5580 

CAGCAAAATG GTAAAAAAAA AAAAAAAAAA A 5611 

Seq ID NOj C193 DNA Sequence 
Nucleic Acid Accession ftt NM_018646 
Coding sequence: 217.. 2394 

1 11 21 31 41 51 

I I I I I I 

GCTCTGCCAA GTGTAACAAA CTCACAGCCC TCTCCAAACT GGCTGGGGCT GCTGGGAGAC 60 

TCCCAAGGAA CTCGTCAGGA AGGCAGGAGA CAGGAGACGG GACCTCTACA GGGAGACGGT 120 

GGGCCGGCCC TTGGGGGGGC TGATGTGGCC CCAAGGCTGA GTCCCGTCAG GGTCTGGCCT 180 

CGGCCTCAGG CCCCCAAGGA GCCGGCCCTA CACCCCATGG GTTTGTCACT GCCCAAGGAG 240 

AAAGGGCTAA TTCTCTGCCT ATGGAGCAAG TTCTGCAGAT GGTTCCAGAG ACGGGAGTCC 300 

TGGGCCCAGA GCCGAGATGA GCAGAACCTG CTGCAGCAGA AGAGGATCTG GGAGTCTCCT 360 

CTCCTTCTAG CTGCCAAAGA TAATGATGTC CAGGCCCTGA ACAAGTTGCT CAAGTATGAG 420 

GATTGCAAGG TGCACCAGAG AGGAGCCATG GGGGAAACAG CGCTACACAT AGCAGCCCTC 480 

TATGACAACC TGGAGGCCGC CATGGTGCTG ATGGAGGCTG CCCCGGAGCT GGTCTTTGAG 540 

CCCATGACAT CTGAGCTCTA TGAGGGTCAG ACTGCACTGC ACATCGCTGT TGTGAACCAG 600 

AACATGAACC TGGTGCGAGC CCTGCTTGCC CGCAGGGCCA GTGTCTCTGC CAGAGCCACA 660 

GGCACTGCCT TCCGCCGTAG TCCCTGCAAC CTCATCTACT TTGGGGAGCA CCCTTTGTCC 720 
TTTGCTGCCT GTGTGAACAG TGAGGAGATC GTGCGGCTGC TCATTGAGCA TGGAGCTGAC 780 
ATCCGGGCCC AGGACTCCCT GGGAAACACA GTGTTACACA TCCTCATCCT CCAGCCCAAC 840 
AAAACCTTTG CCTGCCAGAT GTACAACCTG TTGCTGTCCT ACGACAGACA TGGGGACCAC 900 
CTGCAGCCCC TGGACCTCGT GCCCAATCAC CAGGGTCTCA CCCCTTTCAA GCTGGCTGGA 960 
GTGGAGGGTA ACACTGTGAT GTTTCAGCAC CTGATGCAGA AGCGGAAGCA CACCCAGTGG 1020 
ACGTATGGAC CACTGACCTC GACTCTCTAT GACCTCACAG AGATCGACTC CTCAGGGGAT 1080 
GAGCAGTCCC TGCTGGAACT TATCATCACC ACCAAGAAGC GGGAGGCTCG CCAGATCCTG 1140 
GACCAGACGC CGGTGAAGGA GCTGGTGAGC CTCAAGTGGA AGCGGTACGG GCGGCCGTAC 1200 
TTCTGCATGC TGGGTGCCAT ATATCTGCTG TACATCATCT GCTTCACCAT GTGCTGCATC 1260 
TACCGCCCCC TCAAGCCCAG GACCAATAAC CGCACGAGCC CCCGGGACAA CACCCTCTTA 1320 
CAGCAGAAGC TACTTCAGGA AGCCTACATG ACCCCTAAGG AOGATATCCG GCTGGTCGGG 1380 
GAGCTGGTGA CTGTCATTGG GGCTATCATC ATCCTGCTGG TAGAGGTTCC AGACATCTTC 1440 
AGAATGGGGG TCACTCGCTT CTTTGGACAG ACCATCCTTG GGGGCCCATT CCATGTCCTC 1500 
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ATCATCACCT 
GGGGAGGTGG 
GCCCQAGQAT 
GACCTGATGC 
TATATCATCT 
GCCCTGTTCA 
GTGGACCTGC 
CTCATGCTCA 
CGGGATGAGC 
CCTCGCTGCC 
TGGTTCCTGC 
CAGGCCTTCC 
CTGGGCTGTC 
TCCCGCAGCA 
GGGATAATCA 
GTTCTCACTT 
AACCAAACAC 
CGCCAAGGAA 
GGCAGAGGAA 
TGCTCATCAC 
TGCTTGGGCG 
ACCCAGGCAG 
AACGGCTCTG 
TTTTCGGATC 



ATGCCTTCAT 
TACCCATGTC 
TCCAGATGCT 
GATTCTGCTG 
TCCAGACAGA 
GCACCTTCGA 
CCTTCATGTA 
ACCTCCTCAT 
TGTGGAGGGC 
TGTGGCCTCG 
GGGTGGAAGA 
ACACCCGGGG 
CCTTCAGCCC 
GTGCCAATTG 
ACAGGGGTCT 
CGCTTCCTGG 
CCAGAGGTCT 
TGTACGTTGA 
GCCCAGCCCA 
CCTTCCGACA 
CTCGGTCTCC 
GTGCAGGGAA 
GGGGTGGGAA 
CCTGAAAAAA 



GGTGCTGGTG 
CTTTGCACTC 
AGGCCCCTTC 
GCTGATGGCT 
GGACCCCGAG 
GCTGTTCCTT 
CAGCATCACC 
TGCCATGATG 
CCAGATTGTG 
CTCCGGGATC 
CAGGCAAGAT 
CTCTGAGGAT 
CCACCTGTCC 
GGAAAGGCTT 
GGAGGACGGG 
AACTTGCTCT 
CATCTCCCAG 
GAATCACTGC 
AGCACGGGGC 
GGAGCACTGC 
ACCCCAGGGT 
GTGCAGAGCT 
GTGGGGCTAG 
AAAAAAAAAA 



ACCATGGTGA 
GTGCTGGGCT 
ACCATCATGA 
GTGGTCATCC 
GAGCTAGGCC 
ACCATCATCG 
TATGCTGCCT 
GGCGACACTC 
GCCACCACGG 
TGCGGACGGG 
CTCAACCGGC 
TTGGACAAAG 
CTTCCTATGC 
CGGCAAGGGA 
GAGAGCTGGG 
CATTTTCCTG 
GCCCCAGGGG 
TCCAGGCCTG 
TGGCAGGGCG 
ATGTCAGAGC 
CATAAGTGGG 
TGTGGAAAGC 
GTCTTGCCAA 



TGCGGCTCAT 
GGTGCAACGT 
TTCAGAAGAT 
TGGGCTTTGC 
ACTTCTACGA 
ATGGCCCAGC 
TTGCCATCAT 
ACTGGCGAGT 
TGATGCTGGA 
AGTATGGCCT 
AGCGGATCCA 
ACTCAGTGGA 
CCTCAGTGTC 
CCCTGAGGAG 
AATATCAGAT 
GGTGCATCAA 
AGAAAGAGGA 
CATTACTCCT 
TGAGGAACTC 
ACTTTAAAAA 
GAGAGAGCCC 
GTGTGAGTGA 
CTCCATCTTC 



CAGTGCCAGC 
CATGTACTTC 
GATTTTTGGC 
TTCAGCCTTC 
CTACCCCATG 
CAACTACAAC 
CGCCACACTG 
GGCCCATGAG 
GCGGAAGCTG 
GGGAGACCGC 
ACGCTACGCA 
AAAACTAGAG 
TCGAAGTACC 
AGACCTGCGT 
CTGACTGCGT 
ACAAAACAAA 
GTAGCATGAA 
TCAGCTCTGG 
TCCTGTGGCC 
CAGGCCAGCC 
TTCCCAGGGC 
GGGAGACAGG 
AATAAAGTCG 



Seg ID NO: CI 94 DNA Sequence 

Nucleic Acid Accession Jh NM_021910.1 

Coding sequence: 260.. 601 



1 
I 

GTTCTCCACA 
GTGAGGTTTG 
ATTTCTCCCG 
CCCACTCCAC 
CCCGGGCCAG 
CTTTCCTGTC 
GCACAGCCTC 
CATCGTCATG 
TCCCCTGACC 
GTCCGGTCAC 
ATGAGGACAG 
TGCACAGAAA 
GGCAGGGCCT 
AATGATTGTG 
TTGGGGCACC 
CTGCTTGTTG 
CGCCCTCTGC 
AAGGAATGAA 
CATTCCAGGA 
ATAATAGTAA 
TTTCCAAATA 
AACATACAAT 
CCCAATACCT 
CTCAAAACAC 
ACAATAAACT 



11 

I 

ACTGCCAGCA 
CTTAGGGCGT 
GAACCTCTGC 
GGTGCAGCTG 
CGCTCTGACA 
CTGGACGCCA 
CAGGTTGGCG 
AGTGAGTGGA 
ACTCAGCTCT 
CATCCAGGGG 
ACCAGCTGAA 
CTTGAACTCC 
CATCTCACCT 
CCTCTGCCCA 
ATGAGAAGGT 
GGAAAAGCCC 
CTCCTCAGCC 
AGAGAGCTCT 
AAACTGGGAC 
GCCCCTGTGA 
CCCAAATATC 
GACCACATAG 
AATAAGGCAT 
CCAAAAATAT 
CGTGACTTGT 



21 
I 

ATCCTTCCAC 
GGCAGCTTCG 
TCAGCCTGGT 
CGGCTTATCT 
TGCAGAAGGT 
ATGACCTAGA 
GGCTCATCTG 
GGAGCTCGGG 
CCCCAACAGG 
AGACTCCACC 
ATTGGGTGGA 
AGGATGGAAT 
CTCGCAAGAG 
AGCAGCCTGG 
TGGCGTGCCC 
ACAGGCCTGT 
ATGGGACCTC 
AACCAGATGG 
ATAGGATCGT 
CCCTCTCACT 
CCTGCAAGCC 
TGTGAGAACT 
GCGAAATGTT 
CTCCTCCAAT 
CCCCTC 



31 
I 

CAGGCAAAAC 
GATAAACGCA 
GAACCACACA 
CTCAGCCCAG 
GACCCTGGGC 
AGATAAAAAC 
CGCTGGGGTT 
GGAGCAGGCG 
TGCAAAATGC 
TCTCATCACC 
GGACCGTTCT 
TCTTCCTCCT 
GGTCTCTTTG 
AGACTTCCTA 
TGGAGGCTGA 
TCCCTTGTGG 
ATATGCAATT 
AACACTGGAA 
CCCGCTATGA 
TACCCCGAGA 
CGTTAAATAA 
TCCAACAAGC 
CTCATGAACT 
GTCCTGAGAC 



41 
I 

ACATCATCTA 
GGACTCCGCC 
GGCCCGAGTT 
CGAGATGCCA 
CTGCTTGTGT 
AGTCCTTTCT 
CTGTGCGCCA 
GGCCGGGGCT 
AAATGCAAGT 
CCCGGCTCAG 
CTGTCCCCAG 
CTGCTGGGAC 
TTCAATTTTT 
TGTGTGCATT 
CACAGAGGCT 
CTTGGGACAT 
TGGGATTTAC 
CATTCCAGTG 
TGGAAGTGTT 
CCTCACTTTA 
TTCCCTATGC 
CTCAAAGTCC 
ACCCCACAAC 
ATGAACCCAA 



51 
I 

AGGAAAAGAA 
TGGCAGCCCG 
TCACCCAGTC 
GCCTTCCTGT 
TCCTGGCAGG 
ACTATGACTG 
TGGGCATCAT 
GGGGCTCCCC 
TTGGCCAGAA 
CCCAAAGCTG 
GTCCTGTCTC 
TCCTTTGCAT 
TTTAATCTAA 

GGCACTGAGC 
GGCACAGGCC 
TAGTAGCCAA 
GACCCTGGAC 
CAGACAGTTT 
TTACAAGATC 
TACCCTTAAT 
CTTGAGACTC 
AGGCCTAAAA 
AAAGAGACCC 



Seq ID NO: CI 95 DNA Sequence 

Nucleic Acid Accession #: NM_005971.2 

Coding sequence: 176.. 43 9 



1560 
1620 
1680 
1740 
1800 
1660 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2918 



1 
1 

GTTCTCCACA 
GTGAGGTTTG 
ATTTCTCCCG 
GAAGGTGACC 
CCTAGAAGAT 
CATCTGCGCT 
ATGCAAGTTT 
AGGCTCAGCC 
GTCCCCAGGT 
GCTGGGACTC 
CAATTTTTTT 
TGTGCATTGG 
CAGAGGCTGG 
TGGGACATGG 
GGATTTACTA 
TTCCAGTGGA 
GAAGTGTTCA 
TCACTTTATT 
CCCTATGCTA 
CAAAGTCCCT 
CCCACAACAC 



11 
I 

ACTGCCAGCA 
CTTAGGGCGT 
GAACCTCTGC 
CTGGGCCTGC 
AAAAACAGTC 
GGGGTTCTGT 
GGCCAGAAGT 
CAAAGCTGAT 
CCTGTCTCTG 
CTTTGCATGG 
TAATCTAAAA 
GGTGGGGCTT 
CACTGAGCCT 
CACAGGCCCG 
GTAGCCAAAA 
CCCTGGACCA 
GACAGTTTAT 
ACAAGATCTT 
CCCTTAATAA 
TGAGACTCCC 
GCCTAAAACT 



21 
I 

ATCCTTCCAC 
GGCAGCTTCG 
TCAGCCTGGT 
TTGTGTTCCT 
CTTTCTACTA 
GCGCCATGGG 
CCGGTCACCA 
GAGGACAGAC 
CACAGAAACT 
CAGGGCCTCA 
TGATTGTGCC 
GGGGCACCAT 
GCTTGTTGGG 
CCCTCTGCCT 
GGAATGAAAG 
TTCCAGGAAA 
AATAGTAAGC 
TCCAAATACC 
CATACAATGA 
CAATACCTAA 
CAAAACACCC 



31 
I 

CAGGCAAAAC 
GATAAACGCA 
GAACCACACA 
GGCAGGCTTT 
TGACTGGCAC 
CATCATCATC 
TCCAGGGGAG 
CAGCTGAAAT 
TGAACTCCAG 
TCTCACCTCT 
TCTGCCCAAG 
GAGAAGGTTG 
AAAAGCCCAC 
CCTCAGCCAT 
AGAGCTCTAA 
ACTGGGACAT 
CCCTGTGACC 
CAAATATCCC 
CCACATAGTG 
TAAGGCATGC 
AAAAATATCT 



41 
I 

ACATCATCTA 
GGACTCCGCC 
GGCCAGOGCT 
CCTGTCCTGG 
AGCCTCCAGG 
GTCATGAGTG 
ACTCCACCTC 
TGGGTGGAGG 
GATGGAATTC 
CGCAAGAGGG 
CAGCCTGGAG 
GCGTGCCCTG 
AGGCCTGTTC 
GGGACCTCAT 
CCAGATGGAA 
AGGATCGTCC 
CTCTCACTTA 
TGCAAGCCCG 
TGAGAACTTC 
GAAATGTTCT 
CCTCCAATGT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1466 



51 
I 

AGGAAAAGAA 
TGGCAGCCCG 
CTGACATGCA 
ACGCCAATGA 
TTGGCGGGCT 
CAAAATGCAA 
TCATCACCCC 
ACCGTTCTCT 
TTCCTCCTCT 
TCTCTTTGTT 
ACTTCCTATG 
GAGGCTGACA 
CCTTGTGGCT 
ATGCAATTTG 
CACTGGAACA 
CGCTATGATG 
CCCCGAGACC 
TTAAATAATT 
CAACAAGCCT 
CATGAACTAC 
CCTGAGACAT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 



1322 



WO 03/042661 



GAACCCAAAA AGAGACCCAC AATAAACTCG TGACTTGTCC CCTC 1304 

Seq ID NO: C196 DNA Sequence 

Nucleic Acid Accession ft: NM_O0496l.2 

Coding sequence: S5..157S 

1 11 21 31 41 51 

I I 1 I I I 

GCCAGAGCGT GAGCCGCGAC CTCCGCGCAG GTGGTCGCGC CGGTCTCCGC GGAAATGTTO 60 

TCCAAAGTTC TTCCAGTCCT CCTAGGCATC TTATTGATCC TCCAGTCGAG GGTCGAGGGA 120 

CCTCAGACTG AATCAAAGAA TGAAGCCTCT TCCCGTGATG TTGTCTATGG CCCCCAGCCC 180 

CAGCCTCTGG AAAATCAGCT CCTCTCTGAG GAAACAAAGT CAACTGAGAC TGAGACTGGG 240 

AGCAGAGTTG GCAAACTGCC AGAAGCCTCT CGCATCCTGA ACACTATCCT GAGTAATTAT 30 0 

GACCACAAAC TGCGCCCTGG CATTGGAGAG AAGCCCACTG TGGTCACTGT TGAGATCGCC 360 

GTCAACAGCC TTGGTCCTCT CTCTATCCTA GACATGGAAT ACACCATTGA CATCATCTTC 420 

TCCCAGACCT GGTACGACGA ACGCCTCTGT TACAACGACA CCTTTGAGTC TCTTGTTCTG 480 

AATGGCAATG TGGTGAGCCA GCTATGGATC CCGGACACCT TTTTTAGGAA TTCTAAGAGG 540 

ACCCACGAGC ATGAGAT CAC CATGCCCAAC CAGATGGTCC GCATCTACAA GGATGGCAAG 600 

GTGTTGTACA CAATTAGGAT GACCATTGAT GCCGGATGCT CACTCCACAT GCTCAGATTT 660 

CCAATGGATT CTCACTCTTG CCCTCTATCT TTCTCTAGCT TTTCCTATCC TGAGAATGAG 720 

ATGATCTACA AGTGGGAAAA TTTCAAGCTT GAAATCAATG AGAAGAACTC CTGGAAGCTC 780 

TTCCAGTTTG ATTTTACAGG AGTGAGCAAC AAAACTGAAA TAATCACAAC CCCAGTTGGT 840 

GACTTCATGG TCATGACGAT TTTCTTCAAT GTGAGCAGGC GGTTTGGCTA TGTTGCCTTT 900 

CAAAACTATG TCCCTTCTTC CGTGACCACG ATGCTCTCCT GGGTTTCCTT TTGGATCAAG 960 

ACAGAGTCTG CTCCAGCCCG GACCTCTCTA GGGATCACCT CTGTTCTGAC CATGACCACG 1020 

TTGGGCACCT TTTCTCGTAA GAATTTCCCG CGTGTCTCCT ATATCACAGC CTTGGATTTC 1080 

TATATCGCCA TCTGCTTCGT CTTCTGCTTC TGCGCTCTGT TGGAGTTTGC TGTGCTCAAC 1140 

TTCCTGATCT ACAACCAGAC AAAAGCCCAT GCTTCTCCTA AACTCCGCCA TCCTCGTATC 12 Q0 

AATAGCCGTG CCCATGCCCG TACCOGTGCA CGTTCCCGAG CCTGTGCCCQ CCAACATCAG 1260 

GAAGCTTTTG TGTGCCAGAT TGTCACCACT GAGGGAAGTG ATGGAGAGGA GCGCCCGTCT 1320 

TGCTCAGCCC AGCAGCCCCC TAGCCCAGGT AGCCCTGAGG GTCCCCGCAG CCTCTGCTCC 1380 

AAGCTGGCCT GCTGTGAGTG GTGCAAGCGT TTTAAGAAGT ACTTCTGCAT GGTCCCCGAT 1440 

TGTGAGGGCA GTACCTGGCA GCAGGGCCGC CTCTGCATCC ATGTCTACCG CCTGGATAAC 1500 

TACTCGAGAG TTGTTTTCCC AGTGACTTTC TTCTTCTTCA ATQTGCTCTA CTGGCTTGTT 1560 

TGCCTTAACT TGTAGGTACC AGCTGGTACC CTGTGGGGCA ACCTCTCCAG TTCCCCAGGA 1620 

GGTCCAAGCC CCTTGCCAAG GGAGTTGGGG GAAAGCAGCA GCAGCAGCAG GAGCGACTAG 1680 

AGTTTTTCCT GCCCCATTCC CCAAACAGAA GCTTGCAGAG GGTTTGTCTT TGCTGCCCCT 1740 

CTCCCCTACC TGGCCCATTC ACTGAGTCTT CTCAGCAGAC CATTTCAAAT TATTAATAAA 1800 

TGGGCCACCT CCCTCTTCTT CAAGGAGCAT CCGTGATGCT CAGTGTTCAA AACCACAGCC 1860 

ACTTAGTGAT CAGCTCCCTA AAACCATGCC TAAGTACAGG CGGATTAGCT ATCTTCCAAC 1920 

AATGCTGACC ACCAGACAAT TACTGCATTT TTCCAGAAGC CCACTATTGC CTTTGTAGTG 1980 

CTTTCGGCCC AGTTCTGGCC TCAGCCTCAA AGTGCACCGA CTAGTTGCTT GCCTATACCT 2040 

GGCACCTCAT TAAGATGCTG GGCAGCAGTA TAACAGGAGG AAGAGATCCC TCTCCTTTGG 2100 

TCAGATTATT ATGTTCTCAG TTCTCTCTCC CTGCTACCCC TTTCTCTGCA GATAGATAGA 2160 

CACTGGCATT ATCCCTTTAG GAAGAGGGGG GGGCAGCAAG AGAGCCTATT TGGGACAGCA 2220 

TTCCTCTCTC TCTGCTGCTG TGACATCTCC CTCTCCTTGC TGGCTCCATC TTTCGTCTGC 2280 

ACTACCAATT CAATGCCCTT CATCCAATGG GTATCTATTT TTGTGTGTGA TTATAGTAAC 2340 

TACTCCCTGC TTTATATGCC ACCCTCTTCC TTCTCTTTGA CCCCTGTGAC TCTTTCTGTA 2400 

ACTTTCCCAG TGACTTCCCC TAGCCCTGAC CCAGGCACTA GGCCTTGGTG ACTTCCTGGG 2460 

GCCAAGAAAC TAAGGAAACT CGGCTTTGCA ACAGGCATTA CTCGCCATTG ATTGGTGCCC 2520 

ACCCAGGGCA CACTGTCGGA GTTCTATCAC TTGCTTGACC CCTGGACCCA TAAACCAGTC 2580 

CACTGTTATA CCCGGGGCAC TCTAACCATC ACAATCAATC AATCAAATTC CCTTAAATTT 2640 

GTATGGCACT GGAACTTTGG CAAAGCACTT TTGACAAGTT GTGTCTGATT GGAGCTTCAT 2700 

GATAGCCTTG TGACATCTTT AGGGCAGGAT TCTTATCCCC ATTTTGCAGA TGAAAACCCT 2760 

GAGTCACAGA TTTCTGTGGG ACTGTGGATC TCACTGGAAG CTATCCAAGA GCCCACTGTC 2820 

ACCTTCTAGA CCACATGATA GGGCTAGACA GCTCAGTTCA CCATGATTCT CTTCTGTCAC 2880 

CTCTGCTGGC ACACCAGTGG CAAGGCCCAG AATGGCGACC TCTCTTTAGC TCAATTTCTG 2940 

GGCCTGAGGT GCTCAGACTG CCCCCAAGAT CAAATCTCTC CTGGCTGTAG TAACCCAGTG 3000 

GAATGAATTT GGACATGCCC CAATGCTTCT ATATGCTAAG TGAAATCTGT GTCTGTAATT 3060 

TGTTGGGGGG TGGATAGGGT GGGGTCTCCA TCTACTTTTT GTCACCATCA TCTGAAATGG 3120 

GGAAATATGT AAATAAATAT ATCAGCAAAG CAAAAAGAAA AAAAAAAA 3168 

Seq ID NO: C197 DNA Sequence 

Nucleic Acid Accession ft: NM_021984.l 

Coding sequence: S72..1753 

1 11 21 31 41 51 

I 1 1 I I I 

GCCAGAGCGT GAGCCGCGAC CTCCGCGCAG GTGGTCGCGC CGGTCTCCGC GGAAATGTTG 60 

TCCAAAGTTC TTCCAGTCCT CCTAGGCATC TTATTGATCC TCCAGTCGAG AACATGTATA 120 

CAGAGAAGTG CTCAAATCAT AAGTGTACAG CTGATGAGTT GTCAAAAAAT GACCACAGCG 180 

GTGTAAAGAA AGCCAAATCA AGGACCCGAA TGTGAGCAGG ACCTCAGAAG CCCCCTTTGT 240 

CACTGCCTCC CAGCAAAGGC AGCACTATCC GGACTTCTAA CACCATCGGG TCGAGGGACC 300 

TCAGACTGAA TCAAAGAATG AAGCCTCTTC CCGTGATGTT GTCTATGGCC CCCAGCCCCA 360 

GCCTCTGGAA AATCAGCTCC TCTCTGAGGA AACAAAGTCA ACTGAGACTG AGACTGGGAG 420 

CAGAGTTGGC AAACTGCCAG AAGCCTCTCG CATCCTGAAC ACTATCCTGA GTAATTATGA 480 

CCACAAACTG CGCCCTGGCA TTGGAGAGAA GCCCACTGTG GTCACTGTTG AGATCTCCGT 540 

CAACAGCCTT GGTCCTCTCT CTATCCTAGA CATGGAATAC ACCATTGACA TCATCTTCTC 600 

CCAGACCTGG TACGACGAAC GCCTCTGTTA CAACGACACC TTTGAGTCTC TTGTTCTGAA 660 

TGGCAATGTG GTGAGCCAGC TATGGATCCC GGACACCTTT TTTAGGAATT CTAAGAGGAC 720 

CCACGAGCAT GAGATCACCA TGCCCAACCA GATGGTCCGC ATCTACAAGG ATGGCAAGGT 780 

GTTGTACACA ATTAGGATGA CCATTGATGC CGGATGCTCA CTCCACATGC TCAGATTTCC 840 

AATGGATTCT CACTCTTGCC CTCTATCTTT CTCTAGCTTT TCCTATCCTG AGAATGAGAT 900 
GATCTACAAG TGGGAAAATT TCAAGCTTGA AATCAATGAG AAGAACTCCT GGAAGCTCTT 960 
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CCAGTTGGAT TTTACAGGAG TGAGCAACAA AACTGAAATA ATCACAACCC CAGTTGGTGA 1020 

CTTCATGGTC ATGACGATTT TCTTCAATGT GAGCAGGCGG TTTGGCTATG TTGCCTTTCA 1080 

AAACTATGTC CCTTCTTCCG TGACCACGAT GCTCTCCTGG GTTTCCTTTT GGATCAAGAC 1140 

AGAGTCTGCT CCAGCCCGGA CCTCTCTAGG GATCACCTCT GTTCTGACCA TGACCACGTT 1200 

5 GGGCACCTTT TCTCGTAAGA ATTTCCCGCG TGTCTCCTAT ATCACAGCCT TGGATTTCTA 1260 

TATCGCCATC TGCTTCGTCT TCTGCTTCTG CGCTCTGTTG GAGTTTGCTG TGCTCAACTT 1320 

CCTGATCTAC AACCAGACAA AAGCCCATGC TTCTCCTAAA CTCCGCCATC CTCGTATCAA 1380 

TAGCCGTGCC CATGCCCGTA CCCGTGCACG TTCCCGAGCC TGTGCCCGCC AACATCAGGA 1440 

AGCTTTTGTG TGCCAGATTG TCACCACTGA GGGAAGTGAT GGAGAGGAGC GCCCGTCTTG 1500 

10 CTCAGCCCAG CAGCCCCCTA GCCCAGGTAG CCCTGAGGGT CCCCGCAGCC TCTGCTCCAA 1560 

GCTGGCCTGC TGTGAGTGGT GCAAGCGTTT TAAGAAGTAC TTCTGCATGG TCCCCGATTG 1620 

TGAGGGCAGT ACCTGGCAGC AGGCCCGCCT CTGCATCCAT GTCTACCGCC TGGATAACTA 16 BO 

CTCGAGAGTT GTTTTCCCAG TGACTTTCTT CTTCTTCAAT GTGCTCTACT GGCTTGTTTG 174 0 

CCTTAACTTG TAGGTACCAG CTGGTACCCT GTGGGGCAAC CTCTCCAGTT CCCCAGGAGG 1800 

15 TCCAAGCCCC TTGCCAAGGG AGTTGGGGGA AAGCAGCAGC AGCAGCAGGA GCGACTAGAG I860 

TTTTTCCTGC CCCATTCCCC AAACAGAAGC TTGCAGAGGG TTTGTCTTTG CTGCCCCTCT 1920 

CCCCTACCTG GCCCATTCAC TGAGTTTTCT CAGCAGACCA TTTCAAATTA TTAATAAATG 1980 

GGCCACCTCC CTCTTCTTCA AGGAGCATCC GTGATGCTCA GTGTTCAAAA CCACAGCCAC 2040 

TTAGTGATCA GCTCCCTAAA ACCATGCCTA AGTACAGGCG GATTAGCTAT CTTCCAACAA 2100 

20 TGCTGACCAC CAGACAATTA CTGCATTTTT CCAGAAGCCC ACTATTGCCT TTGCAGTGCT 2160 

TTCGGCCCAG TTCTGGCCTC AGCCTCAAAG TGCACCGACT AGTTGCTTGC CTATACCTGG 2220 

CACCTCATTA AGATGCTGGG CAGCAGTATA ACAGGAGGAA GAGATCCCTC TCCTTTGGTC 2280 

AGATTATTAT GTTCTCAGTT CTCTCTCCCT GCTACCCCTT TCTCTGCAGA TAGATAGACA 2340 

CTGGCATTAT CCCTTTAGGA AGAGGGGGGG GCAGCAAGAG AGCCTATTTG GGACAGCATT 2400 

25 CCTCTCTCTC TGCTGCTGTG ACATCTCCCT CTCCTTGCTG GCTCCATCTT TCGTCTGCAC 2460 

TACCAATTCA ATGCCCTTCA TCCAATGGGT ATCTATTTTT GTGTGTGATT ATAGTAACTA 2520 

CTCCCTGCTT TATATGCCAC CCTCTTCCTT CTCTTTGACC CCTGTGACTC TTTCTGTAAC 2580 

TTTCCCAGTG ACTTCCCCTA GCCCTGACCC AGGCACTAGG CCTTGGTGAC TTCCTGGGGC 2640 

CAAGAAACTA AGGAAACTCG GCTTTGCAAC AGGCATTACT CGCCATTGAT TGGTGCCCAC 2700 

30 CCAGGGCACA CTGTCGGAGT TCTATCACTT GCTTGACCCC TGGACCCATA AACCAGTCCA 2760 

CTGTTATACC CGGGGCACTC TAACCATCAC AATCAATCAA TCAAATTCCC TTAAATTTGT 2820 

ATGGCACTGG AACTTTGGCA AAGCACTTTT GACAAGTTGT GTCTGATTGG AGCTTCATGA 2880 

TAGCCTTGTG ACATCTTTAG GGCAGGATTC TTATCCCCAT TTTGCAGATG AAAACCCTGA 2940 

GTCACAGATT TCTGTGGGAC TGTGGATCTC ACTGGAAGCT ATCCAAGAGC CCACTGTCAC 3000 

35 CTTCTAGACC ACATGATAGG GCTAGACAGC TCAGTTCACC ATGATTCTCT TCTGTCACCT 3060 

CTGCTGGCAC ACCAGTGGCA AGGCCCAGAA TGGCGACCTC TCTTTAGCTC AATTTCTGGG 3120 

CCTGAGGTGC TCAGACTGCC CCCAAGATCA AATCTCTCCT GGCTGTAGTA ACCCAGTGGA 3180 

ATGAATTTGG ACATGCCCCA ATGCTTCTAT ATGCTAAGTG AAATCTGTGT CTGTAATTTG 3240 

TTGGGGGGTG GATAGGGTGG GGTCTCCATC TACTTTTTGT CACCATCATC TGAAATGGGG 3300 

40 AAATATGTAA ATAAATATAT CAGCAAAGC 3329 
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Seq ID NO: CI 98 DNA Sequence 
Nucleic Acid Accession #: NM_021987. 
Coding sequence: 572.. 1657 



11 



21 



31 



41 



1 

I I 

GCCAGAGCGT GAGCCGCGAC CTCCGCGCAG GTGGTCGCGC CGGTCTCCGC 
TCCAAAGTTC TTCCAGTCCT CCTAGGCATC TTATTGATCC TCCAGTCGAG 
CAGAGAAGTG CTCAAATCAT AAGTGTACAG CTGATGAGTT GTCAAAAAAT 
GTGTAAAGAA AGCCAAATCA AGGACCCGAA TGTGAGCAGG ACCTCAGAAG 
CACTGCCTCC CAGCAAAGGC AGCACTATCC GGACTTCTAA CACCATCGGG 
TCAGACTGAA TCAAAGAATG AAGCCTCTTC CCGTGATGTT GTCTATGGCC 
GCCTCTGGAA AATCAGCTCC TCTCTGAGGA AACAAAGTCA ACTGAGACTG 
CAGAGTTGGC AAACTGCCAG AAGCCTCTCG CATCCTGAAC ACTATCCTGA 
CCACAAACTG CGCCCTGGCA TTGGAGAGAA GCCCACTGTG GTCACTGTTG 
CAACAGCCTT GGTCCTCTCT CTATCCTAGA CATGGAATAC ACCATTGACA 
CCAGACCTGG AATTCTAAGA GGACCCACGA GCATGAGATC ACCATGCCCA 
CCGCATCTAC AAGGATGGCA AGGTGTTGTA CACAATTAGG ATGACCATTG 
CTCACTCCAC ATGCTCAGAT TTCCAATGGA TTCTCACTCT TGCCCTCTAT 
CTTTTCCTAT CCTGAGAATG AGATGATCTA CAAGTGGGAA AATTTCAAGC 
TGAGAAGAAC TCCTGGAAGC TCTTCCAGTT TGATTTTACA GGAGTGAGCA 
AATAATCACA ACCCCAGTTG GTGACTTCAT GGTCATGACG ATTTTCTTCA 
GCGGTTTGGC TATGTTGCCT TTCAAAACTA TGTCCCTTCT TCCGTGACCA 
CTGGGTTTCC TTTTGGATCA AGACAGAGTC TGCTCCAGCC CGGACCTCTC 
CTCTGTTCTG ACCATGACCA CGTTGGGCAC CTTTTCTCGT AAGAATTTCC 
CTATATCACA GCCTTGGATT TCTATATCGC CATCTGCTTC GTCTTCTGCT 
GTTGGAGTTT GCTGTGCTCA ACTTCCTGAT CTACAACCAG ACAAAAGCCC 
TAAACTCCGC CATCCTCGTA TCAATAGCCG TGCCCATGCC CGTACCCGTG 
AGCCTGTGCC CGCCAACATC AGGAAGCTTT TGTGTGCCAG ATTGTCACCA 
TGATGGAGAG GAGCGCCCGT CTTGCTCAGC CCAGCAGCCC CCTAGCCCAG 
GGGTCCCCGC AGCCTCTGCT CCAAGCTGGC CTGCTGTGAG TGGTGCAAGC 
GTACTTCTGC ATGGTCCCCG ATTGTGAGGG CAGTACCTGG CAGCAGGGCC 
CCATGTCTAC CGCCTGGATA ACTACTCGAG AGTTGTTTTC CCAGTGACTT 
CAATGTGCTC TACTGGCTTG TTTGCCTTAA CTTGTAGGTA CCAGCTGGTA 
CAACCTCTCC AGTTCCCCAG GAGGTCCAAG CCCCTTGCCA AGGGAGTTGG 
CAGCAGCAGC AGGAGCGACT AGAGTTTTTC CTGCCCCATT CCCCAAACAG 
AGGGTTTGTC TTTGCTGCCC CTCTCCCCTA CCTGGCCCAT TCACTGAGTT 
ACCATTTCAA ATTATTAATA AATGGGCCAC CTCCCTCTTC TTCAAGGAGC 
CTCAGTGTTC AAAACCACAG CCACTTAGTG ATCAGCTCCC TAAAACCATG 
GGCGGATTAG CTATCTTCCA ACAATGCTGA CCACCAGACA ATTACTGCAT 
GCCCACTATT GCCTTTGCAG TGCTTTCGGC CCAGTTCTGG CCTCAGCCTC 
GACTAGTTGC TTGCCTATAC CTGGCACCTC ATTAAGATGC TGGGCAGCAG 
GGAAGAGATC CCTCTCCTTT GGTCAGATTA TTATGTTCTC AGTTCTCTCT 



CCCTGTGGGG 
GGGAAAGCAG 
AAGCTTGCAG 
TTCTCAGCAG 
ATCCGTGATG 
CCTAAQTACA 
TTTTCCAGAA 
AAAGTGCACC 
TATAACAGGA 
CCCTGCTACC 
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CCTTTCTCTG CAGATAGATA GACACTGGCA TTATCCCTTT AGGAAGAGGG GGGGGCAGCA 2280 

AGAGAGCCTA TTTGGGACAG CATTCCTCTC TCTCTGCTGC TGTGACATCT CCCTCTCCTT 2340 

GCTGGCTCCA TCTTTCGTCT GCACTACCAA TTCAATGCCC TTCATCCAAT GGGTATCTAT 2400 

TTTTGTGTGT GATTATAGTA ACTACTCCCT GCTTTATATG CCACCCTCTT CCTTCTCTTT 2460 

5 GACCCCTGTG ACTCTTTCTG TAACTTTCCC AGTGACTTCC CCTAGCCCTG ACCAGGCACT 2S20 

AGGCCTTGGT GACTTCCTGG GGCCAAGAAA CTAAGGAAAC TCGGCTTTGC AACAGGCATT 2580 

ACTCGCCATT GATTGGTGCC CACCCAGGGC ACACTGTCGG AGTTCTATCA CTTGCTTGAC 2640 

CCCTGGACCC ATAAACCAGT CCACTGTTAT ACCCGGGGCA CTCTAAOCAT CACAATCAAT 2700 

CAATCAAATT CCCTTAAATT TGTATGGCAC TGGAACTTTG GCAAAGCACT TTTGACAAGT 2760 

10 TGTGTCTGAT TGGAGCTTCA TGATAGCCTT GTGACATCTT TAGGGCAGGA TTCTTATCCC 2820 

CATTTTGCAG ATGAAAACCC TGAGTCACAG ATTTCTGTGG GACTGTGGAT CTCACTGGAA 2880 

GCTATCCAAG AGCCCACTGT CACCTTCTAG ACCACATGAT AGGGCTAGAC AGCTCAGTTC 294 0 

ACCATGATTC TCTTCTGTCA CCTCTGCTGG CACACCAGTG GCAAGGCCCA GAATGGCGAC 3000 

CTCTCTTTAG CTCAATTTCT GGGCCTGAGG TGCTCAGACT GCCCCCAAGA TCAAATCTCT 3060 

15 CCTGGCTGTA GTAACCCAGT GGAATGAATT TGGACATGCC CCAATGCTTC TATATGCTAA 3120 

GTGAAATCTG TGTCTGTAAT TTGTTGGGGG GTGGATAGGG TGGGGTCTCC ATCTACTTTT 3X80 

TGTCACCATC ATCTGAAATG GGGAAATATG TAAATAAATA TATCAGCAAA GC 3232 

Seq ID NO: C199 DNA Sequence 
20 Nucleic Acid Accession #: NM_021990.1 
Coding sequence: 1309.. 2490 

1 11 21 31 41 51 

0 r I I I I I I 

ZD GCCAGAGOGT GAGCCGCGAC CTCCGCGCAG GTGGTCGCGC CGGTCTCCGC GGAAATGTTG 60 

TCCAAAGTTC TTCCAGTCCT CCTAGGCATC TTATTGATOC TCCAGTCGAG AACATGTATA 120 

CAGAGAAGTG CTCAAATCAT AAGTGTACAG CTGATGAGTT GTCAAAAAAT GACCACAGCG 180 

GTGTAAAGAA AGCCAAATCA AGGACCCGAA TGTGAGCAGG ACCTCAGAAG CCCCCTTTGT 240 

CACTGCCTCC CAGCAAAGGC AGCACTATCC GGACTTCTAA CACCATCGGT GAGTTTCATA 300 

30 CCTTGGCAGA TGGCCTTTAA CATTTTTGTT TAATTCAATT ATTCTTACTA ATCTTCTTCT 360 

TTTTCTTGGC TGTGGTGCAT GGCTGTGGAG CTCAGGGTGG ACTCCTGTTG GGCAGCCAGT 420 

TCCTGGATGG CTGTCTGTGG GTGGAGGACT CCTGCCTTTC CTGTTTAGAC ACCCACAAAG 480 

GCTGCTCTTT AGCCTCCTTC CCTTCATCCC CTTCCCCTGC CCCCAGTGCA ACGAGTATTA 540 

CACAACCAAC AAAACCGCAA AATATTCCCA CAATTTTCTG GTCCTCTCTG GGAGAGGCCG 600 

35 CTCTGGCTTT TCCTCTCAGC CCTGGCCCTC TGCCTGCTCC TCACTCCTGG TTGGTGCTGG 660 

TCAGGCTGAC TAGAGGCCAA GGCGACCAAC ACTAGGCAAA CGCGGCCAGC GCTCAGACAT 720 

AAATGCCCTC TTCATTTCAC GTGTAACATT CTTTTAAAAT CTAGGTCTTG GTTTTGTTGA 780 

TTTTTTCTTA AATAAAAGAG TGATCATAAA AGAGGGACAG CATAGAAAGT CCCCAAAGAG 840 

CAGCAAGGTT TTAAAGAAAT TCACAAGCCT AATCTGTCAC TGTCTTATAA TTTGCTATTA 900 

40 CCAGTCACAA TTTAACTAGG TTTTGTGTTG AAAACTTGTT TTGGTTTGCT TCTGTCCCAA 960 

GAGGCACTAG CTGGGGCCCC TACAGAGTGC AGGGCAGAGC TTCATTTTTC GTTTGAATGT 1020 

TCTAGGGTCG AGGGACCTCA GACTGAATCA AAGAATGAAG CCTCTTCCCG TGATGTTGTC 1080 

TATGGCCCCC AGCCCCAGCC TCTGGAAAAT CAGCTCCTCT CTGAGGAAAC AAAGTCAACT 1140 

GAGACTGAGA CTGGGAGCAG AGTTGGCAAA CTGCCAGAAG CCTCTCGCAT CCTGAACACT 1200 

45 ATCCTGAGTA ATTATGACCA CAAACTGCGC CCTGGCATTG GAGAGAAGCC CACTGTGGTC 1260 

ACTGTTGAGA TCTCCGTCAA CAGCCTTGGT CCTCTCTCTA TCCTAGACAT GGAATACACC 1320 

ATTGACATCA TCTTCTCCCA GACCTGGTAC GACGAACGCC TCTGTTACAA CGACACCTTT 1380 

GAGTCTCTTG TTCTGAATGG CAATGTGGTG AGCCAGCTAT GGATCCCGGA CACCTTTTTT 1440 

AGGAATTCTA AGAGGACCCA CGAGCATGAG ATCACCATGC CCAACCAGAT GGTCCGCATC 1500 

50 TACAAGGATG GCAAGGTGTT GTACACAATT AGGATGACCA TTGATGCCGG ATGCTCACTC 1560 

CACATGCTCA GATTTCCAAT GGATTCTCAC TCTTGCCCTC TATCTTTCTC TAGCTTTTCC 1620 

TATCCTGAGA ATGAGATGAT CTACAAGTGG GAAAATTTCA AGCTTGAAAT CAATGAGAAG 1680 

AACTCCTGGA AGCTCTTCCA GTTTGATTTT ACAGGAGTGA GCAACAAAAC TGAAATAATC 1740 

ACAACCCCAG TTGGTGACTT CATGGTCATG ACGATTTTCT TCAATGTGAG CAGGCGGTTT 1800 

55 GGCTATGTTG CCTTTCAAAA CTATGTCCCT TCTTCCGTGA CCACGATGCT CTCCTGGGTT 1860 

TCCTTTTGGA TCAAGACAGA GTCTGCTCCA GCOCGGACCT CTCTAGGGAT CACCTCTGTT 1920 

CTGACCATGA CCACGTTGGG CACCTTTTCT CGTAAGAATT TCCCGCGTGT CTCCTATATC 1980 

ACAGCCTTGG ATTTCTATAT CGCCATCTGC TTCGTCTTCT GCTTCTGCGC TCTGTTGGAG 2040 

TTTGCTGTGC TCAACTTCCT GATCTACAAC CAGACAAAAG CCCATGCTTC TCCTAAACTC 2100 

60 CGCCATCCTC GTATCAATAG CCGTGCCCAT GCCCGTACCC GTGCACGTTC CCGAGCCTGT 2160 

GCCOGCCAAC ATCAGGAAGC TTTTGTGTGC CAGATTGTCA CCACTGAGGG AAGTGATGGA 2220 

GAGGAGCGCC CGTCTTGCTC AGCCCAGCAG CCCCCTAGCC CAGGTAGCCC TGAGGGTCCC 2280 

CGCAGCCTCT GCTCCAAGCT GGCCTGCTGT GAGTGGTGCA AGCGTTTTAA GAAGTACTTC 2340 

TGCATGGTCC CCGATTGTGA GGGCAGTACC TGGCAGCAGG GCCGCCTCTG CATCCATGTC 2400 

65 TACCGCCTGG ATAACTACTC GAGAGTTGTT TTCCCAGTGA CTTTCTTCTT CTTCAATGTG 2460 

CTCTACTGGC TTGTTTGCCT TAACTTGTAG GTACCAGCTG GTACCCTGTG GGGCAACCTC 2520 

TCCAGTTCCC CAGGAGGTCC AAGCCCCTTG CCAAGGGAGT TGGGGGAAAG CAGCAGCAGC 2580 

AGCAGGAGCG ACTAGAQTTT TTCCTGCCCC ATTCCCCAAA CAGAAGCTTG CAGAGGGTTT 2640 

GTCTTTGCTG CCCCTCTCCC CTACCTGGCC CATTCACTGA GTTTTCTCAG CAGACCATTT 2700 

70 CAAATTATTA ATAAATGGGC CACCTCCCTC TTCTTCAAGG AGCATCCGTG ATGCTCAGTG 2760 

TTCAAAACCA CAGCCACTTA GTGATCAGCT CCCTAAAACC ATGCCTAAGT ACAGGCGGAT 2820 

TAGCTATCTT CCAACAATGC TGACCACCAG ACAATTACTG CATTTTTCCA GAAGCCCACT 2880 

ATTGCCTTTG CAGTGCTTTC GGCCCAGTTC TGGCCTCAGC CTCAAAGTGC ACCGACTAGT 2940 

_ _ TGCTTGCCTA TACCTGGCAC CTCATTAAGA TGCTGGGCAG CAGTATAACA GGAG GAAG AG 3000 

75 ATCCCTCTCC TTTGGTCAGA TTATTATGTT CTCAGTTCTC TCTOCCTGCT ACCCCTTTCT 3060 
CTGCAGATAG ATAGACACTG GCATTATCCC TTTAGGAAGA GGGGGGGGCA GCAAGAGAGC 3120 
CTATTTGGGA CAGCATTCCT CTCTCTCTGC TGCTGTGACA TCTCCCTCTC CTTGCTGGCT 3180 
CCATCTTTCG TCTGCACTAC CAATTCAATG CCCTTCATCC AATGGGTATC TATTTTTGTG 3240 
TGTGATTATA GTAACTACTC CCTGCTTTAT ATGCCACCCT CTTCCTTCTC TTTGACCCCT 3300 
80 GTGACTCTTT CTGTAACTTT CCCAGTGACT TCCCCTAGCC CTGACCAGGC ACTAGGCCTT 3360 
GGTGACTTCC TGGGGCCAAG AAACTAAGGA AACTCGGCTT TGCAACAGGC ATTACTCGCC 3420 
ATTGATTGGT GCCCACCCAG GGCACACTGT CGGAGTTCTA TCACTTGCTT GACCCCTGGA 3480 
CCCATAAACC AGTCCACTGT TATACCCGGG GCACTCTAAC CATCACAATC AATCAATCAA 3540 
ATTCCCTTAA ATTTGTATGG CACTGGAACT TTGGCAAAGC ACTTTTGACA AGTTGTGTCT 3600 
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GATTCGAGCT TCATGATAGC CTTGTGACAT CTTTAGGGCA GGATTCTTAT CCCCATTTTG 3660 

CAGATGAAAA CCCTGAGTCA CAGATTTCTG TGGGACTGTG GATCTCACTG GAAGCTATCC 3720 

AAGAGCCCAC TGTCACCTTC TAGACCACAT GATAGGGCTA GACAGCTCAG TTCACCATGA 3780 

TTCTCTT CT G TCACCTCTGC TGGCACACCA GTGGCAAGGC CCAGAATGGC GACCTCTCTT 3840 

TAGCTCAATT TCTGGGCCTG AGGTGCTCAG ACTGCCCCCA AGATCAAATC TCTCCTGGCT 3900 

GTAGTAACCC AGTGGAATGA ATTTGGACAT GCCCCAATGC TTCTATATGC TAAGTGAAAT 3960 

CTGTGTCTGT AATTTGTTGG GGGGTGGATA GGGTGGGGTC TCCATCTACT TTTTGTCACC 4020 

ATCATCTGAA ATGGGGAAAT ATGTAAATAA ATATATCAGC AAAGC 4065 

Seq ID NO: C200 DNA Sequence 

Nucleic Acid Accession ft: NM_021819.1 

Coding sequence: 3 9.. 1619 



1 11 21 31 41 51 

15 1 | | | | | 

TAGGGCGGTC GCGTAATTCG GCACGAGGGC GCTCACGTAT GCCGGCGGTC AGTGGTCCAG 60 

GTCCCTTATT CTGCCTTCTC CTCCTGCTCC TGGACCCCCA CAGCCCTGAG ACGGGGTGTC 120 

CTCCTCTACG CAGGTTTGAG TACAAGCTCA GCTTCAAAGG CCCAAGGCTG GCATTGCCTG 180 

GGGCTGGAAT ACCCTTCTGG AGCCATCATG GAGACGCCAT CCTGGGCCTG GAGGAAGTGC 240 

20 GGCTGACGCC ATCCATGAGG AACCGGAGTG GCGCCGTGTG GAGCAGGGCC TCTGTCCCCT 300 

t TCTCTGCCTG GGAAGTAGAG GTGCAAATGA GGGTGACGGG ACTGGGGCGC CGGGGAGCCC 360 

ACGGCATGGC CGTGTGGTAC ACCCGGGGCA GGGGCCATGT AGGCTCTGTC CTTGGGGGGC 420 

TGGCTTCGTG GGACGGCATC GGGATCTTCT TTGACTCTCC GGCAGAGGAT ACTCAGGACA 480 

GTCCTGCCAT CCGTGTGCTG GCCAGGGACG GGCACATCCC CTCTGAGCAG CCTGGGGATG 540 

25 GAGCTAGCCA AGGGCTGGGC TCCTGTCATT GGGACTTCCG GAACCGGCCA CACCCCTTCA 60 0 

GAGCACGGAT CACCTACTGG GGGCAGAGGC TGCGCATGTC CTTGAACAGT GGCCTCACTC 660 

CCAGTGATCC AGGTGAGTTC TGTGTGGATG TGGGGCCCCT GCTTTTGGTC CCTGGAGGTT 720 

TCTTTGGGGT CTCAGCAGCC ACCGGCACCC TGGCAGATGA TCATGATGTC CTGTCCTTCC 780 

TGACCTTCAG CCTGAGTGAG CCCAGCCCAG AGGTTCCCCC TCAGCCCTTC CTGGAGATGC 840 

30 AGCAGCTCCG CCTGGCGAGG CAGCTGGAAG GGCTGTGGGC AAGGCTGGGC TTGGGCACCA 900 

GGGAGGATGT AACTCCAAAA TCAGACTCTG AAGCTCAAGG AGAAGGGGAA AGGCTCTTTG 960 

ACCTGGAGGA GACGCTGGGC AGACACCGCC GGATCCTGCA GGCTCTGCGG GGTCTCTCCA 1020 

AGCAGCTGGC CCAGGCTGAG AGACAATGGA AGAAGCAGCT GGGGCCCCCA GGCCAAGCCA 1080 

GGCCTGACGG AGGCTGGGCC CTGGATGCTT CCTGCCAGAT TCCATCCACC CCAGGGAGGG 1140 

35 GTGGCCACCT CTCCATGTCA CTCAATAAGG ACTCTGCCAA GGTCGGTGCC CTGCTCCATG 1200 

GACAGTGGAC TCTGCTCCAG GCCCTGCAAG AGATGAGGGA TGCAGCTGTC CGCATGGCTG 1260 

CAGAAGCCCA GGTCTCCTAC CTGCCTGTGG GCATTGAGCA TCATTTCTTA GAGCTGGACC 1320 

ACATCCTGGG CCTCCTGCAG GAGGAGCTTC GGGGCCCGGC GAAGGCAGCA GCCAAGGCCC 1380 

. CCCGCCCACC TGGCCAGCCC CCAAGGGCCT CCTCGTGCCT GCAGCCTGGC ATCTTCCTGT 1440 

40 TCTACCTCCT CATTCAGACT GTAGGCTTCT TCGGCTACGT GCACTTCAGG CAGGAGCTGA 1500 

ACAAGAGCCT TCAGGAGTGT CTGTCCACAG GCAGCCTTCC TCTGGGTCCT GCACCACACA 1560 

CCCCCAGGGC CCTGGGGATT CTGAGGAGGC AGCCTCTCCC TGCCAGCATG CCTGCCTGAC 1620 

CCACCTCAGA GCCTGCTTTG CATCACTGGG AAGCAGGCAG TGTCTTGGGT GGGGGCTTGG 1680 

TCAGTATCCT CTCCGTCTGG GTGCCCAGCT CCCACGCACA CCTGAGCTTT CGGCATGCTC 1740 

45 CCACCTCGTT AAAGGTGATT TCCCTCTCAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1800 

A 1801 

Seq ID NOi C201 DNA Sequence 
Nucleic Acid Accession ft: XM_117036.1 
50 Coding sequence: 25.. 495 

1* 11 21 31 41 51 

I I I I I I 

AGCCAAGAGA GGGGGGACAG ACAGATGGAA AGACGGACAC GGGGAGCTCT GGGGAGCAGG 60 

CGGCCCCCAC CCCCACTTCC AGCTCTGAGA CATTTGTGCA CAGGCCTGCA GGCGGCTGGG 120 

ATGGCCTGGC CGGGCACGCT CTGGAGACAC ACATGCCAAG GCAGGGCCNG GGCAGCCGAG 180 

GGCCCGTGGG GGCTGTTCCG ACCACACAGG TGCCCGAGGG AGGCAGGGCA GGCCCCGGTG 240 

GOACCCAGCC CCGAAACGCA GGGTGTCGCC CACGTTTGCA GCCGCGCTCG CGTGTCGGTG 300 

GATGAGCGGG AACCGGGGGG CGGCGCCTAC GCAATGCACG TTACGCCGCG GTGGAAGGGG 360 

TGTCACCGCC ACAGCGGCOG CACCGTGCGT GGCTCTGTTT CCTGGAAACG TCCAGAACAG 420 

GCAGCCCCCG AGACAGGGAG GGGCCCAGCG GTCGCCAGGG GAAGTGGGGA TGGGAATGAG 480 

TGCGGATGGG GCTGAGGTTC TTCCTGAAGG ATGAATGCTC TGAAACTGTG AATAGACACA 540 

CGGAACTGTG CACGTTCACN GGTGAACCGT GTGGGGAGTG AATCCATCTC AACAGAGCTG 600 

TTACACAGTG AACAAGAGCA CGANGGGTGG AGGCCNCTTG TCTGCTCCTG CAGATOGAGC 660 

CGGCTGCTCA GAGCCGGGAG ATGGCCACTC AGCTCCTTTA ACTG 704 



Seq ID NO: C202 DNA Sequence 

Nucleic Acid Accession ft: XM_167803.2 

Coding sequence; 1162.. 1488 

1 11 21 31 41 51 

I I I I I I 

AACATCATAC ATAGTAGGTG AATCGTTTTG TAGAGTGAAG AATGCTAATG TAAAGCAAAT 60 

AGTCACCCAC GTTCCTTGTA AATCCAAATG TTTCTATATT GTAGCTTTGC TTAAAATGGG 120 

GTCGGCCCCA ACTG CAT CCT CCTCTTTGGC GGGCTGGGGA GCGGCCCCCA GCCGGGACGG 180 

GAGGGCAGCG ACCCCGAGGC CTCGTGGACG TGGGAGAGAG TGTGGTGGGA AGTCTTGAGC 240 

GGAGGAGGGG ATCTGCCCTT CTCCACTCCT CTCTTGGATC CGCCTCGGTT TCCTGTCCCC 300 

CCACCACCCG CCTGCCCCGC GGAAGACCGC CCAGTGAGCC AGCCCCCACC TTCCAGGCGC 360 

CTTCGCCCTG GGGATCCAAC CAACTTGTAT CGAGTGGGCG GGGCAACGGC TCCCCATTTT 420 

TCCCGAGCCC CGCCCACAGA GCTCTTAGCC AATCCTATGC AGAGAGCATC TCCTGGCAGG 480 

GGTCTCTTCC CAACCAGACC CCACCCAGGC ACATTAGCGA CCAGGCTTGG GCTTCCCCAG 540 

CGCCCCACCA OCACCACGTG CAGGTGGAGC TCTGGGATGC TATGTTGGGG CGGCAAGCGG 600 

TGGGCCGAGG GCCGGGTAGG CTAGCACGGG AGGTAAGGGT GGTATGGGAT GGGGCGGGGG 660 

CGGTCTAGGG CAATAGGAGA GCAGAGAATG GGGGAACTTG AGGGTGGGGG GAGGGCACCG 720 

1326 
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GAGCCTTGCC 
CCCCATCTGT 
CACCATCACC 
AGTGTAATTA 
AQCTAGCTGA 
GGCTTCGTCT 
TTTCTTGTGA 
GCACCTGCCC 
CCAGGGGGGC 
GAGTGCAAGT 
GCAGCGAAGA 
ACAAAATTTT 
TATTTTTGTG 
CGTGAAAAAG 



ACCATCCCAG 
AAAATGATGG 
TGACCTGGGG 
CCGAATCCTG 
GGCCAACGCA 
GCCATCTTTG 
CCATCTCCCC 
TCTGGCACCA 
TCCCCGGAGC 
GTTTTATAAA 
GAAATGAAGA 
TCCACCACAC 
GGAATGGGCC 
T 



GACTTTGGGC 
TAATAATACT 
GTCAAGGCAG 
ACTGCAAGGC 
GGCCCATCCG 
TGGGTGCCCT 
ATGAAAGTGC 
GATGCCAGGG 
CCCAGGGGTG 
AAATAAAACA 
AAAACTGAAG 
ATACCCTCTA 
CTGCTTTTTG 



AAGTCACCCG 
TCACCTACCT 
GAGGACTCCG 
CCACCTGCCC 
TCTCTTCACT 
AGACTTAGTC 
TGTACAAATT 
AAGGGACAGA 
GGGATTGGTG 
AAAACCCACC 
AAAAAAAAAA 
AGCCAGCAAG 
TGGCAAGGCC 



CACTCCCTGG 
CATAGGGGAG 
AAGGTGCTAC 
CTCCCCCACA 
CTGTCGCAGG 
CTTATCTTGT 
CCACCCGCCC 
GGAAAACAGC 
GCCACTGTTT 
ATCACAAAAA 
AACAGGAAAA 
ATTTCCTCTT 
TGTTCTGATT 



GCCTOGGTTT 
GTTGTGAGGC 
CCGTGAGCAA 
GAGCCTCCAG 
CCCTTTCATG 
CCTGGTTTCC 
CAGGACCCCC 
CACAAACAAG 
GTATGTTCTT 
AAAAAATTTT 
AAAGAACCAT 
TGCAAAATCA 
AATAAAGGAT 



Seq ID NO: C203 DNA Sequence 

Nucleic Acid Accession fl: NMJ>24780.1 

Coding sequence; 31.. 1023 



ATTAATCTGG 
CCACGGCACG 
GGCATTCTTT 
ACCCTCATTG 
AATTCCTTCC 
GGCCTTCAGG 
GTGTGGATTG 
ATCATGTTCT 
TGGCGGGCCT 
GGGGTCTTGT 
CCTTTTCGAG 
ACACGGCCTG 
TTCTTTTTCA 
GAGGGAAGGA 
AAAATGTTCC 
AGCTCACTTG 
CATGATGGCA 
TGATGACTCT 
ATTTCTTCCA 
CAGGAGCGGA 
TTTAATACCT 
GTTGTAATAT 
ATTATCTTCA 
TCCCAGACCT 
CATGTAATAC 
ACTTTGAACT 
CTGAACTTTC 
CCCCTGTGAG 
AGTACCGTAG 
ATACCCCACC 
GAGACTGAAT 



11 
I 

CCGTGCCATG 
AAGTCTACGT 
GTTACTATTG 
GCCAGGACAT 
TGGGGGAGTT 
AGTTTGACAT 
GCATCTTCTT 
ACTCCAAAAA 
CACAGATGAT 
GCACCCTGGC 
GTCTGCCTCT 
GCTACCTGTG 

tcxttcaccct 

AGATTATGAT 
TGATAGAAAA 
TTCTGGAAAG 
GTCTTGACTT 
TTTGGTAACC 
TGCCACCTGT 
AACTGACTAC 
GGATTGCTGA 
TGAAATGAGC 
AGAAGTGTGT 
GGCAAAGGTT 
TCCAGCAGGG 
AAAGACATCC 
TTGCGTACTG 
GTGTTATACA 
CCAATGAGAT 
GTGATGACTT 



21 
I 

CATCTACTCC 
TCTCCTGATC 
GCTCAACACC 
CTACCGGCTC 
TCTGAGGAGA 
TGCCAGGAAC 
CTGCCCCCTG 
TATCAGCCTG 
GACTTTCTTC 
CATCACCATC 
CTTCATTCAC 
GGTTGTTTGG 
CATTGTGCTA 
AAGGCTGCTC 
ATTGATCAAG 
GAGAGAGGTG 
GCGATCTAGA 
AGACACCAAT 
GCCTTTAGGA 
CATGTAATTA 
TTTTTCAAGA 
CTACAAAAAC 
GCAGGAATGA 
TAGAAACTGT 
ATATGAAGCC 
CCCATGTCCC 
ATTACATGAG 
TGACCATCAA 
TTATCCGAGA 
GAAATATAAT 



31 
I 

ATGTTCAGGC 
CGAAACATCT 
GTGGCCCTGT 
CTTCTGATGG 
ATCATTGGGA 
GTTCTAGAAC 
CTGCCCTTTA 
ATGATGAATT 
ATCTTCTTGC 
TGGAGATTGA 
TCCATCTACA 
ATCTATCGGA 
ATCATCACCT 
CATGAGCAGA 
CTGCAGGATA 
GAGCAACAAG 
AGAT CAGTTC 
CAAATAAGGG 
ACTGCCCAGA 
TCAAAGTAAA 
CAAAATACTT 
CTAGGAAGAG 
TTGGTTCTTA 
TGCTAAGAAA 
TCTGAATTGT 
AAAGGTGGAA 
TCTTTGGAGT 
AGTCCTACGT 
CGCGATTATT 
CAGCGCTGGC 



41 

I 

TTGTGGAGAG 
TTTTGAAAAT 
CTGGTGAAGA 
ATTTTGTGTT 
TGCAACTGAT 
TGATCTATGC 
TCCAAATGAT 
TCCAGCCTCC 
TCTTTTTCCC 
AGCCTTCAGC 
GCTGGATCGA 
ACCTCATTGG 
ATCTTTACTG 
TCATTAATGA 
TGGAGAAGAA 
GCTTTTTGCA 
AAGAAGGTAA 
GAGGAGACGA 
AGAAAATCCA 
ATTGGGCATT 
GGGGTTTTCC 
ATAACTAGGG 
GAAATCTCTC 
AGTGGTCCAT 
AGAACCTGCA 
TACAACCAGA 
CGGGGATGGA 
CAAGCTAGCT 
GCTAATTGGA 
AATTTTTGAC 



Seq ID NO: C204 Protein Sequence 
Protein Accession #: Eos sequence 



1 
I 

AGCGGGGTTA 
ATCCTGGCAT 
CCCTGCGAGT 
CAGTGGAGAA 
TCATCATACA 
TTTTTCAAGA 
TTGATGTGAT 
TGCGGTTTTC 
TGGGCATAGA 
CCCTTATACC 
TGGAAGAGAA 
GTATCCTTCA 
GATCCAAGAA 
AGATCATTGG 
TTTTGAAGTT 
TTACCGTGGC 
GTTATTTTAG 
GGAACAGCGG 
TGACCACCTG 
TTAATCCCCA 
TCACTCATGA 
ACCTGTCAGA 
TCTCTGCCTA 
CCGTTTATTA 
CGGTGCTGTT 
CCATGTTCAG 



11 
I 

GATGCAGGGA 
TTTAAAAATA 
CCCAATCAAT 
GAAGGCTTGC 
GGTTGAGAAT 
AAAGGTGCTG 
AGACTCTCAG 
AACATCTTTG 
CACTCCTGGT 
CATGACATCC 
AAGGAACCTT 
GCTCAATTGC 
CAGCCTGTCG 
AGGCAAGTTT 
CAACATTTTC 
CAAAAAGAAC 
GGACACAGTG 
GGCATCCTAC 
CTTCTTCAGT 
CATTTACTCC 
AAAAGCTGTG 
GCTCCGTCAG 
CATGGTAGCC 
CCTGGCTGAG 
ACTGCCTTTC 
GCTTGTGGAG 



21 
I 

GTTATGTTGT 
ATGTGAATGG 
GCGGGTGTGA 
AGGAGGCAGG 
GTTTCCTCTG 
CTAGACTCAA 
ACAGTCAGCA 
AATGAGTCGA 
TCTTCACATG 
AGAGACAGAA 
AGGAAAATAG 
TGTATTCAGT 
GAAATTCTGA 
GGAACCAGCG 
TCATTCATCC 
ACCCTCCAGT 
ATGTACTATG 
AACATGCAGC 
TTGCTGTTCA 
GGAGGGATCA 
AAGCTAAAAC 
GAGAATTCCA 
TGGGTTGTCT 
TACAACTTAG 
GTTGTGTCCT 
AGGTACGAGA 



31 
I 

GATTTGTGTA 
TGTATCCCTT 
CTGCTGTATG 
AGATGCTGTC 
GGGTCCAAAG 
GCATCAACAT 
AAAGGAATGA 
TGTCTCAGAC 
AAACTGTTCA 
TTAAAGCCAT 
TTGACAAAGA 
GTCTGAACTC 
ATTCCATCAG 
TCCTCTCCTA 
TGAACTTCAG 
TCACTGGGCT 
GCTTTTACAC 
TGGCCTACAT 
GCATGGCCAA 
CCAAGCTGAT 
AGAAGAATCT 
AGTTGACGTT 
CTACAGGAGT 
AGTTCCTGAA 
GCATTAATCT 
TGCCACGGCA 




780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1511 



TAGCACTGAG 
CAATCAGCTG 
GGCCATAGCC 
GACACACAGT 
GGCCGTGCCA 
CQAAGTCTAC 



51 
I 

GTAOGAGATG 
ATCAATCATT 
GTGTTGGGAA 
CTCTTTAGTC 
CACAAGTCTT 
ACAAACTCTG 
TATGCTTTTC 
GAGCAAAGCC 
ATCCTTCACC 
TGACTGTGGC 
CACCCTAAGT 
AAGTGTGCAC 
GCAGATCACA 
GGGCAAAGAT 
AGCAAACCCC 
TTTGGGGGAA 
TCCAAGGGCC 
AAATGGAATG 
AGGCTTTAGC 
CCATGCTATT 
AATAAAGATT 
AATAATGTAT 
CTGCCAGACT 
CCTGAATAAA 
TTTATTTGTG 
GGTCTCATCT 
GGAGGTTCTG 
TTGCAGTGGC 
AATTTTCCCA 
AGTCTCTACG 



51 
I 

TGGTGAGCTC 
ATCACTTTTT 
AAGCCCACCC 
GTGAATGAAA 
TCAAATCAGA 
ATATCTGACA 
AACCAGGTGC 
CTAGAATGCA 
TTAATCGCAT 
CCAAGGACCA 
CAGACCCATC 
GCTTATCGGA 
AAGACGCTGA 
CTGAGATGGC 
ATCCCTCAGT 
ACTGGGGTGG 
ATCCAGCACG 
GGAGCATGCT 
AACAACTTCA 
GACTTCACTG 
ATAAGGGAGA 
CTGACCCGCT 
TGCTGTGCAG 
AACCCTGGGG 
TGCATCTACT 
GTTCTCCTGA 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1813 



60 

120 

180 

240 

300 

360 

420 

480 
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600 

660 

720 

780 
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900 
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1140 

1200 
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1440 

1500 

1560 
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TCCGAAACAT 
CCGTGGCCCT 
TCCTTCTGAT 
GAATCATTGG 
ACGTTCTAGA 
TGCTGCCCTT 
TGATGATGAA 
TCATCTTCTT 
TCTGGAGATT 
ACTCCATCTA 
GGATCTATCG 
TAATCATCAC 
TCCATGAGCA 
AGCTGCAGGA 
TGGAGCAACA 
GAAGATCAGT 
ATCAAATAAG 
GAACTGCCCA 
TATCAAAGTA 
GACAAAATAC 
ACCTAGGAAG 
GATTGGTTCT 
GTTGCTAAGA 
CCTCTGAATT 
CCAAAGGTGG 
AGTCTTTGGA 
AAAGTCCTAC 
GACGCGATTA 
ATCAGCGCTG 



CTTTTTGAAA 
GTCTGGTGAA 
GGATTTTGTG 
GATGCAACTG 
ACTGATCTAT 
TATCCAAATG 
TTTCCAGCCT 
GCTCTTTTTC 
GAAGCCTTCA 
CAGCTGGATC 
GAACCTCATT 
CTATCTTTAC 
GATCATTAAT 
TATGGAGAAG 
AGGCTTTTTG 
TCAAGAAGGT 
GGGAGGAGAY 
GAAGAAAATC 
AAATTGGGCA 
TTGGGGTTTT 
AGATAACTAG 
TAGAAATCTC 
AAAGTGGTCC 
GTAGAACCTG 
AATACAACCA 
GTCGGGGATG 
GTCAAGCTAG 
TTGCTAATTG 
GCAATTTTTG 



ATATCAATCA 
GAGTGTTGGG 
TTCTCTTTAG 
ATCACAAGTC 
GCACAAACTC 
ATTATGCTTT 
CCGAGCAAAG 
CCATCCTTCA 
GCTGACTGTG 
GACACCCTAA 
GGAAGTGTGC 
TGGCAGATCA 
GAGGGCAAAG 
AAAGCAAACC 
CATTTGGGGG 
AATCCAAGGG 
GAAAATGGAA 
CAAGGCTTTA 
TTCCATGCTA 
CCAATAAAGA 
GGAATAATGT 
TCCTGCCAGA 
ATCCTGAATA 
CATTTATTTG 
GAGGTCTCAT 
GAGGAGGTTC 
CTTTGCAGTG 
GAAATTTTCC 
ACAGTCTCTA 



TTGGCATTCT 
AAACCCTCAT 
TCAATTCCTT 
TTGGCCTTCA 
TGGTGTGGAT 
TCATCATGTT 
CCTGGCGGGC 
CCGGGGTCTT 
GCCCTTTTCG 
GTACACGGCC 
ACTTCTTTTT 
CAGAGGGAAG 
ATAAAATGTT 
CCAGCTCACT 
AACATGATGG 
CCTGATGACT 
TGATTTCTTC 
GCCAGGAGCG 
TTTTTAATAC 
TTGTTGTAAT 
ATATTATCTT 
CTTCCCAGAC 
AACATGTAAT 
TGACTTTGAA 
CTCTGAACTT 
TGCCCCTGTG 
GCAGTACCGT 
CAATACCCCA 
CGGAGACTGA 



TTGTTACTAT 
TGGCCAGGAC 
CCTGGGGGAG 
GGAGTTTGAC 
TGGCATCTTC 
CTACTCCAAA 
CTCACAGATG 
GTGCACCCTG 
AGGTCTGCCT 
TGGCTACCTG 
CATCCTCACC 
GAAGATTATG 
CCTGATAGAA 
TGTTCTGGAA 
CAGTCTTGAC 
CTTTTGGTAA 
CATGCCACCT 
GAAACTGACT 
CTGGATTGCT 
ATTGAAAWRA 
CAAGAAGTGT 
CTGGCAAAGG 
ACTCCAGCAG 
CTAAAGACAT 
TCTTGCGTAC 
AGGTGTTATA 
AGCCAATGAG 
CCGTGATGAC 
ATAAG 



TGGCTCAACA 
ATCTACCGGC 
TTTCTGAGGA 
ATTGCCAGGA 
TTCTGCCCCC 
AATATCAGCC 
ATGACTTTCT 
GCCATCACCA 
CTCTTCATTC 
TGGGTTGTTT 
CTCATTGTGC 
ATAAGGCTGC 
AAATTGATCA 
AGGAGAGAGG 
TTGCGATCTA 
CCAGACACCA 
GTGCCTTTAG 
ACCATGTAAT 
GATTTTTCAA 
RMMWAMAAAA 
GTGCAGGAAT 
TTTAGAAACT 
GGATATGAAG 
CCCCCATGTC 
TGATTACATG 
CATGACCATC 
ATTTATCCGA 
TTGAAATATA 



1620 
1600 
1740 
1B00 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3285 



Seq ID NOi C205 DNA Sequence 

Nucleic Acid Accession #: NM_002250.1 

Coding sequence: 3 97.. 1680 



1 11 

I I 

GTCCTTOGGT GTCTGGGTGT 
TGTGACAAAG CCCATCGTGT 
CTTGCTCTGC CCCAAGACAC 
GCCTCCCCCT ACCCTGGGGC 
CCCAGAGGGC ACAGGGCTCT 
GCTGACGACC TGCAAGCCAC 
GCAGGAAGCT GGCTGAGCCC 
CTGGGGGCCT TGAGACGCCG 
GCACTGGTGC TGGCAGGAAC 
TTCGGGGGGT GCTCGTGGGC 
ACCTTCTTAC TOCTCTGCCT 
ACCGACAACG GGCTGCGGGA 
GTGCTGGAGC TGGTGGTGTG 
CAGGATTTAG GGGCGCCGCT 
GAAGCGCTGC TGTCCCTGGC 
CTGCGCAGCG GCGTCCTGCT 
CGCTTCCGCC ACTGGTTQGT 
CTCGGCCTCA CGCTTGGCCT 
CAGGCTGTTA ATGCCACTGG 
CTGACCATCG GCTATGGTGA 
TGCACTGGAG TCATGGGTGT 
CTGGAGTTTA ACAAGGCAGA 
AAAGAGATGA AGGAGTCCGC 
ACTCGCAGGA AGGAGTCTCA 
AAOGCGTTCC GCCAGGTGCG 
GTGGACATCT CCAAGATGCA 
CACCGGGCCC TGGAGAAACA 
CTGCTTAGCA CTGCCCTGGG 
CTGGACCCAC GAGGAGGAAC 
CTGCCACTCC TGACCCAGCC 
TGGCTTGGAG TGGGTTGGCT 
GGCCTTGGCC CACCTGAGGC 
ATCAAAAACA CTCTCACTAT 
GCGACTGGAG GCAGGACTCC 
GATTCTGGGA GGCTTCAGTT 
GGGGCTGGAG GTGGCGCCCC 
CAGAGAGCAC CTGGTGGGGA 
AATAAATGTT AAAGCCAG 




TGGGCTGCAC 
GACCTCCCOG 
CATGCTGCTG 
CAACGCTTCC 
GGCCAAGCTT 
CTGGCTGACC 
GCACCTTTCA 
CGTGGTGCCG 
CTGCTGCACA 
GAAGCACGTG 
TGCCCGAGTG 
TGCTGCCCGC 
GCTGAAACAC 
CATGATCCTG 
GATTGACACG 
GCCGAGGCAG 
CAGGCTACTT 
CTGAACAAAG 
TGCTGATGGC 
CCCAGGTGGG 
GCTGCTATGG 
TGGGTCCCTG 
ACCGCTGGCC 
CTGGTGGGAC 
GGAAGAAGTG 



31 41 51 

I I I 

GGTGTGTGTC ACAAAGTACA GACCATTGTG 60 

GTCTTTATCC ACGTGGATGG ACGTCTCTTT 120 

CTCCTTATTC TCAAAAGGGG GAGCTGGGGA 180 

CCTCCCCGCC CTGCCTGGCC GTCACCACTC 240 

AGAGCAAAAG TCCCAGAGCC AGCAGAGCAG 300 

CTGTGCGTGC TGCGAGGTGG GGGACCCTGG 360 

GGGGCCATGG GCGGGGATCT GGTGCTTGGC 420 

CTGGAGCAGG AGAAGTCTCT GGCCGGCTGG 480 

CTCATGGTGC TGCATGCAGA GATGCTGTGG 540 

TTCCTGGTTA AATGCACGAT CAGCATTTCC 600 

TTTCATGCCA AAGAGGTCCA GCTGTTCATG 660 

GCGCTGACCG GGCGGCAGGC GGCGCAGATC 720 

CCGGCGCCCG TGCGGGGCCC GCCGTGCGTG 780 

CAGCCCTGGC CGGGATTCCT GGGCCAAGGG 840 

CGTCTCTACC TGGTGCCCCG CGCCGTGCTC 900 

TACCGCAGCA TCGGCGCTCT CAATCAAGTC 960 

TACATGAACA CGCACCCTGG CCGCCTGCTG 1020 

ACCGCCTGGG TGCTGTCCGT GGCCGAGAGG 1080 

GACACACTTT GGCTGATCCC CATCACATTC 1140 

GGCACCATGT GGGGCAAGAT CGTCTGCCTG 1200 

GCCCTGCTGG TGGCCGTGGT GGCCOGGAAO 1260 

CACAACTTCA TGATGGATAT CCAGTATACC 1320 

CTACAAGAAG CCTGGATGTT CTACAAACAT 1380 

AGGGATCAGC GCAAGCTGCT GGCCGCCATC 1440 

CGGAAGCTCC GGGAACAAGT GAACTOCATG 1500 

TATGACCTGC AGCAGAATCT GAGCAGCTCA 1560 

CTGGCGGGGA AGCTGGATGC CCTGACTGAG 1620 

CTTCCAGAAC CCAGCCAGCA GTCCAAGTAG 1680 

TCCCCAGTAC TGAGGTGGTG GACATCGTCT 1740 

CACCTCAAGT GCAAGGACCA AAGGGGGCCC 1800 

TGCTGGAGGG GACGCTGGCT AAAGTGGGTA 1860 

AACATGGTCA CCCCCACTCT GCATACCCTC 1920 

ACGACCTCCA GCTCTCAGTT ACAAGTGCAG 1980 

GGAAAGAGGG TACTAGGGGC CCGGATCCAG 2040 

GAGCTGAAGA ACTGGGTATG AGGCTGGGGC 2100 

AACAAAGAGG ACACCATTTT TCCAGAGCTG 2160 

TAACTCACCA GCCTCTGCTC TTATCTTTGT 2220 

2238 



Seq ID NO: C206 DNA Sequence 

Nucleic Acid Accession ft: NM_0 2 5257.1 

Coding sequence: 1..2139 



1 11 21 31 41 51 

I ! I I I I 

ATGGGGGGAA AGCAGCGGGA CGAGGATGAC GAGGCCTACG GGAAGCCAGT CAAATACGAC 
CCCTCCTTTC GAGGCCCCAT CAAGAACAGA AGCTGCACAG ATGTCATCTG CTGCGTCCTC 
TTCCTGCTCT TCATTCTAGG TTACATCGTG GTGGGGATTG TGGCCTGGTT GTATGGAGAC 

1328 
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CCCCGGCAAG 
AAAGATAAGC 
ATCTCAGTTG 
CCGGAGGACC 
TATACAAAAA 
AGCCTGCAAC 
TGCTTTCCAT 
ATACAGCAGG 
ATCTTTGAAG 
GTCTTGAGCC 
• CTGATCCTGG 
CGAGTGCTGC 
GCCTACCAGA 
GCCATCCTGC 
CTGAAGGAGG 
ACCTTTGTCC 
CCCACGCAGC 
TGTGAGAAAG 
TGCCCAGGGC 
GTCTTCAATC 
GCCCTGGGCC 
CCCCAGGACA 
ACTGGQTCAT 
TTGGAGTATA 
TGCTGTTTCA 
GCATACATCA 
ATGCTACTCA 
CTGTTCTTTG 
TCCGGTCGCA 
CTGCCCATCA 
TTCGGCATGT 
GGCTCCCTGG 
AACGAGGCGC 
ACTGCACCCC 
TTGTGGTAAA 
GAGAGGCTGA 
GTGAAAC 



TCCTCTACCC 
CGTATCTCCT 
CTGAGAACGG 
CATGGACTGT 
ACAGGAACTT 
AGGAACTCTG 
GGACCAACAT 
GGATCAGCGG 
ATTTTGCCCA 
TACTGTTTAT 
GAGTGCTGGG 
GGGACAAGGG 
GCGTGCAGGA 
TGCTGGTGCT 
CCAGCAAGGC 
TCCTCCTCAT 
CAGCCACTCT 
TGCCAATAAA 
TGATGTGCGT 
TGCAAATCTA 
AATGCGTCCT 
TCCCTACCTT 
TGGCATTTGG 
TTGACCACAA 
AGTGCTGCCT 
TGATCGCCAT 
TGCGAAACAT 
GGAAGCTGCT 
TCCCGGGGCT 
TGACCTCCAT 
GTGTGGACAC 
ACOGGCCCTA 
CCCCGGACAA 
ACCCCCACCG 
AAAAGGTTTT 
GGCGGGCGGA 




CGCTGGAGCC 
CCCCTTAATC 
AGCCCTCATC 
GCTCAGAGGA 
CTGGTGTCTG 
CTACGGGAAG 
TGTCAGGGTG 
GGTGGTCGGA 
GGGTAAAGAC 
CCTGGGGGCC 
GCTCTTCCTC 
CTACATGTCC 
CAAGAAGAGG 
TCCAGCCATC 
AGGCCAGGCG 
TCACCTGAGT 



ACTGGGGCCT 
ATCTTCAGCT 
CCCACACCCC 
GAGTTCTCAC 
GGGGTACCCT 
CTCCTCCCCT 
GCGCTCCCAG 
AGCCTCAATG 
TGGATTCTTG 
CGCCTGGTGG 
TATGGCATCT 
TCCCAGCTGG 
GCCGCCCTGA 
OGGCAGOGGA 
ATGATGTCTA 
TACTGGGCCA 
CTCTGGGCAT 
AACCCCACGG 
TACTCATCCA 
GGGCTCTTCT 
TTTGCCTCCT 
TCTGCCTTCA 
CTGACCCTTG 
GTGCAGAACC 
GAAAAATTTA 
AATTTCTGTG 
GTCGTCCTGG 
GGCGTGGGGG 
TTTAAGAGCC 
TATGTCATCG 
TGCTTCCTGG 
AAGAGCCTTC 
AAGAAGTGAC 
CAACCTCACT 
CCGTGGCTCA 
CAGGAGTTCG 



ACTGTGGCAT 
GCATCCTGTC 
AGGTGTGTGT 
AGACTGTTGG 
GGAATATGAC 
CTGCTCCAGC 
GGATCACCAA 
CCCGAGACAT 
TTGCCCTGGG 
CTGGGCCCCT 
ACTACTGCTG 
GTTTCACCAC 
TCGTGTTGGC 
TTCGTATTGC 
CCATGTTCTA 
TGACTGCTCT 
CCAACATCAG 
CCCACCTTGT 
AAGGCCTAAT 
GGACCCTTAA 
TCTACTGGGC 
TCCGCACACT 
TGCAGATAGC 
CTGTAGCCCG 
TCAAGTTCCT 
TCTCAGCCAA 
ACAAAGTCAC 
TCCTGTCCTT 
CCCACCTCAA 
CCAGCGGCTT 
AAGACCTGGA 
TAAAGATTCT 
AGCTCCGGCC 
TCGCCTTACA 
CGCCTGTAAT 
AGACCAGCCT 



GGGGGAGAAC 
CAGCAACATC 
GTCCTCCTGC 
GGAAGTCTTC 
GGTGATCACA 
TCTGGGACGC 
TGACACCACC 
CAGTGTTAAG 
GGTGGCTCTG 
GGTGCTGGTG 
GGAGGAGTAC 
CAACCTCAGT 
GGTGCTTGAA 
CATCGCCCTC 
CCCACTGGTC 
GTATCCTCTG 
CTCCCCCGGC 
GAACTCCTCG 
CCAACGTTCT 
CTGGGTACTG 
CTTCCACAAG 
CCGTTACCAC 
CCGGGTCATC 
CTGCATCATG 
AAACCGCAAT 
AAATGCGTTC 
AGACCTGCTG 
CTTTTTTTTC 
CTATTACTGG 
CTTCAGCGTT 
GCGGAACAAC 
GGGCAAGAAG 
CTGATCCAGG 
GGTCTCCATT 
CCAACACTTT 
GGCCAACATG 



Seq ID NO: C207 DNA Sequence 

Nucleic Acid Accession #: NMJ>161B0.1 

Coding sequence! 26.. 1618 



1 
I 

CAGGAAGGTT 
CTATAAATCC 
CAGCAGACTC 
GGCAGCGTAT 
TGTGTGGTTC 
CGACCACTGC 
GATGCTCGTG 
TAACCCAAGG 
TGATTTTGCT 
CCATCAGGAC 
CCTGGGTTAC 
TACAGAATTC 
TCATCTGTGC 
GCAAACCCCT 
GAAAGTTAAA 
TCATGCTGAA 
CATGCCTCCT 
GTCCAACATG 
TAGTGCACAC 
GGGCTTCTGC 
ATCCTACATT 
GGGATTTATT 
TGTAATGTCC 
GGAAGAAAAG 
GGGCATGGAC 
TGGCCTGGGC 
TGGGGTGGCA 
CAATAAAGAG 



11 
I 

CCTCTCCCAG 
CTAGCTGATG 
ATCATGCACA 
GTGACCCCAG 
CTCAGCCCCA 
CGGTCCAGGT 
GGCATGGCTC 
AGGAAGCTGG 
GCCGACTTCA 
AAGGAGAAGG 
CTTTTGGGTG 
CAGGTCATGT 
AGTATCTCTG 
CAGGACCCTC 
AATGGTTACG 
CAGACTCGCA 
CACTACCGCT 
CTGTTCTTCA 
AACTCCACAG 
ATCAACTCCG 
GGATTAAAGG 
GGGCTCTTCC 
AGCACCCTGT 
GAGAGGCAGC 
TGCGCCACCC 
TTTCTGGTCA 
CTGATAGGCT 
ACAATGACCC 



21 
i 

TGGCCATGGG 
ATGGCCCCTT 
GCATGGCCAT 
TCCTGCTCAG 
TCCTGGGATT 




31 
I 

TAGCAACAGT 
TGACTCTGTG 
GTTCGGAAGA 
CGTAGGTCTG 
CCTGCTGCAG 
GAGACCCTAC 
TGGGGCTACT 
AAGTGTCACC 
CATCAAAGCC 
CCATGCCCTC 
GGCCCATCTG 
TGCATTGGTG 
TACAGAGGTT 
AGATGGAATG 
GCTGGCAATG 
ATTAAAGTCA 
CAGCCACCTC 
GGGCCAGATT 
CTACGAAAGA 
ACTTTATTCT 
CACGGGATAT 
CTCCACCCTG 
CTTTAACCTC 
AGGGGACCCA 
GGTGCAGCTG 
GACCGTTGTC 
CGCTCTCTTT 
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41 51 

I I 
GGGCAGGCTG GCCGCCACAT 
GAGCCGCCTA AAAGACCCAC 
GAGTTCTGCT ACGCGGTGGA 
CCCAGCAGCC TGTACAGCAT 
CGCGTGGTCG GATCGGCCAG 
ATCCTCACCC TGGGAGTCAT 
GTTGTAGCAG CTTTGATTGC 
ATGATAGGTG TCGTTCTCTT 
TACTTATTTG ATGTCTGCTC 
TTCACAGGTT TTGGAGGTGC 
GAGCTGGGAA GACTGTTGGG 
CTCACTTTGT GTTTTACTGT 
GCAAAGGGCA TTCCCCCACA 
TACGAGTATG GTTCTATCGA 
CAGGGAGCAA AAAACAAAAA 
CTGCTGAGAG CACTGGTGAA 
ATTGGATGGA CGGCCTTCCT 
GTGTACCGCG GGGATCCCTA 
GGAGTCGAGG TTGGATGTTG 
TACTTTCAGA AAGTTTTGGT 
TTGCTGTTTG GCCTGGGGAC 
GTCCTGTGCA GCCTGTTTGG 
ATTACTGAGT ACCACCGCGA 
GACAACAGCG TGAGAGGGAA 
GCTCAGATCC TGGTCGGAGG 
GTCGTGGTGA TCACAGCGTC 
GTTAGATATG TGGATTAGGT 



60 

120 

180 
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300 
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420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 
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1380 

1440 
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1560 

1620 

1650 



Seq ID NO: C208 DNA Sequence 

Nucleic Acid Accession #: NM_003273.1 

Coding sequence: 255.. 2024 

1 ll 21 31 41 SI 

I I I I I I 

CGCCGCGGGG CCGGATCCTC CGCGOGGCCG AGTCCATCTC CTGGGAAATG GGGCGGACAG 
TGTTTCCTTG ACTGACTATT GTGAGCGCCC TCTCTCTCCG GCGGAGCGGA GACCATGGCC 
CCCACTCAGG CCCCCGGGCC CCGCTGGAAT TCGGAGGGCC CCTGGGTAAT GGGGCAGAGA 
GATGGGACCT GGGGCAAAGG CTAAGCGAAG GAGAGCTGGA GCGGGTGAAC TAAGAGCGGG 
GGCGAGATCT GAGGATGGAA GGCTTTGGGG GTGTCGGAGG CAGAGGGACC 

1329 
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CAGCQAAGGG 
GGGGATTTAT 
GACTAAGATG 
CTGAGGACGQ 
GGGCGCCGTG 
CTTCCCGCAG 
CTGGCGGCCC 
GAGGTGCTGT 
GCGCTCTACC 
GAAGCGAATG 
CCGCAGTGGC 
GCTTCCAGGC 
CTCTGGGGGC 
TCATCTTCAG 
CTGGGGGGAA 
GTATCTGTTT 
TCCTCATCAA 
CCATGTGGCT 
AGGCCGTCCT 
GGGACATGGC 
CGCAGCCCCT 
ACATCTTCCG 
GAGTGGCTGG 
GGTGGGGTAT 
CCTTGCCCTG 
TGCTGGTGCA 
GCAGGAGTAC 
CACCACCCCA 
CGACACACTT 
AGAAGAGGTG 




GGGAGAGCTG AGGAGGGGCC GGTTCTGGGG GCTGCAGAAC 
GGGAGCAGGA GGAGGGTCTT CGAGGGGCCT GGGGGCGGGG 
AAGGGAACTG GG AGGCAGCG GGGTGCCTGG GGGCCGAGGG 
CGCACTCTGG GAATGCCGAG AGGGTCCCGC AGAGACGTCA 
GGGAGCTGGG GGGCTAGGGG CGGACGCCGA CGTGATGGCC 
TCTGCTACTG CTGCTGCCCG CCACCATGTT CCACCTGCTC 
CGCGCGCCTG CTGGGTCCAC CCGCGTCCCT GCCCGGGCTG 
GGCGCTGCTG CTGTGGCTCG CCTGGCTCGG CCTGCAGGCG 
GCGCAAGGTG CGGGCCCCGC TCGCGGACGC TCGGGGGAGG 
GGGAAAGGAC GCCCCGGGCC TTATCAGAGC CCCCTTGGAC 
GAATTGAAGG ACAAGAGTCG CCTGCGCTAT CCTATTAACG 
ACAGCCCTGT TGGTGGGGCT GGGGATGTCA GCGGGGCTGC 
GCTCCCGGAA ATGCTCCTGC CCTTGGCGTT TGTCGCCACC CTCACCGCTT 
CCTCTTTCTC TACATGAAGG CGCAGGTAGC CCCAGTTTCG GCCCTGGCAC 
CTCAGGCAAT CCGATTTACG ACTTTTTTCT GGGACGAGAG CTCAACCCTC 
CTTCGACTTC AAATATTTCT GTGAACTGCG ACCCGGCCTC ATCGGCTGGG 
CCTGGCCCTG TTGATGAAGG AGGCAGAGCT TCGAGGCAGT CCCTCACTGG 
GGTCAATGGC TTCCAGTTGC TCTACGTGGG TGATGCCCTC TGGCACGAGG 
CACCACCATG GATATCACAC ATGACGGGTT TGGCTTCATG CTGGCGTTTG 
CTGGGTGCCC TTCACCTACA GCCTGCAGGC CCAGTTCCTG CTGCACCACC 
GGGGTTGCCC ATGGCCTCTG TCATCTGCCT CATCAATGCT ACTGGTTACT 
TGGGGCGAAT TCCCAGAAAA ACACTTTCCG AAAGAATCCT TCTGACCCCA 
GCTTGAGACC ATCTCTACAG CCACAGGGCG GAAACTGCTG GTGTCTGGGT 
GGTCCGCCAT CCCAACTATC TTGGAGACCT CATCATGGCT CTGGCTTGGT 
CGGGGTGTCA CACCTGCTGC CCTACTTCTA CCTCCTCTAC TTCACCGCGC 
CCGTGAGGCC CGGGATGAGC GGAGTGCCTG CAGAAGTACG GCCTGGCCTG 
TGCCGGCGTG TGCCTTACCG CATCATGCCC TACATCTACT GAAGCGGCTC 
GGTGGGGCAT GTGCCCACTC ATCCACCAGC ACACCCAGGA CCAGGAGCCT 
GGGACTCAAG GGCTTGCACC CCACCCAGCC CTGAGGATGA ACAACCTCAG 
GTTTAGAGCA AGGAAAAAAA TGAAACCAGT GACCAAAAAA AAAAAAAAAA 



Seg ID NO: C209 DNA Sequence 

Nucleic Acid Accession #: NM_01S720.1 

Coding sequence: 21..183B 
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1 11 21 31 41 51 

I I I I I I 

CCAGTTCCGG CACGAGGACC ATGGGCCGGC TGCTGOGGGC CGCCCGGCTG CCGCCGCTGC 
TTTCGCCGCT GCTGCTTCTG CTGGTTGGGG GAGCGTTCCT GGGTGCCTGT GTGGCTGGGT 
CTGATGAGCC TGGCCCAGAG GGCCTCACCT CCACCTCCCT GCTAGACCTC CTGCTGCCCA 
CTGGCTTGGA GCCACTGGAC TCAGAGGAGC CTAGTGAGAC CATGGGCCTG GGAGCTGGGC 
TGGGAGCCCC TGGCTCAGGC TTCCCCAGCG AAGAGAATGA AGAGTCTCGG ATTCTGCAGC 
CACCACAGTA CTTCTGGGAA GAGGAGGAAG AGCTGAATGA CTCAAGTCTG GACCTGGGAC 
CCACTGCAGA TTATGTTTTT CCTGACTTAA CTGAGAAGGC AGGTTCCATT GAAGACACTA 
GCCAGGCTCA AGAGCTGCCA AACCTCCCCT CTCCCTTGCC CAAGATGAAT CTGGTTGAGC 
CTCCCTGGCA TATGCCTCCC AGAGAGGAGG AAGAAGAGGA AGAGGAAGAG GAGGAGAGGG 
AGAAGGAAGA GGTAGAGAAA CAAGAGGAGG AGGAAGAGGA GGAGCTGCTC CCTGTGAATG 
GATCCCAAGA AGAAGCCAAG CCTCAGGTCC GTGACTTTTC TCTCACCAGC AGCAGCCAGA 
CCCCAGGGGC CACCAAAAGC AGGCATGAAG ACTCCGGGGA CCAGGCCTCA TCAGGTGTGG 
AGGTGGAGAG CAGCRTGGGG CCCAGCTTGC TGCTGCCTTC AGTCACCCCA ACTACAGTGA 
CTCCGGGGGA CCAGGACTCC ACCAGCCAAG AGGCAGAGGC CACAGTGCTG CCAGCTGCAG 
GGCTTGGGGT AGAGTTCGAG GCTCCTCAGG AAGCAAGCGA GGAAGCCACT GCAGGAGCAG 
CTGGTTTGTC TGGCCAGCAC GAGGAGGTGC CGGCCTTGCC TTCATTCCCT CAAACCACAG 
CTCCCAGTGG GGCCGAGCAC CCAGATGAAG ATCCCCTTGG CTCTAGAACC TCAGCCTCTT 
CCCCACTGGC CCCTGGAGAC ATGGAACTGA CACCTTCCTC TGCTACCTTG GGACAAGAAG 
ATCTCAACCA GCAGCTCCTA GAAGGGCAGG CAGCTGAAGC TCAATCCAGG ATACCCTGGG 
ATTCTACGCA GGTGATCTGC AAGGACTGGA GCAATCTGGC TGGGAAAAAC TACATCATTC 
TGAACATGAC AGAGAACATA GACTGTGAGG TGTTCCGGCA GCACCGGGGG CCACAGCTCC 
TGGCCCTGGT GGAAGAGGTG CTGCCCCGCC ATGGCAGTGG CCACCATGGG GCCTGGCACA 
TCTCTCTGAG CAAGCCCAGC GAGAAGGAGC AGCACCTTCT CATGACACTG GTGGGCGAGC 
AGGGGGTGGT GCCCACTCAA GATGTCCTTT CCATGCTGGG TGACATCCGC AGGAGCCTGG 
AGGAGATTGG CATCCAGAAC TATTCCACAA CCAGCAGCTG CCAGGCGCGG GCCAGCCAGG 
TGCGCAGCGA CTACGGCACG CTCTTCGTGG TGCTGGTGGT CATTGGGGCC ATCTGCATCA 
TCATCATTGC GCTTGGCCTG CTCTACAACT GCTGGCAGCG CCGGCTGCCC AAGCTCAAGC 
ACGTGTCGCA CGGGGAGGAG CTGCGCTTCG TGGAGAACGG CTGCCACGAC AACCCCACGC 
TGGACGTGGC CAGCGACAGC CAGTCGGAGA TGCAGGAGAA GCACCCCAGC CTGAACGGCG 
GCGGGGCCCT CAACGGCCCG GGGAGCTGGG GGGCGCTCAT GGGGGGCAAG CGGGACCCCG 
AGGACTCGGA CGTGTTCGAG GAGGACACGC ACCTGTGAGC GCAGCGAGGC GCAGGCCGAG 
TGGGCCGCCA GGACCAAGCG AGGTGGACCC CGAAACGGAC GGCCCGGAGC CCGCACCAGC 
CCCGCGCCTA CCCGGCCGCC CCCGCGCCTG GCCCTCGGCG CGGGCTCCTT CCCGCTTCCC 
CCGACTTCAC ACGGCGGCTT CGGACCAACT CCCTCACTCC CGCCCGAGGG GCAGGCCTCA 
AAGCCCGCCT TGGCCCCGCT TTCCCGCCCC TGAACCCCGG CCCCGCGGGC GGCGGGCGCG 
CTTCCTGCGC CCCGGGACTC AATTAAACCC GCCCGGAGAC CACGCGGGCC CAGCGAAAAA 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAA 

Seq ID NO: C210 DNA Sequence 

Nucleic Acid Accession {*: NMJ>01197.3 

Coding sequence : 61 . . 543 
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31 
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51 



1 11 21 

I I I I I t 

GACACGAAGC CTCCCGGGTG GCTTACAGAC GCTGCCAGCA TCGCCGCCGC CAGAGGAGAA 60 
ATGTCTGAAG TAAGACCCCT CTCCAGAGAC ATCTTGATGG AGACCCTCCT GTATGAGCAG 120 

1330 



WO 03/042661 



CTCCTGGAAC 
CCTATGGAGG 
CTGGCCTGCA 
TCCGAGGTGG 
AGGGATGTTC 
TTCTGGAGAT 
CTGCTGCTGC 
TGAGGCCCCG 
GGCGGCCTGC 
CCCQAGATAG 
TTCCAGTTTT 
ACTGTGGCCT 
AGGGGGAGTG 
GAATAGATTC 
AAA 



CCCCGACCAT 
ACTTCGATTC 
TCGGGGACGA 
CCATGCACAG 
TTAGAAGTTT 
CCCCGAACCC 

GCGGCTCAGG 
TGCTGTTATC 
TGCTGGAACA 
CGTTTTTTCT 
GTGCCCAGGA 
CTGGTCACAC 
CGAGGAGCAG 



GGAGGTTCTT 
TTTGGAATGC 
GATGGACGTG 
CCTGGGTCTG 
CATGGACGGT 
CGGGTCCTGG 
GCTGCCGCTG 
GCGGGGCTGG 
TTTTTAACTG 
CTGCTGAGGT 
AAAAGATGAA 
AGAGCCATTC 
CCCTGTGTGA 
GAGTGCTCAA 



GGCATGACTG 
ATGGAGGGCA 
AGCCTCAGGG 
GCTTTCATCT 
TTCACCACAC 
GTGTCCTGGG 
CTCAGCGGGG 
CCCCACCCCC 
TTTTCTCATG 
TTTATACTCA 
TTCCTATGGC 
ACTCCTGCCC 
TATGTGATGC 
TAAAATGTTG 



ACTCTGAAGA 
GTGACGCATT 
CCCCGCGCCT 
ACGACGAGAC 
TTAAGGAGAA 
AACAGGTGCT 
GCCTGCACCT 
ATGACCACTG 
ATGCCTTTTT 
GGTTTTTTGT 
TCTGCAATTG 
CTGCCCACAC 
CCTCGGCAAA 
GTTTCCAGCA 



GGACCTGGAC 
GGCCCTGCGG 
GGCCCAGCTC 
TGAGGACATC 
CATAATGAGG 
GCTGGCGCTG 
GCTGCTCAAG 
CCCTGGAGGT 
ATATTTAAAC 
TTTTTTTTTA 
TCACCGGTTA 
GGCAGGTAGC 
GAATCTACTG 
AAAAAAAAAA 
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Seq ID NO: C21X DNA Sequence 
Nucleic Acid Accession #: AF2723S7 
Coding sequence: 83.. 1060 



GCTGCTCCCG 
GCCCCGGCGG 
GCGGCTGCTG 
CGCCGGCCAC 
GGCAAGGTGT 
CCAGCAAGGG 
CAGACTGGAA 
ATCAACTCCG 
CTCGGCACGG 
CACGCCCCAC 
GGAGCCCCGG 
GGCCGGTGCA 
CCGCCTGACT 
CCGGATCTCG 
CCAACGGCAA 
GGCCTCCTCG 
GGCCCCGACC 
GCCCCTGCCG 
CACCTGCTCC 
AACATGTTTT 
TGCCAGGGAG 
GTTCGGTTTA 
GGGTAATTCT 



11 
I 

ACGCGGAGCC 
CGGCGCTGAA 
CGGCTGCTGC 
CCGGATGTAG 
CCTCCTGGTG 
CTCTGTGTGC 
GATGAGATTG 
CCCCTACCCA 
GGGCAGGGGC 
ACCTCCCTGG 
GGAGGGCAAG 
GCCGCCCTCT 
CAGAAGGCCG 
CCTGGGGACC 
CAGATGCTGT 
GATGAGGAGA 
GGGGAAATGG 
GCCCCCAGCT 
CCGACCTCGA 
GATGCTGTGT 
ACCCCCGAAC 
GACCCCCAAA 
AGAGACAAAA 




AGGACCGACA 
TGGAGCTGGG 
GCTCCCCTGT 
GCGACGGCCT 
COGTAGCCTC 
ACTACGCCAC 
AGCGGCTGGC 
GCCTGGAGCG 
ATGAGGACGG 
AGGTGCGCAA 
CACCGCCTGC 
GGCCCCCGGG 
GCTTTTGGCT 
CTTTGTGCCA 
CTGGAGGGGG 
GCCAATTAAA 



CCAGGAGCTT 
GCGGCTCCCG 
CCTCCCCTCC 
GTCATCCGAC 
OGCCCTTGTG 

TGCGAAGGCC 
ACAGAGCGCG 
GCATAAAGAG 
AGACTTCACG 
CCCTCTGTTC 
ACTGCCATGA 
GAGGGGCAGG 
GGGCCTCGGG 
GGACACCTCC 
CATGGAGAAC 
GTCCATTTCA 



GCCCGGAAGG 
GAGCCTGCCA 
ACTCCAGGAA 
CCGGTGCACA 
CTGATCCTGG 
TGCAGGCTGC 
CCTGGCTCAC 
GAGATGTACC 
CCACCCAAGG 
GTGTACGAGT 
GACCACGCCG 
CCTGGAGGCA 
GCCTGGAGCT 
CTCCAGGCCC 
TGGTCCCCTG 
CGTAGAGCGC 
GAAAAAAAAA 



51 
I 

GTGCCATGCT 
CGCGGCACCT 
GTGGAGCCGC 
TGAAGAGGCG 
TCCAGGAGGA 
GGCCCCAGCC 
AGTCTGGACA 
CCCTGGGCTT 
CCCCCACGGC 
TGTCGCCCCT 
CGTTCTGTGT 
AGCGTGAGAT 
CTGCAGCTCC 
ACTACCAGCA 
AGCTGGACAC 
GCCCGGGCCT 
CACTGTCCGC 
GACAGACGCC 
TCCCACTAAA 
TGGGACCCCT 
CACCTCTCCT 
AGGAACGGGT 
A 
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Seq ID NO: C212 DNA Sequence 

Nucleic Acid Accession #: NM_004445.1 

Coding sequence: 799.. 3819 

1 11 21 31 41 51 

I I I I I I 

CGGAGGGGGC GGGCCGGGCT GCGTTCGCTC CAGCCGOGGC TCTACAGCAG CGGGCGGCGG 60 

GACCCGGGAC CCAGCTTGGC GACGGCGATT CTCGACGCGG GCCCCCAGGA TTCTCCCGGC 120 

GCCCCACCTC TGGAGCAGCC CCTGCCGCCA GCGTCAGGTC CACCCCGGAA TCCCAGGGAC 180 

TCTCGGCGCC GAACGGACCC GGGCCGGTGC AACGGGGTCC CCGGACTGGA GAAGACGCGG 240 

GTGGCACCGT GCGAGCTCCA GGAGCCCCGG GTCCACTGCG AGGCCTCGGG GGGCGCAGAC 300 

CTGCAGAGAC TGCGGCCAAC GGGAAGAAAT AAAGGGATTA TAGTCCACCC AATTCACAGA 360 

CTTCTGAGAC TCAGACACGA GGAGAGATAG AGAACCGCCA ATCTCTAGAT CAACAAGCAA 420 

AGGAGGTGCC AAGCCTGTTT GTCTTCATTG TGACACTGGA GTCTAGATGC TGGGAAGTOC 480 

AAGATCAGGG TGCCGGCATG GTCAGTTCCT GGCGAAGCCT CTCTTCTAGG TTTCAGACTG 540 

CCCTCTTCTT TGTTGTGTCC TCGAATGGCA GAAAAAGGGG TGGCTGTTGG AGGAAGGGAG 600 

GAGAGTAAAT GAAGAGAAAG AACTGGAATA ACCCCTTGCA GAAAAAAAAA AAAAGGGAAG 660 

CTTAGCTGTA CACCCTGAGT CTTGCAAAAG CTGCAGCCCC ACCCAGGAGC AGGGTGGTGG 720 

CTGGGGCGAT GGTGGACGCC CTGAAGATGT CCCATGGCTA CTGAAGGGGC TGCCCAGTTA 780 

GGGAACAGAG TGGCGGGCAT GGTGTGTAGC CTATGGGTGC TGCTCCTGGT GTCTTCAGTT 840 

CTGGCTCTGG AAGAGGTATT GCTGGACACC ACOGGAGAGA CATCTGAGAT TGGCTGGCTC 900 

ACCTACCCAC CAGGGGGGTG GGACGAGGTG AGTGTTCTGG ACGACCAGCG AOGCCTGACT 960 

CGGACCTTTG AGGCATGTCA TGTGGCAGGG GCCCCTCCAG GCACCGGGCA GGACAATTGG 1020 

TTGCAGACAC ACTTTGTGGA GCGGCGCGGG GCCCAGAGGG CGCACATTCG ACTCCACTTC 1080 

TCTGTGCGGG CATGCTCCAG CCTGGGTCTG AGCGGCGGCA CCTGCCGGGA GACCTTCACC 1140 

CTTTACTACC GTCAGGCTGA GGAGCCCGAC AGCCCTGACA GCGTTTCCTC CTGGCACCTC 1200 

AAACGCTGGA CCAAGGTGGA CACAATTGCA GCAGACGAGA GCTTTCCCTC CTCCTCCTCC 1260 

TCCTCCTCCT CTTCTTCCTC TGCAGCGTGG GCTGTGGGAC CCCACGGGGC TGGGCAGCGG 1320 

GCTGGACTGC AACTGAACGT CAAAGAGCGG AGCTTTGGGC CTCTCACCCA ACGCGGCTTC 1380 

TACGTGGCCT TCCAGGACAC GGGGGCCTGC CTGGCCCTGG TCGCTGTCAG GCTCTTCTCC 1440 

TACACCTGCC CTGCCGTGCT CCGATCCTTT GCTTCCTTTC CAGAGACGCA GGCCAGTGGG 1500 

GCTGGGGGGG CCTCCCTGGT GGCAGCTGTG GGCACCTGTG TGGCTCATGC AGAGCCAGAG 1560 

GAGGATGGAG TAGGGGGCCA GGCAGGAGGC AGCCCCCCCA GGCTGCACTG CAACGGGGAG 1620 

GGCAAGTGGA TGGTAGCTGT CGGGGGCTGC CGCTGCCAGC CTGGATACCA ACCAGCACGA 1680 

GGAGACAAGG CCTGCCAAGC CTGCCCACGG GGGCTCTATA AGTCTTCTGC TGGGAATGCT 1740 

CCCTGCTCAC CATGCCCTGC CCGCAGTCAC GCTCCCAACC CAGCAGCCCC CGTTTGCCCC 1800 

TGCCTGGAGG GCTTCTACCG GGCCAGTTCC GACCCACCAG AGGCCCCCTG CACTGGTCCT 1860 

CCATCGGCTC CCCAGGAGCT TTGGTTTGAG GTGCAAGGCT CAGCACTCAT GCTACACTGG 1920 
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CGCCTGCCTC 
TGTGAAGGCC 
GAGGTCCACT 
CTCCGGGCAC 
AGCCCTQACC 
GCTGTCCCTG 
CAGCCCGACC 
GAAGACGAAT 
CTGAGCCCTG 
CCCTACGGGG 
CCGGAAAGAC 
GCAGCCATCA 
GAGCAGCTGC 
ACCTACGAGG 
ATCAAGATTG 
CAGCCACGGG 
GAAAGCCTGC 
AACATCCTGC 
TTCATGGAGC 
CTGCAGCTGG 
GCCTTCGTCC 
AAGGTGGCCC 
CCAGAGGTCA 
CTCATGTGGG 
GTACTAAATG 
TTACATCTAC 
CAGCTGGTGG 
GACCCAGGGG 
CTGGACTCAC 
TCCAAGTTTG 
GCCCTGGGCA 
CAGCAACACC 
AGCCCTGGAC 
TGTGAGAGAT 
CCTCTCCACC 



GGGAGCTGGG 
GCCAGGAACC 
TCGACCCTCG 
ACGTACCCTA 
CTCCTCAGGC 
TGGTGCACCA 
AGACCAATGG 
CCCACTCCTT 
GCCACATCTA 
GCAAAGTCTA 

CCGTGCTGGC 
AGCAATACAG 
ACCCCTGTCA 
AGGAGGTCAT 
GACGGAGGGA 
AGATGACCTT 
GGCTGGAGGG 
TTGGCCCCCT 
TGGCCATGCA 
ATCGCTCGCT 
GTCTTGGCCA 
TTGCACATGG 
AAGTGATGAG 
CAATAGAGCA 
TTATGTTGGA 
CTGCATTTGA 
AAAGGCCTTC 
CCCAGGCCTG 
GCCTCTGTAC 
TCACCCTGGC 
TGAGGCAGCA 
ACTGGTCCGA 
GCCCCACACC 
AGCCCCCTCC 



GGGTCGAGGG 
TGCCAGCGGT 
CCAGAGAGGC 
CATCTTAGAG 
TGCAGCCATC 
GGTGAGCCGG 
GAACATCCTG 
CACCCTGACC 
TGGTTTCCAG 
TTTCCAGACA 
GATCGGCTCC 
GGTCGTCTTC 
CAGCCCAGGA 
GGCCATCCGA 
TGGGACAGGC 
GCAGACTGTG 



CGTGGTCACC 
GGACAGCTTC 
GCGGGGAGTG 
GTCTGCCCAC 
CAGTCCTCAG 
AAAGCATACA 
TTATGGAGAA 
GGAGTTCCGG 
CACTTGGCAG 
CAAGATGATC 
CCAGGCCCTT 
GCTTTCAGCC 
CTTCAGTGAT 
TGGCCACCAG 
GGGCTCAGTG 
GAAGGGACAT 
AAACCCAACC 
TCATTAAAGG 



GACCTGCTCT 
GGTGGGGGCA 
CTGACTGAGA 
GTGCAGGCTG 
AATGTCAGCA 
GCATCCAACA 
GACTATCAGC 
AGCGAGACCA 
GTGCGGGCCC 
CTTCCTCAAG 
ATCCTGGGGG 
CAGCGGAAGC 
CTCGGGGTGA 
GAACTTGCCC 
TCTTTTGGAG 
GCCATCCAGG 
GCCGCAGTGC 
AAGAGCCGAC 
CTCAGGCAGC 
GCTGCTGCCA 
AGCGTGCTGG 
GGCCCAAGTT 
ACATCCAGTG 
CGGCCTTACT 
CTGCCCCCGC 
AAGGACCGTG 
CGCAAGCCAG 
CTGACCCCTG 
ATTGGACTGG 
GTGGCTCAGC 
AAGAAGCTGC 
GAGGTCTGAG 
GTGGGACGTG 
CTCCCGATGG 
GAAAGAAGGG 



TCAATGTCGT 
CTTGTCACCG 
GCCGAGTGTT 
TTAATGGGGT 
CCAGCCATGA 
GCATCACGGT 
TCCGCTACTA 
ACACTGCCAC 
GGACTGCTGC 
GGGAGCTGTC 
CTTTGGCCTT 
GGCGTGGGAC 
AGTATTACAT 
GGGAAGTCGA 
AAGTGCGCCA 
CCCTGTGGGC 
TGGGTCAGTT 
CCCTCATGGT 
GGGAGGGCCA 
TGCAGTACCT 
TGAATAGCCA 
GTTTGCTTCG 
ATGTCTGGAG 
GGGACATGAG 
CTCCAGGCTG 
CCCGGCGGCC 
ATACCCTGCA 
TGGCCCTGGA 
AGTGCTACCA 
TCAGCCTAGA 
TGCACCACAT 
AATGACGATA 
AGCCGGGCTC 
CTGCATTCCC 
AATTTGCAAA 



GTGCAAGGAG 
CTGCAGGGAT 
AGTGGGGGGA 
GTCTGAGCTC 
AGTGCCCTCT 
GTCCTGGCCG 
TGACCAGGCA 
CGTGACACAG 
CGGCCACGGC 
TTCCCAGCTT 
CCTCCTGCTG 
TGGCTACACG 
CGACCCCTCC 
TCCTGCTTAT 
GGGCCGCCTG 
CGGGGGCGCC 
CCAGCACCCC 
GCTGACGGAG 
GTTCAGCAGC 
GTCCAGCTTT 



CTGGGCAGCC 
CTTTGGGATA 
TGAGCAGGAG 
TCCTCCTGGA 
TCATTTTGAC 
GGCTGGOGGG 
CTTTCCTTGT 
GGACAACTTC 
AGACCTGCCT 
CCAGCTCCTT 
CCCGTGACTC 
CAACAGCCTC 
TGGTCCTCCG 



1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4010 



Seq ID NO: C213 UNA 
Nucleic Acid Accession #: XM_043340.4 
Coding sequence: 195.. 1067 



GGGCGGCGCC 
GTGGCGAGTG 
TGGCAGCGGG 
CCGGCGGCGG 
AGGACCTTAC 
TCCTCTTCCT 
TCTATGTCGA 
GTTTACCCAA 
ACGAAGTGGG 
GCTTCGAGGG 
GTGCCACAGC 
GGGACACGCT 
GACTCACCTC 
ATGAGGACAA 
ACGTCGCCTA 
GCCCCATCAC 
TCTGTGCCAT 
CAGCCTCTGA 
CTAATGGCCG 
CCTCAATCTG 
TAGAGGATGG 
TACACTGCCT 
AGGGGTGGGG 
TCGACAATCT 
AGCCAACCTG 
AGGTACTCCT 
TATGATTTGT 
CATTTTTTTG 
TGGAAGAGAC 
TTAGTTCTTT 



11 
I 

CAATGGGCTG 
GCGAGTGTCA 
CTCGGACCCA 
CACGATGCCC 
GCAGCCAACG 
CTTCCTCTCG 
TGACCCAGAC 
TCTGCACTGC 
CCACATCGAC 
GCAGTTCAGC 
CCAGCCACAG 
ACAGGTCCAG 
CAACCCCCTG 
GAGTGGCAAG 
CAGCCACACG 
GGTCAAGTAC 
CATTGGCGGG 
GGCCTGGAAG 
AGGACCCTGG 
GTCCCAAATC 
CTCGATGTTT 
GAAGTTGCAG 
AGTCCAGGGG 
CGCAGCCCCC 
CCCCAGAGGC 
TTATACACCC 
CACTATAGAT 
AAGAAAAAAA 
ACCTCTGACC 
CCCCACTCGT 



21 

t 

CGCGGAGCGT 
GGGGGGCGGC 
CGCGGCGCCG 
TTTGACTTCA 
TACACCGGGG 
GAGCTCACCG 
AAGGACAGCG 
GAGTTGGTTG 
AACTCCATGA 
ATCAACAAGG 
AACCCAGACA 
AACATCCACG 
GCCTCCCACG 
CAGCGGTACT 
GGCCGCATCA 
ACAGAGAGAC 
ACCTTCACCG 
AAGATCCAGC 
GCATCGCCAG 
TGGCTGTGTC 
TGCAGCTACC 
TTCCCCTTTC 
AACATCCCAG 
ACTAGGTGGA 
ACCAGACCTG 
GGCACCTTCC 
GGAAATGTGT 
AAAAAACAAC 
CGTTATTCTT 
TGGGTTCAAC 




GGAGGTTTGA 
CCATTATCTC 
GATTTATAAC 
GTGGCAAGAT 
GGCTTGACAT 
AGATCCCGCT 
TCCCCGGCAA 
TGACGCATGT 
GAGCTTTCAA 
ACTACATCCT 
CCTACCAGTA 
TCCCTGCAAT 
GGCAGCGGCT 
TCGCCGGCAT 
TGGGCAAGAT 
CCTTGCCTCC 
CCAAAGGGTG 
TCTTTTCCCC 
CCTGGGGAGC 
AATGCATATC 
CACATTAATG 
GGCTTTCAGC 
ACGAAAGATG 
GGCATGCACA 
TCTGAGGACA 
ATAATCAAAA 
TAGATTAAAA 



41 
I 

CAGCGGGAGG 
GGGGCGGCCG 
GGCCTGCAGC 
CATCTACAGG 
CATCTGCTGC 
GACAGAAGTT 
CGACGTCAGT 
TCAGGATGAG 
GAACAATGGG 
CTTCCACGTG 
CATCCACAAG 
TGCTCTCGGG 
GAAGATTGTG 
CACGGTGGCC 
CTGGTTCCGC 
GTACAGATTC 
CCTGGACTCA 
GCATTGACGC 
AGTGCCCTGT 
TGTGGGAAGT 
GTGTTTCTTT 
CCCAAGAACA 
GATCAGCTCT 
ATTTGGTTTC 
TTTGGGACCA 
GTACTTCCCA 
TGAGTTGAAA 
TAGGGGATGT 
TCTGAAGGGA 
GGCTGATTTT 



51 
I 

CGAGTGGCGA 
GAGGAGGCGT 
GCTCCCACCC 
AAGGTGCCCA 
TGCCTCTTCA 
GTGAACGAGC 
CTGAACATCA 
ATGGGCAGGC 
GCAGGCTGCC 
TCCACACACA 
CTCTCCTTTG 
GGAGCAGACA 
CCCACGGTTT 
AACAAGGAAT 
TACGACCTCA 
ATCACCACGA 
TGCATCTTCA 
CACACCCAGC 
CTCCTTTGGC 
GGGGGGAAAG 
TTAGACAAAT 
GAGTCAGGCA 
CAGCCAGGCT 
TCCCCTGGGC 
GGCTGCCCAA 
AGCAAGCCCC 
TTCTTTTATG 
CAGTTTCCTA 
AAAAAATGTT 
CAG 



Seq ID NO: C214 DNA Sequence 

Nucleic Acid Accession #: NM_002151.1 

Coding sequence: 246.. 1499 



51 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1793 



1 11 21 31 41 

I I I I I 

TCGAGCCCGC TTTCCAGGGA CCCTACCTGA GGGCCCACAG GTGAGGCAGC CTGGCCTAGC 
AGGCCCCACG CCACCGCCTC TGCCTCCAGG CCGCCCGCTG CTGCGGGGCC ACCATGCTCC 
TGCCCAGGCC TGGAGACTGA CCCGACCCCG GCACTACCTC GAGGCTCCGC CCCCACCTGC 
TGGACCCCAG GGTCCCACCC TGGCCCAGGA GGTCAGCCAG GGAATCATTA ACAAGAGGCA 
GTGACATGGC GCAGAAGGAG GGTGGCCGGA CTGTGCCATG CTGCTCCAGA CCCAAGGTGG 



60 

120 

180 

240 

300 
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CAGCTCTCAC TGCGGGGACC CTGCTACTTC TGACAGCCAT CGGGGCGGCA TCCTGGGCCA 360 

TTGTGGCTGT TCTCCTCAGG AGTGACCAGG AGCOGCTGTA CCCAGTGCAG GTCAGCTCTG 420 

CGGACGCTCG GCTCATGGTC TTTGACAAGA CGGAAGGGAC GTGGCGGCTG CTGTGCTCCT 480 

_ CGCGCTCCAA CGCCAGGGTA GCCGGACTCA GCTGCGAGGA GATGGGCTTC CTCAGGGCAC 540 

J TGACCCACTC CGAGCTGGAC GTGCGAACGG CGGGCGCCAA TGGCACGTCG GGCTTCTTCT 600 

GTGTGGACGA GGGGAGGCTG CCCCACACCC AGAGGCTGCT GGAGGTCATC TCOGTGTGTG 660 

ATTGCCCCAG AGGCCGTTTC TTGGCCGCCA TCTGCCAAGA CTGTGGCCGC AGGAAGCTGC 720 

CCGTGGACCG CATCGTGGGA GGCCGGGACA CCAGCTTGGG CCGGTGGCCG TGGCAAGTCA 780 

GCCTTCGCTA TGATGGAGCA CACCTCTGTG GGGGATCCCT GCTCTCCGGG GACTGGGTGC 840 

1U TGACAGCCGC CCACTGCTTC CCGGAGCGGA ACCGGGTCCT GTCCCGATGG CGAGTGTTTG 900 

CCGGTGCCGT GGCCCAGGCC TCTCCCCACG GTCTGCAGCT GGGGGTGCAG GCTGTGGTCT 960 

ACCACGGGGG CTATCTTCCC TTTCGGGACC CCAACAGCGA GGAGAACAGC AACGATATTG 1020 

CCCTGGTCCA CCTCTCCAGT CCCCTGCCCC TCACAGAATA CATCCAGCCT GTGTGCCTCC 1080 

1 _ CAGCTGCCGG CCAGGCCCTG GTGGATGGCA AGATCTGTAC CGTGACGGGC TGGGGCAACA 1140 

1J CGCAGTACTA TGGCCAACAG GCCGGGGTAC TCCAGGAGGC TCGAGTCCCC ATAATCAGCA 1200 

ATGATGTCTG CAATGGCGCT GACTTCTATG GAAACCAGAT CAAGCCCAAG ATGTTCTGTG 1260 

CTGGCTACCC CGAGGGTGGC ATTGATGCCT GCCAGGGCGA CAGCGGTGGT CCCTTTGTGT 1320 

GTGAGGACAG CATCTCTCGG ACGCCACGTT GGCGGCTGTG TGGCATTGTG AGTTGGGGCA 1380 

CTGGCTGTGC CCTGGCCCAG AAGCCAGGCG TCTACACCAA AGTCAGTGAC TTCCGGGAGT 1440 

GGATCTTCCA GGCCATAAAG ACTCACTCCG AAGCCAGCGG CATGGTGACC CAGCTCTGAC 1500 

CGGTGGCTTC TCGCTGCGCA GCCTCCAGGG CCCGAGGTGA TCCCGGTGGT GGGATCCACG 1560 

CTGGGCCGAG GATGGGACGT TTTTCTTCTT GGGCCCGGTC CACAGGTCCA AGGACACCCT 1620 

CCCTCCAGGG TCCTCTCTTC CACAGTGGCG GGCCCACTCA GCCCGGAGAC CACCCAACCT 1680 

CACCCTCCTG ACCCCCATGT AAATATTGTT CTGCTGTCTG GGACTCCTGT CTAGGTGCCC 1740 

CTGATGATGG GATGCTCTTT AAATAATAAA GATGGTTTTG ATT 1783 
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Seq ID MO: C215 DNA Sequence 
Nucleic Acid Accession #: AB037745.1 
Coding sequence: 26-. 1744 

1 11 21 31 41 51 

1 I I I I I 

ATGGTGGAAC ACGCTGCCCA CAAACATGGA AACGACCGTT CTCAGTGGGA TCAACTTCGA 60 

GTACAAGGGC ATGACAGGCT GGGAGGTGGC TGGTGATCAC ATTTACACAG CTGCTGGAGC 120 

CTCAGACAAT GACTTCATGA TTCTCACTCT GGTTGTGCCA GGATTTAGAC CTCCGCAGTC 180 

GGTGATGGCA GACACAGAGA ATAAAGAGGT GGCCAGAATC ACATTTGTCT TTGAGACCCT 240 

CTGTTCTGTG AACTGTGAGC TCTACTTCAT GGTGGGTGTG AATTCTAGGA CCAACACTCC 300 

TGTGGAGAOG TGGAAAGGTT CCAAAGGCAA ACAGTCCTAT ACCTACATCA TTGAGGAGAA 360 

CACTACCACG AGCTTCACCT GGGCCTTCCA GAGGACCACT TTTCATGAGG CAAGCAGGAA 420 

GTACACCAAT GACGTTGCCA AGATCTACTC CATCAATGTC ACCAATGTTA TGAATGGCGT 480 

GGCCTCCTAC TGCCGTCCCT GTGCCCTAGA AGCCTCTGAT GTGGGCTCCT CCTGCACCTC 540 

TTGTCCTGCT GGTTACTATA TTGACCGAGA TTCAGGAACC TGCCACTCCT GCCCCCCTAA 600 

CACAATTCTG AAAGCCCACC AGCCTTATGG TGTCCAGGCC TGTGTGCCCT GTGGTCCAGG 660 

GACCAAGAAC AACAAGATCC ACTCTCTGTG CTACAATGAT TGCACCTTCT CACGCAACAC 720 

TCCAACCAGG ACTTTCAACT ACAACTTCTC CGCTTTGGCA AACACCGTCA CTCTTGCTGG 780 

AGGGCCAAGC TTCACTTCCA AAGGGTTGAA ATACTTCCAT CACTTTACCC TCAGTCTCTG 840 

TGGAAACCAG GGTAGGAAAA TGTCTGTGTG CACCGACAAT GTCACTGACC TCCGGATTCC 900 

TGAGGGTGAG TCAGGGTTCT CCAAATCTAT CACAGCCTAC GTCTGCCAGG CAGTCATCAT 960 

CCCCCCAGAG GTGACAGGCT ACAAGGCCGG GGTTTCCTCA GAGCCTGTCA GCCTTGCTGA 1020 

TCGACTTATT GGGGTGACAA CAGATATGAC TCTGGATGGA ATCACCTCCC CAGCTGAACT 1080 

TTTCCACCTG GAGTCCTTGG GAATACCGGA CGTGATCTTC TTTTATAGGT CCAATGATGT 1140 

GACCCAGTCC TGCAGTTCTG GGAGATCAAC CACCATCCGC GTCAGGTGCA GTCCACAGAA 1200 

AACTGTCCCT GGAAGTTTGC TGCTGCCAGG AACGTGCTCA GATGGGACCT GTGATGGCTG 1260 

CAACTTCCAC TTCCTGTGGG AGAGCGCGGC TGCTTGCCCG CTCTGCTCAG TGGCTGACTA 1320 

CCATGCTATC GTCAGCAGCT GTGTGGCTGG GATCCAGAAG ACTACTTACG TGTGGCGAGA 1380 

AOCCAAGCTA TGCTCTGGTG GCATTTCTCT GCCTGAGCAG AGAGTCACCA TCTGCAAAAC 1440 

CATAGATTTC TGGCTGAAAG TGGGCATCTC TGCAGGCACC TGTACTGCCA TCCTGCTCAC 1500 

CGTCTTGACC TGCTACTTTT GGAAAAAGAA TCAAAAACTA GAGTACAAGT ACTCCAAGCT 1560 

GGTGATGAAT GCTACTCTCA AGGACTGTGA CCTGCCAGCA GCTGACAGCT GCGCCATCAT 1620 

GGAAGGCGAG GATGTAGAGG ACGACCTCAT CTTTACCAGC AAGAAGTCAC TCTTTGGGAA 1680 

GATCAAATCA TTTACCTCCA AGCAGCCAGC TCCTGTCACC ATCTCTCTTT CAGAGGACTC 1740 

CTGATGGATT TGACTCAGTG CCGCTGAAGA CATCCTCAGG AGGCCCAGAC ATGGACCTGT 1800 

GAGAGGCACT GCCTGCCTCA CCTGCCTCCT CACCTTGCAT AGCACCTTTG CAAGCCTGCG 1860 

GCGATTTGGG TGCCAGCATC CTGCAACACC CACTGCTGGA AATCTCTTCA TTGTGGCCTT 1920 

ATCAGATGTT TGAATTTCAG ATCTTTTTTT ATAGAGTACC CAAACCCTCC TTTCTGCTTG 1980 

CCTCAAACCT GCCAAATATA CCCACACTTT GTTTGTAAAT TATGCCCTTG CTTGTATCTT 2040 

GTTTCCCAAA ATGGCCCATC CGCCAGAGCC ATAGCTTCGT CTGCTCATAA TTCTTATAGC 2100 

TTTGGAATGA AAATATTTCT ATCTTCTTAA GTATAGAAAC TATTTCCTCT GTCCTCTAAC 2160 

TTAAGGGCAG AAACAGCTGG GAGTTTTCCT CGCATGCCCT CAGCTCATGA TCTCTTCAGG 2220 

AGAGAGGCTG GGTGAGGAGG GTGTCGGGGT TCCCTGGTGG ATAATCTTCA TAGCAGCCTG 2280 

GATCCATTTC CCCTGGATAA CCAGCTCAAA GGGAGTGAAA ATGGTAGTCT GAGGGCAAGG 2340 

GGAGCAAGGC CTGGGTAAGA AAAGCCTTGA AAAGCATAAA AAGAGGCCGG GCGCGGTGGC 2400 

TCACGCCTGT AATCCCAGCA CTTTGGGAGG CGGAGGCGGG CAGATCATGA GGTCGGGAGA 2460 

TTGAGACCAT CCTGGCTAAC ACGGTGAAGC CCOGTCTCTA CTGGAAATAC AAAAAATTAG 2520 

CCGGGCGTGG TGGCGGGTGC CTGTGGTCCC AGCTACTCGG GAGGCTGAGG CGGGAGAATA 2580 

GCGTGGGCCT GGAAGGCGGA GCTTGCAGTG AGCCGAGATC GCGCCACTGC ACTCCATCCA 2640 

GCCTGGGTGA CAGAGTGAGA CTCTGCCTCA AAAAAAAAAA AAAAAAAGAA AAGCACAAAG 2700 

AGAGGCAACA AGGAATGTTT TTGTTTTTGA GACAGGCTCT CACTCTGTCA CCTAGGCTGG 2760 

AGTGCAGTGG CGTAATCACT GTTCAGTGCA GCCTCAAGCT CTTGGGCTCA GGCTATCCTC 2820 

CCATCTCAGC CTCTCAAGTA GCTGGGACTA CGAGTGTGCA CCACCAGGCT CACTAATTTT 2880 

TGTGTTTTTT GTAGACACGG GGTTTCACCG TGTTGCCCAG GCTGGTCTCC AACTCCTGGG 2940 

CTCAAGTGAT CTGTCCGCCT CGGC CTCCCA AACTGCTGGG ATTACAGGCA TAAGCCACTG 3000 

CACTCAGCCT TTTATTTGTT TTTTAAACCA CGTAGCTCAT TGCCTTCTCT TAAGTAAATG 3060 

ATAGATATTC TCACTGAAGC CAAAGGAATA AGTTCATCAA GAAAATGCCC AAAGCCCTGG 3120 
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TGGATACATC CTCCCTATCT TTTTTTTAAA CCTTCCACTA TCACTCTATG ACACTGAAAA 3180 

GAACCAGGTA AGCCCCAAAC CCAGATGTTC CAGCCTTATC CTCTATTGGG TTTACCCACA 3240 

GACATAGCAA ACCCTGTCAG TGAGGAAAAT TCCCCATCCT TGAGTGCCCC CGTCCTAGAA 3300 

GTTTGGGCCA TATTATGGAA CAGGGGTCTC TTATTTGAAA AGAGCACAAG GAGGCCAAGA 3360 

TTTTAATGGG GCACTTTAGG GGATACAGCC CACAATGGCA TGGGCCTGAG GTGGCCGTGA 3420 

TGTCTGCTTC TAAGCTTAAC GCATCTGCTC AGGCACAGAA TAAACGTCTA GGCTGGCCAA 3480 

AAAAGGAACT GAATCCCAGG CCCATACGCC AGCACCAGAA TCAAACCAGT CTTCAAGGAA 3540 

GGAAGGCTAG GAGAGTTTAA CAAGATTTTC ACTGGGCCCA GCATGGTGGC TCACACCTGT 3600 

AATCCCAAGG CAGAATGGTG GCTTGAGCTC AGGAGTTCAA GACCAGCCTG GGCAACACAG 3660 

TGAGACCCTG TCTCTAAAAA ATTTAAAAAT AAACAAGGTG TTCACCAAGC TGGGATACTT 3720 

CTCACTATTA AGCCCCTATC TTTCTCTTTT TTTCATTCTC AATTGCTTTG TGTGATAAAA 3780 

AACTAAAGAG ACTTCTGGTC CAATTTCTGG CAACATCCCT TCTGAAAGGT GAGTAGAGTG 3840 

GGTGTCTTCT ATGCCCATTT TCCCCAATTT TACACAAACT ATTATCAATG AACTTTTAAG 3900 

TACCTAGAAT GGGTAAAACC AGAGCAAGAC TTTAAATTAC CTTCTTCTTT CTTCTACTGG 3960 

CAGTTCTGCC TCCATCACTA TCAGGCTAGG GTGACCTTCC CTTGGTCAAG CCCCAATTGC 4020 

CCATGATTTG TGCCTGTGCC CTTTCTCCAG TGACCATTTG GTGACCAGAT GGTAGATATA 4080 

GAAAGGGGAT GGCATTTGCA AGTGACTAGT CTGCCACAAA ATGCTCATCT GATTAGCCAC 4140 

TGCTGCCCTG GCAATGGCTT TGTAAGAGTC AATGAGAACT AGAGCCAGGC TGTGGTCCCT 4200 

GGCCATCAAC AGTGTTGGTG ACGGCAGGGA GTCCCTTTGG TTTAATAAAT CCAGTTTTTC 4260 

TTTGGGTATC CAAATTCTCC CCTCCTTTTG TAGGAGTCAG GCTCTCAGAA CCTGTGTCCA 4320 

TGTTGGAACT TCCCCCAGTG TGGATGCAGA TACGCAGCTC CTGAQCTCCA GCCTAAAGTC 4380 

TTCTGTAGCC TCAGCAATAC TTGGGCACCT GCTGTCTCAC TGAATAGCTT TCTTTTGTGA 4440 

CAAAGGCCAC AGACAGCCCT TAGACTATTC CGGAAACAGT AGGAAAAATT ACATATGTCT 4500 

TTGACTTCTT TATTCTGACT CCACTGATTT TAGCCATAAT ACTTTAAGGA GCTACTTTTT 4560 

ACTACCCCTT ACCGTGCTGA CTTCTGCAGG TCTGCCCTGT GACCTGTCAG GAACTCCTGA 4620 

GTTACGCTAC TGGGGTCACC TGTTGCTCCC CTAGCAAGTT AGGCATGTCA TATATTTTTA 4660 

ACAGCTTTAT TGAGATATAA TTCACATATT ATACAATTCA CCTTTAAAAC ATACGATTCA 4740 

ATGGTTTTCA GCAAACTCAC AGAGTTGTCC GCCCACTTGA GAGCAAACAC ATGTTCAATT 4800 

TTCTTTTCCT TTTTTTTTTT GAGACAGAGT CAGCTTTGTC GCCCAGGCTG GAGTGCAGTG 4860 

CCATGATCTT GGCTCACTGC AGCCTCCCCA TCCTGGGTTC AAGTGATCCT TCTGCTTCAG 4920 

CCTCCCCAGT AGCTGGGATT ACAGGCATGC GCCACCACGC CTAGCTAATT TTTGTGTTTT 4980 

TAGTAGAGAT GGGGTTTCAC CGTGTTGGCC AGGCTGGTCT CAAACTCCTG GACTCAAGTG 5040 

ATCCACCCAC CTCGGCCTCC CAAAGTGCTG GGATTGCAGG TGTGAGCCAC CGTGCCTGGC 5100 

CTACGTGTTC AATTTTCTAT GAACAAAGGC TTTAGTCCTT GACCCAGGGC TAAAGTGGTC 5160 

TGTCCAAGCT GTTGTTGGTA GAGGGAGTAT GATAAAATGT TTAAATCTCA TTTGGTTACC 5220 

TTGAGTCCTG GAACACGCAG TAACTGTCAT GCTATAGTCA TCATCTGTAT TTGGCTGGGA 5280 

ATACAAATGA AGATTGTGGT GTATTCAAGC AGTAGGGTTT TTGCTTTTGT TTTTGTTTTA 5340 

GTGCCAACAA AACTTTTTTT TGTCTGACTA CATTAAAGAT AAGACTGACT ATATTTATAC 5400 

AACAGAAACT TTGTAATAGA TTTTTTCAGC TTTGTGAAAT OGAATTTTTT TTCATCAGGG 5460 

CTGGTTGGAT TTCCTTTTTA CCCTGTAATC CAAGCGTTAA TAGTTTGTTA GAAGATGGGT 5520 

TATTGCATGT CACTTTTTTT TTTTTGTAAA ATAAAAACAT ACCTTAC 5567 



Seq ID NO: C216 DNA Sequence 
Nucleic Acid Accession #: NM_004864.1 
45 Coding sequence: 26-952 



1 11 21 31 41 51 

I I I I I I 

CGGAACGAGG GCAACCTGCA CAGCCATGCC CGGGCAAGAA CTCAGGACGG TGAATGGCTC 60 

TCAGATGCTC CTGGTGTTGC TGGTGCTCTC GTGGCTGCCG CATGGGGGCG CCCTGTCTCT 120 

GGCCGAGGCG AGCOGCGCAA GTTTCCCGGG ACCCTCAGAG TTGCACTCCG AAGACTCCAG 180 

ATTCCGAGAG TTGCGGAAAC GCTACGAGGA CCTGCTAACC AGGCTGCGGG CCAACCAGAG 240 

CTGGGAAGAT TCGAACACCG ACCTCGTCCC GGCCCCTGCA GTCCGGATAC TCACGCCAGA 300 

AGTGCGGCTG GGATCCGGCG GCCACCTGCA CCTGCGTATC TCTCGGGCCG CCCTTCCCGA 360 

GGGGCTCCCC GAGGCCTCCC GCCTTCACCG GGCTCTGTTC CGGCTGTCCC CGACGGCGTC 420 

AAGGTCGTGG GACGTGACAC GACCGCTGCG GCGTCAGCTC AGCCTTGCAA GACCCCAAGC 480 

GCCCGCGCTG CACCTGCGAC TGTCGCCGCC GCCGTCGCAG TCGGACCAAC TGCTGGCAGA 540 

ATCTTCGTCC GCACGGCCCC AGCTGGAGTT GCACTTGCGG CCGCAAGCCG CCAGGGGGCG 600 

CCGCAGAGCG CGTGCGCGCA ACGGGGACGA CTGTCCGCTC GGGCCCGGGC GTTGCTGCCG 660 

TCTGCACACG GTCCGCGCGT CGCTGGAAGA CCTGGGCTGG GCCGATTGGG TGCTGTCGCC 720 

ACGGGAGGTG CAAGTGACCA TGTGCATCGG CGCGTGCCCG AGCCAGTTCC GGGCGGCAAA 780 

CATGCACGCG CAGATCAAGA CGAGCCTGCA CCGCCTGAAQ CCCGACACGG AGCCAGCGCC 840 

CTGCTGCGTG CCCGCCAGCT ACAATCCCAT GGTGCTCATT CAAAAGACCG ACACCGGGGT 900 

GTCGCTCCAG ACCTATGATG ACTTGTTAGC CAAAGACTGC CACTGCATAT GAGCAGTCCT 960 

GGTCCTTCCA CTGTGCACCT GCGCGGGGGA GGCGACCTCA GTTGTCCTGC CCTGTGGAAT 1020 

GGGCTCAAGG TTCCTGAGAC ACCCGATTCC TGCCCAAACA GCTGTATTTA TATAAGTCTG 1080 

TTATTTATTA TTAATTTATT GGGGTGACCT TCTTGGGGAC TCGGGGGCTG GTCTGATGGA 1140 

ACTGTGTATT TATTTAAAAC TCTGGTGATA AAAATAAAGC TGTCTGAACT GTTAAAAAAA 1200 

AAAA 1204 

Seq ID NO: C432 DNA Sequence 

Nucleic Acid Accession #: NMJJ52858.1 

Coding sequence: 54.. 1259 

1 11 21 31 41 51 

I I I I I I 

GGCACGAGGT GTTGCCCTCA GGTCGCTCCC GGGCGCGGAC ACGGAACCCG GCCATGGAAG 60 

ATCCGTOGGG GGCTCGCGAG CCCCGGGCCC GGCCGAGAGA GCGGGACCCG GGACGGCGCC 120 

CCCACCCAGA CCAAGGCCGC ACCCACGATC GACCGCGGGA CCGACCCGGG GACCCGCGCA 180 

GGAAGCGAAG CAGCGACGGG AACCGGCGAA GGGACGGGGA CCGGGACCCG AAGAGAGACC 240 

AGGAGAGGGA CGGGAACCGC GACCGGAACC GGGACGGGGA GAGGGAGAGA GAGAGGGAAA 300 

GAGACCCGGA CCGAGGCCCC CGCCGGGACA CACACAGGGA CGCGGGCCCT CGCGCAGGTG 360 

AACACGGAGT TTGGGAAAAA CCGCGCCAAA GCCGGACGCG GGACGGAGCC CGGGGACTGA 420 

CCTGGGACGC AGCCGCGCCT CCTGGGCCCG CGCCCTGGGA AGCCCCGGAG CCGCCGCAGC 480 
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CGCAGAGGAA 
ATCTGCCCTC 
AAGGACTCCT 
TGCTGGAGGT 
CAGGGGGCTA 
CTTACAGTGG 
ACCAGCTAAA 



TGGTGACCGA 
TCTACTTCCA 
TGTACCAAAG 
CTGGAATCTT 
AGGGCTACCG 
GGTTTCTAAA 
TTCCCAAGAA 
GTGTGGTGGG 
CTTCTGGAGT 
TGGCCGGCCT 
ATCACTTGAG 
TAAAATACAA 
AGGCTGAGGC 
CGCCACTGCA 



GGGAGACCCC 
GACCCCCAGG 
GGAATGCCAC 
TCTCCTGAAC 
CACGGGCATC 
CTTTGATGGT 
GCTGCCCATG 
CTTCGTTGCC 
AGGCTTGTTG 
CTACCTCTCT 
CAAAGGCTAC 
TGCTGCCCTG 
AAAAGTTAGG 
ACGCTCTGAC 
TCCCTTGTTG 
CGGAGCTCCC 
CCTCTGTGAG 
CGGCGGCTCA 
GTCAGGAGCT 
AAAAATTAGC 
AGGAGAATCG 
CTCCAGCCCA 



GGGCGCCGCA 
CCTGGACGAG 
AAATGCAAAT 
TTGCTGATCC 
ACCAGCTTGG 
GCTGACGGGG 
GTCACTGTGG 
ATGGGTGTCC 
GACATGCTCA 
GCTGCCTATG 
AGCGGTTTCG 
GGCATTGTGG 
AAGCTAAAAG 
AGATGCAAGT 
TGGAAGTTTC 
AGTCGCATGG 
TGAGGGACCA 
CACCTGTAAC 
CGAGACCAGC 
CAGGCGTGGT 
CTTGAATCTG 
GGTGACAGAG 



GACCCGAAAG 
AGGAGGTGGA 
ACTTGTGCAC 
TGGCCTGCAG 
GGGGCATTTA 
AGAAGGCCCA 
CAATGGCCTG 
TGCGGGTCCC 
TCGCGGGGGG 
GCTCTCCTGT 
GCTGCAGTTT 
TCTTTGCCCT 
AGAAGCCAGC 
GGTGGTGGAA 
CAATGCTGGA 
AGCGGTGTTC 
ATCAAAATTA 
CCCAGCACTT 
TTGGCCAACA 
GGCGGGCGCC 
GGAGGCGGAG 
CGAGACTCCA 



TGAACCCCCT 
ATATTACCAG 
TGGGAGAGCC 
CTCTGTGTCT 
CTACTATCAG 
GCAACTGGAT 
TAGTGGAGCC 
GTGGCATTGT 
GTACATCCOG 
GTGTAAAGAG 
CCACGGAGCA 
GGGGGCGGTC 
AGAAATGTTT 
GGTAGTCTGA 
AAAGCAGCGA 
ATGGATGCAA 
TTTTTCAAAA 
TGGGAGGCTG 
TGGTGAGCCC 
TGTAATCCCA 
ATTGCAGTGA 
TCTCAAAAAA 



TCGGAGAGAT 
TCAGAGGCGG 
TGCTGCCAAA 
TACAGTTCCA 
TTCGGAGGGG 
GTCCAGTTCT 
CTCACAGCCC 
CCACTGTTGC 
GCCTTGTACT 
AGGCAGGCGC 
GATATAGGAG 
CTGGCCATAA 
GAATTTTAAG 
GCCACTGCCT 
GCCAGCGTTG 
CAGACCCTGG 
AGCAAAAAAA 
AGGTGGGTGG 
CCGTCTCTAC 
GCTACTTGGG 
GCCGAGATCC 
AAAAAAAAAA 



Seq ID NO: C434 DNA Sequence 

Nucleic Acid Accession (M Eos sequence 

Coding sequence: 261.. 2861 



SI 



540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1S00 

1560 

1620 

1680 

1740 

1800 



1 11 21 31 41 

I I I I I I 

GAGCTAGCGC TCAAGCAGAG CCCAGCGCGG TGCTATCGGA CAGAGCCTGG CGAGCGCAAG 60 

CGGCGCGGGG AGCCAGCGGG GCTGAGCGCG GCCAGGGTCT GAACCCAGAT TTCCCAGACT 120 

AGCTACCACT CCGCTTGCCC ACGCCCCGGG AGCTCGCGGC GCCTGGCGGT CAGCGACCAG 180 

ACGTCCGGGG CCGCTGCGCT CCTGGCCCGC GAGGCGTGAC ACTGTCTGGQ CTACAGACCC 240 

AGAGGGAGCA CACTGCCAGG ATGGGAGCTG CTGGGAGGCA GGACTTCCTC TTCAAGGCCA 300 

TGCTGACCAT CAGCTGGCTC ACTCTGACCT GCTTCCCTGG GGCCACATCC ACAGTGGCTG 360 

CTGGGTGCCC TGACCAGAGC CCTGAGTTGC AACCCTGGAA CCCTGGCCAT GACCAAGACC 420 

ACCATGTGCA TATCGGCCAG GGCAAGACAC TGCTGCTCAC CTCTTCTGCC ACGGTCTATT 480 

CCATCCACAT CTCAGAGGGA GGCAAGCTGG TCATTAAAGA CCACGACGAG CCGATTGTTT 540 

TGCGAACCCG GCACATCCTG ATTGACAACG GAGGAGAGCT GCATGCTGGG AGTGCCCTCT 600 

GCCCTTTCCA GGGCAATTTC ACCATCATTT TGTATGGAAG GGCTGATGAA GGTATTCAGC 660 

CGGATCCTTA CTATGGTCTG AAGTACATTG GGGTTGGTAA AGGAGGCGCT CTTGAGTTGC 720 

ATGGACAGAA AAAGCTCTCC TGGACATTTC TGAACAAGAC CCTTCACCCA GGTGGCATGG 780 

CAGAAGGAGG CTATTTTTTT GAAAGGAGCT GGGGCCACCG TGGAGTTATT GTTCATGTCA 840 

TCGACCCCAA ATCAGGCACA GTCATCCATT CTGACCGGTT TGACACCTAT AGATCCAAGA 900 

AAGAGAGTGA ACGTCTGGTC CAGTATTTGA ACGCGGTGCC CGATGGCAGG ATCCTTTCTG 960 

TTGCAGTGAA TGATGAAGGT TCTCGAAATC TGGATGACAT GGCCAGGAAG GCGATGACCA 1020 

AATTGGGAAG CAAACACTTC CTGCACCTTG GATTTAGACA CCCTTGGAGT TTTCTAACTG 1080 

TGAAAGGAAA TCCATCATCT TCAGTGGAAG ACCATATTGA ATATCATGGA CATCGAGGCT 1140 

CTGCTGCTGC CCGGGTATTC AAATTGTTCC AGACAGAGCA TGGCGAATAT TTCAATGTTT 1200 

CTTTGTCCAG TGAGTGGGTT CAAGACGTGG AGTGGACGGA GTGGTTCGAT CATGATAAAG 1260 

TATCTCAGAC TAAAGGTGGG GAGAAAATTT CAGACCTCTG GAAAGCTCAC CCAGGAAAAA 1320 

TATGCAATCG TCCCATTGAT ATACAGGCCA CTACAATGGA TGGAGTTAAC CTCAGCACCG 1380 

AGGTTGTCTA CAAAAAAGGC CAGGATTATA GGTTTGCTTG CTACGACCGG GGCAGAGCCT 1440 

GCCGGAGCTA CCGTGTACGG TTCCTCTGTG GGAAGCCTGT GAGGCCCAAA CTCACAGTCA 1500 

CCATTGACAC CAATGTGAAC AGCACCATTC TGAACTTGGA GGATAATGTA CAGTCATGGA 1560 

AACCTGGAGA TACCCTGGTC ATTGCCAGTA CTGATTACTC CATGTACCAG GCAGAAGAGT 1620 

TCCAGGTGCT TCCCTGCAGA TCCTGCGCCC CCAACCAGGT CAAAGTGGCA GGGAAACCAA 1680 

TGTACCTGCA CATCGGGGAG GAGATAGACG GCGTGGACAT GCGGGGGGAG GTTGGGCTTC 1740 

TGAGCCGGAA CATCATAGTG ATGGGGGAGA TGGAGGACAA ATGCTACCCC TACAGAAACC 1800 

ACATCTGCAA TTTCTTTGAC TTCGATACCT TTGGGGGCCA CATCAAGTTT GCTCTGGGAT 1860 

TTAAGGCAGC ACACTTGGAG GGCACGGAGC TGAAGCATAT GGGACAGCAG CTGGTGGGTC 1920 

AGTACCCGAT TCACTTCCAC CTGGCCGGTG ATGTAGACGA AAGGGGAGGT TATGACCCAC 1980 

CCACATACAT CAGGGACCTC TCCATCCATC ATACATTCTC TCGCTGCGTC ACAGTCCATG 2040 

GCTCCAATGG CTTGTTGATC AAGGACGTTG TGGGCTATAA CTCTTTGGGC CACTGCTTCT 2100 

TCACGGAAGA TGGGCCGGAG GAACGGAACA CTTTTGACCA CTGTCTTGGC CTCCTTGTCA 2160 

. AGTCTGGAAC CCTCCTCCCC TCGGACCGTG ACAGCAAGAT GTGCAAGATG ATCACAGAGG 2220 

ACTCCTACCC AGGGTACATC CCCAAGCCCA GGCAAGACTG CAATGCTGTG TCCACCTTCT 2280 

GGATGGCCAA TCCCAACAAC AACCTCATCA ACTGTGCCGC TGCAGGATCT GAGGAAACTG 2340 

GATTTTGGTT TATTTTTCAC CACGTACCAA CGGGCCCCTC CGTGGGAATG TACTCCCCAG 2400 

GTTATTCAGA GCACATTCCA CTGGGAAAAT TCTATAACAA CCGAGCACAT TCCAACTACC 2460 

GGGCTGGCAT GATCATAGAC AACGGAGTCA AAACCACCGA GGCCTCTGCC AAGGACAAGC 2520 

GGCCGTTCCT CTCAATCATC TCTGCCAGAT ACAGCCCTCA CCAGGAOGCC GACCCGCTGA 2580 

AGCCCCGGGA GCCGGCCATC ATCAGACACT TCATTGCCTA CAAGAACCAG GACCACGGGG 2640 

CCTGGCTGCG CGGCGGGGAT GTGTGGCTGG ACAGCTGCCA TTTCAGAGGG GAGGCTCAGG 2700 

AAGGCTTCTT GCTTACAGGA ATGAAGGCTG GGGGCATTTT GCTGGGGGGA GATGAGGCAG 2760 

CCTCTGGAAT GGCTCAGGGA TTCAGCCCTC CCTGCCGCTG CCTGCTGAAG CTGGTGACTA 2820 

CGGGGTCGCC CTTTGCTCAC GTCTCTCTGG CCCACTCATG ATGGAGAAGT GTGGTCAGAG 2880 
GGGAGCAATG GGCTTTGCTG CTTATGAGCA CAGAGGAATT CAGTCCCCAG GCAGCCCTGC 2940 
CTCTGACTCC AAGAGGGTGA AGTCCACAGA AGTGAGCTCC TGCCTTAGGG CCTCATTTGC 3000 
TCTTCATCCA GGGAACTGAG CACAGGGGGC CTCCAGGAGA CCCTAGATGT GCTCGTACTC 3060 
CCTCGGCCTG GGATTTCAGA GCTGGAAATA TAGAAAATAT CTAGCCCAAA GCCTTCATTT 3120 
TAACAGATGG GGAAAGTGAG CCCCCAAGAT GGGAAAGAAC CACACAGCTA AGGGAGGGCC 3180 
TGGGGAGCCC CACCCTAGCC CTTGCTGCCA CACCACATTG CCTCAACAAC CGGCCCCAGA 3240 
GTGCCCAGGC ACTCCTGAGG TAGCTTCTGG AAATGGGGAC AAGTCCCCTC GAAGGAAAGG 33 00 
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AAATGACTAG AGTAGAATGA CAGCTAGCAG ATCTCTTCCC TCCTGCTCCC AGCGCACACA 3360 

AACCCGCCCT CCCCTTGGTG TTGGCGGTCC CTGTGGCCTT CACTTTGTTC ACTACCTGTC 3420 

AGCCCAGCCT GGGTGCACAG TAGCTGCAAC TCCCCATTGG TGCTACCTGG CTCTCCTGTC 34 BO 

TCTGCAGCTC TACAGGTGAG GCCCAGCAGA GGGAGTAGGG CTCGCCATGT TTCTGGTGAG 3540 

5 CCAATTTGGC TGATCTTGGG TGTCTGAACA GCTATTGGGT CCACCCCAGT CCCTTTCAGC 3600 

TGCTGCTTAA TGCCCTGCTC TCTCCCTGGC CCACCTTATA GAGAGCCCAA AGAGCTCCTG 3660 

TAAGAGGGAG AACTCTATCT GTGGTTTATA ATCTTGCACG AGGCACCAGA GTCTCCCTGG 3720 

GTCTTGTGAT GAACTACATT TATCCCCTTT CCTGCCCCAA CCACAAACTC TTTCCTTCAA 3 7 BO 

AGAGGGCCTG CCTGGCTCCC TCCACCCAAC TGCACCCATG AGACTCGGTC CAAGAGTCCA 3840 

10 TTCCCCAGGT GGGAGCCAAC TGTCAGGGAG GTCTTTCCCA CCAAACATCT TTCAGCTGCT 3900 

GGGAGGTGAC CATAGGGCTC TGCTTTTAAA GATATGGCTG CTTCAAAGGC CAGAGTCACA 3960 

GGAAGGACTT CTTCCAGGGA GATTAGTGGT GATGGAGAGG AGAGTTAAAA TGACCTCATG 4020 

TCCTTCTTGT CCACGGTTTT GTTGAGTTTT CACTCTTCTA ATGCAAGGGT CTCACACTGT 4080 

GAACCACTTA GGATGTGATC ACTTTCAGGT GGCCAGGAAT GTTGAATGTC TTTGGCTCAG 4140 

15 TTCATTTAAA AAAGATATCT ATTTGAAAGT TCTCAGAGTT GTACATATGT TTCACAGTAC 4200 

AGGATCTGTA CATAAAAGTT TCTTTCCTAA ACCATTCACC AAGAGCCAAT ATCTAGGCAT 4260 

TTTCTTGGTA GCACAAATTT TCTTATTGCT TAGAAAATTG TCCTCCTTGT TATTTCTGTT 4320 

TGTAAGACTT AAGTGAGTTA GGTCTTTAAG GAAAGCAACG CTCCTCTGAA ATGCTTGTCT 4380 

nTrit ' f g l'l' GCCGAAATAG CTGGTCCTTT TTCGGGAGTT AGATGTATAG AGTGTTTGTA 44 4 0 

20 TGTAAACATT TCTTGTAGGC ATCACCATGA ACAAAGATAT ATTTTCTATT TATTTATTAT 4500 

ATGTGCACTT CAAGAAGTCA CTGTCAGAGA AATAAAGAAT TGTCTTAAAT GTCATGATTG 4560 

GAGATGTCCT TTGCATTGCT TGGAAGGGGT GTACCTAGAG CCAAGGAAAT TGGCTCTGGT 4620 

TTGGAAAAAT TTTGCTGTTA TTATAGTAAA CATACAAAGG ATGTCAAAAA AAAAAAAAAA 4680 

AAAAAAAAAA AAAAAAAAAA AA 4702 
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Seq ID NO: C217 Protein Sequence 
Protein Accession #: NP 005805.1 



„ 1 11 21 31 41 51 

30 | | | | | | 

MVGKMWPVLW TLCAVRVTVD AISVETPQDV LRASQGKSVT LPCTYHTSTS SREGLIQWDK 60 

LLLTHTERW IWPFSNKNYI HGELYKNRVS ISNNAEQSDA SITIDQLTMA DNGTYECSVS 120 

LMSDLBGNTK SRVRLLVLVP PSKPECGIEG ETIIGNNIQL TCQSKEGSPT PQYSWKRYNI 180 

LNQEQPLAQP ASGQPVSLKN ISTDTSGYYI CTSSKEEGTQ FCNITVAVRS PSMNVALYVG 240 

35 IAVGWAALI IIGIIIYCCC CRGKDDNTED KEDARPNREA YEEPPEQLRE LSREREEEDD 300 

YRQEEQRSTG RESPDHLDQ 319 



Seq ID NO: C218 Protein Sequence 
Protein Accession ft: Eos sequence 



1 11 21 31 41 51 

I I I I I I 

MGSRTPESPL HAVQLRWGPR RRPPLLPLLL LLLPPPPRVG GFNLDAEAPA VLSGPPGSFF 60 

GPSVEFYRPG TDGVSVLVGA PKANTSQPGV LQGGAVYLCP WGASPTQCTP IEFDSKGSRL 120 

LESSLSSSEG EEPVEYKSLQ WPGATVRAHG SSILACAPLY SWRTEKEPLS DPVGTCYLST 160 

DNPTRILEYA PCRSDPSWAA GQGYCQGGFS AEFTKTGRW LGGPGSYFHQ GQILSATQEQ 240 

IAESYYPEYL INLVQGQLQT RQASSIYDDS YLGYSVAVGE FSGDDTEDFV AGVPKGNLTY 300 

GYVTILNGSD IRSLYNFSGB QMASYFGYAV AATDVNGDGL DDLLVGAPLL MDRTPDGRPQ 360 

CA EVGRVYVYLQ HPAGIEPTPT LTLTGHDEFG RFGSSLTPLG DLDQDGYNDV AIGAPFGGET 420 

50 QQGWFVFPG GPGGLGSKPS QVLQPLWAAS HTPDFFGSAL RGGRDLDGNG YPDLIVGSFG 480 

VDKAWYRGR PIVSASASLT IFPAMFNPEE RSCSLEGNPV ACINLSFCLN ASGKHVADSI 540 

GFTVELQLDW QRQKGGVRRA LFLASRQATL TQTLLIQNGA REDCREMKIY LRNESEFRDK 600 

LSPIHIALNF SLDPQAPVDS HGLRPALHYQ SKSRIEDKAQ ILLDCGEDNI CVPDLQLEVF 660 

_ _ GEQNHVYLGD KNALNLTFHA QNVGEGGAYE AELRVTAPPE AEYSGLVRHP GNFSSLSCDY 720 

55 FAVNQSRLLV CDLGNPMKAG ASLWGGLRFT VPHLRDTKKT IQFDFQILSK NLNNSQSDW 780 

SFRLSVEAQA QVTLKGVSKP EAVLFPVSDW HPRDQPQKEE DLGPAVHHVY ELINQGPSSI 840 

SQGVLELSCP QALEGQQLLY VTRVTGLNCT TNHPINPKGL ELDPEGSLHH QQKREAPSRS 900 

SASSGPQIliK CPEAECFRLR CELGPLHQQE SQSLQLHFRV WAKTFLQREH QPFSLQCEAV 960 

YKALKMPYRI LPRQLPQKER QVATAVQWTK AEGSYGVPLW IIILAILFGL LLLGLLIYIL 1020 

YKLGPPKRSL PYGTAMEKAQ LKPPATSDA 1049 



Seq ID NO: C219 Protein Sequence 
Protein Accession ft: NP_002412.1 

1 11 21 31 41 51 

I I I I I I 

MHSFPPLLLL LFWGWSHSF PATLETQEQD VDLVQKYLEK YYNLKNDGRQ VEKRRNSGPV 60 

VEKLKQMQEF FGLKVTGKPD AETLKVMKQP RCGVPDVAQP VLTEGNPRWE QTHLTYRIEN 120 

YTPDLPRADV DHAIEKAFQL WSNVTPLTPT KVSEGQADIM ISFVRGDHRD NSPFDGPGGN 180 

IiAHAFQPGPG IGGDAHFDED ERWTNNFRBY NLHRVAAHEL GHSLGLSHST DIGALMYPSY 240 

TFSGDVQLAQ DDIDGIQAIY GRSQNPVQPI GPQTPKACDS KLTFDAITTI RGEVMFFKDR 300 

FYMRTNPFYP EVELNPISVF WPQLPNGLEA AYEFADRDEV RPFKGNKYWA VQGQNVLHGY 360 

PKDIYSSFGF PRTVKHIDAA LSEENTGKTY FFVANKYWRY DEYKRSMDPG YPKMIAHDFP 420 

GIGHKVDAVF MKDGFFYFFH GTRQYKFDPK TKRILTLQKA NSWFNCRKN 469 

Seq ID NO: C220 Protein Sequence 
Protein Accession ft: Eos sequence 



„ 1 11 21 31 41 51 

80 | | | | | | 

MHSFPPLLLL LFWGWSHSF PATLETQEQD VDLVQKYLEK YYNLKNDGRQ VEKRRNSGPV 60 

VEKLKQMQEF FGLKVTGKPD AETLKVMKQP RCGVPDVAQF VLTEGNPRWE QTHLTYRIEN 120 

YTPDLPRADV DHAIEKAFQL WSNVTPLTPT KVSEGQADIM ISFVRGDHRD NSPFDGPGGN 180 

LAHAFQPGPG IGGDAHFDED ERWTNNFRBY NLHRVAAHAL GHSLGLSHST DIGALMYPSY 240 
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TFSGDVQLAQ DDIDQIQAIY GRSQNPVQPI GPQTPKACDS KLTFDAITTI RGEVMPFKDR 300 

FYMRTNPFYP EVELNPISVF WPQLPNGLBA AYEFADRDEV RFFKGNKYWA VQGQNVLHGY 360 

PKDIYSSFGF PRTVKHIDAA LSEENTGKTY FFVANKYWRY DEYKRSMDPG YPKMIAHDFP 420 

^ GIGHKVDAVF MKDGFFYFFH GTRQYKFDPK TKRILTLQKA NSWFNCRKN 469 

Seq ID NO: C221 Protein Sequence 
Protein Accession ft: NP_055146.1 

1 11 21 31 41 51 

10 | | | | | | 

MVRKPWSTI SKGGYLQGNV NGRLPSLGNK EPPGQEKVQL KRKVTLLRGV SIIIGTIIGA 60 

GIFISPKGVL QNTGSVGMSL TIWTVCGVLS LFGALSYAEL GTTIKKSGGH YTYILEVFGP 120 

LPAFVRVWVE LLIIRPAATA VISLAFGRYI fcEPFFIQCEI PELAIKLITA VGITWMVLN 1B0 

SMSVSWSARI QIFLTFCKLT AIIiIIIVPGV MQLIKGQTQN FKDAFSGRDS SITRLPLAFY 240 

15 YGMYAYAGWF YLNFVTEEVE NPEKTIPLAI CISMAITIGV YVLTNVAYFT TINAEELLLS 300 

NAVAVTFSER LLGNFSLAVP IFVALSCFGS MKGGVFAVSR LFYVASREGH LPEILSMIHV 360 

RKHTPLPAVI VLHPLTMIML FSGDLDSLLN FLSFARWLFI GLAVAGLIYL RYKCPDMHRP 420 

FKVPLFIPAL FSFTCLFMVA LSLYSDPFST GIGFVITLTG VPAYYLFIIW DKKPRWFR1M 480 

SEKITRTLQI ILEWPEEDK L 501 
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Seq ID NO: C222 Protein Sequence 
Protein Accession ft: NP 003237.1 



1 11 21 31 41 51 

25 | | | | | | 

MGLAWGLGVL FLMHVCGTNR IPESGGDNSV FDIFELTGAA RKGSGRRLVK GPDPSSPAFR 60 

IEDANLIPPV PDDKFQDLVD AVRAEKG FLL LASLRQMKKT RGTLLALERK DHSGQVFSW 120 

SNGKAGTLDIi SLTVQGKQHV VSVEEALLAT GQWKSITLFV QEDRAQLYID CEKMENAELD 180 

. VPIQSVFTRD LASIARLRIA KGGVNDNFQG VLQNVRFVFG TTPEDILRNK GCSSSTSVLL 240 

30 TLDNNWNGS SPAIRTNYIG HKTKDLQAIC GISCDELSSM VLBLRGLRTI VTTLQDSIRK 300 

VTEENKELAN ELRRPPLCYH NGVQYRNNEE WTVDSCTECH CQNSVTICKK VSCPIMPCSN 360 

ATVPDGECCP RCWPSDSADD GWSPWSEWTS CSTSCGNGIQ QRGRSCDSLN NRCEGSSVQT 420 

RTCHIQECDK RFKQDGGWSH WSPWSSCSVT CGDGVITRIR LCNSPSPQMN GKPCEGEARE 480 

TKACKKDACP INGGWGPWSP WDICSVTCGG GVQKRSRLCN NPAPQFGGKD CVGDVTENQI 540 

35 CNKQDCPIDG CLSNPCFAGV KCTSYPDGSW KCGACPPGYS GNGIQCTDVD ECKEVPDACF 600 

NHNGEHRCEN TDPGYNCLPC PPRFTGSQPF GQGVEHATAN KQVCKPRNPC TDGTHDCNKN 660 

AKCNYLGHYS DPMYRCECKP GYAGNGIICG EDTDLDGWPN ENLVCVANAT YHCKKDNCPN 720 

LPNSGQEDYD KDGIGDACDD DDDNDKIFDD RDNCPFHYNP AQYDYDRDDV GDRCDNCPYN 780 

HNPDQADTDN NGEGDACAAD IDGDGILNER DNCQYVYNVD QRDTDMDGVG DQCDNCPLEH 840 

40 NPDQLDSDSD RIGDTCDNNQ DIDEDGHQNN LDNCPYVPNA NQADKDKDGK GDACDHDDDN 900 

DGIPDDKDNC RLVPNPDQKD SDGDGRGDAC KDDFDHDSVP DIDDICPENV DISETDFRRF 960 

QMIPLDPKGT SQNDPNWWR HQGKEIiVQTV NCDPGLAVGY DEFNAVDFSG TFFINTERDD 1020 

DYAGFVFGYQ SSSRFYWMW KQVTQSYWDT NPTRAQGYSG LSVKWNSTT GPGEHLRNAL 1080 

WHTGNTPGQV RTLWHDPRHI GWKDFTAYRW RLSHRPKTGF IRWMYEGKK IMADSGPIYD 1140 

45 KTYAGGRLGL FVFSQEMVFP SDLKYECRDP 1170 



Seq ID NO: C223 Protein Sequence 
Protein Accession ft: NP_002183.1 

1 11 21 31 41 51 

I I I i I I 

MPLLWLRGFL LASCWIIVRS SPTPGSEGHS AAPDCPSCAL AALPKDVPNS QPEMVEAVKK 60 

HILNMLHLKK RPDVTQPVPK AALLNAIRKL HVGKVGENGY VEIEDDIGRR AEMNELMEQT 120 

SEIITFAESG TARKTLHFEI SKEGSDLSW ERAEVWLFLK VPKANRTRTK VTIRLFQQQK 180 

HPQGSLDTGE EAEEVGLKGE RSELLLSEKV VDARK9TWHV FPVSSSIQRL LDQGKSSLDV 240 

RIACEQCQES GASLVLLGKK KKKEEEGEGK KKGGGEGGAG ADEEKEQSHR PFLMLQARQS 300 

EDHPHRRRRR GLECDGKVNI CCKKQFFVSF KDIGWNDWII APSGYHANYC EGECPSHIAG 360 

TSGSSLSFHS TVINHYRMRG HSPFANLKSC CVPTKLRPMS MLYYDDGQNI IKKDIQNMIV 420 

426 



Seq ID NO: C224 Protein Sequence 
Protein Accession #t NP 000086.1 



1 11 21 31 41 51 

65 | | } | | 1 

. MVPDTACVLL LTLAALGASG QGQSPLGSDL GPQMLRELQE TNAALQDVRD WLRQQVREIT 60 

FLKNTVMECD ACGMQQSVRT GLPSVRPLLH CAPGFCFPGV ACIQTESGGR CGPCPAGFTG 120 

NGSHCTDVNE CNAHPCFPRV RCINTSPGFR CEACPPGYSG PTHOGVGLAF AKANKQVCTD 180 

INECETGQHN CVPNSVCINT RGSFQCGPCQ PGFVGDQASG CQRGAQRFCP DGSPSECHEH 240 

70 ADCVLERDGS RSCVCRVGWA GNGILCGRDT DLDGFPDBKL RCPEPQCRKD NCVTVPNSGQ 300 

BDVDRDGIGD ACDFDADGDG VPfoEKDNCPL VRNPDQRNTD EDKWGDACDN CRSQKNDDQK 360 

DTDQDGRGDA CDDDIDGDRI RNQADNCPRV PNSDQKDSDG DGIGDACDNC PQKSNPDQAD 420 

VDHDFVGDAC DSDQDQDGDG HQDSRDNCPT VPNSAQEDSD HDGQGDACDD DDDNDGVPDS 480 

_ _ RDNCRLVPNP GQEDADRDGV GDVCQDDFDA DKWDKIDVC PENAEVTLTD FRAFQTWLD 540 

75 PEGDAQIDPN WWLNQGREI VQTMNSDPGL AVGYTAFNGV DFEGTFHVNT VTDDDYAGFI 600 

FGYQDSSSFY WMWKQMEQT YWQANPFRAV AEPGIQLKAV KSSTGPGEQL RNALWHTGDT 660 

ESQVRLLWKD PRNVGWKDKK SYRWFLQHRP QVGYIRVRFY EGPELVADSN WLDTTMRGG 720 

RLGVFCFSQE NIIWANLRYR CNDTIPEDYE THQLRQA 757 

80 Seq ID NO: C225 Protein Sequence 
Protein Accession ft: NP_612464 

1 11 21 31 41 51 

I I 1 I I I 
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MRPQGPAASP QRLRGLLLLL LLQLPAPSSA SBIPKGKQKA QLRQRBWDL YNGMCLQGPA 60 

GVPGRDGSPG ANQIPGTPGI PGROGFKGEK GBCLRBSFEE SWTPNYKQCS WSSLNYGIDL 120 

GKIAECTFTK MRSNSALRVL FSGSLRLKCR NACCQRWYFT FNGAECSGPL PIEAIIYLDQ 180 

GSPEMNSTIN IHRTSSVEGL CEGIGAGLVD VAIWVGTCSD YPKGDASTGW NSVSRIIIEE 240 

LPK 243 

Seq ID NO: C226 Protein Sequence 
Protein Accession ft: NP_0 03216.1 

1 11 21 31 41 51 

1 I 1 I I I 

MATMENKVIC ALVLVSMLAL GTLAEAQTET CTVAPRERQN CGFPGVTPSQ CANKGCCFDD 60 

TVRGVPWCFY PNTIDVPPEE ECEF 84 

Seq ID NO: C227 Protein Sequence 
Protein Accession #: NP_056234.1 

1 11 21 31 41 51 

I I I I I I 

MPKRAHWGAL SWLILLWGH PRVALACPHP CACYVPSEVH CTFRSLASVP AGIARHVERI 
NLGFNSIQAIj SETSFAGLTK LELLMIHGNE IPSIPDGALR DLSSLQVFKF SYNKLRVITG 
QTLQGLSNLM RLHIDHNKIE FIHPQAFNGL TSLRItLHLEG NLLHQLHPST FSTFTFLDYF 
RLSTIRHLYL AENNVRTLPA SMLRNMPLLE NLYLQGNPWT CDCEMRWFLE WDAKSRGILK 
CKKDKAYEGG QLCAMCFSPK KLYKHEIHKL KDMTCLKPSI ESPLRQNRSR SIEEEQEQEE 
DGGSQLILEK FQLPQWSISL NMTDEHGNMV NLVCDIKKPM DVYKIHLNQT DPPDIDINAT 
VALDFECPMT RENYEKLWKI* IAYYSEVPVK LHRELMLSKD PRVSYQYRQD ADEEALYYTG 
VRAQILAEPE WVMQPSIDIQ LNRRQSTAKK VLLSYYTQYS QTISTKDTRQ ARGRSWVMIE 
PSGAVQRDQT VLEGGPCQLS CNVKASESPS IFWVLPDGSI LKAPMDDPDS KFSILSSGWL 
RIKSMEPSDS GLYQCIAQVR DEMDRMVYRV LVQSPSTQPA EKDTVTIGKN PGESVTLPCN 
AIiAIPEAHLS WILPNRRIIN DLANTSHVYM LPNGTLSIPK VQVSDSGYYR CVAVNQQGAD 
HFTVGITVTK KGSGLPSKRG RRPGAKALSR VREDIVEDEG GSGMGDEENT SRRLLHPKDQ 
EVFLKTKDDA INGDKKAKKG RRKLKLWKHS EKEPETNVAE GRRVFESRRR INMANKQINP 
ERWADILAKV RGKNLPKGTE VPPLIKTTSP PSI»SLEVTPP FPAVSPPSAS PVGTVTSAEE 
SSADVPLLGE EEHVLGTISS ASMGLEHNHN GVILVEPEVT STPLEEWDD LSEKTEEITS 
TEGDLKGTAA PTLISEPYEP SPTLHTLDTV YEKPTHEETA TEGWSAADVG SSPEPTSSEY 
EPPLDAVSLA ESEPMQYFDP DItETKSQPDB DKMKEDTFAH LTPTPTIWVN DSSTSQLFED 
STIGEPGVPG QSHLQGLTDN IHLVKSSLST QDTLLIKKGM KEMSQTLQGG NMLEGDPTHS 
RSSESEGQES KSITLPDSTL GIMSSMSPVK KPAETTVGTL LDKDTTTVTT TPRQKVAPSS 
TMSTHPSRRR PNGRRRLRPN KFRHRHKQTP PTTFAPSBTF STQPTQAPDI KISSQVESSL 
VPTAWVDNTV NTPKQLEMEK NAEPTSKGTP RRKHGKRPNK HRYTPSTVSS RASGSKPSPS 
PENKHRNIVT PSSETILLPR TVSLKTEGPY DSLDYMTTTR KIYSSYPKVQ ETLPVTYKPT 
SDGKEIKDDV ATNVDKHKSD ILVTGESITN AIPTSRSLVS TMGEFKEESS PVGFPGTPTW 
NPSRTAQPGR LQTDIPVTTS GENLTDPPLL KELEDVDFTS EFLSSLTVST PFHQEBAGSS 
TTLSSIKVEV ASSQAETTTL DQDHLETTVA ILLSETRPQN HTPTAARMKE PASSSPSTIL 
MSLGQTTTTK PALPSPRISQ ASRDSKENVF LNYVGNPETE ATPVNNEGTQ HMSGPNELST 
PSSDRDAFNI* STKLELEKQV FGSRSLPRGP DSQRQDGRVH ASHQLTRVPA KPILPTATVR 
LPEMSTQSAS RYFVTSQSPR HWTNKPEITT YPSGAIiPENK QFTTPRLSST TIPLPLHMSK 
PSIPSKFTDR RTDQFNGYSK VFGNNNIPEA RNPVGKPPSP RIPHYSNGRL PFFTNKTLSF 
PQLGVTRRPQ IPTSPAPVMR ERKVIPGSYN RIHSHSTFHL DFGPPAPPLL HTPQTTGSPS 
TNLQNIPMVS STQSSISFIT SSVQSSGSFH QSSSKFFAGG PPASKFWSL3 EKPQIIjTKSP 
QTVSVTAETD TVFPCEATGK PKPFVTWTKV STGALMTPNT RIQRFEVLKN GTLVIRKVQV 
QDRGQYMCTA SNLHGLDRMV VLLSVTVQQP QILASHYQDV TVYIjGDTIAM ECLAKGTPAP 
QISWIFPDRR VWQTVSPVES RITLHENRTL SIKEASFSDR GVYKCVASNA AGADSLAIRI* 
HVAALPPVIH QEKLENISLP PGLSIHIHCT AKAAPLPSVR WVLGDGTQIR PSQFLHGNLF 
VFPNGTLYIR NLAPKDSGRY ECVAANLVGS ARRTVQLNVQ RAAANARITG TSPRRTDVRY 
GGTLKLDCSA SGDPWPRILW RLPSKRMIDA LFSFDSRIKV FANGTLWKS VTDKDAGDYL 
CVARNKVGDD YWLKVDWM KPAKIEHKBE NDHKVFYGGD LKVDCVATGL PNPEISWSLP 
DGSLVNSFMQ SDDSGGRTKR YWFNNGTLY FNEVGMREBG DYTCFAENQV GKDEMRVRVK 
WTAPATIRN KTYLAVQVPY GDWTVACEA KGEPMPKVTW LSPTNKVIPT SSEKYQIYQD 
GTLLIQKAQR SDSGNYTCLV RNSAGEDRKT VWIHVNVQPP KINGNPNPIT TVREIAAGGS 
RKLIDCKAEG IPTPRVLHAF PEGWLPAPY YGNRITVHGN GSLDIRSLRK SDSVQLVCMA 
RNEGGEARLI VQLTVLEPME KPIFHDPISE KITAMAGHTI SLNCSAAGTP TPSLVWVLPN 
GTDLQSGQQL QRFYBKADGM LHISGLSSVD AGAYRCVARN AAGHTERLVS LKVGLKPEAN 
KQYHNLVSII NGETLKLPCT PPGAGQGRFS WTLPNGMHLE GPQTLGRVSL LDNGTLTVRE 
ASVFDRGTYV CRMETEYGP9 VTSIPVIVIA YPPRITSEPT PVIYTRPGNT VKLNCMAMGI 
PKADITWELP DKSHLKAGVQ ARLYGNRFLH PQGSLTIQHA TQRDAGFYKC MAKNILGSDS 
KTTYIHVF 

Seq ID NO: C228 Protein Sequence 
Protein Accession ft: Eos sequence 



MPGTKLTRTG 
SRQYTVRYRE 
TPESAPTTAP 
FQNTFFHTPR 
SLCQQECSCT 
MCYEDP7VSS 
PQGRNAKDLL 
SDTQDQKRTJj 
SASPABHAST 
PSSVLRDRSS 



11 
I 

APADYRVILK 
KGELARWDYK 
ENLNVWPVNG 
LSNHLEQSPS 
QKDPSCLAYL 
LTGNSLKSVA 
LDLKNKILAN 
RPPSRHGHSV 
QGTSHRPSLP 
VHPGAKPASP 
GEDAPATNSN 



21 
I 

TSQEDELDVP 
QIANRRVLIB 
KPTWAASWD 
PILETLLLPW 
IDIQTKQVNK 
ASKADVQQNT 
GGAPRKPQLR 
VAPGRTAVRA 
ASLNDNDLVD 
ARRTPHSGAA 



31 
I 

DDISVRVMSS 
NLIPDTVYEF 
ALPETEGKVK 
WMVCSLGNAI 
DPQLEGSVFG 
EDNGKPBKPE 
AKKAEELDLQ 
RMPALPRREG 
SDEDERAVGS 



41 
I 

QSVLVSWVDP 
AVRISQGBRD 
VCLLDTGLPS 
FSKSGPQTGB 
PCFLFYFLTF 
PSSPSPRAPA 
STEITGEEEL 
VDKPGFSIAT 
LHPKGAFAQP 
SRLSPPHGGS 
VSEGAEASDG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2828 



51 
I 

VLEKQKKWA 
GKWSTSVFQR 
VSSFQPSAKS 
AWDLTPKPSL 
MLDIGGFSFI 
SSQHPSVPAS 
GSREDSPMSP 
QPRPGAPPSA 
RPALSPSRQS 
SRLLPTQPHI* 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 
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GGRQAEATAQ TLRARPASGH FHLLRHKPFA ANGRSPSRPS IGRGPRLQPS SSPQSTVPSR 720 

AHPRVPSHSD SHPKLSSGIH GDEEDEKPLP ATWNDHVPS SSRQPISRGW EDLRRSPQRG 780 

ASLHRKEPIP ENPKSTGADT HPQGKYSSLA SKAQDVQQST DADTEGHSPK AQPGSTDRHA 840 

SPARPPAARS QQHPSVPRRM TPGRAPEQQP PPPVATSQHH PGPQSRDAGR SPSQPRLSI/T 900 

5 QAGRPRPTSQ GRSHSSSDPY TASSRGMLPT ALQNQDBDAQ GSYDDDSTEV EAQDVRAPAH 960 

AARAKEAAAS LPKHQQVESP TGAGAGGDHR SQRGHAASPA RPSRPGGPQS RARVPSRAAP 1020 

GKSEPPSKRP LSSKSQQSVS AEDEEEEDAG PFKGGKEDLL SSSVPKWPSS STPRGGKDAD 1080 

GSLAKEEREP AIALAPRGGS LAPVKRPLPP PPGSSPRASH VPSRPPPRSA ATVSPVAGTH 1140 

PWPRYTTRAP PGHFSTTPML SLRQRMMHAR FRNPLSRQPA RPSYRQGYNG RPNVEGKVLP 1200 

10 GSNGKPNGQR IINGPQGTKW WDLDRGLVL NAEGRYLQDS HGNPLRIKLG GDGRTIVDLE 1260 

GTPWSPDGL PLFGQGRHGT PLANAQDKPI LSLGGKPLVG LEVIKKTTHP PT TTMQ PTTT 1320 

TTPIiPTTTTP RPTTATTMQP TTTTTPLPTT TPRPTTATTR RTTTRRPTTT VRTTTRTTTT 1380 

TTPKPTTPIP TCPPGTLERH DDDGNLIMSS NGIPECYAEE DEFSGLETDT AVPTEEAYVI 1440 

YDEDYEFETS RPPTTTEPST TATTPRVIPE EGAISSFPEE EFDLAGRKRF VAPYVTYLNK 1S00 

15 DPSAPCSLTD ALDHFQVDSL DEIIPNDLKK SDLPPQHAPR NITWAVEGC HSFVIVDWDK 1560 

ATPGDLVTGY LVYSASYEDF IRNKFSTQAS SVTHLPIENL KPNTRYYFKV QAQNPHGYGP 1620 

ISPSVSFVTE SDNPLLWRP PGGELSGSHS LSNMIPATRT AMDGNM 1666 

Seq ID NO: C229 Protein Sequence 
20 Protein Accession #: NPJ)0300S.l 

1 11 21 31 41 SI 

I I I I I I 

MFLSILVALC LWLH1ALGVR GAPCEAVRIP MCRHMPWNIT RMPNHLHHST QENAIIAIEQ 60 

25 YEELVDVNCS AVLRFFFCAM YAPICTLEFL HDPIKPCKSV CQRARDDCEP LMKMYNHSWP 120 

ESLACDELPV YDRGVCISPE AIVTDLPEDV KW1DITPDMM VQERPLDVDC KRLSPDRCKC 180 

KKVKPTLATY LSKNYSYVIH AKIKAVQRSG CNEVTTWDV KEIFKSSSPI PRTQVPLITN 240 

SSCQCPHIIiP HQDVLIMCYE WRSRMMLLEN CLVEKWRDQL SKRSIQWEER LQEQRRTVQD 300 

KKKTAGRTSR SNPPKPKGKP PAPKPASPKK NIKTRSAQKR TNPKRV 346 



30 



60 



Seq ID NO: C230 Protein Sequence 
Protein Accession #: NP_005931.l 



35 1 11 21 31 41 51 

I I I I I I 

MAPAAWLRSA AARALLPPML LLLLQPPPLL ARALPPDVHH LHAERRGPQP WHAALPSSPA 60 

PAPATQEAPR PASSLRPPRC GVPDPSDGLS ARNRQKRFVL SGGRWEKTDL TYRILRFPWQ 120 

LVQEQVRQTM AEALKVWSDV TPLTFTEVHE GRADIMIDFA RYWHGDDLPF DGPGGILAHA 180 

40 FFPKTHREGD VHFDYDETWT IGDDOGTDLL QVAAHEFGHV LGLQHTTAAK ALMSAFYTFR 240 

YPLSLSPDDC RGVQHLYGQP WPTVTSRTPA LGPQAGIDTN EIAPLHPDAP PDACEASFDA 300 

VSTIRGELFF FKAGFVWRLR GGQLQPGYPA LASRHWQGLP SPVDAAFEDA QGHIWFFQGA 360 

QYWVYDGEKP VLGPAPLTEL. GLVRFPVHAA LVWGPEKNKI YFFRGRDYVfR PHPSTRRVDS 420 

PVPRRATDWR GVPSEIDAAF QDADGYAYFL RGRLYWKFDP VKVKALEGFP RLVGPDFFGC 480 

45 AEPANTFL 488 

Seq ID NO: C231 Protein Sequence 
Protein Accession #: NP_076927 

50 1 11 21 31 41 51 

I I I I I I 

MGENDPPAVE APFSFRSLFG LDDLKISPVA PDADAVAAQI LSLLPLKFFP IIVIGIIALI 60 

LALAIGLGIH FDCSGKYRCR SSFKCIELIA RCDGVSDCKD GEDEYRCVRV GGQNAVLQVF 120 

TAASWKTMCS DDWKGHYANV ACAQLGFPSY VSSDNLRVSS LEGQFREEFV SIDHLLPDDK 180 

55 VTALHHSVYV REGCASGHW TLQCTACGHR RGYSSRIVGG NMSLLSQWPW QASLQFQGYH 240 

LCGGSVITPL WIITAAHCVY DLYLPKSWTI QVGLVSLLDN PAPSHLVEKI VYHSKYKPKR 300 

LGNDIALMKL AGPLTFNEMI QPVCLPNSEE NFPDGKVCWT SGWGATEDGG DASFVLNHAA 360 

VPLISNKICN HRDVYGGIIS PSMLCAGYLT GGVDSCQGDS GGPLVCQERR LWKLVGATSF 420 

GIGCAEVNKP GVYTRVTSFL DWIHEQMERD LKT . 453 



Seq ID NO: C232 Protein Sequence 
Protein Accession ft: NP_003211 



1 11 21 31 41 51 

65 | | | | | | 

MLWKLTDNIK YEDCEDRHDG TSNGTARLPQ LGTVGQSPYT SAPPLSHTPN ADFQPPYFPP 60 

PYQPIYPQSQ DPYSHVNDPY SLNPLHAQPQ PQHPGWPGQR QSQESGLLHT HRGLPHQLSG 120 

LDPRRDYRRH EDLLHGPHAL SSGLGDLSIH SLPHAIEEVP HVEDPGINIP DQTVIKKGPV 180 

SLSKSNSNAV SAIPINKDNL FGGWNPNEV FCSVPGRLSL LSSTSKYKVT VAEVQRRLSP 240 

70 PECLNASLLG GVLRRAKSKN GGRSLREKLD KIGLNLPAGR RKAANVTLLT SLVEGEAVHL 300 

ARDFGYVCET EFPAKAVAEF LNRQHSDPNE QVTRKNMLLA TKQICKEFTD LLAQDRSPLG 360 

NSRPNPILEP GIQSCLTHFN LISHGFGSPA VCAAVTADQN YLTEALKAMD KMYLSNNPNS 420 

HTDNNAKSSD KEEKHRK 437 

75 Seq ID NO: C233 Protein Sequence 
Protein Accession #: NP_002979-1 

1 11 21 31 41 51 

on I I I I II 

OU MKGLAAALLV LVCTMALCSC AQVGTNKEIiC CLVYTSWQIP QKFIVDYSET SPQCPKPGVI 60 
LLTKRGRQIC ADPNKKWVQK YISDLKUIA 89 

Seq ID NO: C234 Protein Sequence 
Protein Accession {J: NP_0040S4.1 

1339 
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1 11 21 31 41 51 

I I I I I I 

MILQAHLHSL CLLMLYLATG YGQEGKFSGP LKPMTFSIYE GQEPSQIIPQ FKANPPAVTP 60 

ELTGBTDNIF VIEREGLIiYY NRALDRETRS THNLQVAALD ANGIIVEGPV PITIEVKDIN 120 

DNRPTFLQSK YEGSVRQNSR PGKPFLYVNA TDLDDPATPN GQLYYQIVIQ LPMINNVMYF 180 

QINNKTGAIS LTREGSQELN PAKNPSYNLV ISVKDMGGQS ENSFSDTTSV DIIVTENIWK 240 

APKPVEMVEN STDPHPIKIT QVRWNDPGAQ YSLVDKEKLP RFPFS1DQEG DIYVTQPLDR 300 

EEKDAYVFYA VAKDEYGKPL SYPLEIHVKV KDINDNPPTC PSPVTVFEVQ ENERLGNSIG 360 

TLTAHDRDEE NTANSFLNYR IVEQTPKLPM DGLFLIQTYA GMLQLAKQSL KKQDTPQYNL 420 

TIEVSDKDFK TLCFVQINVI DINDQIPIFE KSDYGNLTLA EDTNIGSTIL TIQATDADEP 480 

FTGSSKILYH IIKGDSEGRL GVDTDPHTNT GYVIIKKPLD FETAAVSNIV FKAENPEPLV 540 

FGVKYNASSF AKFTLIVTDV NEAPQFSQHV FQAKVSEDVA IGTKVGNVTA KDPEGLDISY 600 

SLRGDTRGWL KIDHVTGEIF SVAPLDREAG SPYRVQWAT EVGGSSLSSV SEFHLILMDV 660 

NDNPPRLAKD YTGLFFCHPL SAPGSLIFEA TDDDQHLFRG PHFTFSLGSG SLQNDWEVSK 720 

INGTHARLST RHTEFEEREY WLIRINDGG RPPLEGIVSL PVTFCSCVEG SCFRPAGHQT 780 

GIPTVGMAVG ILLTTLLVIG IILAWFIRI KKDKGKDNVE SAQASEVKPL RS 832 

Seq ID NO: C235 Protein Sequence 
Protein Accession #: np_004434.1 

1 11 21 31 41 51 

I I I I I I 

MARARPPPPP SPPPGLLPLL PPLLLLPLUi LPAGCRALEE TLMDTKWVTS ELAWTSHPES 60 

GWEEVSGYDB AMNPIRTYQV CNVRESSQNN WLRTGFIWRR DVQRVYVELK FTVRDCNS1P 120 

NIPGSCKETP NLFYYEADSD VASASSPFWM ENPYVKVDTI APDESFSRLD AGRVNTKVRS 180 

FGPLSKAGFY LAFQDQGACM SLISVRAFYK KCASTTAGFA LFPETLTGAE PTSLVIAPGT 240 

CIPNAVEVSV PLKLYCNGDG EWMVPVGACT CATGHEPAAK ESQCRPCPPG SYKAKQGEGP 300 

CLPCPPNSRT TSPAASICTC HNNFYRADSD SADSACTTVP SPPRGVISNV NET SL I LEWS 360 

EPRDLGGRDD LLYNVICKKC HGAGGASACS RCDDNVEFVP RQLGLTERRV HISHLLAHTR 420 

YTFEVGAVNG VSGKSPLPPR YAAVNITTNQ AAPSEVPTLR LHSSSGSSLT LSWAPPERPN 480 

GVILDYEMKY FEKSEGIAST VTSQMNSVQL DGLRPDARYV VQVRARTVAG YGQYSRPAEF 540 

ETTSERGSGA QQLQEQLPLI VGSATAGLVF WAWVIAIV CLRKQRHGSD SEYTEKLQQY 600 

IAPGMKVYID PFTYEDPNEA VREFAKEIDV SCVKIEEVIG AGEFGEVCRG RLKQPGRREV 660 

PVAIKTLKVG YTERQRRDFL SEASIMGQFD HPNIIRLEGV VTKSRPVMIL TEFMENCALD 720 

SFLRLNDGQF TVIQLVGMLR GIAAGMKYLS EMNYVHRDLA ARNILVNSNL VCKVSDFGLS 780 

RFLEDDPSDP TYTSSLGGKI PIRWTAPEAI AYRKFTSASD VWSYGIVMHE VMSYGERPYW 840 

DMSNQDVINA VEQDYRLPPP MDCPTALHQL MLDCWVRDRN LRPKFSQIVN TLDKLIRNAA 900 

SLKVIASAQS GMSQPLLDRT VPDYTTFTTV GDWLDAIKMG RYKESFVSAG FASFDLVAQM 960 

TAEDLLRIGV TLAGHQKKIL SSIQDMRLQM NQTLPVQV 998 

Seq ID NO: C236 Protein Sequence 
Protein Accession #: NP_001795.1 

1 11 21 31 41 51 

III!)! 

MYVGYVLDKD SPVYPGPARP ASLGLGPANY GPPAPPPAPP QYPDFSSYSH VEPAPAPPTA 60 

WGAPFPAPKD DWAAAYGPGP AAPAASPASL AFGPPPDFSP VPAPPGPGPG LLAQPLGGPG 120 

TPSSPGAQRP TPYEWMRRSV AAGGGGGSGK TRTKDKYRW YTDHQRLELE KEFHYSRYIT 180 

IRRKSBLAAN LGLTERQVKI WFQNRRAKER KVNKKKQQQQ QPPQPPMAHD ITATPAGPSL 240 

GGLCPSNTSL LATSSPMPVK EEFLP 265 

Seq ID NO: C237 Protein Sequence 
Protein Accession #: NP_068813.1 

1 11 21 31 41 51 

I I I I I I 

MGSDRARKGG GGPKDFGAGL KYNSRHEKVN GLEEGVEPLP VNNVKKVEKH GPGRWWLAA 60 

VLIGIiliVLL GIGFLVWHLQ YRDVRVQKVF NGYMRITNEN FVDAYENSNS TEFVSIASKV 120 

KDALKLLYSG VPFLGPYHKE SAVTAFSEGS VIAYYWSEFS IPQHLVEEAE RVMAEERWM 180 

LPPRARSLKS FWTSWAFP TDSKTVQRTQ DNSCSFGLHA RGVELMRFTT PGFPDSPYPA 240 

HARCQWALRG DADSVLSLTF RSFDLASCDE RGSDLVTVYN TLSPMEPHAL VQLCGTYPPS 300 

YNLTFHSSQN VLLITLITNT ERRHPGFEAT FFQLPRMSSC GGRLRKAQGT FNSPYYPGHY 360 

PPNIDCTWNI EVPNNQHVKV RFKFFYLLEP GVPAGTCPKD YVEINGEKYC GERSQFWTS 420 

NSNKITVRFH SDQSYTDTGF LAEYLSYDSS DPCPGQFTCR TGRCIRKELR CDGWADCTDH 480 

. SDELNCSCDA GHQFTCKNKF CKPLFWVCDS VNDCGDNSDE QGCSCPAQTF RCSNGKCLSK 540 

SQQCNGKDDC GDGSDEASCP KVNWTCTKH TYRCLNGLCL SKGNPECDGK EDCSDGSDEK 600 

DCDCGLRSFT RQARWGGTD ADEGEWPWQV SLHALGQGHI OGASLISPNW LVSAAHCYID 660 

DRGFRYSDPT QWTAFlJGLHD QSQRSAPGVQ ERRLKRIISH PFFNDFTFDY DIALLELEKP 720 
AEYSSMVRPI CLPDASHVFP AGKAIWVTGW GHTQYGGTGA LILQKGEIRV INQTTCENLL 780 
PQQITPRMMC VGFLSGGVDS CQGDSGGPLS SVEADGRIFQ AGWSWGDGC AQRNKPGVYT 840 
RLPLFRDWIK ENTGV 855 

Seq ID NO: C238 Protein Sequence 
Protein Accession #; Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MPPFLLLEAV CVFLFSRVPP SLPLQEVHVS KETIGKISAA SKMMWCSAAV DIMFLLDGSN 60 
SVGKGSFERS KHFAITVCDG LDISPERVRV GAFQFSSTPH LEFPLDSFST QQEVRARIKR 120 
MVFKGGRTET ELALKYLLHR GLPGGRNASV PQILIIVTDG KSQGDVALPS KQLKERGVTV 180 
FAVGVRFPRW EELHAIASEP RGQHVLLAEQ VEDATNGLFS TLSSSAICSS ATPDCRVEAH 240 
PCEHRTLEMV REFAGNAPCW RGSRRTLAVL AAHCPFYSWK RVFLTHPATC YRTTCPGPCD 300 
SQPCQNGGTC VPEGLDGYQC LCPLAFGGEA NCALKLSLEC RVDLLFLLDS SAGTTLDGFL 360 
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RAKVFVKRFV RAVLSEDSRA RVGVATYSRE LLVAVPVGEY QDVPDLVWSL DGIPFRGGPT 420 

LTGSALRQAA ERGFGSATRT GQDRPRRWV LLTESHSEDE VAGPARHARA RELLLLGVGS 480 

EAVRABLEBI TGSPKHVMVY SDPQDLFNQI PELQGKLCSR" QRPGCRTQAL DLVFMLDTSA 540 

SVGPENPAQM QSFVRSCALQ PEVNPDVTQV GLWYGSQVQ TAFGLDTKPT RAAMLRAISQ 600 

5 APYLGGVGSA GTALLHIYDK VMTVQRGARP GVPKAVWLT GGRGAEDAAV PAQKLRNNGI 660 

SVLWGVGPV LSEGLRRLAG PRDSLIHVAA YADLRYHQDV LIEWLCGEAK RPVNLCKPSP 720 

CMNEGSCVLQ NGSYRCKCRD GWEGPHCENR FLRRP 755 

Seq ID NO: C239 Protein Sequence 
10 Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MPPFLLLEAV CVFLPSRVPP SLPLQBVHVS" KETIGKISAA SKMMWCSAAV DIMFLLDGSN 60 

15 SVGKGSFERS KHFAITVCDG LDISPERVRV GAFQFSSTPH LEFPLDSFST QQEVKARIKR 120 

MVFKGGRTET ELALKYLLHR GLPGGRNASV PQILIIVTDG KSQGDVALPS KQLKERGVTV 180 

FAVGVRFPRW EELHALASEP RGQHVLLAEQ VEDATNGLFS TLSSSAICSS ATPDCRVEAH 240 

PCEHRTLEMV REFAGNAPCW RGSRRTLAVL AAHCPFYSWK RVFLTHPATC YRTTCPGPCD 300 

SQPCQNGGTC VPEGLDGYQC LCPLAFGGEA NCALKLSLEC RVDLLFLLDS SAGTTLDGFL 360 

20 RAKVFVKRFV RAVLSEDSRA RVGVATYSRE LLVAVPVGEY QDVPDLVWSL DGIPFRGGPT 420 

LTGSALRQAA ERGFGSATRT GQDRPRRWV LLTESHSEDE VAGPARHARA RELLLLGVGS 480 

EAVRAELEEI TGSPKHVMVY SDPQDLFNQI PELQGKLCSR QRPGCRTQAL DLVFMLDTSA 540 

SVGPENFAQM QSFVRSCALQ PEVNPDVTQV GLWYGSQVQ TAFGLDTKPT RAAMLRAISQ 600 

APYLGGVGSA GTALLHIYDK VMTVQRGARP GVPKAVWLT GGRGAEDAAV PAQKLRNNGI 660 

25 SVLWGVGPV LSEGLRRLAG PRDSLIHVAA YADLRYHQDV LIEWLCGEAK QPVNLCKPSP 720 

CMNEGSCVLQ NGSYRCKCRD GWEGPHCENR EWSSCSVCVS QGWILETPLR HMAPVQEGSS 780 

RTPPSNYREG LGTEMVPTFW NVCAPGP 807 

Seq ID NO: C240 Protein Sequence 
30 Protein Accession &i XPJ>97386.1 

1 11 21 31 41 51 

I I I I I I' 

MPKSEPLGCL SPASRAPGSA AATGAWLPAA SGGPGPLGPP CTCPPRSLGR GRAGSRAGSS 60 

PSGCVCVSGI LRWSVGDPA SRRWVDLDSN SEDLSLLLTP MIVGTGGVGG GWARGWVPAQ 120 

EKEVAEGSGH AGRGNGRRLQ RVYGARSWIL GRKPCLQRLL PASGGPVQPQ PCPSPATACR 180 

WGFKFGVAFW GAAQHPPLCR LGGGRAVPSA TRTLDGF 217 

Seq ID NO: C241 Protein Sequence 
Protein Accession #: CAC03433 



35 
40 
45 
50 



1 11 21 31 41 51 

I I I I I I 

MLSSTDFTFA SWELWRVDH PNEEQQKDVT LRVSGDLHVG GVMLKLVEQI NISQDWSDFA 60 

LWWEQKHCWL LKTHWTLDKY GVQADAKLLF TPQHKMLRLR LPNLKMVRLR VSFSAWFKA 120 

VSDICKILNI RRSEELSLLK PSGDYFKKKK KKDKNNKEPI IEDILNLESS PTASGSSVSP 180 

GLYSKTMTPI YDPINGTPAS STMTWFSDSP LTEQNCSILA FSQPPQSPEA LADMYQPRSL 240 

VDKAKLNAGW LDSSRSLMEQ GIQEDEQLLL RFKYYSFFDL NPKYDAVRIN QLYEQARWAX 300 

LLEEIDCTEB EMLIPAALQY HISKLSLSAE TQDFAGESEV DEIEAALSNL EVTLEGGKAD 360 

SLLEDITDIP KLADNLKLFR PKKLLPKAFK QYWFIFKDTS IAYFKNKELE QGEPLEKLNL 420 

RGCEWPDVN VAGRKFGIKL LIPVADGMNE MYLRCDHENQ YAQWMAACML AS KGKTMAD S 480 

SYQPEVLNIL SFLRMKNRNS ASQVASSLEN MDMNPECFVS PRCAKKHKSK QLAARILEAH 540 

QNVAQMPLVE AKLRFIQAWQ SLPEFGLTYY LVRFKGSKKD DILGVSYNRL IKIDAATGIP 600 

VTTWRFTNIK QWNVNWETRQ WIEFDQNVF TAFTCLSADC KIVHEYIGGY IFLSTRSKDQ 660 

55 NETLDEDLFH KLTGGQD 677 

Seq ID NO: C242 DNA Sequence 
Nucleic Acid Accession #i NM_005170 
60 Coding sequence: 337.. 918 

1 11 21 31 41 51 

I I I I I I 

GGGCGTGAGA AAGGCGACGG CGGCGGCGCG GAGGAGGGTT ATCTATACAT TTAAAAACCA 60 

65 GCCGCCTGCG CCGCGCCTGC GGAGACCTGG GAGAGTCCGG CCGCACGCGC GGGACACGAG 120 

CGTCCCACGC TCCCTGGCGC GTACGGCCTG CCACCACTAG GCCTCCTATC CCCGGGCTCC 180 

AGACGACCTA GGACGCGTGC CCTGGGGAGT TGCCTGGCGG CGCCGTGCCA GAAGCCCCCT 240 

TGGGGCGCCA CAGTTTTCCC CGTCGCCTCC GGTTCCTCTG CCTGCACCTT CCTGCGGCGC 300 

GCCGGGACCT GGAGCGGGCG GGTGGATGCA GGCGCGATGG ACGGCGGCAC ACTGCCCAGG 360 

70 TCCGCGCCCC CTGCGCCCCC CGTCCCTGTC GGCTGCGCTG CCCGGCGGAG ACCCGCGTCC 420 

CCGGAACTGT TGCGCTGCAG CCGGCGGCGG CGACCGGCCA CCGCAGAGAC CGGAGGCGGC 480 

GCAGCGGCOG TAGCGCGGCG CAATGAGCGC GAGCGCAACC GCGTGAAGCT GGTGAACTTG 540 

GGCTTCCAGG CGCTGCGGCA GCACGTGCCG CACGGCGGCG CCAGCAAGAA GCTGAGCAAG 600 

GTGGAGACGC TGCGCTCAGC CGTGGAGTAC ATCCGCGCGC TGCAGCGCCT GCTGGCCGAG 660 

CACGACGCCG TGCGCAACGC GCTGGCGGGA GGGCTGAGGC CGCAGGCCGT GCGGCCGTCT 720 

GCGCCCCGCG GGCCGCCAGG GACCACCCCG GTCGCCGCCT CGCCCTCCOG CGCTTCTTCG 780 

TCCCCGGGCC GCGGGGGCAG CTCGGAGCCC GGCTCCCCGC GTTCCGCCTA CTCGTCGGAC 840 

GACAGCGGCT GCGAAGGCGC GCTGAGTCCT GCGGAGCGCG AGCTACTCGA CTTCTCCAGC 900 

TGGTTAGGGG GCTACTGAGC GCCCTCGACC TA 932 



75 
80 



Seq ID NO: C243 Protein Sequence 
Protein Accession #: NP_060233.1 



21 31 41 51 
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I 

HSGGHQLQLA 
LNLTLEGVFA 
SLASKARMAG 
KAHVRIELKE 
SQLATRRYQA 
PWLHQHRTCP 
PSRSAVARPP 
LQSTSQHPAA 
NCTDISLQGV 
ETQVSSHVHY 
SNSAAPSGRL 
SEPTPGSRPQ 
NSQPVWLCLT 
QAV 



ALWPWLLMAT 
GVAEITPAEG 
ERGASAVLPD 
PPAWPDYDVW 
SCRQARGEWP 
LCVFNITEGD 
RPGPFLPSQE 
CPVPLRRARP 
HGSSSTFCSS 
HRHRHHHYKK 
SNPQCPRALP 
DATVHPACQI 
PRQPLEPHPP 



LQAGFGRTGL 
KLMQSHPLYL 
ITEDRAAAEQ 
IIHTWGTIP 
DSGSSCSSAP 
SFSQSLGPSR 
PGMGPRHHRF 
PDSSGSGESY 
LSSDFDPLVY 
RFQWHGRKPG 
EPAPGPVDAS 
FPHYTPSVAY 
GEGPSEWSSD 



I 

VLAAAVESER 
CNASDDDNLE 
LQQPLGLTWP 
VIILASVLRI 
VCAICLEEFS 
SYQEPGRRLH 
PRAAHPRAPG 
CTERSGYLAD 
CSPKGDPQRV 
PETGVPQSRP 
SICPSTSSLF 
PWSPEAHPLI 
TAEGRPCPYP 



I 

SAEQKAVIRV 
PGFISIVKLE 
WLIWGNDAE 
RCRPRHSRPD 
EGQELRVISC 
LIRQHPGHAH 
EQQRLAGAQH 
GPASDSSSGP 
DMQPSVTSRP 
PIPRTQPQPE 
NLQKSSLSAR 
CGPPGLDKRL 
HCQVLSAQPG 



I 

IPLKMDPTGK 
SPRRAPRPCL 
KLMEFVYKNQ 
PLQQRTAWAI 
LHEFHRNCVD 
YKLPAAYLLG 
PYAQGWGMSH 
CHGSSSDSW 
RSLDSWPTG 
PPSPDQQVTG 



LPETPGPCYS 



60 

120 
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240 

300 

360 

420 
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720 

780 

783 



Seq ID NO: C244 DNA Sequence 
Nucleic Acid Accession #: NM_0042B9 
Coding sequence i 493.. 1695 



51 



1 11 21 31 41 

I 1 I I I I 

GCCGCCGCCT CGTCCACCGG AGGAGCCGGC GCCAGCGTGG ACGGCGGCAG CCAGGCTGTG 
CAGGGGGGCG GCGGGGACCC CCGAGCGGCT CGGAGTGGCC CCTTGGACGC CGGGGAAGAG 
GAGAAGGCAC CCGCGGAACC GACGGCTCAG GTGCCGGACG CTGGCGGATG TGCGAGCGAG 
GAGAATGGGG TACTAAGAGA AAAGCACGAA GCTGTGGATC ATAGTTCCCA GCATGAGGAA 
AATGAAGAAA GGGTGTCAGC CCAGAAGGAG AACTCACTTC AGCAGAATGA TGATGATGAA 
AACAAAATAG CAGAGAAACC TGACTGGGAG GCAGAAAAGA CCACTGAATC TAGAAATGAG 
AGACATCTGA ATGGGACAGA TACTTCTTTC TCTCTGGAAG ACTTATTCCA GTTGCTTTCA 
TCACAGCCTG AAAATTCACT GGAGGGCATC TCATTGGGAG ATATTCCTCT TCCAGGCAGT 
ATCAGTGATG GCATGAATTC TTCAGCACAT TATCATGTAA ACTTCAGCCA GGCTATAAGT 
CAGGATGTGA ATCTTCATGA GGCCATCTTG CTTTGTCCCA ACAATACATT TAGAAGAGAT 
CCAACAGCAA GGACTTCACA GTCACAAGAA CCATTTCTGC AGTTAAATTC TCATACCACC 
AATCCTGAGC AAACCCTTCC TGGAACTAAT TTGACAGGAT TTCTTTCACC GGTTGACAAT 
CATATGAGGA ATCTAACAAG CCAAGACCTA CTGTATGACC TTGACATAAA TATATTTGAT 
GAGATAAACT TAATGTCATT GGCCACAGAA GACAACTTTG ATCCAATCGA TGTTTCTCAG 
CTTTTTGATG AACCAGATTC TGATTCTGGC CTTTCTTTAG ATTCAAGTCA CAATAATACC 
TCTGTCATCA AGTCTAATTC CTCTCACTCT GTGTGTGATG AAGGTGCTAT AGGTTATTGC 
ACTGACCATG AATCTAGTTC CCATCATGAC TTAGAAGGTG CTGTAGGTGG CTACTACCCA 
GAACCCAGTA AGCTTTGTCA CTTGGATCAA AGTGATTCTG ATTTCCATGG AGATCTTACA 
TTTCAACACG TATTTCATAA CCACACTTAC CACTTACAGC CAACTGCACC AGAATCTACT 
TCTGAACCTT TTCCGTGGCC TGGGAAGTCA CAGAAGATAA GGAGTAGATA CCTTGAAGAC 
ACAGATAGAA ACTTGAGCCG TGATGAACAG CGTGCTAAAG CTTTGCATAT CCCTTTTTCT 
GTAGATGAAA TTGTCGGCAT GCCTGTTGAT TCTTTCAATA GCATGTTAAG TAGATATTAT 
CTGACAGACC TACAAGTCTC ACTTATCCGT GACATCAGAC GAAGAGGGAA AAATAAAGTT 
GCTGCGCAGA ACTGTCGTAA ACGCAAATTG GACATAATTT TGAATTTAGA AGATGATGTA 
TGTAACTTGC AAGCAAAGAA GGAAACTCTT AAGAGAGAGC AAGCACAATG TAACAAAGCT 
ATTAACATAA TGAAACAGAA ACTGCATGAC CTTTATCATG ATATTTTTAG TAGATTAAGA 
GATGACCAAG GTAGGCCAGT CAATCCCAAC CACTATGCTC TCCAGTGTAC CCATGATGGA 
AGTATCTTGA TAGTACCCAA AGAACTGGTG GCCTCAGGCC ACAAAAAGGA AACCCAAAAG 
GGAAAGAGAA AGTGAGAAGA AACTGAAGAT GGACTCTATT ATGTGAAGTA GTAATGTTCA 
GAAACTGATT ATTTGGATCA GAAACCATTG AAACTGCTTC AAGAATTGTA TCTTTAAGTA 
CTGCTACTTG AATAACTCAG TTAACGCTGT TTTGAAGCTT ACATGGACAA ATGTTTAGGA 
CTTCAAGATC ACACTTGTGG GCAATCTGGG GGAGCCACAA CTTTTCATGA AGTGCATTGT 
ATACAAAATT CATAGTTATG TCCAAAGAAT AGGTTAACAT GAAAACCCAG TAAGACTTTC 
CATCTTGGCA GCCATCCTTT TTAAGAGTAA GTTGGTTACT TCAAAAAGAG CAAACACTGG 
GGATCAAATT ATTTTAAGAG GTATTTCAGT TTTAAATGCA AAATAGCCTT ATTTTCATTT 
AGTTTGTTAG CACTATAGTG AGCTTTTCAA ACACTATTTT AATCTTTATA TTTAACTTAT 
AAATTTTGCT TTCT 

Seq ID NO: C245 Protein Sequence 
Protein Accession #: NPJJ04433 



1 
I 

MALRRLGAAL 
VCNVFESSQN 
DSATKTFPNW 
YGGCMSIilAV 
CNGDGEWLVP 
TNCVCRNGYY 
ICKSCGSGRG 
SPQFASVNIT 
YNATAIKSPT 
LPLIIGSSAA 
FTYEDPNEAV 
TEKQRRDPLS 
VIQLVGMLRG 
YTSALGGKIP 
EQDYRLPPPM 
INIiPLLDRTI 
LAGHQKKILN 



11 

I 

LLLPLLAAVE 
NWLRTKFIRR 
MENPWVKVDT 
RVFYRKCPRI 
IGRCMCKAGF 
RADLDPLDMP 
ACTRCGDNVQ 
TNQAAPSAVS 
NTVTVQGLKA 
GLVFLIAVW 
RETAKE ID IS 
EASIMGQFDH 
IAAGMKYLAD 
IRWTAPEAIQ 
DCPSALHQLM 
PDYTSFNTVD 
SIQVMRAQMN 



21 
I 

ETLMDSTTAT 
RGAHRIHVEM 
IAADESFSQV 
IQNGAIPQET 
EAVENGTVCR 
CTTIPSAPQA 
YAPRQLGIiTE 
IMHQVSRTVD 
GAIYVFQVRA 
IAIVCNRRRG 
CVKIEQVIGA 
PNVIHLEGW 
KNYVHRDLAA 
YRKFTSASDV 
LDCWQKDRNH 
EWLEAIKMGQ 
QIQSVEV 



31 
I 

AELGWMVHPP 
KFSVRDCSSI 
DXiGGRVMKIN 
LSGAESTSLV 
GCPSGTFKAN 
VISSVNETSL 
PRIYISDLLA 
SITLSWSQPD 
RTVAGYGRYS 
FERADSEYTD 
GEFGEVCSGH 
TKSTPVMIIT 
RNILVNSNLV 
WSYGIVMWEV 
RPKFGQIVNT 
YKESFANAGF 



41 
I 

SGWEEVSGYD 
PSVPGSCKET 
TEVRSFGPVS 
AARGSCIANA 
QGDBACTRCP 
MLEWTPPRDS 
HTQYTFEIQA 
QPNGVILDYE 
GKMYFQTMTE 
KLQHYTSGHM 
LKLPGKREIF 
EFMENGSLDS 
CKVSDFGLSR 
MSYGERPYWD 
LDKMIRNPNS 
TSFDWSQMM 



51 
I 

ENMNTIRTYQ 
FNLYYYEADF 
RSGFYLAFQD 
EEVDVPIKLY 
INSRTTSEGA 
GGREDLVYNI 
VNGVTDQSPF 
LQYYEKELSE 
AEYQTSIQEK 
TPGMKIYIDP 
VAIKTIiKSGY 
FLRQNDGQFT 
FLEDDTSDPT 
MTNQDVINAI 
LKAMAPLSSG 
MEDILRVGLT 
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Seq ID NO: C246 Protein Sequence 
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Protein Accession fh NP_114148.1 

1 11 21 31 41 51 

-I 1 I I I I 

J MDARRVPQKD LRVKKNLKKF RYVKLISMBT SSSSDDSCDS FASDNFANTR UJSVREGCRT SO 

RSQCRHSGPL RVAMKFPARS TRGATNKKAE SRQPSENSVT DSNSDSEDES GMNFLEKRAL 120 

NIXQNKAMLA KLMSELESFP GSFRGRHPLP 6SDSQSRRPR RRTFPGVASR RNPERRARPL 180 

TRSRSRILGS LDALPMEEEE EEDKYMLVRK RKTVDGYMNE DDLPRSRRSR SSVTLPHIIR 240 

PVEEITEGGV GERLQQFSKR RYITVHWALL VINAVRRLLI PKQTAETQTA GAFEASSVAP 300 

10 AFETVMVKRS GMLCWIRTGI ARLVEESATA VSAGSEMDGV RLGSLCI 347 

Seq ID NO: C247 Protein Sequence 
Protein Accession ft: NP_036 577.1 

15 1 11 21 31 41 51 

I I I I I I 

MENPSPAAAL GKALCALLLA TLGAAGQPLG GESICSARAP AKYSITFTGK WSQTAFPKQY 60 

PLFRPPAQWS SliLGAAHSSD YSMWRKNQYV SNGLRDFAER GEAWALMKEI EAAGEALQSV 120 

HAVFSAPAVP SGTGOTSAEL EVQRRHSLVS FWRIVPSPD WFVGVDSLDL CDGDRWREQA 180 

20 ALDLYPYDAG TDSGFTFSSP NFATIPQDTV TEITSSSPSH PANSFYYPRL KALPPIARVT 240 

LVRLRQSPRA FIPPAPVLPS RDNEIVDSAS VPETPLDCEV SLWSSWGLCG GHCGRLGTKS 300 

RTRYVRVQPA NNGSPCPELE EEAECVPDNC V 331 

Seq ID NO: C248 Protein Sequence 
25 Protein Accession #: NP_063 947.1 

1 11 21 31 41 51 

1 I I I I I 

MLQDPDSDQP LNSLDVKPLR KPRIPMETFR KVGIPIIIAL LSLASIIIW VLIKVILDKY 60 

30 YFLCGQPLHF IPRKQLCDGE LDCPLGEDEE HCVKSPPEGP AVAVRLSKDR STLQVLDSAT 120 

GNWFSACFDN FTEALAETAC RQMGYSSKPT FRAVEIGPDQ DLDWEITEN SQELRMRNSS 180 

GPCLSGSLVS LHCLACGKSIi KTPRWGGEE ASVDSWPWQV SIQYDKQHVC GGSILDPHWV 240 

LTAAHCFRKH TDVFNWKVRA GSDKLGSFPS LAVAKIIIIE FNPMYPKDND IALMKLQFPL 300 

TFSGTVRPIC LPFFDEELTP ATPLWIIGWG FTKQNGGKMS DILLQASVQV IDSTRCNADD 360 

35 AYQGEVTEKM MCAGIPEGGV DTCQGDSGGP LMYQSDQWHV VGIVSWGYGC GGPSTPGVYT 420 

KVSAYLNWIY NVWKAEL 437 



40 



65 



Seq ID NO: C249 Protein Sequence 
Protein Accession #: NP_003036.1 



1 11 21 31 41 51 

1 I I I I i 

MGCKVLLNIG QQMLRRKWD CSREETRLSR CLNTFDLVAL GVGSTLGAGV YVLAGAVARE 60 

NAGPAIVISF LIAALASVLA GLCYGEFGAR VPKTGSAYLY SYVTVGELWA FITGWNLILS 120 

45 YIIGTSSVAR AWSATFDELI GRPIGEFSRT HMTLNAPGVL AENPDIFAVI I ILILTGLLT 180 

LGVKESAMVN KIFTCINVLV LGPIMVSGFV KGSVKNWQIiT EEDFGNTSGR LCLNNDTKEG 240 

KPGVGGFMPF GFSGVLSGAA TCFYAFVGFD CIATTGEEVK NPQKAIFVGI VASLLICFIA 300 

YFGVSAALTL MMPYFCLDNN SPLPDAFKHV GHEGAKYAVA VGSLCALSAS LLGSMFPMPR 360 

VIYAMAEDGL LFKFLANVND RTKTPIIATL ASGAVAAVMA FLFDLKDLVD LMSIGTLIAY 420 

50 SLVAACVLVL RYQPEQPNLV YQMASTSDEL DPADQNELAS TNDSQLGFLP EAEMFSLKTI 480 

LSPKNMEPSK ISGLIVNIST SLIAVLIITF CIVTVLGREA LTKGALWAVF LLAGSALLCA 540 

WTGVIWRQP ESKTKLSFKV PFLPVLPILS IFVNVYLMMQ LDQGTWVRFA VWMIilGFIIY 600 

FGYGLWHSEE ASLDADQART PDGNLDQCK 629 

55 Seq ID NO: C250 Protein Sequence 
Protein Accession #: NP_0 02 767.1 

1 11 21 31 41 51 

£ A I ! I II I 

OU MRAPHLHLSA ASGARAIAKL LPLLMAQLWA AEAALLPQND TRLDPEAYGA PCARGSQPWQ 60 

VSLFNGLSFH CAGVLVDQSW VLTAAHOGNK PLWARVGDDH LLLLQGEQLR RTTRSWHPK 120 

YHQGSGPILP RRTDEHDLML LKLARPWPG PRVRALQLPY RCAQPGDQCQ VAGWGTTAAR 180 

RVKYNKGLTC SSITILSPKE CEVFYPGWT NNMICAGLDR GQDPCQSDSG GPLVCDETLQ 240 

GILSWGVYPC GSAQHPAVYT QICKYMSWIN KVIRSN 276 



Seq ID NO: C251 Protein Sequence 
Protein Accession #: XP_095088.3 



„^ 1 11 21 31 41 51 

70 | | | | | 1 

MTRAATAEPG RVSPASPARS TAGLPRAFLQ SLRTLLDILD DWQRGCVHLR EIQSLWVEAR 60 

ELPSGVLEGL SQRRGPQPGA AVRSRRGGAV PRGARAVPER CAGTETRRGR RCSGLQRLGG 120 

GFRGCPADPC ARGEHRRHTI TSGVDCGLLK QMKELEQEKE VLUQGLEMMA QGRDWYQQQL 180 

QQVQERQCRL GQSRASADFG AVGSPRPLGR LLPKVQEVAR WLGELLAEAC AGRALPTSSS 240 

75 GPPCSALTST SSPGWQQQII LMLKEQNRLL TQEVTEKSER ITQLEQKSAL IKQLFEARAL 300 

SQQDGGLSPA GPHIEPLTRF RLPVLTWAGA LLSPHSPQLL LPLSADSGGP LHELPDTWFP 360 

AVLLWVPSPG KRTAHARLHF HQRPAEGAWQ LGCGAEAAPE TCGTLPHFES HKTTCEPDSL 420 

GGPCPQEGDR SWSHLGAAFD VAPAVAKVTP NREDAAGSRH GDICPLCPKG LLTFRDIAIE 480 

OA FSLAEWQCLD HAQQNLYRDV KLENYRNLFS LGMTVSKPDL IACLEQNKEP QNIKRNEMAA 540 

80 KHPVTCSHFN QDLQPEQSIK DSLQKVIPRT YGKCGHENLQ LKKCCKRVDE CEVHKGGYND 600 

LNQCLSNTQN KIFQTHKCVK VFSKFSNSNR HNARYTGKKH LKCKKYGKSF CMFSHLNQHQ 660 

IIHTKEKSYK CEECGKSPNH SSSGTTHKRI LTGEKPYRCB ECGKAFRWPS NLTRHKRIHT 720 

GEKPYACEEC GQAFRRSSTL TNHKRIHTGE RPYKCEECGK AFSVSSALIY HKRIHTGEKP 780 

YTCEECGKAF NCSSTLKTHK IIHTGEKPYT CEECGRTFNC SSTVKAHKRI HTGEKPYKCE 840 

1343 
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ECDKAFKVJHS SLAKHKIIHT GEKPYKCSDS KALAKSSBVQ KVYSGDGENG IRVHKKKETQ 900 

GWLVRNKNEN RTGLFQIRAA VRPNRDPSWG QQEGSLTDPI QRKEEPDLQN HYDHQNALED 960 

QRNTGVGGLL TFRDWIBFS LEEWQCLDHA QQNLYRDVML ENYRNLVSLG IAVSKPDLIT 1020 

CLEQNKEPWN IKRNEMVTKH PDLPPELGIK DSLQKVIPRR YGKSGHDNIiQ VKTCKSMGEC 1080 

5 EVQKGGCNEV NQCLSTTQNK IFQTHKCVKV FGKFSNSNRH KTRHTGKKHP KCKKYGKSFC 1140 

MVSQLHQHQI IHTRENSYQC EECGKPFKCS STLSKHKRIH TGEKPYRCEE CGKAFTWSST 1200 

LTKHRRIRTG EKPYTCEECG QAFSRSSTLA NHKRIHTGEK PYTCEECGKA FSLSSSLTYH 1260 

KRIHTGEKPY TCEECGKAFN CSSTLKKHKI IHTGBKPYKC KECGKAFAFS STLNTHKRIH 1320 

TGEEPYKCEE CDKAFKWSSS IiANHKSMHTG EKPYKCE 13S7 



10 



45 



75 



Seq ID NO: C252 Protein Sequence 
Protein Accession fl: NP_U4433.1 



1 11 21 31 41 51 

15 | | | | I I 

MASRSMRLLL LLSCLAKTGV LGDIIMRPSC APGWFYHKSN CYGYFRKLRN WSDAELECQS 60 

YGNGAHIiASI LSLKEASTIA EYISGYQRSQ PIWIGLHDPQ KRQQWQWrDG AMYLYRSWSG 120 

KSMGGNKHCA EMSSNNNFLT WSSNECNKRQ HFLCKYRP 158 

20 Seq ID NO: C253 Protein Sequence 
Protein Accession #: XP_051860.2 

1 11 21 31 41 51 

25 MDGVNLSTEV VYKKGQDYRF ACYDRGRACR SYRVRFLCGK PVRPKLTVTI DTNVNSTILN 60 

LEDNVQSWKP GDTLVIASTD YSMYQAEEFQ VLPCRSCAPN QVKVAGKPMY LHIGEEIDGV 120 

DMRAEVGLLS RNI IVMQEME DKCYPYRNHI CNFFDFDTFG GHIKFALGFK AAHLEGTELK 180 

HMGQQLVGQY PIHFHLAGDV DERGGYDPPT YIRDLSIHHT FSRCVTVHGS NGLLIKDWG 240 

YNSLGHCFFT EDGPEERNTF DHCLGLLVKS GTLLPSDRDS KMCKMITGDS YPGYIPKPRQ 300 

30 DCNAVSTFWK ANPNNNLINC AAAGSEETGF WFIFHHVPTG PSVGMYSPGY SEHIPLGKFY 360 

NNRAHSNYRA GMIIDNGVKT TEASAKDKRP FLSIISARYS PHQDADPLKP REPAI IRHFI 420 

AYKNQDHGAW LRGGDVWLDS CRFADNGIGL TLASGGTFPY DDGSKQEIKN SLFVGESGNV 480 

GTEMMDNRIW GPGGLDHSGR TLPIGQNFPI RGIQLYDGPI NIQNCTFRKF VALEGRHTSA 540 

LAFRLNNAWQ SCPHNNVTGI AFEDVPITSR VFFGEPGPWF NQLDMDGDKT SVFHDVDGSV 600 

35 SEYPGSYLTK NDNWLVRHPD CINVPDWRGA ICSGCYAQMY IQAYKTSNLR MKIIKNDFPS 660 

HPLYIiEGALT RSTHYQQYQP WTLQKGYTI HWDQTAPAEL AIWLINFNKG DWIRVGLCYP 720 

RGTTFSILSD VHNRLLKQTS KTGVFVRTLQ MDKVEQSYPG RSHYYWDEDS GLLFLKLKAQ 780 

NEREKFAFCS MKGCERIKIK ALIPKNAGVS DCTATAYPKF TERAWDVPM PKKLFGSQLK 840 

TKDHFLEVKM ESSKQHFFHL WNDFAYIEVD GKKYPSSEDG IQVWIDGNQ GRWSHTSFR 900 

40 NSILQGIPWQ LFNYVATIPD NSIVLMASKG RYVSRGPWTR VLEKLGADRG LKLKEQMAFV 960 

GFKGSFRPIW VTLDTEDHKA KIFQWPIPV VKKKKL 996 



Seq ID NO: C254 Protein Sequence 
Protein Accession #: NP_055188.1 



1 11 21 31 41 51 

1 I I I I 1 

MTALSSENCS FQYQLRQTNQ PLDVNYLLFL IILGKILLNI LTLGMRRKNT OQNFMEYFCI 60 

SLAFVDLLLL VNISIILYFR DFVLLSIRFT KYHICLFTQI ISFTYGFLHY PVFLTACIDY 120 

50 CLNFSKTTKL SFKCQKLFYF FTVILIWISV IaAYVLGDPAI YQSLKAQNAY SRHCPFYVSI 180 

QSYWLSFFMV KILFVAFITC WEEVTTLVQA IRITSYMNET ILYFPFSSHS SYTVRSKKIF 240 

LSKLIVCFLS TWLPFVLIiQV IIVLLKVQIP AYIEMNIPV1L YFVNSFLIAT VYWFNCHKLN 300 

UCDIGLPLDP FVNWKCCFIP LTIPNLEQIE KPISIMIC 338 

55 Seq ID NO: C255 Protein Sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

*A I I I • 1 1 

OU MALVLGSLLL LGLCGNSFSG GQPSSTDAPK AWNYELPATN YETQDSHKAG PIGILFELVH 60 

IFIiYWQPRD FPEDTLRKFIi QKAYESKIDY DKTVYYEAGI ILCCVLGLLF IILMPLVGYF 120 

FCMCRCCNKC GGEMHQRQKE NGPFLRKCFA ISLLVICIII SIGIFYGFVA NHQVRTRIKR 180 

SRKLADSNFK DLRTLLNETP EQIKYILAQY NTTKDKAFTD LNSINSVLGG GILDRLRPNI 240 

IPVLDEIKSM ATAIKETKEA LENMNSTLKS LEQQSTQLSS SLTSVKTSLR SSLNDPLCLV 300 

65 HPSSETCNSI RLSLSQLNSN PELRQLPPVD AELDNVNNVL RTDLDGLVQQ GYQSLNDIPD 360 

RVQRQTTTW AGIKRVLNSI GSDIDNVTQR LPIQDILSAF SVYVNNTESY IHRNLPTLEE 420 

YDSYWWLGGL VICSLLTLIV IFYYLGLLCG VCGYDRHATP TTRGCVSNTG GVFLMVGVGL 480 

SFLFCWILMI IWLTFVFGA NVEKLICEPY TSKELFRVLD TPYLLNEDWE YYLSGKLFNK 540 

SKMKLTFEQV YSDCKKNRGT YGTLHLQNSF NISEHLNINE HTGSISSELE SLKVNLNIFL 600 

70 LGAAGRKNLQ DFAACGIDRM NYDSYLAQTG KSPAGVNIjLS FAYDLEAKAN SLPPGNLRNS 660 

LKRDAQTIKT IHQQRVLPIE QSLSTLYQSV KILQRTGNGL LERVTRILAS LDFAQNFITN 720 

NTS SV 1 1 BET KKYGRTIIGY FEHYLQWIEF SISEKVASCK PVATALDTAV DVFLCSYIID 780 

PLNLFWFGIG KATVFLLPAL IFAVKLAKYY RRMDSEDVYD DVETIPMKNM ENGNNGYHKD 840 

HVYGIHNPVM TSPSQH 856 



Seq ID NO: C256 Protein Sequence 
Protein Accession #: NP_149038.1 



1 11 21 31 41 51 

80 | | | | I I 

MKAI IHLTLL ALLSVNTATN QGNSADAVTT TETATSGPTV AAADTTETNF PETASTTANT 60 

PSFPTATSPA PPIISTHSSS TIPTPAPPII STHSSSTIPI PTAADSESTT NVNSLATSDI 120 

ITASSPNDGL ITMVPSETQS NNEMSPTTED NQSSGPPTGT ALLETSTLNS TGPSNPCQDD 180 

PCADNSLCVK LHNTSFCLCL EGYYYNSSTC KKGKVFPGKI SVTVSETFDP EEKHSMAYQD 240 
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LHSBITSLFK DVFGTSVYGQ TVILTVSTSL SPRSEMRADD KFVNVTIVTI LAETTSDNEK 300 

TVTEKINKAI RSSSSNPLNY DLTLRCDYYG CNQTADDCLN GLACDCKSDL QRPNPQSPFC 360 

VASSLKCPDA CNAQHKQCLI KKSGQAPECA CVPGYQEDAN GNCQKCAFGY SGLDCKDKPQ 420 

LILTIVGTIA GIVILSMIIA LIVTARSNNK TKHIEEENLI DEDFQNLKLR STGFTNLGAE 480 

J GSVFPKVRIT ASRDSQMQNP YSRHSSMPRP DY 512 

Seq ID NO: C257 Protein Sequence 
Protein Accession ft: NP_001423.1 

10 i 11 21 31 41 51 

I I I I I I 

MTAGRRMEML CAGRVPALLL CLGFHLLQAV LSTTVIPSCI PGESSDNCTA LVQTEDNPRV 60 

AQVSITKCSS DMNGYCLHGQ CIYLVDMSQN YCRCEVGYTG VRCEHFFLTV HQPLSKEYVA 120 

LTVILIILFL ITWGSTYYF CRWYRNRKSK EPKKEYERVT SGDPELPQV 169 
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Seq ID NO: C258 Protein Sequence 
Protein Accession ft: AAC63902.1 



-~ 1 11 21 31 41 51 

20 | | 



MDRSKENCIS GPVKATAPVG GPKRVLVTQQ IPCQNPLPVN SGQAQRVLCP SNSSQRVPLQ 60 

AQKLVSSHKP VQNQKQKQLQ ATSVPHPVSR PLNNTQKSKQ PLPSAPENNP EEELASKQKN 120 

EESKkRQWAL EDFEIGRPLG KGKFGNVYLA REKQSKFILA LKVLFKAQLE KAGVEHQLRR 180 

BVEIQSHLRH PNILRLYGYF HDATRVYLIL EYAPLGTVYR ELQKLSKFDE QRTATYITEL 240 

ANALSYCHSK RVIHRDIKPE NLLLGSAGEL KIADFGWSVH APSSRRTTLC GTLDYLPPEM 300 

IEGRMHDEKV DLWSLGVLCY EFLVGKPPFE ANTYQETYKR ISRVEFTFPD FVTEGARDLI 360 

SRLLKHNPSQ RPMLREVLEH PWITANSSKP SNCQNKESAS KQS 403 



Seq ID NO: C259 Protein Sequence 
3U Protein Accession ft: NP_037504.1 



1 11 21 31 41 51 

I I I I I I 

MSRTAYTVGA IiLLLLGTLLP AAEGKKKGSQ GAIPPPDKAQ HNDSEQTQSP QQPGSRNRGR 60 

GQGRGTAMPG EEVLESSQEA LHVTERKYLK RDWCKTQPLK QTIHEEGCNS RTIINRFCYG 120 

QCNSFYIPRH IRKEEGSFQS CSFCKPKKFT TMMVTLNCPE LQPPTKKKRV TRVKQCRCIS 180 

IDLD 184 



Seq ID NO: C260 Protein Sequence 
41) Protein Accession ft: Eos sequence 



1 11 21 31 41 51 

I I I I I I 

MKVGVLWLIS FFTFTDGHGG FLGKNDGIKT KKELIVNKKK HLGPVEBYQL LLQVTYRDSK 60 

EKRDLRNFLK LLKPPLLWSH GLIRIIRAKA TTDCNSLNGV LQCTCEDSYT WFPPSCLDPQ 120 

NCYLHTAGAL PSCECHLNNL SQSVNFCERT KIWGTFKINE RPTNDLLNSS SAIYSKYANG 180 

IEIQLKKAYE RIQGFESVQV TQFRNGSIVA GYEWGSSSA SELLSAIEHV AEKAKTALHK 240 

LFPLEDGSFR VFGKAQCNDI VFGFGSKDDE YTLPCSSGYR GNITAKCESS GWQVIRETCV 300 

LSLLBELNKN FSMIVGNATE AAVSSFVQNL SVIIRQNPST TVGNIASWS ILSNISSLSL 360 

ASHFRVSNST MEDVISIADN ILNSASVTNW TVLLREEKYA SSRLLETLEN ISTLVPPTAL 420 

PLNFSRKFID WKGIPVNKSQ LKRGYSYQIK MCPQNTSIPI RGRVLIGSDQ FQRSLPETII 480 

SMASLTLGNI LPVSKNGNAQ VNGPVISTVI QNYSINEVFL FFSKIESNLS QPHCVPWDFS 540 

HLQWNDAGCH LVNETQDIVT CQCTHLTSFS ILMSPPVPST IFPWKWITY VGLGISIGSL 600 

ILCLIIEALF WKQIKKSQTS HTRRICMVNI ALSLLIADVW FIVGATVDTT VNPSGVCTAA 660 

VFFTHFFYLS LFPWMLMLGI LLAYRIILVF HHMAQHLMMA VGFCLGYGCP LIISVITIAV 720 

TQPSNTYKRK DVCWLNWSNG SKPLLAPWP ALAIVAVNFV WLLVLTKLW RPTVGERLSR 780 

DDKATIIRVG KSLLILTPLL GLTWGFGIGT IVDSQNLAWH VIFALLNAFQ GFFILCFGIL 840 

LDSKLRQLLP NKLSALSSWK QTEKQNSSDL SAKPKPSKPP NPLQNKGHYA FSHTGDSSDN 900 

IMLTQFVSNE 910 

Seq ID NO: C261 Protein Sequence 
Protein Accession ft: NP_000575.1 

1 11 21 31 41 51 

I I 1 I I I 

MTSKLAVALL AAFLISAALC EGAVLPRSAK ELRCQCIKTY SKPFHPKFIK ELRVIESGPH 60 

CANTEIIVKL SDGRELCLDP KENWVQRWE KFLKRAENS 99 



Seq ID NO: C262 Protein Sequence 
/U Protein Accession #: NP 005594.1 



1 11 21 31 41 51 

I I I I I 1 

MSTERDSETT FDEDSQPNDE WPYSDDETE DELDDQGSAV EPEQNRVNRE AEENREPFRK 60 

ECTWQVKAND RKYHEQPHFM NTKFLCIKES KYANNAIKTY KYNAFTFIPM NLFEQFKRAA 120 

NLYFLALLIL QAVPQISTLA WYTTLVPLLV VLGVTAIKDL VDDVARHKMD KEINNRTCEV 180 

IKDGRFKVAK WKEIQVGDVI RLKKNDFVPA DILLLSSSEP NSLCYVETAE LDGETNLKFK 240 

MSLEITDQYL QREDTLATFD GFIBCEEPNN RLDKFTGTLF WRNTSFPLDA DKILLRGCVI 300 

RNTDFCHGLV IFAGADTKIM KNSGKTRFKR TKIDYLMNYM VYTIFWLIL LSAGLAIGHA 360 

YWEAQVGNSS WYLYDGEDDT PSYRGFLIFW GYI1VLNTMV PISLYVSVEV IRLGQSHFIN 420 

WDLQMYYAEK DTPAKARTTT LNEQLGQIHY IFSDKTGTLT QNIMTFKKCC INGQIYGDHR 480 

DASQHNHNKI EOVDFSWNTY ADGKLAFYDH YLIEQIQSGK EPEVRQFFFL LAVCHTVMVD 540 

RTDGQLNYQA ASPDEGALVN AARNFGPAFL ARTQNTITIS ELGTERTYNV LAILDFNSDR 600 

KRMSIIVRTP EGNIKLYCKG ADTVIYERLH RMNPTKQETQ DALDIFANET LRTLCLCYKE 660 
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IBEKEFTEWN KKFMAASVAS TNRDEALDKV YEEIEKDLIL LGATAIEDKL QDGVPETISK 720 

LAKADIKIWV LTGDKKETAE NIGFACELLT EDTTICYGED INSLLHARME NQRNRGGVYA 780 

KFAPPVQESF PPPGGNRALI ITQSWLNEIL LEKKTKRNKI LKLKFPRTEE ERRMRTQSKR 840 

RLEAKKEQRQ KNFVDLACEC SAVICCRVTP KQKAMWDLV KRYKKAITLA IGDGANDVNM 900 

5 IKTAHIGVGI SGQEGMQAVM SSDYSFAQFR YLQRLLLVHG RWSYIRMCKF LRYFFYKNFA 960 

FTLVHFWYSF FNGYSAQTAY EDWFITLYNV LYTSLPVLLM GLLDQDVSDK LSLRFPGLYI 1020 

VGQRDLLFNY KRFFVSLLHG VLTSMILFFI PLGAYLQTVG QDGBAPSDYQ SFAVTIASAL 10 SO 

VITVNFQIGL DTSYWTFVNA FSIFGSIALY FGIMFDFHSA GIHVLFPSAF QFTGTASNAL 1140 

RQPYIWLTII LTVAVCLLPV VAIRFLSMTI WPSESDKIQK HRKRLKAEEQ WQRRQQVFRR 1200 

10 GVSTRRSAYA FSHQRGYADL ISSGRSIRKK RSPLDAIVAD GTAEYRRTGD S 1251 
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Seq ID NO: C263 Protein Sequence 
Protein Accession #: XM_044533 

1 11 21 31 41 51 

I I I I I I 

MLRTAMGLRS WLAAPWGALP PRPPLLLLLL LLLLLQPPPP TWALSPRISL PLGSEERPFL 60 

RFEAEHISNY TALLLSRDGR TLYVGAREAL FALSSNLSFL PGGEYQELLW GADAEKKQQC 120 

SFKGKDPQRD CQNYIKILLP LSGSHLFTCG TAAFSPMCTY INMENFTLAR DEKGNVLLED 180 

GKGRCPFDPN FKSTALWDG ELYTGTVSSF QGNDPAISRS QSLRPTKTES SLNWLQDPAF 240 

VASAYIPESL GSLQGDDDKI YFFFSETGQE FEFFENTIVS RIARICKGDE GGERVLQQRW 300 

TSFLKAQLLC SRPDDGFPFN VLQDVFTLSP SPQDWRDTLF YGVFTSQWHR GTTEGSAVCV 360 

FTMKDVQRVF SGLYKEVNRE TQQWYTVTHP VPTPRPGACI TNSARERKIN SSLQLPDRVL 420 

NFLKDHFLMD GQVRSRMLLL QPQARYQRVA VHRVPGLHHT YDVLFLGTGD GRLHKAVSVG 480 

PRVHIIEELQ IFSSGQPVQN LLLDTHRGLL YAASHSGWQ VPMANCSLYR SCGDCLLARD 540 

PYCAWSGSSC KHVSLYQPQL ATRPWIQDIE GASAKDLCSA SSWSPSFVP TGEKPCEQVQ 600 

FQPNTVNTLA CPLLSNLATR LWLRNGAPVN ASASCHVLPT GDLLLVGTQQ LGEFQCWSLE 660 

EGFQQLVASY CPEWEDGVA DQTDEGGSVP VIISTSRVSA PAGGKASWGA DRSYWKEFLV 720 

MCTLPVLAVL LPVLFLLYRH RNSMKVFLKQ GECASVHPKT CPWLPPETR PLNGLGPPST 780 

PLDHRGYQSL SDSPPGSRVP TESEKRPLSI QDSFVEVSPV CPRPRVRLGS EIRDSW 837 

Seq ID NO: C264 Protein Sequence 
Protein Accession #: NPJJ08950.1 

1 11 21 31 41 SI 

I I 1 I I I 

MASQNRDPAA TSVAAARKGA BPSGGAARGP VGKRLQQELM TLMMSGDKGI SAFPESDNLF 60 

KWVGTIHGAA GTVYEDLRYK LSLEFPSGYP YNAPTVKFTiT PCYHPNVDTQ GNICLDILKE 120 

KWSALYDVRT ILLSIQSLLG EPNIDSPLNT HAAELHKNPT AFKKYLQETY SKQVTSQEP 179 

Seq ID NO: C265 Protein Sequence 
'Protein Accession #: NP_055399.1 



. _ 1 11 21 31 41 51 

45 | | i | | | 

MGRGWGFLFG LLGAVWLLSS GHGEEQPPET AAQRCFCQVS GYLDDCTCDV ETIDRFNNYR 60 

LFPRLQKLLE SDYFRYYKVN LKRPCPFWND ISQCGRRDCA VKPCQSDEVP DGIKSASYKY 120 

SEEANNLIEE CEQAERLGAV DESLSEETQK AVLQWTKHDD SSDNFCEADD IQSPEAEYVD 180 

LLLNPERYTG YKGPDAWKIW NVIYEENCFK PQTIKRPLNP LASGQGTSEB NTFYSWLEGL 240 

CVEKRAFYRL ISGLHASINV HLSARYLLQE THLEKKWGHN ITEFQQRFDG ILTEGEGPRR 300 

LKNLYFLYLI ELRALSKVLP FFERPDFQLF TGNKIQDEEN KMLLLEILHE IKSFPLHFDB 360 

NSFFAGDKKE ARKLKEDFRL HFRNISRIMD CVGCFKCRLW GKLQTQGLGT ALKILFSEKL 420 

IANMPESGPS YEFHLTRQEI VSLFNAFGRI STSVKBLENF RNXjLQNIH 468 



Seq ID NO: C266 Protein Sequence 
Protein Accession #: NP_002879.l 

1 11 21 31 41 51 

I I I I I I 

MQPRRQRLPA PWSGPRGPRP TAPLLALLLL LAPVAAPAGS GGPDDPGQPQ DAGVPRRLLQ 60 
QKARAALHFF NFRSGSPSAL RVLAEVQBGR AWINPKEGCK VHWFSTERY NPESLLQEGE 120 
GRLGKCSARV FFKNQKPRPT INVTCTRLIE KKKRQQEDYL LYKQMKQLKN PLEIVSIPDN 180 
HGHIDPSLRL IWDIiAFLGSS YVMWEMTTQV SKYYLAQLTS VRQWVRKT 228 



65 Seq ID NO i C267 Protein Sequence 
Protein Accession #-. NP 005400.1 



1 11 21 31 41 51 

I ! I I I I 

MSVKGMAIAL AVILCATWQ GPPMFKRGRC LCIGPGVKAV KVADIEKASI MYPSNNCDKI 60 

EVIITLKENK GQRCLNPKSK QARLIIKKVE RKNF 94 

Seq ID NO: C268 Protein Sequence 
Protein Accession #: FGENESH predicted 

1 11 21 31 41 51 

I I I I I I 

MLRQVLRRGL QSFCHRLGLC VSRHPVFFLT VPAVLTITFG LSALNRFQPE GDXjERLVAPS 60 

HSLAKIERSL ASSLFPLDQS KSQLYSDLHT PGRYGRVILL SPTGDNILLQ AEGILQTHRA 120 

VLEMKVNHKG YNYTFSHLCV LRNQDKKCVL DDI I SVLEDL RQAAVSNKTT ARVQVRYPNT 180 

KLKVCSFCML LPIKEAALHF LP 202 

Seq ID NO: C269 Protein Sequence 
Protein Accession #: NPJJ02429.1 
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1 11 21 31 41 51 

I I I I I t 

MRLPLLLVFA SVIPGAVLLL DTRQFLIYNE DHKRCVDAVS PSAVQTAACN QDAESQKFRW 60 

5 VSESQIMSVA FKLCLGVPSK TDWVAITLYA CDSKSEFQKW ECKNDTLLGI KGEDLFFNYG 120 

NRQEKNIMLY KGSGLWSRWK IYGTTDNLCS RGYEAMYTLL GNAKGATCAF PFKFENKWYA 180 

DCTSAGRSDG WLWCGTTTDY DTDKLFGYCP LKFEGSBSLW NKDPLTSVSY QINSKSALTW 240 

HQARKSCQQQ NAELLSITEI HEQTYLTGLT SSLTSGLWIG LNSLSFNSGW QWSDRSPFRY 300 

UIWLPGSPSA EPGKSCVSLN PGKNAKWENL ECVQKLOYIC KKGNTTLNSF VIPSESDVPT 360 

10 HCPSQWWPYA GHCYKIHRDE KKIQRDALTT CRKEGGDI/TS IHTIEELDPI ISQLGYEPND 420 

ELWIGLNDIK IQMYFEWSDG TPVTFTKWLR GEPSHENNRQ EDCWMKGKD GYWADRGCEW 480 

PLGYICKMKS RSQGPEIVEV EKGCRKGWKK HHFYCYMIGH TLSTFAEANQ TCNNENAYLT 540 

TIEDRYEQAF LTSFVGLRPE KYFWTGLSDI QTKGTFQWTI EEEVRFTHWN SDMPGRKPGC 600 

VAMRTGIAGG LWDVLKCDEK AKFVCKHWAE GVTHPPKPTT TPEPKCPEDW GASSRTSLCF 660 

15 KLYAKGKHEK KTWFESRDFC RALGGDLASI NNKEEQQTIW RLITASGSYH KLFWLGLTYG 720 

SPSEGFTWSD GSPVSYENWA YGEPNNYQNV EYCGELKGDP TMSWNDINCE HLNNWICQIQ 780 

KGQTPKPEPT PAPQDNPPVT EDGWVIYKDY QYYFSKEKET MDNARAFCKR NFGDLVSIQS 840 

ESEKKFLWKY VNRNDAQSAY PIGLLISLDK KFAWMDGSKV DYVSWATGEP NFANEDENCV 900 

TMYSNSGFWN DINCGYPNAF 1CQRHNSSIN ATTVMPTMPS VPSGCKEGWN FYSNKCFKIF 960 

20 GFMEEERKNW QEARKACIGF GGNLVSIQNE KEQAFLTYHM KDSTFSAWTG LNDVNSEHTF 1020 

LWTDGRGVHY TNWGKGYPGG RRSSLSYEDA DCWIIGGAS NEAGKWMDDT CDSKRGYICQ 1080 

TRSDPSLTNP PATIQTDGFV KYGKSSYSLM RQKFQWHEAE TYCKLHNSLI AS I LDP YEN A 1140 

FAWLQMETSN ERVWIALNSN LTDNQYTV1TD KWRVRYTNWA ADEPKLKSAC VYLDLDGYWK 1200 

TAHCNESFYF LCKRSDEI PA TEPPQLPGRC PESDHTAWIP FHGHCYYIBS SYTRNWGQAS 1260 

25 LECLRMGSSL VSIESAAESS FLSYRVEPLK SKTNFWIGLF RNVEGTWLWI NNSPVSFVNW 1320 

NTGDPSGERN DCVALHASSG FWSNIHCSSY KGYICKRPKI IDAKPTHELL TTKADTRKMD 1380 

PSKPSSNVAG WIIVILLIL TGAGLAAYFF YKKRRVHliPQ EGAFENTLYF NSQSSPGTSD 1440 

MKDLVGNIEQ MEHSVI 1456 

30 Seq ID NO: C270 Protein Sequence 
Protein Accession 8: Eos sequence 

1 11 21 31 41 51 

~C I I I I I I 

J J MVLLHWCLLW LLFPLSSRTQ KLPTRDEELF QMQIRDKAFF HDSSVIPDGA EISSYLFRDT 60 

PKRYFFWEE DNTPLSVTVT PCDAPLEWKL SLQELPEDRS GEGSGDLEPL EQQKQQIINE 120 

EGTELFSYKG NDVBYFISSS SPSGLYQLDL LSTEKDTHFK VYATTTPESD QPYPELPYDP 180 

RVDVTSLGRT TVTLAWKPSP TASLLKQPIQ YCWINKEHN FKSLCAVEAK LSADDAFMMA 240 

PKPGLDFSPF DFAHFGFPSD NSGKERSFQA KPSPKLGRHV YSRPKVDIQK ICIGNKNIFT 300 

VSDLKPDTQY YFDVFWNIN SNMSTAYVGT FARTKEEAKQ KTVBLKDGKI TDVFVKRKOA 360 

KFLRFAPVSS HQKVTFFIHS CLDAVQIQVR RDGKLLLSQN VEGIQQFQLR GKPKAKYLVR 420 

LKGNKKGASM LKILATTRPT KQSFPSLPED TRIKAFDKLR TCSSATVAWL GTQERNKFCI 480 

YKKEVDDNYN EDQKKREQNQ CLGPDIRKKS EKVLOCYFHS QNLQKAVTTE TIKGLQPGKS 540 

YLLDVYVIGH GGHSVKYQSK WKTRKFC 568 
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Seq ID NO: C271 Protein Sequence 
Protein Accession #: AAH34229.1 



1 11 21 31 41 51 

50 | | | | | | 

MEKVQLEFEN QEMEKKLOEF RSTRNKEKED RESSEYYNKS GKVGKLVNQS YMMSQNKGNV 60 

VKFSAGKVKL KLLKEQIQEP VKPTVNYKMA NS9ECEKPKI NGKVCGQCEN KAALLVCLEC 120 

GEDYCSGCFA NVHQKGALKL HRTTLLQARS QILFNVLDVA KQFIKDVNPD EPKEENNSTK 180 

ETSKIQHKPK SVLLQRSSSE VEITTMKRAQ RTKPRKSLLC EGSFDEEASA QSFQEVLSQW 240 

55 RTGNHDDNKK QNLHAAVKDS LEECEVQTNL KIWREPLNIE LKEDILSYME KLHLKKHRRT 300 

PQEQLFKCYQ IRSHIHMKPL VMHSVLKMKT MKIVMVRRPK YNTQLFYCQ 349 



Seq ID NO: C272 Protein Sequence 
Protein Accession #s NP_078963-1 



1 11 21 31 41 51 

I I I I I I 

MEKLWLKKHR RTPQEQLFKM LSDTFPHPHE TTGDAQCSQN ENDED SDGEE TKVQHTALLL 60 

PVETLNIERP EPSLKIVELD DTYEEEFEEA ENIVPYKVKL ADADSQRSCA FHDCQKNSFP 120 

YENGIHQHHV FDKGKRDFLN LCLRNSSTYY KDNSKGETSN TDFDNIVDPD VYSSDIEKIE 180 

ESTSFERNLK EKNIGLESNQ KSDDSCVSLE SKDTLLGRDL EKAPIEEKLS QDIKESLBLS 240 

NLYKRPSFEE SKTTKSSLLL QEIACRSKPI TKQYQGLERF FIFDTNERLN LLPSHRLECN 300 

NSSTRITLAE DREWIPDHSL SEYADNAIVL GVLQGAQSPS SSRKQQKMGQ KSQRPSTANF 360 

PLSNSVKESS SCLSSSHPRS RSAAAQSSSR AASEISEIEY IDITDQNELS LDDTTDQHTL 420 

70 DNLEKELQVL RSLADTSEKL YSLTSEEFPD FSSQSLNISQ ISTDFLKTSH VRGPCGVEEL 480 

SCSGRDTKIQ SLLSLSESST DEEEEDFLNK QHVITLPWSK ST 522 



Seq ID NO: C273 Protein Sequence 
Protein Accession #: NP_005399.1 

1 11 21 31 41 51 

I I I I I I 

MKVSAVLLCL LLMTAAFNPQ GLAQPDALNV PSTCCFTFSS KKISLQRLKS YVITTSRCPQ 60 

KAVIFRTKLG KEICADPKEK HVQNYMKHLG RKAHTLKT 98 

Seq ID NO: C274 Protein Sequence 
Protein Accession #: BAC05158.1 

1 11 21 31 41 51 
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1 I I I I I 

MFLLTGGVSL KSAEKNPDPT WLQDKSWEEI CRASEFPAFR GLRQHFCEHI YEWREIYDSK 60 

EPHNAKFPAP MDKNLNELQK IIILRCLRPD KITPAITNYV TDKLGKKFVE PPPFDLTKSY 120 

LDSNCTIPL1 FVLSPGADPM ASLLKFANDK SMSGNKFQAI SLGQGQGPIA AKMIKAAIEE 180 

5 GTWVCLQNCH LAVSWMPMLE KICEDFTSET CNSSFRLWLT SYPSSKFPVT ILQNGVKMTN 240 

EPPTGLRLNL LQSYLTDPVS DPEFFKGCRG KELLFINEYD TIPFEAISYL TGECNYGGRV 300 

TDDWDRRLLL TMLADFYNLY IVENPHYKFS PSGNYFAPPK GTYEDYIEFI KKLPFTQHPE 360 

IFGLHENVDI SKDLQQTKTL FESLLLTQGG SKQTGASGST DQILLEITKD ILNKLPSDFD 420 

IEMALRKYPV RYEESMNTVL VQEMERFNNL IITIRNTLRD LEKAIKGWV MDSALEALSS 480 

10 SLLVGKVPEI WAKRSYPSLK PLGSYITDFL ARLNFLQDWY NSGKPCVFWL SGFFFTQAFL 540 

TGAMQNYARK YTTPIDLLGY EFEVIPSDTS DTSPEDGVYI HGLYLDGARW DRESGLLAEQ 600 

YPKLLFDLMP IIWIKPTQKS RIIKSDAYVC PLYKTSERKG TLSTTGHSTN FVIAMLLKTD 660 

QPTRHWIKRG VALLCQLDD 679 

15 Seq ID NO j C275 Protein Sequence 
Protein Accession #i AAA60212.1 
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1 11 21 31 41 51 

I I I I I I 

MAESHLLQWL LLLLPTLCGP GTAAWTTSSL ACAQGPEFWC QSLEQALQCR ALGHCLQEVW 60 

GHVGADDLCQ ECEDIVHILN KMAKEAIFQD TMRKFLEQEC NVLPLKLLMP QCNQVLDDYF 120 

PLVIDYFQNQ TDSNGICMHL GLCKSRQPEP EQEPGMSDPL PKPLRDPLPD PLLDKLVLPV 180 

LPGALQARPG PHTQDLSEQQ PPIPLPYCWL CRALIKRIQA MIPKGALAVA VAQVCRWPL 240 

VAGGICQCLA ERYSVILLDT LLGRMLPQLV CRLVLRCSMD DSAGPRSPTG EWLPRDSECH 300 

LCMSVTTQAG NSSEQAIPQA MLQACVGSWL DREKCKQFVE QHTPQLL.TLV PRGWDAHTTC 360 

QALGVCGTMS SPLQCIHSPD L 381 

Seq ID NO: C276 Protein Sequence 
Protein Accession #: NP_6 3 1911.1 

1 11 21 31 41 51 

I I I I I I 

MLGCGIPALG LLLLLQGSAD GNGIQGFFYP WSCEGDIWDR ESCGGQAAID SPNLCLRLRC 60 

CYRNGVCYHQ RPDENVRRKH MWALVWTCSG LLLLSCSICL FWWAKRRDVL HMPGFLAGPC 120 

DMSKSVSLLS KHRGTKKTPS TGSVPVALSK ESRDVEGGTE GEGTEEGEET EG EEEE D 177 

Seq ID NO: C277 Protein Sequence 
Protein Accession #t NP_473364.1 

1 11 21 31 41 51 

I t I I I I 

MKLVTIFLLV TISLCSYSAT AFLINKVPLP VDKLAPLPLD NILPFMDPLK LLLKTLGISV 60 
EHLVEGLRKC VNELGPEASE AVKKLLEALS HLV 93 



45 Seq ID NO: C278 Protein Sequence 

Protein Accession ft: FGENESH predicted 



1 11 21 31 41 51 

I I 1 I I I 

MPLSYAYKNA ETLAGRRTSS WMSRGAYQRR NTRAAGRPEE CTDRNWHAGR TRGIMLGQLE 60 

ERCSDVFGVS FFHWVRGLAG SGAKLQTFTP AQEGAPTVQR QAEALLKCRQ SGRPGRGGAE 120 

SERARDASML SPLSAAMRNY PTSSTIPPRR" SYSPTEIAHK SYSCSIiPDMK ISMAESGPSL 180 

DSLDILBDGE SGSPFLVTHL YFLGWTTGM EQLDFETGPN IFDLQIYVKD EVGVTDLQVL 240 

TVQVTDVNEP PQFQGNLAED HLRADQPHFN AHSHTYVRW ATALARHRLR SSIGSPFLGT 300 

FCVWGMQYF LISPPKSFRM SANGTLFSTT ELDFEAGHRS FHLIVEVRDS GGLKASTELQ 360 

VNIVNUTDEV PRFTSPTRVY TVLEELSPGT IVANITAEDP DDEGFPSHLL YSITTVSKYF 420 

MINQLTGTIQ VAQRIDRDAG ELRQNPTISL EVLVKDRPYG GQENRIQITF IVEDVNDNPA 480 

TCQKFTFRSS LHPALCSKTL THMDTVLDCF HAADKDIPVT GRFTKERGLI GLTVPHGWGS 540 

LTIMAEGKEE QVTSYMDGSR QRDRACVGKL LLIKPSDLMR LSHYHENNSG KTCPHDSISS 600 

YQVPPTTCRN SRIQATNNED TSSVTVTVNI LEENDEKPIC TPNSYFLAI*P VDLKVGTNIQ 660 

NFKLTCTDLD SSPRSFRYSI GPGNVNNHPT FSPNAGSNVT RLLLTSRFDY AGGFDKIWDY 720 

KLLVYVTDDN LMSDRKKAEA LVETGTVTLS IKVIPHPTTI ITTTPRPRVT YQVLRKNVYS 780 

PSAWYVPFVI TLGSILLLGL LVYIiWLLAK AIHRHCPCKT GKNKEPIiTKK GETKTAKRDV 840 

WETIQMNTI FDGEAIDPEP EQASLBLYAL LPSCCDPSPV TLRKVQVCGE SEETGQCSGH 900 

65 ITLPGKIPVD DPRKQETGLQ GDFEVWTLCP AVKWVGSPQ AERCIRLALS LKKYSSD 957 



Seq ID NO: C279 Protein Sequence 
Protein Accession #: XP 168571.1 



11 21 31 41 51 

I I I I I I 

MINQLTGTIQ VAQRIDRDAG ELRQNPTISL EVLVKDRPYG GQENRIQITF IVEDVNDNPA 60 

TCQKFTFSIM VPERTAKGTL LLDLNKFCFD DDSEAPNNRF NFTMPSGVGS GSRFLQDPAG 120 

SGKIVLIGDL DYENPSNLAA GNKYTVIIQV QDVAPPYYKN NVYVYILTSP ENEFPLIFDR 180 

75 PSYVFDVSER RPAQGHLSGP EEKRLLSICM VRAVCHHFGL HIASGSPRVP GRPIGQSHPQ 240 

TLPLQDWEEQ GTSDKERRNE DCRERRRGGN YPDEHYL 277 



Seq ID NO: C280 Protein Sequence 
Protein Accession #: NP_005257.2 

1 11 21 31 41 51 

I I I I I I 

MGDWSFLGNF LEEVHKHSTV VGKVWLTVLF IFRMLVLGTA AESSWGDEQA DFRCDTIQPG 60 
CQNVCYDQAF PISHIRYWVL QIIFVSTPSL VYMGHAMHTV RMQEKRKLRE AERAKEVRGS 120 
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GSYEYPVAEK AELSCWEEGN GRIALQGTLL NTYVCSILIR TTMBVGPIVG QYFIYGIFLT 180 

TLHVCRRSPC PHPVNCYVSR PTEKNVFIVP MLAVAALSLL LSLAELYHLG WKKIRQRFVK 240 

PRQHMAKCQL SGPSVGIVQS CTPPPDFNQC LENGPGGKFF NPFSNNMASQ QNTDNLVTEQ 300 

VRGQEQTPGE GFIQVRYGQK PEVPNGVSPG HRLPHGYHSD KRRLSKASSK ARSDDLSV 358 

Seq ID NO: C281 Protein Sequence 
Protein Accession #: NP_0S5274.2 

1 11 21 31 41 51 

I 1 I I I I 

MYLSICCCFL LWAPALTLKN LNYSVPEEQG AGTVIGNIGR DARLQPGLPP AERGGGGRSK 60 

SGSYRVLENS APHLLDVDAD SGULYTKQRI DRESLCRHNA KCQLSLEVFA NDKEICMIKV 120 

EIQDINDNAP SFSSDQIEMD ISENAAPGTR FPLTSAHDPD AGENGLRTYL LTRDDHGLFG 180 

LDVKSRGDGT KFPELVIQKA *LDREQQNHHT LVLTALDGGE PPRSATVQIN VKVIDSNDNS 240 

PVFEAPSYLV ELPENAPLGT WIDLNATDA DEGPNGEVLY SFSSYVPDRV RELFSIDPKT 300 

GLIRVKGNLD YEENGMLEID VQARDLGPNP IPAHCKVTVK LIDRNDNAPS IGFVSVRQGA 360 

LSEAAPPGTV IALVRVTDRD SGKNGQLQCR VLGGGGTGGG GGLGGPGGSV PFKLEENYDN 420 

FYTWTDRPL DRETQDEYNV TIVARDGGSP PLNSTKSFAI KILDENDNPP RFTKGLYVLQ 480 

VHENNIPGEY LGSVLAQDPD LGQNGTVSYS ILPSHIGDVS IYTYVSVNPT NGAIYALRSF 540 

NFEQTKAFEF KVLAKDSGAP AHLESNATVR VTVLDVNDNA PVIVLPTLQN DTAELQVPRN 600 

AGLGYLVSTV RALDSDFGES GRLTYEIVDG NDDHLFEIDP SSGEIRTLHP FWEDVTPWE 660 

LWKVTDHGK PTLSAVAKLI IRSVSGSLPE GVPRVNGEQH HWDMSLPLIV TLSTISIILL 720 

AAMITIAVKC KRENKEIRTY NCRIAEYSHP QLGGGKGKKK KINKNDIKLV QSEVEERNAM 780 

KVMNWSSPS LATSPMYFDY QTRLPLSSPR SEVMYLKPAS NNLTVPQGHA GCHTSFTGQG 840 

TNASETPATR MSIIQTDNFP AEPNYMGSRQ QFVQSISVAP RLRTQKEPA 889 

Seq ID NO: C282 Protein Sequence 
Protein Accession #: NP_005592.1 

1 11 21 31 41 51 

I I I I I I 

MELCRSIiALL GGSLGLMFCL IALSTDFWFE AVGPTHSAHS GLWPTGHGDI ISGYIHVTQT 60 
FSIMAVLWAL VSVSFLVLSC FPSLFPPGHG PLVSTTAAFA AAISMWAMA VYTSERWDQP 120 
PHPQIQTFFS WSFYLGWVSA ILLLCTGALS LGAHCGGPRP GYETL 165 

Seq ID NO: C283 Protein Sequence 
Protein Accession ft: NP_006424.2 

1 11 21 31 41 51 

! I I I I I 

MATVJALLLLA AMLLGNPGLV FSRLSPEYYD LARAHLRDEE KSCPCLAQEG PQGDLLTKTQ 60 
ELGRDYRTCL TIVQKLKKMV DKPTQRSVSN AATRVCRTGR SRWRDVCRNF MRRYQSRVTQ 120 
GLVAGETAQQ ICEDLRLCIP STGPL 145 

Seq ID NO: C2B4 Protein Sequence 
Protein Accession #: NP_005594.1 

1 11 21 31 41 51 

I I I I I I 

MKVSAAALAV ILIATALCAP ASASPYSSDT TPCCFAYIAR PLPRAKIKEY FYTSGKCSNP 60 

AWFVTRKNR QVCANPEKKW VREYINSLEM S 91 



Seq ID NO: C285 Protein Sequence 
Protein Accession #: NP_071437.1 

1 11 21 31 41 51 

I I I I I I 

MAPGRAVAGL LIiLAAAGLGG VAEGPGIAFS EDVLSVFGAN LSLSAAQLQH LLEQKGAASR 60 

VGVPEPGQLH FNQCLTAEEI FSLHGPSNAT QITSSKFSVI CPAVLQQIOT HPCEDRPKHK 120 

TRPSHSEVWG YGFLSVTIIN LASLLGLILT PLIKKSYFPK ILTFFVGLAI GTLFSNAIFQ 180 

LIPEAFGFDP KVDSYVEKAV AVFGGFYLLF FFERMLKKLL KTYGQNGHTH FGNDNFGPQE 240 

KTHQPKALPA INGVTCYANP AVTEANGHIH FDNVSWSLQ DGKKEPSSCT CLKGPKLSBI 300 

GTIAWMITLC DALHNFIDGL AIGASCTLSL LQGLSTSIAI LCBEPPHELG DFVILLNAGM 360 

STRQALLFNF LSACSCYVGL AFGILVGNNF APNIIFALAG GMFLYISLAD MFPEMNDMLR 420 

EKVTGRKTDF TFFMIQNAGM LTGFTAILLI TLYAGEIBLE 460 



Seq ID NO: C286 Protein Sequence 
Protein Accession tf: NP_004175.1 

1 11 21 31 41 51 

i I 1 i I I 

MPNSEPASLL ELFNSIATQG ELVRSLKAGN ASKDEIDSAV KMLVSLKMSY KAAAGEDYKA 60 

DCPPGNPAPT SNHGPDATEA EEDFVDPWTV OTSSAKGIDY DKLIVRFGSS KIDKELINRI 120 

ERATGQRPHH FLRRGIFFSH RDMNQVLDAY ENKKPFYLYT GRGPSSEAMH VGHLIPFIFT 180 

KWLQDVFNVP LVTQMTDDEK YLWKDLTLDQ AYGDAVENAK DIIACGFDIN KTFIFSDLDY 240 

MGMSSGFYKN WKIQKHVTF NQVKGIFGFT DSDCIGKISP PAIQAAPSFS NSFPQIFRDR 300 

TDIQCLIPCA IDQDPYFRMT RDVAPRIGYP KPALLHSTFF PALQGAQTKM SASDPNSSIF 360 

LTDTAKQIKT KVNKHAFSGG RDTIEEHRQF GGNCDVDVSF MYLTFFLEDD DKLEQIRKDY 420 

TSGAMLTGEL KKALIEVLQP LIAEHQARRK EVTDEIVKEP MTPRKLSFDF Q 471 



Seq ID NO: C2B7 Protein Sequence 
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Protein Accession H : NP_004929.l 

1 11 21 31 41 SI 

I I I I I I 

MTVFRQENVD DYYDTGEELG SGQFAWKKC REKSTGLQYA AKFIKKRRTK SSRRGVSRED 60 

IEREVSILKE IQHPNVITLH EVYENKTDVI LILELVAGGE LFDFLAEKES LTEEEATEPL 120 

KQILMGVYYL HSLQIAHFDL KPEKIMLLDR NVPKPRIKII DFGLAHKIDF GNEFKNIFGT 180 

PEFVAPBIVN YEPLGLEADM WSIGVITYIL LSGASPFLGD TKQETLANVS AVNYEFEDEY 240 

PSNTSALAKD FIRRLLVKDP KKRMTIQDSL QHPWIKPKDT OQALSRKASA VNMEKFKKFA 300 

ARKKWKQSVR LISLCQRLSR SFLSRSNMSV ARSDDTLDEE DSFVMKAIIH AINDDNVPGL 360 

QHLLGSLSNY DVNQPNKHGT PPLLIAAGCG NIQILQLLIK RGSRIDVQDK GGSNAVYWAA 420 

RHGHVDTLKF LSENKCPLDV KDKSGEMALH VAARYGHADV AQVTCAASAQ IPISRTKEEE 480 

TPLHCAAWHG YYSVAKALCE AGCNVNIKNR EGETPLLTAS ARGYHDIVEC LAEHGADLNA 540 

CDKDGHIALH LAVRRCQMEV IKTLLSQGCF VDYQDRHGNT PLHVACKDGN MPIWALCEA 600 

NCNLDISNKY GRTPLHLAAN NGILDWRYL CLMGASVEAL TTDGKTAEDL ARSEQHEHVA 660 

GLLARLRKDT HRGLFIQQLR PTQNLQPRIK LKLFGHSGSG KTTLVESLKC GLLRSFFRRR 720 

RPRLSSTNSS RFPPSPLASK PTVSVSINNLt YPGCENVSVR SRSMMFEPGL TKGMIiEVFVA 780 

PTHHPHCSAD DQSTKAIDIQ NAYLNGVGDF SVWEFSGNPV YFCCYDYFAA NDPTSIHWV 840 

FSLEEPYEIQ LNPVIFWLSF LKSLVPVEEP IAFGGKLKNP LQWLVATHA DIHNVPRPAG 900 

GEFGYDKDTS LLKEIRNRFG NDLHISNKLF VLDAGASGSK DMKVUINHLQ EIRSQIVSVC 960 

PPMTHLCEKI ISTLPSWRKL NGPNQLMSLQ QFVYDVQDQL NPLASEEDLR RIAQQLHSTG 1020 

EINIMQSETV QDVLLLDPRW LCTNVLGKLli SVETPRALHH YRGRYTVEDI QRLVPDSDVE 1080 

ELLQILDAMD I CAROLS SGT MVDVPALIKT DNLHRSWADE EDEVMVYGGV RIVPVEHLTP 1140 

PPCGIFHKVQ VNLCRWIHQQ STEGDADIRL WVNGCKLANR GAELLVLLVN HGQGIEVQVR 1200 

GLETEKIKCC LLI45SVCSTI ENVMATTLPG LLTVKHYLSP QQLREHHEPV MIYQPRDFFR 1260 

AQTLKETSLT NTMGGYKESF SSIMCFGCHD VYSQASLGMD IHASDLNLLT RRKLSRLLDP 1320 

PDPLGKDWCL LAMNLGI*PDL VAKYNTNNGA PKDFLPSPLH ALLREWTTYP ESTVGTLMSK 1380 

LRELGRRDAA DLLLKASSVF KINLDGNGQE AYASSCNSGT SYNSISSWS R 1431 

Seq ZD NO: C288 Protein Sequence 
Protein Accession #: NP — 002072. 1 

1 11 21 31 41 51 

I I I I I I 

MELRARGWWIi LCAAAALVAC ARGDPASKSR SCGEVRQIYG AKGFSLSDVP QAEISGEHLR 60 

ICPQGYTCCT SEMEENLANR SHAELETALR DSSRVLQAML ATQLRSFDDH FQHLLNDSER 120 

TLQATFPGAF GELYTQNARA FRDLYSELRL YYRGANLHLE ETLAEFWARL LERLFKQLHP 180 

QLLLPDDYLD CLGKQAEALR PFGEAPRELR LRATRAFVAA RSFVQGLGVA SDWRKVAQV 240 

PLQPECSRAV MKLVYCAHCL GVPGARPCPD YCRNVLKGCL ANQADLDAEW RNLLDSMVLI 300 

TDKFWGTSGV ESVIGSVHTW LAEAINALQD NRDTLTAKVI QGCGNPKVNP QGPGPEEKRR 360 

RGKLAPRERP PSGTLEKXjVS EAKAQLRDVQ DFHISLPGTL CSEKMALSTA SDDRCWNGMA 420 

RGRYLPEVKG DGLANQINNP EVEVDITKPD MTIRQQIMQL KIMTNRLRSA YNGNDVDFQD 480 

ASDDGSGSGS 490 

Seq ID NO: C289 Protein Sequence 
Protein Accession #: AAH30205.1 

1 11 21 31 41 51 

I I I I I I 

MIILIYLFLL LWEDTQGWGF KDGIFHNSIW LERAAGVYHR EARSGKYKLT YAEAKAVCEF 60 

EGGHLATYKQ LEAARKIGFH VCAAGWMAKG RVGYPIVKPG PNCGFGKTGI IDYGIRLNRS 120 

ERWDAYCYNP HAKECGGVFT DPKQIFKSPG FPNEYEDNQI CYWHIRLKYG QRIHLSFLDF 180 

DLEDDPGCLA DYVEIYDSYD DVHGFVGRYC GDELPDDIIS TGNVMTLKFL SDASVTAGGF 240 

QIKYVAMDPV SKSSQGKNTS TTSTGNKNPIi AGRFSHL 277 

Seq ID NO: C290 Protein Sequence 
Protein Accession #: NP_001973.1 

1 11 21 31 41 51 

I I I I I I 

MRANDALQVL GLLFSLARGS EVGNSQAVCP GTLNGLSVTG DAENQYQTLY KLYERCEWM 60 

GNLEIVLTGH NADLSFLQWI REVTGYVLVA MNEFSTLPLP NLRWRGTQV YDGKFAIPVM 120 

LNYNTNSSHA LRQLRLTQLT EILSGGVYIE KNDKLCHMDT IDWRDIVRDR DAEIWKDNG 180 

RSCPPCHEVC KGRCWGPGSE DCQTLTKTIC APQCNGHCFG PNPNQCCHDB CAGGCSGPQD 240 

TDCFACRHFN DSGACVPRCP QPLVYNKLTF QLEPNPHTKY QYGGVCVASC PHNPWDQTS 300 

CVRACPPDKM EVDKNGLKMC EPCGGLCPKA CEGTGSGSRF QTVDSSNIDG FVNCTKILGN 360 

LDFIilTGLNG DPWHKIPALD PEKLNVFRTV REITGYLNIQ SWPPHMHNFS VFSNLTTIGG 420 

RSLYNRGFSL LIMKNUJVTS LGFRSLKEIS AGRIYISANR QLCYHHSLNW TKVLRGPTEE 480 

RLDIKHNRPR RDCVAEGKVC DPIiCSSGGCW GPGPGQCLSC RNYSRGGVCV THCNFLNGEP 540 

REFAHEAECF SCHPECQPMG GTATCNGSGS DTCAQCAHFR DGPHCVSSCP HGVLGAKGPI 600 

YKYPDVQNEC RPCHENCTQG CKGPELQDCL GQTLVLIGKT HLTMALTVIA GLWIFKMLG 660 

GTFLYWRGRR IQNKRAMRRY LERGESIEPL DPSEKANKVL ARIFKETELR KLKVLGSGVF 720 

GTVHKGVWIP EGESIKIPVC IKVIEDKSGR QSFOAVTDHM LAIGSLDHAH IVRLLGLCPG 780 

SSLQLVTQYL PLGSLLDHVR QHRGALGPQL LLNWGVQIAK GMYYLEEHGM VHRNLAARNV 840 

LLKSPSQVQV ADFGVADLLP PDDKQLLYSE AKTPIKWMAL ESIHFGKYTH QSDVWSYGVT 900 

VWELMTFGAE PYAGLRLAEV PDIiLEKGERI* AQPQICTIDV YMVMVKCWMI DENIRPTFKE 960 

LANEFTRMAR DPPRYLVIKR ESGPGIAPGP EPHGLTNKKL EEVELEPELD LDLDLEAEED 1020 

NIATTTLGSA LSLPVGTLNR PRGSQSLLSP SSGYMPMNQG NLGGSCQESA VSGSSERCPR 1080 

PVSLHPMPRG CLASESSEGH VTGSEAELQE KVSMCRSRSR SRSPRPRGDS AYHSQRHSLL 1140 

TPVTPLSPPG LEEEDVNGYV MPDTHLKGTP SSREGTLSSV GLSSVLGTEE EDEDEEYEYM 1200 

NRRRRHSPPH PPRPSSLEEL GYEYMDVGSD LSASLGSTQS CPLHPVPIMP TAGTTPDEDY 1260 

EYMNRQRDGG GPGGDYAAMG ACPASEQGYE EMRAFQGPGH QAPHVHYARL KTLRSLEATD 1320 

SAFDNPDYWH SRLFPKANAQ RT 1342 
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Seq ID NO: C291 Protein Sequence 
Protein Accession ft: NP_001207.1 

„ 1 11 21 31 41 51 

5 | | | | 1 I 

MAPLCPSPWL PLLIPAPAPG LTVQLLLSLL LLMPVHPQRL PRMQEDSPLG GGSSGEDDPL GO 

GEEDLPSEED SPREEDPPGE EDLPGEEDLP GEEDLPEVKP KSEEEGSLKL EDLPTVEAPG 120 

DPQEPQNNAH RDKEGDDQSH WRYGGDPPHP RVSPACAGRF QSPVDIRPQL AAFCPALRPL 180 

ELLGFQtiPPL PELRLRNNGH SVQLTLPPGL EMALGPGREY RALQLHLHWG AAGRPGSEHT 240 

10 VEGHRFPAEI HWHLSTAFA RVDEALGRPG GLAVLAAFLE EGPEENSAYE QLLSRLEEIA 300 

EEGSETQVPG LDISALLPSD FSRYFQYEGS LTTPPCAQGV IWTVFNQTVM LSAKQLHTLS 360 

DTLWGPGDSR LQLNFRATQP LNGRVIEASF PAGVDSSPRA AEPVQLNSCL AAGDILALVF 420 

GLLFAVTSVA FLVQMRRQHR RGTKGGVSYR PAEVAETGA 459 

15 Seq ID NO: C292 Protein Sequence 
Protein Accession ft: NP_004198.1 

1 11 21 31 41 51 

on I I I I I I 

ZU MGGAWDEGP TGVKAPDGGW GWAVLFGCFV ITGFSYAFPK AVSVFFKELI QEFGIGYSDT 60 

AWISSILLAM LYGTGPLCSV CVNRFGCRPV MLVGGLFASIi GMVAASFCRS IIQVYLTTGV 120 

ITGLGLALNF QPSLIMLNRY FSKRRPMANG I*AAAGSPVFL CALSPLGQLL QDRYGWRGGF 180 

LILGGLLLNC CVCAALMRPL WTAQPGSGP PRPSRRLLDLi SVFRDRGFVL YAVAASVMVL 240 

GLFVPPVFW SYAKDLGVPD TKAAFLLTIL GFIDIFARPA AGFVAGLGKV RPYSVYLFSF 300 

25 SMFFNGLADL AGSTAGDYGG LWFCIFFGI SYGMVGALQF BVLMAIVGTH KFSSAIGLVL 360 

LMEAVAVLVG PPSGGKLLDA THVYMYVFIL AGAEVLTSSL ILLLGNFFCI RKKPKEPQPE 420 

VAAAEEEKLH KPPADSGVDL REVEHFLKAE PEKNGEWHT PETSV 465 

Seq ID NO: C293 Protein Sequence 
30 Protein Accession ft: NP_0 0034 9.1 

1 11 21 31 41 51 

I I I I 1 I 

MALFVRLLAL AIALALGPAA TLAGPAKSPY QLVLQHSRLR GRQHGPNVCA VQKVIGTNRK 60 

35 YFTNCKQWYQ RKICGKSTVI SYECCPGYEK VPGEKGCPAA LPLSNLYETL GWGSTTTQL 120 

YTDRTEKLRP EMEGPGSFTI FAPSNEAWAS LPAEVLDSLV SNVNIELLNA LRYHMVGRRV 180 

LTDELKHGMT LTSMYQNSNI QIHHYPNGIV TVNCARLLKA DHHATNGWH LIDKVISTIT 240 

NNIQQIIEIE DTFBTLRAAV AASGLNTMLE GNGQYTLLAP TNEAFEKIPS ETLNRILGDP 300 

EALRDLLNNH ILKSAMCAEA IVAGLSVETL EGTTLEVGCS GDMLTINGKA IISNKDILAT 360 

40 NGVIHYIDEL LIPDSAKTLF ELAAESDVST AIDLFRQAGL GNHLSGSERL TLLAPLNSVF 420 

KDGTPPIDAH TRNLLRNHII RDQLASKYLY HGQTLETLGG KKLRVFVYRN SLCIENSCIA 480 

AHDKRGRYGT LFTMDRVLTP PMGTVMDVLK GDNRFSMLVA AIQSAGLTET LNREGVYTVF 540 

APTNEAFRAL PPRERSRIjLG DAKELANILK YHIGDEILVS GGIGALVRLK SLQGDKLEVS 600 

LKNNWSVNK EPVABPDIMA TNGWHVITN VLQPPANRPQ ERGDELADSA LEIFKQASAF 660 

45 SRASQRSVRL APVYQKLLER MKH 683 

Seq ID NO: C294 Protein Sequence 
Protein Accession ft: NP_006527.1 

50 1 11 21 31 41 51 

I 1 1/1 I I 

MTQRSIAGPI CNLKFVTLLV ALSSELPFLG AGVQLQDNGY NGLLIAINPQ VPENQNLISN 60 

IKEMITEASF YIiFNATKRRV FFRNIKILIP ATWKANNNSK IKQESYEKAN VIVTDWYGAH 120 

GDDPYTLQYR GCGKEGKYIH FTPNFLLNDN LTAGYGSRGR VFVHEWAHLR WGVFDEYNND 180 

55 KPFYINGQNQ IKVTRCSSDI TGIFVCEKGP CPQENCIISK LFKEGCTFIY NSTQNATASI 240 

MFMQSLSSW EFCNASTHNQ EAPNLQNQMC SLRSAWDVIT DSADFBRSFP MNGTELPPPP 300 

TFSLVQAGDK WCLVLDVSS KMAEADRLLQ LQQAAEPYLM QIVEIHTFVG IASFDSKGEI 360 

RAQLHQINSN DDRKLLVSYL PTTVSAKTDI SICSGLKKGF BWEKUIGKA YGSVMILVTS 420 

GDDKLLGNCL PTVLSSGSTI HSIALGSSAA PNLEELSRLT GGLKFFVPDI SNSNSMIDAF 480 

60 SRISSGTGDI FQQHIQLEST GENVKPHHQL KNTVTVDNTV GNDTMFLVTW QASGPPEIIL 540 

FDPDGRKYYT NNFITNLTFR TASIiWIPGTA KPGHWTYTLN NTBHSLOALK VTVTSRASNS 600 

AVPPATVEAF VERDSLHFPH PVMIYANVKQ GFYPILNATV TATVEPETGD PVTLRLLDDG 660 

AGADVIKNDG IYSRYFFSFA ANGRY SLKVH VNHSPSISTP AHS IPG SHAM YVPGYTANGN 720 

IQMNAPRKSV GRNEEERKWG FSRVSSGGSF SVLGVPAGPH PDVFPPCKII DLEAVKVEEE 780 

65 LTLSWTAPGB DFDQGQATSY EIRMSKSLQN IQDDPNNAIL VNTSKRNPQQ AGIREIFTFS 840 

PQISTNGPEH QPNGBTHESH RIYVAIRAMD RNSLQSAVSN IAQAPLFIPP NSDPVPARDY 900 

LILKGVLTAM GLIGIICLII WTHHTLSRK KRADKKEKGT KLL 943 

Seq ID NO: C295 Protein Sequence 
70 Protein Accession ft: Eos sequence 

1 11 21 31 41 51 

I I I 1 I I 

MKFLLIIjLLQ ATASGALPLN SSTSLEKNNV LFGERYLEKF YGLEINKLPV TKMKYSGNLM 60 

75 KEKIQEMQHF LGLKVTGQLD TSTLEMMHAP RCGVPDVHHF REMPGGPVWR KHYITYRINN 120 

YTPDMNREDV DYAIRKAFQV WSNVTPLKFS KINTGMADIL WFARGAHGD FHAFDGKGGI 180 

LAHAFGPGSG IGGDAHFDED EFWTTHSGGT NLFLTAVHEI GHSLGLGHSS DPKAVMFPTY 240 

KYVDINTFRL SADDIRGIQS LYGDPKENQR LPNPDNSEPA LCDPNLSFDA VTTVGNKIFF 300 

FKDRFFWLKV SERPKTSVNL ISSLWPTLPS GIBAAYEIEA RNQVFLFKDD KYWLISNLRP 360 

80 EPNYPKSIHS FGFPNFVKKI DAAVFNPRFY RTYFFVDNQY WRYDBRRQMM DPGYPKLITK 420 

NFQGIGPKID AVPYSKNKYY YFFQGSNQFE YDFLLQRITK TLKSNSWFGC . 470 



Seq ID NO: C296 Protein Sequence 
Protein Accession ft: Eos sequence 
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1 11 21 31 41 51 

I I I I I I 

MKFLLILLLQ ATASGALPLN SSTSLEKNNV LFGERYLEKF YGLEINKLPV TKMKYSGNLM 60 

5 KEKIQEMQHP LGLKVTGQLD TSTLEMMHAP RCGVPDVHHF REMPGGPVWR KHYITYRINN 120 

YTPDMNREDV DYAIRKAFQV WSNVTPLKFS KINTGMADIL WPARGAHGD FHAPDGKGGI 180 

LAHAFGPGSG IGGDAHFDED EFWTTHSGGT NLFLTAVHAI GHSLGLGHS9 DPKAVMFPTY 240 

KYVDINTFRL SADDIRGIQS LYGDPKENQR LPNPDNSEPA LCDPNLSFDA VTTVGNKIFF 300 

PKDRFFWLKV SERPKTSVNI, ISSLWPTLPS GIEAAYEIEA RNQVFLFKDD KYWLISNLRP 360 

10 EPNYPKSIHS FGFPNFVKKI DAAVFNPRPY RTYFFVDNQY WRYDERRQMM DPGYPKLITK 420 

NFQGIGPKID AVFYSKNKYY YPFQGSNQFE YDFLLQRITK TLKSNSWFGC 470 
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Seq ID NO: C297 Protein Sequence 
Protein Accession #: NP_008883.1 



1 11 21 31 41 51 

I I I I I I 

MAKDNSTVRC FQGLLIFGNV IIGCCGIALT AECIFFVSDQ HSLYPLLEAT DNDDIYGAAW 60 

IGIFVGICLF CLSVLGIVGI MKSSRKILLA YFILMPIVYA FEVASCITAA TQRDFFTPNL 120 

20 FLKQMLERYQ NNSPPNNDDQ WKNNGVTKTW DRLMLQDNCC GVNGPSDWQK YTSAFRTENN 180 

DADYPWPRQC CVMNNLKEPL NLEACKLGVP GFYHNQGCYE LISGPMNRHA WGVAWFGFAI 240 

LCWTFWVLLG TMFYWSRIEY 260 

Seq ID NO: C298 Protein Sequence 
25 Protein Accession #: NP_001784.2 

1 11 21 31 41 51 

I I 1 I I I 

MGLPRGPLAS LLLLQVCWLQ CAASEPCRAV FREAEVTLEA GGAEQEPGQA LGKVFMGCPG 60 

30 QEPALFSTDN DDFTVRNGET VQERRSLKER NPLKIFPSKR ILRRHKRDWV VAPISVPENG 120 

KGPFPQRLNQ LKSNKDRDTK I FYS I TG PGA DSPPEGVFAV EKETGWLLLN KPLDREEIAK 180 

YELFGHAVSE NGASVEDPMN ISIIVTDQND HKPKFTQDTF RGSVLEGVLP GTSVMQVTAT 240 

DEDDAIYTYN GWAYSIHSQ EPKDPHDLMF TIHRSTGTIS VISSGLDREK VPEYTLTIQA 300 

TDMDGDGSTT TAVAWE I LD ANDNAPMFDP QKYEAHVPEN AVGHEVQRLT VTDLDAPNSP 360 

35 AWRATYLIMG GDDGDHFTIT THPESNQGIL TTRKGLDFEA KNQHTLYVEV TNEAPFVLKL 420 

PTSTATIWH VEDVNEAPVF VPPSKWEVQ EGIPTGEPVC VYTAEDPDKB NQKISYRILR 480 

DPAGWLAMDP DSGQVTAVGT LDREDEQFVR NNIYEVMVLA MDNGSPPTTG TGTLLLTLID 540 

VNDHGPVPEP RQITICNQSP VRQVLNITDK DLSPHTSPFQ AQLTDDSDIY WTAEVNEEGD 600 

TWLSLKKFL KQDTYDVHLS LSDHGNKEQL TVIRATVCDC HGHVETCPGP WKGGFILPVL 660 

GAVLALLFLL LVLLLLVRKK RKIKEPLLLP EDDTRDNVFY YGEEGGGEED QDYDITQLHR 720 

GLEARPEWL RNDVAPTIIP TPMYRPRPAN PDEIGNFIIE NLKAANTDPT APPYDTLLVF 780 
DYEGSGSDAA SLSSLTSSAS DQDQDYDYLN EWGSRFKKLA DMYGGGEDD 829 



Seq ID NO: C299 Protein Sequence 
Protein Accession #: NP_005620.1 



1 11 21 31 41 51 

I I I I I I 

MAKKSAENGI YSVSGDEKKG PLIAPGPDGA PAKGDGFVGL GTPGGRLAVP PRETWTRQMD 60 

FIMSCVGFAV GLGNVWRFPY LCYKNGGGVF LIPYVLIALV GGI PIFFLE I SLGQFMKAGS 120 

INVWNICPLF KGLGYASMVI VPYCNTYYIM VLAWGFYYLV KSFTTTLPWA TCGHTWNTPD 180 

CVEIFRHEDC ANASLANLTC DQLADRRSPV IEFWENKVLR LSGGLEVPGA LNWEVTLCLL 240 

ACWVLVYFCV WKGVKSTGKI VYFTATFPYV VLWLLVRGV LLPGALDGII YYLKPDWSKL 300 

GSPQVWIDAG TQIFFSYAIG LGALTALGSY NRFNNNCYKD AIILALINSG TSFFAGFWF 360 

55 SILGFMAAEQ GVHISKVABS GPGLAFIAYP RAVTLMPVAP LWAALFFFML LIiLGLDSQFV 420 

GVEGFITGLLi DLLPASYYFR FQREISVALC CALCFVIDLS MVTDGGMYVF QLFDYYSASG 480 

TTLLWQAFWE CVWAWVYGA DRFMDDIACM IGYRPCPWMK WCWSFFTPLV CMGIFIFNW 540 

YYEPLVYNNT YVYPWWGEAM GWAFALSSML CVPLHLLGCL LRAKGTMAER WQHLTQPIHG 600 

LHHLEYRAQD ADVRGLTTI»T PVSESSKWV VESVM 635 



Seq ID NO: C300 Protein Sequence 
Protein Accession #: NP 006507.1 



1 11 21 31 41 51 

65 | | | | | | 

MEPSSKKLTG RLMLAVGGAV LGSLQFGYNT GVINAPQKVI EEFYNQTWVH RYGESILPTT 60 

LTTLWSLSVA IFSVGGMIGS FSVGLFVNRF GRRNSMLMMN LLAFVSAVLM GFSKLGKSFE 120 

MLILGRFIIG VYOGLTTGFV PMYVGEVSPT AFRGALGTLH QLGIWGILI AQVFGLDSIM 180 

GNKDLWPLLL SIIFIPALLQ CIVLPFCPES PRFLLINRNE ENRAKSVLKK LRGTADVTHD 240 

70 LQEMKEESRQ MMREKKVTIL ELFRSPAYRQ PILIAWLQL SQQLSGINAV FYYSTSIFEK 300 

AGVQQPVYAT IGSGIVNTAF TWSLFWER AGRRTLHLIG LAGMAGCAIL MTIALALLEQ 360 

LPWMSYLSIV AIFGFVAFFE VGPGPIPWFI VAELPSQGPR PAAIAVAGFS NWTSNFIVGM 420 

CFQYVEQLCG PYVFIIFTVL LVIiFFIFTYF KVPETKGRTF DEIASGFRQG GASQSDKTPE 480 

ELFHP LOADS QV 492 



Seq ID NO: C301 Protein Sequence 
Protein Accession #: XP_035292.2 



nry 1 11 21 31 41 51 

80 | | | | I I 

MAGAGPKRRA LAAPAAEEKE EAREKMLAAK SADG9APAGE GEGVTLQRNI TLLNGVAIIV 60 

GTIIGSGIFV TPTGVLKEAG SPGLALWWA ACGVFSIVGA LCYAELGTTI SKSGGDYAYM 120 

LEVYGSLPAF LKLWIELLII RPSSQYIVAL VFATYLLKPL FPTCPVPEEA AKLVACLCVL 180 

LLTAVNCYSV KAATRVQDAF AAAKLLALAL IILLGFVQIG KGDVSNLDPN FSFEGTKLDV 240 
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GNIVLALYSG LFAYGGWNYL NFVTEEMINP YRNLPLAIII SLPIVTLVYV LTNLAYFTTL 
STEQMLSSEA VAVDFGNYHL GVMSWIIPVP VGLSCFGSVN GSLPTSSRLF FVGSREGHLP 
SILSMIHPQL LTPVPSLVFT CVMTLLYAFS KDIFSVINFF SFFNWLCVAL AIIGMIWLRH 
RKPELBRPIK VNLALPVFFI LACLFLIAVS FWKTPVECGI GFTIILSGLP VYFFGVWWKN 
KPKWLLQGIF STTVLCQKLM QWPQET 

Seq ID NO: C302 Protein Sequence 
Protein Accession ft: NP_0 052 59.1 

1 11 21 31 41 51 

I I 1 I I I 

MNWSIFEGLL SGVNKYSTAF GRIWLSLVFI FRVLVYLVTA ERVWSDDHKD FDCNTRQPGC 
SNVCFDEFFP VSHVRLWALQ LILVTCPSLL WMHVAYREV QEKRHREAHG ENSGRLYLNP 
GKKRGGLWWT YVCSLVFKAS VDIAFLYVFH SFYPKYILPP WKCHADPCP NIVDCFISKP 
SEKNIFTLFM VATAAICILL NLVELIYLVS KRCHECLAAR KAQAMCTGHH PHGTTSSCKQ 
DDLLSGDLIF LGSDSHPPLL PDRPRDHVKK TIL 

Seq ID NO: C303 Protein Sequence 
Protein Accession ft: NP_005121.1 



1 11 21 31 41 51 

I I I I I I 

MKICSLTLLS FLLLAAQVLL VEGKKKVKNG LHSKWSEQK DTLGNTQIKQ KSRPGNKGKF 

VTKDQANCRW AATEQEEGIS LKVECTQLDH EFSCVFAGNP TSCLKLKDER VYWKQVARNL 

RSQKDICRYS KTAVKTRVCR KDFPESSLKL VSSTLFGNTK PRKEKTEMSP REHIKGKETT 

PSSLAVTQTM ATKAPBCVED PDMANQRKTA LEFCGETWSS LCTFFLSIVQ DTSC 

Seq ID NO: C304 Protein Sequence 
Protein Accession ft: AAH22542 



1 11 21 31 41 51 

I I I I I I 

MCSEIILRQE VLKDGFHRDL LIKVKFGESI EDLHTCRLLI KQDIPAGLYV DPYELASLRE 
RNITEAVMVS ENFDIEAPNY LSKESEVLIY ARRDSQCIDC FQAFLPVHCR YHRPHSEDGB 
ASIWNNPDL LMFCDQAGSR RMIRFRFDSF DKTIBFPILK CWAHSEVAAP CALENEDICQ 
WNKMKYKSVY KNVILQVPVG LTVHTSLVCS VTLLITILCS KKKKK 



Seq ID NO: C305 Protein Sequence 
Protein Accession ft: NP 004985.1 



1 
I 

MSLWQPLVLV 



ITYWIQNYSE 
FDGKDGLLAH 
YSACTTDGRS 
ACTTDGRSDG 
CTSEGRGDGR 
PMYRFTEGPP 
PTAQPTGPPS 
RFSEGRGSRP 
LDKLGLGADV 
THDVFQYREK 



11 

I 

LLVLGCCFAA 
LLLLQKQLSI* 
DLPRAVIDDA 
AFPPGPGIQG 
DGLPWCSTTA 
YRKCATTANY 
LWCATTSNFD 
LHKDDVNGIR 
AGPTGPPTAG 
QGPFLIADKW 
AQVTGALRSG 
AYFGQDRFYW 



21 
i 

PRQRQSTLVL 
PETGELDSAT 
FARAFALWSA 
DAHFDDDELW 
NYDTDDRFGF 
DRDKLFGFCP 

HLYGPRPEPE 
PSTATTVPLS 
PALPRKLDSV 
RGKMLLFSGR 



31 
I 

FPGDLRTNLT 
LKAMRTPRCG 
VTPLTFTRVY 
SLGKGWVPT 
CPSERLYTRD 
TRADSTVMGG 
QGYSLFLVAA 
PRPPTTTTPQ 
PVDDACNVNI 
FEEPLSKKLF 
RLWRFDVKAQ 
VDQVGYVTYD 



41 
I 

DRQLAEEYLY 
VPDLGRFQTF 
SRDADIVIQF 
RFGNADGAAC 
GNADGKPCQF 
NSAGELCVFP 
HEFGHALGLD 
PTAPPTVCPT 
FDAIAEIGNQ 
FPSGRQVWVY 
MVDPRSASEV 
ILQCPED 



51 
I 

RYGYTRVAEM 



GVAEHGDGYP 
HFPFIFEGRS 
PFIFQGQSYS 
FTFLGKEYST 
HSSVPEALMY 



Seq ID NO: C306 Protein Sequence 
Protein Accession ft: NP 000204 



1 
I 

MAGPRPSPWA 
CNTQAELLAA 
EVFEPLESPV 
PQTDMRPBKL 
QTAVCTRDIG 
TQDYPSVPTL 
EAFNRIRSNL 
THVCQLPEDQ 



11 
I 

RLLLAALISV 
GCQRESIWM 
DLYILMDFSN 
KEPWPNSDPP 
HRPDSTHLLV 
VRLLAKHNII 
DIRALDSPRG 
KGNIHLKPSF 
II 



21 
I 

SLSGTLANRC 
ESSFQITEET 
SMSDDLDNLK 
FSFKNVISLT 
FSTESAFHYE 
PIFAVTNYSY 
LRTEVTSKMF 
SDGLKMDAGI 



FQCPRTSGFL 
CHCHQQSIiYT 
LKRAEEVWR 
LLPT.T.AT.T.LL 
SGNLKGRDW 
AQLRQEVEEN 
LTEKQVEQRA 
LVEAIDVPAG 
KSQVSYRTQD 
VQLSNPKFGA 
RKIHFNWLPP 
AQGEGPYSSL 
NDDNRPIGPM 
KRPMSIPIIP 
EELDLRRVTW 



DTICEINYSA 
CSFRDEDDDC 
LCWKYCACCK 



LNEVYRQISG 
FHDLKVAPGY 
TATLGRRLVN 
GTAQGNRDYI 
HLGQPHSTTI 
SGKPKGYRVK 
VSCRTHQEVP 
KKVLVDNPKN 
DIPIVDAQSG 
RLPPELIPRL 



QCVCEPGWTG 
IHPGLCEDLR 
TYSYTMEGDG 
ACLALLPCCN 
PGFATHAASI 
VHKLQQTKFR 
YTLTADQDAR 
ITIIKEQARD 
PVEGELLFQP 
IIRDPDELDR 
YWIQGDSESE 
SEPGRLAFNV 
RMLLIENLRE 
EDYDSFLMYS 



31 
I 

KKAPVKSCTE 
QIDTTLRRSQ 
KMGQNLARVL 
EDVDEFRNKL 
ADGANVLAGI 
SYYEKLHTYF 
QKTRTGSFHI 
ICDVCTCELQ 
PCSGRGECQC 
PSCDCPLSNA 
SCVQCQAWGT 
APGPNSTVLV 
RGHMVGFKED 
NPTELVPYGL 
QQPNAGKKQD 
GMVEFQEGVE 
WSFEQPEFS 
GEAWKELQVK 



41 
I 

CVRVDKDCAY 
MSPQGLRVRL 
SQLTSDYTIG 
QGERISGNLD 



AHLLDSKVPS 
VSSTVTQLSW 
SQPYRYTVKA 
DDVLRSPSGS 
EAPTAPRTTA 



PVSSLGVLQE 
RRGEVGIYQV 
KBVRSARCSF 
GHCVCYGEGR 
TCIDSNGGIC 
GEKKGRTCEE 
HKKKDCPPGS 
HYMLRENLMA 
SLRLARLCTE 
HTIVDTVLMA 
LVDVRVPLFI 
VSRGDQVARI 
LiLELQEVDSL 
PPPHGDLGAP 
VELTNLYPYC 
AEPAETNGBI 
RNGAGWGPER 
QRPSVSDDTG 
ARAGRAAAVP 



LYLFKDGKYW 
TGASVLGPRR 
DRMFPGVPLD 



51 
I 

CTDEMFRDRR 
RPGEERHFEL 
FGKFVDKVSV 
APEGGFDAIL 
DTTGTYTQYR 
DSSNIVELLE 
QLRALEHVDG 
NGDFVCGQCV 
YEGQFCEYDN 



CNFKVKMVDE 
FWWLIPLLLL 
SDHLDTPMLR 
NLLKPDTREC 
PRSAKPALLK 
RPEDDDEKQL 
PVIRRVLDGG 
LRGRQVRRFH 
QNPNAKAAGS 
DYEMKVCAYG 
TAYEVCYGLV 
EAIINLATQP 
CGWKFEPLLG 
RSATPGPPGE 



300 
360 
420 
480 
507 



60 

120 

180 

240 

273 



60 
120 
180 
234 



60 
120 
180 
225 



60 

120 

180 

240 

300 

360 

420 

460 

540 

600 

660 

707 



60 

120 

180 

240 

300 

360 

420 

4 B0 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 
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10 



20 



60 



HLVNGRMDFA FPGSTNSLHR MTTTSAAAYO THLSPHVPHR VLSTSSTLTR DYNSLTRSEH 1500 

SHSTTLPRDY STLTSVSSHD SRLTAGVPDT PTRLVFSALG PTSLRVSWQB PRCERPLQGY IS SO 

SVEYQLLNGG ELHRLNIPNP AQTSWVEDL LPNHSYVFRV RAQSQEGWGR EREGVITIES 1620 

QVHPQSPLCP LPGSAFTLST PSAPGPLVFT ALSPDSLQLS WERPRRPNGD IVGYLVTCEM 16 BO 

AQGGGPATAF RVDGDSPESR LTVPGLSENV PYKFKVQART TEGFGPEREG IITIESQDGG 1740 

PFPQLGSRAG LFQHPLQSEY SSITTTHTSA TEPFLVDGLT LGAQHLEAGG SLTRHVTQEP 1800 

VSRTLTTSGT LSTHMDQQFP QT 1822 

Seq ID NO: C307 Protein Sequence 
Protein Accession #: NP 076404,1 



1 11 21 31 41 51 

I I I I I I 

MGFNLTLAKL PNNELHGQES HNSGNRSDGP GKNTTLHNEF DTIVLPVLYL IIFVASILLN 60 

Ij GLAVWIFFHI RNKTSFIFYL KNIWADLIM TLTFPFRIVH DAGFGPWYFK FILCRYTSVL 120 

FYANMYTSIV FLGLISIDRY LKWKPFGDS RMYSITFTKV LSVCVWVIMA VLSLPNIILT 180 

NGQPTBDNIH DCSKLKSPLG VKHHTAVTYV NSCLFVAVLV ILIGCYIAIS RYIHKSSRQF 240 

ISQSSRKRKH NQSIRVWAV FFTCFLPYHL CRIPFTFSHL DRLLDESAQK ILYYCKEITL 300 

FLSACNVCLD PIIYFFMCRS FSRRLFKKSN IRTRSESIRS LQSVRRSEVR IYYDYTDV 358 



Seq ID NO: C30B Protein Sequence 
Protein Accession #: NP 065840.1 



oc 1 11 21 31 41 51 

25 I I I I 1 I 

MVWCLGLAVL SLVISQGADG RGKPEWSW GRAEESWLG CDLLPPAGRP PLHVIEWLRF 60 

GFLLPIFIQF GLYSPRIDPD YVGRVRLQKG ASLQIEGLRV EDQGWYECRV PFLDQHIPED 120 

DFANGSWVHL TVNSPPQFQE TPPAVLEVQE LEPVTLRCVA RGSPLPHVTW KLRGKDLGQG 180 

« n QGQVQVQNGT LRIRRVERGS SGVYTCQASS TEGSATHATQ LLVLGPPVIV VPPKNSTVNA 240 

JK) SQDVSLACHA EAYPANI»TYS WFQDNINVFH ISRLQPRVQI LVDGSLRLLA TQPDDAGCYT 300 

CVPSNGLLHP PSASAYLTVL CMPGVIRCPV RANPPLLFVS WTKDGKALQL DKFPGWSQGT 360 

EGSLIIALGN EDALGEYSCT PYNSLGTAGP SPVTRVLLKA PPAFIERPKE EYFQEVGREL 420 

L1PCSAQGDP PPWSWTKVG RGLQGQAQVD SNSSLILRPL TKEAHGHWEC SASNAVARVA 480 

TSTNVYVLGT SPHWTNVSV VALPKGANVS WEPGFDGGYL QRFSVWYTPL AKRPDRMHHD 540 

3D WVSLAVPVGA AHLLVPGLQP HTQYQFSVLA QNKLGSGPFS EIVLSAPEGL PTTPAAPGLP 600 

PTEIPPPL5P PRGLVAVRTP RGVLLHWDPP ELVPKRLDGY VLEGRQGSQG WEVLDPAVAG 660 

TETELLVPGL IKDVLYEFRL VAFAGSFVSD PSNTANVSTS GLEVYPSRTQ LPGLLPQPVL 720 

AGWGGVCFL GVAVLVSILA GCLLNRRRAA RRRRKRLRQD PPLIFSPTGK SAAPSALGSG 780 

^ SPDSVAKLKL QGSPVPSLRQ SLLWGDPAGT PSPHPDPPSS RGPLPLEPIC RGPDGRFVMG 840 

4U PTVAAPQERS GREQAEPRTP AQRLARSFDC SSSSPSGAPQ PLCIEDISPV APPPAAPPSP 900 

LPGPGPLLQY LSLPFPREMN VDGDWPPLEE PSPAAPPDYM DTRRCPTSSF LRSPETPPVS 960 

PRESLPGAW GAGATAEPPY TALADWTLRB RUiPGLLPAA PRGSLTSQSS GRGSASFLRP 1020 

PSTAPSAGGS YLSPAPGDTS SWASGPERHP RREHWTVSK RRNTSVDENY EWDSEFPGDM 1080 

ELLETLHLGL ASSRLRPEAE TELGVKTPEE GCLLNTAHVT GPEARCAALR EEFIAFRRRR 1140 

4D DATRARLPAY RQPVPHPBQA TLL 1163 

Seq ID NO: C309 Protein Sequence 
Protein Accession #: 

50 1 11 21 31 41 51 

I I 1 I I I 

MLTKPLQGPP APPGTPTPPP GGKDREAFEA EYRLGPLLGK GGFGTVFAGH RLTDRLQVAI 60 

KVIPRNRVLG WSPLSDSVTC PLEVALLWKV GAGGGHPGVI RLLDWFETQE GFMLVLERPL 120 

PAQDLFDYIT EKGPLGEGPS RCFFGQWAA IQHCHSRGW HRDIKDENIL IDLRRGCAKL 180 

J J IDFGSGALLH DEPYTDFDGT RVYSPPEWIS RHQYHALPAT VWSLGILLYD MVCGDIPFER 240 

DQEILEAELH FPAHVSPDCC ALIRRCLAPK PSSRPSLEEI LLDPWMQTPA EDVTPQPLQR 300 

RPCPFGLVLA TLSLAWPGLA PNGQKSHPHA MSQG 334 



Seq ID NO: C310 Protein Sequence 
Protein Accession #* NP 002501.1 



1 11 21 31 41 51 

I I I I I i 

MECLYYFLGF LLLAARLPLD AAKRFHDVLG NERPSAYMRE HNQLNGWSSD ENDWNEKLYP 60 

OD VWKRGDMRWK NSWKGGRVQA VLTSDSPALV GSNITFAVNL IFPRCQKEDA NGNIVYEKNC 120 

RNEAGLSADP YVYNWTAWSE DSDGENGTGQ SHHNVFPDGK PFPHHPGWRR WNFIYVFHTL 180 

GQYFQKLGRC SVRVSVNTAN VTLGPQLMEV TVYRRKGRAY VPIAQVKDVY WTDQIPVFV 240 

TMFQKNDRNS SDETFLKDLP IMFDVI*IHDP SHFLNYSTIN YKWSFGDNTG LFVSTNHTVN 300 

„ HTYVLNGTFS LNLTVKAAAP GPCPPPPPPP RPSKPTPSLG PAGDNPLSLS RIPDENOQIN 360 

/U RYGHFQATIT IVEGILEVNI IQMTDVLMPV PWPESSLIDF WTCQGSIPT EVCTIISDPT 420 

CEITQNTVCS PVDVDEMCLL TVRRTFNGSG TYCVNLTLGD DTSLALTSTL ISVPDRDPAS 480 

PLRMANSALI SVGCLAIFVT VISLLVYKKH KEYNPIENSP GNWRSKGLS VFLNRAKAVF 540 

FPGNQBKDPIi LKNQEFKGVS S60 

75 Seq ID NO: C311 Protein Sequence 
Protein Accession #: Eos seq 

1 11 21 31 41 51 

OA I I I I I 1 

OKJ MRILKRFLAC IQLLCVCRJLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 120 

FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQI YCFDA DRFSSFEEAV KGKGKLRALS 180 

ILFEVGTEEN LDFKAI IDGV ESVSRFGRQA ALDPFIUiNL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWIVFK DTVSISESQI* AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 
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GSKTVLRSPH 
ENISQGYIFS 
TAQPDVGSGR 
TEVTPHAFTP 
LVIVSALTFI 
IKHFPKHVAD 
KLAQLAEKDG 
NLVEKGRRKC 
GRWTQYHYT 
SKLQQIQHEG 
YVNALLIPGP 
SSLSGEGTDY 
AEDEFVYWPN 
CPKWPNPDSP 
SVDVYQVAKM 
AESLESLV 



CSSEPENVQA 
LGAILNNLLP 
EEEGKDIEEG 
GKGDVPNTSL 
MNLSGTAESL 
SENPETITYD 
ESFLQTNYTE 
SSRQQDLVST 
CLWLVGILI 
LHASSGFTEE 
KLTDYINANY 
DQYWPADGSE 
QWPDMGVPEY 
TVNIFGFLKH 
AGKTKLEKQF 
INASYIMGYY 
KDEPINCESF 
ISKTFELISV 
INLMRPGVFA 



DPENYTSLLV 
NMSYVLQIVA 
AIVNPGRDSA 
NSTSQPVTKL 
NTVSITEYEE 
VLIPESARNA 
IRVDESEKTT 
VNWYSQTTQ 
YWRKCFQTAH 
FEEVQSCTVD 
VDGYNRPKAY 
EYGNFLVTQK 
SLPVLTFVRK 
IRSQRNYLVQ 
QLLSQSNIQQ 
QSNEFIITQH 
KVTLMAEEHK 
IKEEAANRDG 
DIEQYQFLYK 



TWERPRWYD 
ICTNGLYGKY 
TNQIRKKEPQ 
ATEKDISLTS 
ESLLTSFKLD 
SEDSTSSGSE 
KSFSAGPVMS 
PVYNEASNSS 
FYLEDSTSPR 
LGITADSSNH 
IAAQGPLKST 
SVQVLAYYTV 
AAYAKRHAVG 
TEEQYVFIHD 
SDYSAALKQC 
PLLHTIKDFW 
CLSNEEKLII 
PMIVHDEHGG 
VILSLVSTRQ 



TMIEKFAVLY 
SDQLIVDMPT 
ISTTTHYNRI 
QTVTELPPHT 
TGAEDSSGSS 
ESLKDPSMEG 
QGPSVTDLEM 
HESRIGLAEG 
VISTPPTPIF 
PDNKHKNRYI 
AEDFWRMIWB 
RNPTLRNTKI 
PWVHCSAGV 
TLVEAILSKE 



RMIWDHNAQL 
QDFILEATQD 
VTAGTFCALT 



QQLDGEDQTK 
DNPELDLFPE 
GTKYNEAKTN 
VEGTSASLND 
PATSAIPFIS 
NVWFPSSTDI 
PHYSTFAYFP 
LESEKKAVIP 
PISDDVGAIP 
NIVAYDHSRV 
HNVEVIVMIT 
KKGSQKGRPS 
GRTGTYIVLD 
TEVLDSHIHA 
IPVERSRVGI 
WMIPDGQNM 
DYVLEVRHFQ 
TLMHQLEKEN 
NGAALPDGNI 



360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1448 



Seq ID NO: C312 Protein Sequence 
Protein Accession #: XPJ)31379 



MRILKRFLAC 
QSPINIDEDL 
FKASKITFHW 
ILFEVGTEEN 
TDTVDWIVFK 
TGKEEIHEAV 
HEFLTDGYQD 
LIGTEEI IKE 
RSPTRGSEFS 
GSKTVLRSPH 
ENISQGYIFS 
TAQPDVGSGR 
TEVTPHAFTP 
LNTTPAASSS 
ILPQVTSATE 
KTLMFSQVEP 
SLFSGPSHIP 
PVSVAEFTYT 
LNASLQBTSV 
KPVLSANSEP 
AVPSDPILVE 
TISYASEKYE 
EPLNTLINKL 
PHRDGSVTST 
KCHSCSSYRE 
SPGKSPSANG 
ENETSTDFSF 
HESRIGLAEG 
VISTPPTPIF 
TADSSNHPDN 
QGPLKSTAED 
VLAYYTVRNF 
AKRHAVGPW 
QYVFIHDTLV 
SAALKQCNRE 
HTIKDFWRMI 
NEEKLIIQDF 
VHDEHGGVTA 



11 

I " 

IQLLCVCRLD 
TQVNVNLKKL 
GKCNMSSDGS 
LDFKAIIDGV 
DTVSISESQL 
CSSEPENVQA 
LGAILNNLLP 



GKGDVPNTSL 
MNLSGTAESL 
SENPETITYD 
ESFLQTNYTE 
SSRQQDLVST 
DSALHATPVF 
SDKVPLHASL 
PSSDAMMHAR 
IPKSSLITPT 
TSVPGDDNKA 
SISSTKGMFP 
ASSDPASSEM 
TPKVDKISST 
PVLLKSESSH 
IHSDEILTST 
KLLFPSKATS 
SQEKVMNDSD 
LSQKHNDGKE 
ADTNEKDADG 
LESEKKAVIP 
PISDDVGAIP 
KHKNRYINIV 
FWRMIWEHNV 
TLRNTKIKKG 



EAILSKETEV 
KNRTSSIIFV 
WDHNAQLWM 
ILEATQDDYV 
GTFCALTTLM 
PSTSLDSNGA 



21 
I 

WANGYYRQQR 
KFQGWDKTSL 
EHSLBGQKFP 
ESVSRFGKQA 
AVFCEVLTMQ 
DPENYTSLLV 
NMSYVLQIVA 
AIVNPGRDSA 
NSTSQPVTKL 
NTVSITEYEE 
VLIPESARNA 
IRVDESEKTT 
VNWYSQTTQ 
PSVDVSFESI 
PVAGGDLLLE 
SSGPEPSYAL 
ASLLQPTHAL 
LSKSEIIYGN 
GSLAHTTTKV 
LSPSTQLLFY 
MLHLIVSNSA 
QWPSLYSND 
KSSVTGKVFA 
ELSHSAKSDA 
THENSLMDQN 
ENDIQTGSAL 
ILAAGDSEIT 
LVIVSALTFI 
IKHFPKHVAD 
AYDHSRVKLA 
BVIVMITNLV 
SQKGRPSGRV 
GTYIVLDSML 
LDSHIHAYVN 
ERSRVGISSL 
IPDGQNMAED 
LEVRHFQCPK 



ALPDGNIAES 



31 
I 

KLVEEIGWSY 
ENTFIHNTGK 
LEMQIYCFDA 
ALDPFILLNL 
QSGYVMLMDY 
TWERPRWYD 
ICTNGLYGKY 
TNQIRKKEPQ 
ATEKDISLTS 
ESLLTSFKLD 
SEDSTSSGSE 
KSFSAGPVMS 
PVYNGETPLQ 
LSSYDGAPLL 
PSLAQYSDVL 
SDNBGSQHIF 
SGDGEWSGAS 
ETELQIPSFN 
FDHEISQVPE 
ETSASFSTEV 
SSENMLHSTS 
ELFQTANLEI 
GIPTVASDTF 
GLVGGGEDGD 
NPISYSLSEN 
LPLSPESKAW 
PGFPQSPTSS 
CLWLVGILI 
LHASSGFTBB 
QLAEKDGKLT 
EKGRRKCDQY 
VTQYHYTQWP 
QQIQHEGTVN 
ALLIPGPAGK 
SGEGTDYINA 
EFVYWPNKDE 
VfPNPDSPISK 
VYQVAKMINL 
LESLV 



41 

I 

TGALNQKNWG 
TVEINLTNDY 



LPNSTDKYYI 
LQNNFREQQY 
TMIEKFAVLY 
SDQLIVDMPT 
ISTTTHYNRI 
QTVTELPPHT 
TGAEDSSGSS 
ESLKDPSMEG 
QGPSVTDLEM 
PSYSSEVFPL 
PFSSASFSSE 
STTHAASETL 
TVSYSSAIPV 
SDSEFLLPDT 
EMVYPSESTV 
NNFSVQPTHT 
LLQPSFQASD 
VPVFDVSPTS 
NQAHPPKGRH 
VSTDHSVPIG 
TDDDGDDDDD 
SEEDNRVTSV 
AVLTSDEBSG 
VTSENSEVFH 
YWRKCFQTAH 
FETLKEFYQE 
DYINANYVDG 
WPADGSEEYG 
DMGVPEYSLP 
IFGFLKHIRS 
TKLEKQFQLL 
SYIMGYYQSN 
PINCBSFKVT 
TFELISVIKE 
MRPGVFADIE 



51 
I 

KKYPTCNSPK 
RVSGGVSEMV 
KGKGKLRALS 
YNGSLTSPPC 
KFSRQVFSSY 
QQLDGEDQTK 
DNPELDLFPE 
GTKYNEAKTN 
VEGTSASLND 
PATSAIPFIS 
NVWFPSSTDI 
PHYSTFAYFP 
VTPLLLDNQI 
LFRHLHTVSQ 
EFGSBSGVLY 
HDSVGVTYQG 
DGLTALNISS 
MPNMYDNVNK 
VSQASGDTSL 
VDTLLKTVLP 
HMHSASLQGL 
VFATPVLSID 
NGHVAITAVS 
DRGSDGLSIH 



SGQGTSDSLN 



FYLEDSTSPR 
VQSCTVDLGI 
YNRPKAYIAA 
NFLVTQKSVQ 
VLTFVRKAAY 
QRNYLVQTEB 
SQSNIQQSDY 
EFIITQHPLL 
LMAEEHKCLS 
EAANRDGPMI 
QYQFLYKVIL 



Seq ID NO: C313 Protein Sequence 
Protein Accession ft: NP_002842 



1 
I 

MRILKRFLAC 
QSPINIDEDL 
FKASKITFHW 
ILFEVGTEEN 
TDTVDWIVFK 
TGKEEIHEAV 
HEFLTDGYQD 
LIGTEEI IKE 



11 
I 

IQLLCVCRLD 
TQVNVNLKKL 



21 
I 

WANGYYRQQR 
KFQGWDKTSL 



LDFKAIIDGV 
DTVSISESQL 
CSSEPENVQA 
LGAILNNLLP 



GSKTVLRSPH 
ENISQGYIFS 
TAQPDVGSGR 
TEVTPHAFTP 



GKGDVPNTSL 
MNLSGTAESL 
SENPETITYD 
ESFLQTNYTE 
SSRQQDLVST 



ESVSRFGKQA 
AVFCEVLTMQ 
DPENYTSLLV 
NMSYVLQIVA 
AIVNPGRDSA 
NSTSQPVTKL 
NTVSITEYEE 
VLIPESARNA 
IRVDESEKTT 
VNWYSQTTQ 



31 
I 

KLVEEIGWSY 
ENTFIHNTGK 
LEMQIYCFDA 
ALDPFILLNL 
QSGYVMLMDY 
TWERPRWYD 
ICTNGLYGKY 
TNQIRKKEPQ 
ATEKDISLTS 
ESLLTSFKLD 
SEDSTSSGSE 
KSFSAGPVMS 
PVYNAEASNS 



41 
I 

TGALNQKNWG 
TVEINLTNDY 



LPNSTDKYYI 
LQNNFREQQY 
TMIEKFAVLY 
SDQLIVDMPT 
ISTTTHYNRI 
QTVTELPPHT 
TGAEDSSGSS 
ESLKDPSMEG 
QGPSVTDLEM 
SHESRIGLAE 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2315 



51 
I 

KKYPTCNSPK 
RVSGGVSEMV 
KGKGKLRALS 
YNGSLTSPPC 
KFSRQVFSSY 
QQLDGEDQTK 
DNPELDLFPE 
GTKYNEAKTN 
VEGTSASLND 
PATSAIPFIS 
NVWFPSSTDI 
PHYSTFAYPP 
GLESEKKAVI 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 



1355 



WO 03/042661 



PCT/US02/36810 



5 

10 

15 

20 

25 

30 

35 

40 

45 
50 
55 
60 
65 
70 
75 
80 



PLVIVSALTF 
PIKHFPKHVA 
VAYDHSRVKL 
VEVIVMITNL 
GSQKGRPSGR 
TGTYIVLDSM 
VLDSHIHAYV 
VERSRVGISS 
MIPDGQNMAE 
VLEVRHFQCP 
MHQLEKENSV 
AALPDGNIAE 



ICLWLVGIL 
DLHASSGPTE 
AQLAEKDGKL 
VEKGRRKCDQ 
WTQYHYTQW 
LQQIQHEGTV 
NALLIPGPAG 
LSGEGTDYIN 
DEFVYWPNKD 
KWPNPDSPIS 
DVYQVAKMIN 
SLESLV 



IYWRKCFQTA 
EFETLKEFYQ 
TDYINANYVD 
YWPADGSEEY 
PDMGVPEYSL 
NIFGFLKHIR 
KTKLEKQFQL 
ASYIMGYYQS 
EPINCESFKV 
KTFELISVIK 
LMRPGVFADI 



HFYLEDSTSP 
EVQSCTVDLG 
GYNRPKAYIA 
GNFLVTQKSV 
PVLTFVRKAA 
SQRNYLVQTE 
LSQSNIQQSD 
NEFIITQHPL 
TLMAEEHKCL 
EEAANRDGPM 
EQYQFLYKVI 



RVISTPPTPI 
ITADSSNHPD 
AQGPLKSTAE 
QVLAYYTVRN 
YAKRHAVGPV 
EQYVFIHDTL 
YSAALKQCNR 
LHTIKDFWRM 
SNEEKLI IQD 
IVHDEKGGVT 
LSLVSTRQEE 



FPISDDVGAI 
NKHKNRYINI 
DFWRMIWEHN 
FTLRNTKIKK 
WHCSAGVGR 
VEAILSKETE 
EKNRTSSIIP 
IWDHNAQLW 
FILEATQDDY 
AGTFCALTTL 
NPSTSLDSNG 



Seq ID NO: C314 Protein Sequence 
Protein Accession ft: Eos sequence 



1 
I 

MRILKRFLAC 
QSPINIDEDL 
MVFKASKITF 
LSILFEVGT5 
PCTDTDWIVF 
YTGKEEIHEA 
KHEFLTDGYQ 
ELIGTEEIIK 
NRSPTRGSEF 
DGSKTVLRSP 
SENISQGYIF 
ITAQPDVGSG 
PTEVTPRAFT 
PLVIVSALTF 
PIKHFPKHVA 
VAYDHSRVKL 
VEVIVMITNL 
GSQKGRPSGR 
TGTYIVLDSM 
VLDSHIHAYV 



MIPDGQNMAE 
VLEVRHFQCP 
MHQLEKENSV 
AALPDGNIAE 



11 

I 

IQLLCVCRLD 
TQVNVNLKKL 
HWGKCNMSSD 
ENLDFKAIID 
KDTVSISESQ 
VCSSEPENVQ 
DLGAILNNLL 
EEEEGKDIEE 
SGKGDVPNTS 
HMNLSGTAES 
SSENPETITY 
RESFLQTNYT 
PSSRQQDLVS 
ICLWLVGIL 
DLHASSGFTE 
AQLAEKDGKL 
VEKGRRKCDQ 
WTQYHYTQW 
LQQIQHEGTV 
NALLIPGPAG 
LSGEGTDYIN 
DEFVYWPNKD 
KWPNPDSPIS 
DVYQVAKMIN 
SLESLV 



21 
I 

WANGYYRQQR 
KFQGWDKTSL 
GSEHSLEGQK 
GVESVSRFGK 
LAVFCEVLTM 
ADPENYTSLL 
PNMSYVLQIV 
GAIVNPGRDS 
LNSTSQPVTK 
LNTVSITBYE 
DVLIPESARN 
EIRVDESEKT 
TVNWYSQTT 
IYWRKCFQTA 
EFETLKEFYQ 
TDYINANYVD 
YWPADGSEEY 
PDMGVPEYSL 
NIFGFLKHIR 
KTKLEKQFQL 
ASYIMGYYQS 
EPINCESFKV 
KTFELISVIK 
LMRPGVFADI 



31 
I 

KLVEEIGWSY 
ENTFIHNTGK 
FPLEMQIYCF 
QAALDPFILL 
QQSGYVMLMD 
VTWERPRWY 
AICTNGLYGK 
ATNQIRKKEP 
LATEKDISLT 
EESLLTSFKL 



41 
I 

TGALNQKNWG 
TVEINIiTNDY 
DADRFSSFEE 
NLLPNSTDKY 
YLQNNFREQQ 
DTMIEKFAVL 
YSDQLIVDMP 
QISTTTHYNR 
SQTVTELPPH 
DTGAEDSSGS 



TKSFSAGPVM 
QPVYNEASNS 
HFYLEDSTSP 
EVQSCTVDLG 
GYNRPKAYIA 
GNFLVTQKSV 
PVLTFVRKAA 
SQRNYLVQTE 
LSQSNIQQSD 
NEFIITQHPL 
TLMAEEHKCL 
EEAANRDGPM 
EQYQFLYKVI 



SQGPSVTDLE 
SHESRIGLAE 
RVISTPPTPI 
ITADSSNHPD 
AQGPLKSTAE 
QVLAYYTVRN 
YAKRHAVGPV 
EQYVFIHDTL 
YSAALKQCNR 
LHTIKDFWRM 
SNEEKLI IQD 
IVHDEHGGVT 
LSLVSTRQEE 



51 
I 

KKYPTCNSPK 
RVSGGVSEMV 
AVKGKGKLRA 
YIYNGSLTSP 
YKFSRQVFSS 
YQQLDGEDQT 
TDNPELDLFP 
IGTKYNEAKT 
TVEGTSASLN 
SPATSAIPFI 
GNVWFPSSTD 
MPHYSTFAYF 
GLESEKKAVI 
FPISDDVGAI 
NKHKNRYINI 
DFWRMIWEHN 
FTLRNTKIKK 
WHCSAGVGR 
VEAILSKETE 
EKNRTSSIIP 
IWDHNAQLW 
FILEATQDDY 
AGTFCALTTL 
NPSTSLDSNG 



840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

14S6 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

10B0 

1140 

1200 

1260 

1320 

1380 

1440 

1456 



Seq ID NO; C315 Protein Sequence 
Protein Accession Eos sequence 



1 
I 

MRILKRFLAC 
QSPINIDEDL 
FKASKITFHW 
ILFEVGTEEN 
TDTVDWIVFK 
TGKEEIHEAV 
HEFLTDGYQD 
LIGTEEIIKE 
RSPTRGSEFS 
GSKTVLRSPH 
ENISQGYIFS 
TAQPDVGSGR 
TEVTPHAFTP 
LVXVSALTFI 
I KHFPKHVAD 
AYDHSRVKLA 
EVIVMITNLV 
SQKGRPSGRV 
GTYIVLDSML 
LDSHIHAYVN 
SRVAGTILLS 
YIMGYYQSNE 
INCESFKVTL 
FELISVIKEE 
RPGVFADIEQ 



11 
I 

IQLLCVCRLD 
TQVNVNLKKL 
GKCNMSSDGS 
LDFKAIIDGV 
DTVSISESQL 



LGAILNNLLP 
EEEGKDIEEG 
GKGDVPNTSL 
MNLSGTAESL 
SENPETITYD 
ESFLQTNYTE 
SSRQQDLVST 
CLWLVGILI 
LHASSGFTEE 
QLAEKDGKLT 
EKGRRKCDQY 
VTQYHYTQWP 
QQIQHEGTVN 
ALLIPGPAGK 
QSNIQQSDYS 
FIITQHPLLH 
MAEEHKCLSN 
AANRDGPMIV 
YQPLYKVILS 



21 
I 

WANGYYRQQR 
KFQGWDKTSL 
EHSLEGQKFP 
BSVSRFGKQA 
AVFCEVLTMQ 
DPENYTSLLV 
NMSYVLQIVA 
AIVNPGRDSA 
NSTSQPVTKL 
NTVSITEYEE 
VLIPESARNA 
IRVDESEKTT 
VNWYSQTTQ 
YWRKCFQTAH 
FETLKEFYQE 
DYINANYVDG 
WPADGSEEYG 
DMGVPEYSLP 
IFGFLKHIRS 
TKLEKQFQGL 
AALKQCNREK 
TIKDFWRMIW 
EEKLI IQDFI 
HDEHGGVTAG 
LVGTRQEENP 



31 

I 

KLVEEIGWSY 
ENTFIHNTGK 
LEMQIYCFDA 
ALDPFILLNL 
QSGYVMLMDY 
TWBRPRWYD 
ICTNGLYGKY 
TNQIRKKEPQ 
ATEKDISLTS 
ESLLTSFKLD 



41 
I 

TGALNQKNWG 
TVEINLTNDY 
DRFSSFEEAV 
LPNSTDKYYI 
LQNNFREQQY 
TMIEKFAVLY 
SDQLIVDMPT 
ISTTTHYNRI 
QTVTELPPHT 



KSFSAGPVMS 
PVYNEASNSS 
FYLEDSTSPR 
VQSCTVDLGI 
YNRPKAYIAA 
NFLVTQKSVQ 
VLTFVRKAAY 
QRNYLVQTEE 
TLSPRLECRG 
NRTSSIIPVE 
DHNAQLWMI 
LEATQDDYVL 
TFCALTTLMH 
STSLDSNGAA 



QGPSVTDLEM 
HESRIGLAEG 
VISTPPTPIF 
TADSSNHPDN 
QGPLKSTAED 
VLAYYTVRNF 
AKRHAVGPW 
QYVFIHDTLV 
TISAHCNLPL 
RSRVGISSLS 
PDGQNMAEDB 
EVRHFQCPKW 
QLEKENSVDV 
LPDGNIAESL 



51 
I 

KKYPTCNSPK 
RVSGGVSEMV 
KGKGKLRALS 
YNGSLTSPPC 
KFSRQVFSSY 
QQLDGEDQTK 
DNPELDLFPE 
GTKYNEAKTN 
VEGTSASLND 
PATSAIPFIS 
NVWFPSSTDI 
PHYSTFAYFP 
LESEKKAVIP 
PISDDVGAIP 
KHKNRYINIV 
FWRMIWEHNV 
TLRNTKIKKG 
VHCSAGVGRT 
EAILSKETEV 
PGLTDPPTSA 
GEGTDYINAS 
FVYWPNKDEP 
PNPDSPISKT 
YQVAKMINLM 
ESLV 



Seq ID NO: C316 Protein Sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 
QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 
FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 
ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 

1356 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 
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1080 
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1200 
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1440 

1494 



60 
120 
180 
240 



WO 03/042661 



TDTVDWIVFK 
TGKEEIHEAV 
HEFLTDGYQD 
ESRIGLAEGL 
ISTPPTPIFP 
ADSSNHPDNK 
GPLKSTAEDP 
LAYYTVRNFT 
KRHAVGPVW 
YVFIHDTLVE 
AALKQCNREK 
TIKDFWRMIW 
EEKLIIQDFI 
HDEHGGVTAG 
LVSTRQEENP 



DTVSISESQL 
CSSEPENVQA 
LGAILNNLLP 
ESEKKAVIPL 
ISDDVGAIPI 
HKNRYINIVA 
WRMIWEHNVE 
LRNTKIKKGS 
HCSAGVGRTG 
AILSKETEVL 
NRTSSIIPVE 
DHNAQLWMI 
LBATQDDYVL 
TFCALTTLMH 
STSLDSNGAA 



AVFCEVLTMQ 
DPENYTSLLV 
NMSYVLQIVA 
VIVSALTFIC 
KHFPKHVADL 
YDHSRVKLAQ 
VIVMITNLVE 
QKGRPSGRW 
TYIVLDSMLQ 
DSHIHAYVNA 
RSRVGISSLS 
PDGQNMAEDE 
EVRHFQCPKW 
QLEKENSVDV 
LPDGNIAESL 



QSGYVMLMDY 
TWERPRWYD 
ICTNGLYGKY 
LWLVGILIY 
HASSGFTEEF 
LAEKDGKLTD 
KGRRKCDQYW 
TQYHYTQWPD 
QIQHEGTVNI 
LLIPGPAGKT 
GEGTDYINAS 
PVYWPNKDEP 
PNPDSPISKT 
YQVAKMINLM 



LQNNFREQQY 
TMIEKFAVLY 
SDQLIVDMPT 
WRKCFQTAHP 
ETLKEFYQEV 
YINANYVDGY 
PADGSEEYGN 
MGVPEYSLPV 
FGPLKHIRSQ 
KLEKQFQLLS 
YIMGYYQSNB 
INCESFKVTL 
FELISVIKEE 
RPGVFADIEQ 



KFSRQVFSSY 
QQLDGEDQTK 
DNPEASNSSH 
YLEDSTSPRV 
QSCTVDLGIT 
NRPKAYIAAQ 
PLVTQKSVQV 
LTFVRKAAYA 
RNYLVQTEEQ 
QSNIQQSDYS 
FIITQHPLLH 
MAEEHKCLSN 
AANRDGPMIV 
YQFLYKVILS 



Seq ID NO: C317 Protein Sequence 
Protein Accession ft : Eos sequence 



1 
I 

MRILKRFLAC 
QSPINIDEDL 
FKASKITFHW 
ILFEVGTEEN 
TDTVDWIVFK 
TGKEEIHEAV 
HEFLTDGYQD 
LIGTEEIIKE 



11 
1 

IQLLCVCRLD 
TQVNVNLKKL 



21 
I 

WANGYYRQQR 
KFQGWDKTSL 



LDFKAI IDGV 
DTVSISESQL 
CSSEPENVQA 
LGAILNNIiliP 



GSKTVLRSPH 
ENISQGYIFS 
TAQPDVGSGR 
TEVTPHAFIP 
LVIVSALTFI 
IKHFPKHVAD 
AHDHSRVKLA 
EVIVMITNLV 
SQKGRPSGRV 
GTYIVLDSML 
LDSHIHAYVN 
SRV ART ILLS 
YIMGYYQSNB 
INCESFKVTL 
FELISVIKEE 
RPGVFADIEQ 



GKGDVPNTSL 
MNLSGTAESL 
SBNPETITYD 
ESFLQTNYTE 
SSRQQDLVST 
CLWLVGILI 
LHASSGFTEE 
QLAEKDGKLT 
EKGRRKCDQY 
VTQYHYTQWP 
QQIQHEGTVN 
ALLIPGPAGK 
QSNIQQSDYS 
FIITQHPLLH 
MAEEHKCLSN 
AANRDGPMTV 
YQFLYKVILS 



ESVSRFGKQA 
AVFCEVLTMQ 
DPENYTSLLV 
NMSYVLQIVA 
AIVNPGRDSA 
NSTSQPVTKL 
NTVSITEYEB 
VLIPESARNA 
IRVDESEKTT 
VNWYSQTTQ 
YWRKCFQTAH 
FETLKEFYQE 
DYINANYVDG 
WPADGSEBYG 
DMGVPEYSLP 
IFGFLKHIRS 
TKLEKQFQGL 
AALKQCNREK 
TIKDFWRMIW 
EEKLIIQDFI 
HDEHGGVTAG 
LVSTRQEENP 



31 
I 

KLVEEIGWSY 
ENTFIHNTGK 
LEMQIYCFDA 
ALDPPILLNL 
QSGYVMLMDY 
TWERPRWYD 
ICTNGLYGKY 
TNQIRKKEPQ 
ATEKDISLTS 
ESLLTSFKLD 
SEDSTSSGSE 
KSFSAGPVMS 
PVYNEASNSS 
FYLEDSTSPR 
VQSCTVDLGI 
YNRPKAYIAA 
NFLVTQKSVQ 
VLTFVRKAAY 
QRNYLVQTEE 
TLSPRLECRG 
NRTSSIIPVE 
DHNAQLWMI 
LEATQDDYVL 
TFCALTTLMH 
STSLDSNGAA 



41 
I 

TGALNQKNWG 
TVEINLTNDY 
DRFSSFEEAV 
LPNSTDKYYI 
LQNNFREQQY 
TMIEKFAVLY 
SDQLIVDMPT 
ISTTTHYNRI 
QTVTELPPHT 
TGAEDSSGSS 
ESLKDPSMEG 
QGPSVTDLEM 
HESRIGLAEG 
VISTPPTPIF 
TADSSNHPDN 
QGPLKSTAED 
VLAYYTVRNF 
AKRHAVGPW 
QYVFIHDTLV 
TISAHCNLPL 
RSRVGISSLS 
PDGQNMAEDB 
EVRHFQCPKW 
QLEKENSVDV 
LPDGNIAESL 



51 
I 

KKYPTCNSPK 
RVSGGVSEMV 
KGKGKLRALS 
YNGSLTSPPC 
KFSRQVFSSY 
QQLDGEDQTK 
DNPELDLFPE 
GTKYNEAKTN 
VEGTSASLND 
PATSAIPFIS 
NVWFPSSTDI 
PHYSTFAYFP 
LESEKKAVIP 
PISDDVGAIP 
KHKNRYINIV 
FWRMIWEHNV 
TLRNTKIKKG 
VHCSAGVGRT 
EAILSKETEV 
PGLTDPPTSA 
GEGTDYINAS 
PVYWPNKDEP 
PNPDSPISKT 
YQVAKMINLM 
ESL 



300 

360 

420 

480 

S40 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1113 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1493 



Seq ID NO: C318 Protein Sequence 
Protein Accession #: Eos sequence 



MRILKRFLAC 
QSPINIDEDL 
FKASKITFHW 
ILFEVGTEEN 
TDTVDWIVFK 
TGKEEIHEAV 
HEFLTDGYQD 
LIGTEEIIKE 
RSPTRGSEFS 
GSKTVLRSPH 
ENISQGYIFS 
TAQPDVGSGR 
TEVTPHAFTP 
LVIVSALTFI 
IKHFPKHVAD 
AYDHSRVKLA 
EVIVMITNLV 
SQKGRPSGRV 
GTYIVLDSML 
LDSHIHAYVN 
ERSRVGISSL 
IPDGQNMAED 
DGRNFLCSDN 



11 
I 

IQLLCVCRLD 
TQVNVNLKKL 



WANGYYRQQR 
KFQGWDKTSL 



LDFKAI IDGV 
DTVSISESQL 
CSSEPENVQA 
LGAILNNLLP 



GKGDVPNTSL 
MNLSGTAESL 
SENPETITYD 
ESFLQTNYTE 
SSRQQDLVST 
CLWLVGILI 
LHASSGFTEE 
QLAEKDGKLT 
EKGRRKCDQY 
VTQYHYTQWP 
QQIQHEGTVN 
ALLIPGPAGK 
SGEGTDYINA 
EFVYWPNKDE 
PYAPTRKRKF 



ESVSRFGKQA 
AVFCEVLTMQ 
DPENYTSLLV 
NMSYVLQIVA 
AIVNPGRDSA 
NSTSQPVTKL 
NTVSITEYEB 
VLIPESARNA 
IRVDESEKTT 
VNWYSQTTQ 
YWRKCFQTAH 
FETLKEFYQE 
DYINANYVDG 
WPADGSEBYG 
DMGVPEYSLP 
IFGFLKHIRS 
TKLEKQFQLL 
SYIMGYYQSN 
PINCESFKVT 
RGCLPGSQDD 



31 
I 

KLVEEIGWSY 
ENTFIHNTGK 
LEMQIYCFDA 
ALDPPILLNL 
QSGYVMLMDY 
TWERPRWYD 
ICTNGLYGKY 
TNQIRKKEPQ 
ATEKDISLTS 
ESLLTSFKLD 



TGALNQKNWG 
TVEINLTNDY 



KSFSAGPVMS 
PVYNEASNSS 
FYLEDSTSPR 
VQSCTVDLGI 
YNRPKAYIAA 
NFLVTQKSVQ 
VLTFVRKAAY 
QRNYLVQTEE 
SQSNIQQSDY 
EFIITQHPLL 
LMAEEHKCLS 



LPNSTDKYYI 
LQNNFREQQY 
TMIEKFAVLY 
SDQLIVDMPT 
ISTTTHYNRI 
QTVTELPPHT 
TGAEDSSGSS 
ESLKDPSMEG 
QGPSVTDLEM 
HESRIGLAEG 
VISTPPTPIF 
TADSSNHPDN 
QGPLKSTAED 
VLAYYTVRNF 
AKRHAVGPW 
QYVFIHDTLV 
SAALKQCNRB 
HTIKDFWRMI 
NEEKLIIQDF 



51 
I 

KKYPTCNSPK 
RVSGGVSEMV 
KGKGKLRALS 
YNGSLTSPPC 
KFSRQVFSSY 
QQLDGEDQTK 
DNPELDLFPE 
GTKYNEAKTN 
VEGTSASLND 
PATSAIPFIS 
NVWFPSSTDI 
PHYSTFAYFP 
LESEKKAVIP 
PISDDVGAIP 
KHKNRYINIV 
FWRMIWEHNV 
TLRNTKIKKG 
VHCSAGVGRT 
EAILSKETEV 
KNRTSSIIPV 
WDHNAQLWM 
ILEATQAWRS 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1359 



Seq ID NO: C319 Protein Sequence 
Protein Accession #: XP 002914.4 



1 11 21 31 41 51 

I I I I I I 

MKDIDIGKEY IIPSPGYRSV RERTSTSGTH RDREDSKFRR TRPLECQDAL ETAARAEGLS 
LDASMHSQLR ILDEEHPKGK YHHGLSALKP IRTTSKHQHP VDNAGLFSCM TFSWLSSLAR 
VAHKKGELSM EDVWSLSKHE SSDVNCRRLB RLWQEELNEV GPDAASLRRV VWIFCRTRLI 
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LSIVCLMITQ LAGFSGPAFM VKHLLEYTQA TESNLQYSLL LVLGLLLTEI VRSWSLALTW 240 

ALNYRTGVRL RGAILTMAFK KILKLKNIKE KSLGELINIC SNDGQRMFEA AAVGSLLAGG 300 

PWAILGMIY NVIILGPTGF LGSAVFILFY PAMMFASRLT AYFRRKCVAA TDERVQKMNE 360 

VLTYIKFIKM YAWVKAFSQS VQKIREEERR ILEKAGYFQS ITVGVAPIW VIASWTFSV 420 

HMTLGFDLTA AQAFTWTVF NSMTFALKVT PFSVKSLSEA SVAVDRFKSL FLMBEVHMIK 480 

NKPASPHIKI EMKNATLAWD SSHSSIQNSP KLTPKMKKDK RASRGKKEKV RQLQRTEHQA S40 

VLAEQKGHLL LDSDERPSPE EEEGKHIHLG HLRLQRTLHS IDLEIQEGKL VGICGSVGSG 600 

KTSLISAILG QMTLLEGSIA ISGTFAYVAQ QAWILNATLR DNILFGKEYD EERYNSVLNS 660 

CCLRPDLAIL PSSDLTEIGE RGANLSGGQR QRISLARALY SDRSIYILDD PLSALDAHVG 720 

NHIFNSAIRK HLKSKTVLFV THQLQYLVDC DEVIFMKEGC ITERGTHEEL MNLNGDYATI 780 

FNNLLLGETP PVEINSKKET SGSQKKSQDK GPKTGSVKKE KAVKPEEGQL VQLEEKGQGS 840 

VPWSVYGVYI QAAGGPLAFL VIMALFMLNV GSTAFSTWWL SYNIKQGSGN TTVTRGNETS 900 

VSDSMKDNPH MQYYASIYAL SMAVMLILKA IRGWFVKGT LRASSRLHDE LFRRILRSPM 960 

KFFDTTPTGR ILNRFSKDMD EVDVRLPFQA EMFIQNVILV FFCVGMIAGV FPWFLVAVGP 1020 

LVILFSVLHI VSRVLIRELK RLDNITQSPF LSHITSSIQG LATIHAYNKG QEFLHRYQEL 1080 

LDDNQAPFFL FTCAMRWLAV RLDLISIALI TTTGLMIVLM HGQIPPAYAG LAISYAVQLT 1140 

GLFQPTVRLA SET EAR FT SV ERINHYIKTL SLEAPARIKN KAPSPDWPQE GEVTFENAEM 1200 

RYRENLPLVL KKVSFTIKPK EKIGIVGRTG SGKSSLGMAL FRLVELSGGC IKIDGVRISD 1260 

IGLADLRSKL SIIPQEPVLF SGTVRSNLDP FNQYTEDQIW DALERTHMKE CIAQLPLKLE 1320 

SEVMENGDNF SVGERQLLCI ARALLRHCKI LILDEATAAM DTETDLLIQE TIREAFADCT 1380 

MLTIAHRLHT VLGSDRIMVL AQGQWEFDT PSVLLSNDSS RFYAMFAAAE NKVAVKG 1437 

Seq ID NO: C320 Protein Sequence 
Protein Accession #: NP_005679.1 

1 11 21 31 41 51 

t I I I I I 

MKDIDIGKEY IIPSPGYRSV RERTSTSGTH RDREDSKFRR TRPLECQDAL ETAARAEGIiS 60 

LDASMHSQLR ILDEEHPKGK YHHGLSALKP IRTTSKHQHP VDNAGLFSCM TFSWLSSLAR 120 

VAHKKGELSM EDVWSLSKHE SSDVNCRRLB RLWQEELNEV GPDAASLRRV VWIFCRTRLI 180 

LSIVCLMITQ LAGFSGPAFM VKHLLEYTQA TESNLQYSLL LVLGLLLTEI VRSWSLALTW 240 

ALNYRTGVRL RGAILTMAFK KILKLKNIKE KSLGELINIC SNDGQRMFEA AAVGSLLAGG 300 

PWAILGMIY NVIILGPTGF LGSAVFILFY PAMMFASRLT AYFRRKCVAA TDERVQKMNE 360 

VLTYIKFIKM YAWVKAFSQS VQKIREEERR ILEKAGYFQG ITVGVAPIW VIASWTFSV 420 

HMTLGFDLTA AQAFTWTVF NSMTFALKVT PFSVKSLSEA SVAVDRFKSL FLMEEVHMIK 480 

NKPASPHIKI EMKNATLAWD SSHSSIQNSP KLTPKMKKDK RASRGKKEKV RQLQRTEHQA 540 

VLAEQKGHLL LDSDERPSPE EEEGKHIHLG HLRLQRTLHS IDLEIQEGKL VGICGSVGSG 600 

KTSLISAILG QMTLLEGSIA ISGTFAYVAQ QAWILNATLR DNILFGKEYD EERYNSVLNS 660 

CCLRPDLAIL PSSDLTEIGE RGANLSGGQR QRISLARALY SDRSIYILDD PLSALDAHVG 720 

NHIFNSAIRK HLKSKTVLFV THQLQYLVDC DEVIFMKEGC ITERGTHEEL MNLNGDYATI 780 

FNNLLLGETP PVEINSKKET SGSQKKSQDK GPKTGSVKKE KAVKPEEGQL VQLEEKGQGS 840 

VPWSVYGVYI QAAGGPLAFL VIMALFMLNV GSTAFSTWWL SYWIKQGSGN TTVTRGNETS 90 0 

VSDSMKDNPH MQYYASIYAL SMAVMLILKA IRGWFVKGT LRASSRLHDE LFRRILRSPM 960 

KFFDTTPTGR ILNRFSKDMD EVDVRLPFQA EMFIQNVILV FFCVGMIAGV FPWFLVAVGP 1020 

LVILFSVLHI VSRVLIRELK RLDNITQSPF LSHITSSIQG LATIHAYNKG QEFLHRYQEL 1080 

LDDNQAPFFL FTCAMRWLAV RLDLISIALI TTTGLMIVLM HGQIPPAYAG LAISYAVQLT 1140 

GLFQFTVRLA SETEARFTSV ERINHYIKTL SLEAPARIKN KAPSPDWPQE GEVTFENAEM 1200 

RYRENLPLVL KKVSFTIKPK EKIGIVGRTG SGKSSLGMAL FRLVELSGGC IKIDGVRISD 1260 

IGLADLRSKL SIIPQEPVLF SGTVRSNLDP FNQYTEDQIW DALERTHMKE CIAQLPLKLE 1320 

SEVMENGDNF SVGERQLLCI ARALLRHCKI LILDEATAAM DTETDLLIQE TIREAFADCT 1380 

MLTIAHRLHT VLGSDRIMVL AQGQWEFDT PSVLLSNDSS RFYAMFAAAE NKVAVKG 1437 

Seq ID NO: C321 Protein Sequence 
Protein Accession #x NP_005553.1 

1 11 21 31 41 51 

I I I I I I 

MPALWLGCCL CFSLLLPAAR ATSRREVCDC NGKSRQCIFD RELHRQTGNG FRCLNCNDNT 60 

DGIHCEKCKN GFYRHRERDR CLPCNCNSKG SLSARCDNSG RCSCKPGVTG ARCDRCLPGF 120 

HMLTDAGCTQ DQRLLDSKCD CDPAGIAGPC DAGRCVCKPA VTGBRCDRCR SGYYNLDGGN 180 

PEGCTQCFCY GHSASCRSSA EYSVHKITST FHQDVDGWKA VQRNGSPAKL QWSQRHQDVF 240 

SSAQRLDPVY FVAPAKFLGN QQVSYGQSLS FDYRVDRGGR HPSAHDVILB GAGLRITAPL 300 

MPLGKTLPCG LTKTYTFRLN EHPSNNWSPQ LSYFEYRRLL RNLTALRIRA TYGEYSTGYI 360 

DNVTLISARP VSGAPAPWVE QCTCPVGYKG QFCQDCASGY KRDSARLGPF GTCIPCNCGG 420 

GGACDPDTGD CYSGDENPDI ECADCPIGFY NDPHDPRSCK PCPCHNGFSC SVMPETEEW 480 

CNNCPPGVTG ARCELCADGY FGDPFGEHGP VRPCQPCQCN NNVDPSASGN CDRLTGRCLK 540 

CIHNTAGIYC DQCKAGYFGD PLAPNPADKC RACNCNPMGS EPVGCRSDGT CVCKPGFGGP 600 

NCEHGAFSCP ACYNQVKIQM DQFMQQLQRM EALISKAQGG DGWPDTELE GRMQQAEQAL 660 

QDILRDAQIS EGASRSLGLQ LAKVRSQENS YQSRLDDLKM TVERVRALGS QYQNRVRDTH 720 

RLITQMQLSL AESEASLGNT NIPASDHYVG PNGFKSLAQE ATRLAESHVE SASNMEQLTR 780 

ETEDYSKQAL SLVRKALHEG VGSGSGSPDG AWQGLVEKL EKTKSLAQQL TREATQAEIE 840 

ADRSYQHSLR LLDSVSRLQG VSDQSFQVEB AKRIKQKADS LSTLVTRHMD EFKRTQKNLG 900 

NWKEEAQQLL QNGKSGREKS DQLLSRANLA KSRAQEALSM GNAT FY EVES ILKNLREFDL 960 

QVDNRKAEAE EAMKRLSYIS QKVSDASDKT QQAERALGSA AADAQRAKNG AGEALEISSE 1020 

IEQEIGSLNL EANVTADGAL AMEKGLASLK SEMREVEGEL BRKELBFDTN MDAVQMVITB 1080 

AQKVDTRAKN AGVTIQDTLN TLDGLLHLMD QPLSVDEEGL VLLEQKLSRA KTQINSQLRP 1140 

MMSELEERAR QQRGHLHLLE TSIDGILADV KNLENIRDNL PPGCYNTQAL EQQ 1193 

Seq ID NO: C322 Protein Sequence 
Protein Accession #: NP_066924.1 

1 11 21 31 41 51 

I I I I I I 

MANAGLQLLG FILAFLGWIG AIVSTALPQW RIYSYAGDNI VTAQAMYEGL WMSCVSQSTG 60 
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QIQCKVFDSL LNLSSTLQAT RALMWGILL GVIAIFVATV GMKCMKCLED DEVQKMRMAV 120 

IGGAIFLLAG LAILVATAWY GNRIVQEFYD PMTPVNARYE FGQALFTGWA AASLCLLGGA 180 

LLCCSCPRKT TSYPTPRPYP KPAPSSGKDY V 211 

Seq ID NO: C323 Protein Sequence 
Protein Accession ft: AAM77876 

1 11 21 31 41 51 

I I 1 I I I 

MSSWIRWHGP AMARLWGFCW LWGFWRAAF ACPTSCKCSA SRIWCSDPSP GIVAFPRLEP 60 

NSVDPENITE IFIANQKRLE I INEDDVEAY VGLRNLTIVD SGLKFVAHKA FLKNSNLQHI 120 

NFTRNKLTSL SRKHFRHLDL SELILVGNPP TCSCDIHWIK TLQEAKSSPD TQDLYCLNES 180 

SKNIPLANLQ IPNCGLPSAN LAAPNLTVEB GKSITLSCSV AGDPVPNMYW DVGNLVSKHM 240 

NETSHTQGSL RITNISSDDS GKQISCVAEN LVGEDQDSVN LTVHFAPTIT FLESPTSDHH 300 

WCIPFTVKGN PKPALQWFYN GAILNESKYI CTKIHVTNHT EYHGCLQLDN PTHMNNGDYT 360 

L1AKNEYGKD EKQISAHFMG WPGIDDGANP NYPDVIYEDY GTAANDIGDT TNRSNEIPST 420 

DVTDKTGREH LSVYAWVIA SWGFCLLVM LFLLKLARHS KFGMKGFVLF HKIPLDG 477 

Seq ID NO: C324 Protein Sequence 
Protein Accession ft: NP_006171.1 

1 11 21 31 41 SI 

I I I 1 I I 

MSSWIRWHGP AMARLWGFCW LWGFWRAAF ACPTSCKCSA SRIWCSDPSP GIVAFPRLEP 60 

NSVDPENITE IFIANQKRLE I INEDDVEAY VGLRNLTIVD SGLKFVAHKA FLKNSNLQHI 120 

NFTRNKLTSL SRKHFRHLDL SELILVGNPP TCSCDIMWIK TLQEAKSSPD TQDLYCLNES 180 

SKNIPLANLQ IPNCGLPSAN LAAPNLTVEE GKSITLSCSV AGDPVPNMYW DVGNLVSKHM 240 

NETSHTQGSL RITNISSDDS GKQISCVAEN LVGEDQDSVN LTVHFAPTIT FLESPTSDHH 300 

WCIPFTVKGN PKPALQWFYN GAILNESKYI CTKIHVTNHT EYHGCLQLDN PTHMNNGDYT 360 

LIAKNEYGKD EKQISAHFMG WPGIDDGANP NYPDVIYEDY GTAANDIGDT TNRSNEIPST 420 

DVTDKTGREH LSVYAWVIA SWGFCLLVM LFLLKLARHS KFGMKGPASV ISNDDDSASP 480 

LHHISNGSNT PSSSEGGPDA VIIGMTKIPV IENPQYFGIT NSQLKPDTFV QHIKRHNIVL 540 

KRBLGEGAFG KVFLAECYNL CPEQDKILVA VKTLKDASDN ARKDFHREAE LLTNLQHEHI 600 

VKPYGVCVEG DPLIMVFEYM KHGDLNKFLR AHGPDAVLMA EGNPPTELTQ SQMLHIAQQI 660 

AAGMVYLASQ HFVHRDLATR NCLVGENLLV KIGDFGMSRD VYSTDYYRVG GHTMLPIRWM 720 

PPESIMYRKF TTESDVWSLG WLWEIFTYG KQPWYQLSNN EVIECITQGR VLQRPRTCPQ 780 

EVYELMLGCW QREPHMRKNI KGIHTLLQNL AKASPVYLDI LG 822 

Seq ID NO: C325 Protein Sequence 
Protein Accession ft; Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MSSWIRWHGP AMARLWGFCW LWGFWRAAF ACPTSCKCSA SRIWCSDPSP GIVAFPRLEP 60 

NSVDPENITE IFIANQKRLE I INEDDVEAY VGLRNLTIVD SGLKFVAHKA FLKNSNLQHI 120 

NFTRNKLTSL SRKHFRHLDL SELILVGNPP TCSCDIMWIK TLQEAKSSPD TQDLYCLNES 180 

SKNIPLANLQ IPNCGLPSAN LAAPNLTVEE GKSITLSCSV AGDPVPNMYW DVGNLVSKHM 240 

NETSHTQGSL RITNISSDDS GKQISCVAEN LVGEDQDSVN LTVHFAPTIT FLESPTSDHH 300 

WCIPFTVKGN PKPALQWFYN GAILNESKYI CTKIHVTNHT EYHGCLQLDN PTHMNNGDYT 360 

LIAKNEYGKD EKQISAHFMG WPGIDDGANP NYPDVIYEDY GTAANDIGDT TNRSNEIPST 420 

DVTDKTGREH LSVYAWVIA SWGFCLLVM LFLLKLARHS KFGMKGFVLF HKIPLDG 477 

Seq ID NO: C326 Protein Sequence 
Protein Accession 8: NP_570843.1 

1 11 21 31 41 51 

I I I I I I 

MPLKHYLLLL VGCQAWGAGL AYHGCPSECT CSRASQVECT GARIVAVPTP LPWNAMSLQI 60 

LNTHITELNE SPFLNISALI ALRIEKNBLS RITPGAFRNL GSLRYLSLAN NKLQVLPIGL 120 

PQGLDSLESL LLSSNQLLQI QPAHFSQCSN LKELQLHGNH LEYIPDGAFD HLVGLTKLNL 180 

GKNSLTHISP RVFQHLGNLQ VLRLYENRLT DIPMGTFDGL VNLQELALQQ NQIGLLSPGL 240 

FHNNHNLQRL YLSNNHISQL PPSIFMQLPQ LNRLTLFGNS LKELSLGIFG PMPNLRELWL 30 0 

YDNHISSLPD NVFSNLRQLQ VLILSRNQIS FISPGAFNGL TELRELSLHT NALQDLDGNV 360 

FRMLANLQNI SLQNNRLRQL PGNIFANVNG LMAIQLQNNQ LENLPLGIFD HLGKLCELRL 420 

YDNPWRCDSD ILPLRNWLLL NQPRLGTDTV PVCFSPANVR GQSLIIINVN VAVPSVHVPE 480 

VPSYPBTPWY PDTPSYPDTT SVSSTTELTS PVEDYTDLTT IQVTDDRSVW GMTQAQSGLA 540 

IAAIVIGIVA LACSLAACVG CCCCKKRSQA VLMQMKAPNE C 581 

Seq ID NO: C327 Protein Sequence 
Protein Accession 8: NP_002649.1 

1 11 21 31 41 51 

I i I I I I 

MRALLARLLL CVLWSDSKG SNELHQVPSN CDCLNGGTCV SNKYFSNIHW CNCPKKFGGQ 60 

HCEIDKSKTC YEGNGHFYRG KASTDTMGRP CLPWNSATVL QQTYHAHRSD ALQLGLGKHN 120 

YCRNPDNRRR PWCYVQVGLK PLVQECMVHD CADGKKPSSP PEELKFQCGQ KTLRPRFKII 180 

GGEFTTIENQ PWFAAIYRRH RGGSVTYVCG GSLISPCWVI SATHCFIDYP KKEDYIVYLG 240 
RSRLNSNTQG EMKFEVENLI LHKDYSADTL AHHNDIALLK IRSKEGRCAQ PSRTIQTICL 300 
PSMYNDPQFG TSCEITGFGK ENSTDYLYPE QLKMTWKLI SHRECQQPHY YGSEVTTKML 360 
CAADPQWKTD SCQGDSGGPL VCSLQGRMTL TGIVSWGRGC ALKDKPGVYT RVSHFLPWIR 420 

SHTKEENGLA L 431 

Seq ID NO: C328 Protein Sequence 
Protein Accession ft: XP_0 872 54.1 
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1 11 21 31 41 51 

I I I I I I 

MQFRECSING MKYQEINGRL VPEGPTPDSS EGNLSYLSSL SHLNNLSHLT TSSSPRTSPE 60 

NETELIKEHD LFFKAVSLCH TVQ1SNVQTD CTGDGPWQSN LAPSQLEYYA SSPDEKALVE 120 

AAARIGIVFI GNSEETMEVK TLGKLERYKL LHILEFDSDR RRMSVIVQAP SGEKLLFAKG 180 

AESSILPKCI GGEIEKTRIH VDEPALKGLR TLCIAYRKFT SKEYEEIDKR IFEARTALQQ 240 

REEKLAAVFQ FIBKDLILLG ATAVEDRLQD KVRETIEALR MAGIKVWVLT GDKHETAVSV 300 

SLSCGHFHRT KNILELINQK SDSECAEQIiR QLARRITEDH VIQHGLWDG TSLSLAIiREH 360 

EKLFMEVCRN CSAVLCCRMA PLQKAKVIRL IKISPEKPIT LAVGDGANDV SMIQEAHVGI 420 

GIMGKEGRQA ARNSDYAIAR FKFLSKLLFV HGHFYYIRIA TLVQYFFYKN VCFITPQFLY 480 

QFYCLFSQQT LYDSVYLTLY NICFTSLPIL IYSLLEQHVD PHVLQNKPTL YRDISKNRLL 540 

SIKTFLYWTI LGFSHAFIFF FGSYLLIGKD TSLLGNGQMF GNWTFGTLVF TVMVITVTVK 600 

MALETHPWTW INHLVTWGSI IFYFVFSLFY GGILWPPLGS QNMYFVFIQL LSSGSAWFAI 660 

ILMWTCLFL DIIKKVFDRH LHPTSTEKAQ LTETNAGIKC LDSMCCFPEG EAACASVGRM 720 

LERVIGRCSP THISRSWSAS DPFYTNDRSI LTLSTMDSST C 761 



Seq ID NO: C329 Protein Sequence 
Protein Accession #: XPJ) 87461.1 

1 11 21 31 41 51 

I 1 I I I I 

MLPLLAALLA AACPLPPVRG GAADAPGLLG VPSNASVNAS SAASPSPRGC WPRRPPGPPS 60 

ARARRRRRRR RRLCNISVQR QMLSSLLVRW GRPRGFQCDL LLFSTNAHGR AFFAAAFHRV 120 

GPPLLIEHLG LAAGGAQQDL RLCVGCGWVR GRRTGRLRPA AAPSAAAATA GAPTALPAYP 180 

AAEPPGPLWL QGEPLHFCCL DFSLEELQGE PGWRLNRKPI ESTLVACFMT LVIWWSVAA 240 

LIWPVPIIAG FLPNGMEQRR TTASTTAATP AAVPAGTTAA AAAAAAAAAA AVTSGVATK 299 

Seq ID NO: C330 Protein Sequence 
Protein Accession #: XP_051522.2 

1 11 21 31 41 51 

I I 1 I I I 

MDLHLFDYSE PGNFSDISWP CNSSDCIWD TVMCPNMPNK SVLLYTLSFI YIFIFVIGMI 60 
ANSVWWVNI QAKTTGYDTH CYILNLAIAD LWWLTIPVW WSLVQHNQW PMGELTCKVT 120 
HLIFSINLFG SIFFLTCMSV DRYLSITYFT NTPSSRKKMV RRWCILVWL LAFCVSLPDT 180 
YYLKTVTSAS NNETYCRSFY PEHSIKEWLI GMELVSWLG FAVPFSI1AV FYFLLARAIS 240 
ASSDQEKHSS RKIIFSYVW FLVCWLPYHV AVLLDIFSIL HYIPFTCRLE HALFTALHVT 300 
QCLSLVHCCV NPVLYSFINR NYRYELMKAF I FKYSAKTGL TKLIDASRVS ETEYSALEQS 360 
TK 362 



Seq ID NO: C331 Protein Sequence 
Protein Accession #s NP_000341.1 

1 11 21 31 41 51 

I I 1 I I 1 

KGFVRQIQLL LWKNWTLRKR QKIRFWELV WPLSLPLVliI WLRNANPLYS HHECHFPNKA 60 

MPSAGMLPWL QGIFCNVNNP CFQSPTPGES PGIVSNYNNS ILARVYRDFQ ELLMNAPESQ 120 

HLGRIWTELH ILSQFMDTLR THPERIAGRG IRIRDILKDE ETLTLFLIKN IGLSDSWYL 180 

LINSOVRPEQ FAHGVPDIAL KDIACSEALL ERFI IFSQRR GAKTVRYALC SLSQGTLQWI 240 

EDTLYANVDF FKLFRVLPTL LDSRSQGINL RSWGGILSDM SPRIQEFIHR PSMQDLLWVT 300 

RPLMQNGGPE TFTKLMGILS DLLCGYPEGG GSRVLSFNWY EDNNYKAFLG IDSTRKDPIY 360 

SYDRRTTSFC NALIQSLESN PLTKIAWRAA KPLLMGKILY TPDSPAARRI LKNANSTFEE 420 

LEHVRKLVKA WEEVGPQIWY FFDNSTQMNM IRDTLGNPTV KDFLNRQLGE EGITAEAILN 480 

FLYKGPRESQ ADDMANFDWR DIFNITDRTL, RLVNQYLECL VLDKFESYND ETQLTQRALS S40 

LLEENMFWAG WFPDMYPWT SSLPPHVKYK IRMDIDWEK TNKIKDRYWD SGPRADPVED 600 

FRYIWGGFAY LQDMVEQGIT RSQVQAEAPV GIYLQQMPYP CFVDDSFMII LNRCFPIFMV 660 

LAWIYSVSMT VKSIVLEKEL RLKETLKNQG VSNAVIWCTW FLDSFSIMSM SIFLLTIFIM 720 

EGRILHYSDP PILFLFLLAF STATIMLCFL LSTPPSKASL AAACSGVIYF TLYLPHILCF 780 

AWQDRMTAEL KKAVSLLSPV AFGFGTEYLV RFEEQGLGLQ WSNIGNSPTE GDEFSFLLSM 840 

QMMLLDAACY GLLAWYLDQV FPGDYGTPLP WYFLLQESYW LSGEGCSTRE ERALEKTEPL 900 

TEETEDPEHP EGIHDSFFER EHPGWVPGVC VKNLVKIFEP CGRPAVDRLN ITFYENQITA 960 

FLGHNGAGKT TTLSILTGLL PPTSGTVLVG GRDIETSLDA VRQSLGMCPQ HNILFHHLTV 1020 

AEHMLFYAQL KGKSQEEAQL EMEAMLEDTG LHHKRNEEAQ DLSGGMQRKL SVAIAPVGDA 1080 

KWILDEPTS GVDPYSRRSI WDLLLKYRSG RTIIMPTHHM DEADHQGDRI AIIAQGRLYC 1140 

SGTPLFLKNC FGTGLYLTLV RKMKNIQSQR KGSEGTCSCS SKGFSTTCPA HVDDLTPEQV 1200 

LDGDVNELMD WLHHVPEAK LVECIGQELI FLLPNKNFKH RAYASLFREL EETLADLGLS 1260 

SFGISDTPLE EIFLKVTEDS DSGPLFAGGA QQKRENVNPR HPCLGPREKA GQTPQDSNVC 1320 

SPGAPAAHPE GQPPPEPECP GPQLNTGTQL VLQHVQALLV KRFQHTIRSH KDFLAQIVLP 1380 

ATFVFLALML SIVILPFGEY PALTLHPHIY GQQYTFFSMD EPGSEQFTVL ADVLLNKPGP 1440 

GNRCLKEGWL PEYPCGNSTP WKTPSVSPNI TQLFQKQKWT QVNPSPSCRC STREKLTMLP 1500 

ECPEGAGGLP PPQRTQRSTE ILQDLTDRNI SDFLVKTYPA LIRSSLKSKF WVNEQRYGGI 1560 

SIGGKLPWP ITGEALVGFL SDLGRIMNVS GGPITREASK EIPDFLKHLE TEDNIKVWFN 1620 

NKGWHALVSF LNVAHNAILR ASLPKDRSPE EYGITVISQP LNLTKEQLSE ITVLTTSVDA 1680 

WAICVIFSM SFVPASFVLY LIQERVNKSK HLQFISGVSP TTYWVTNFLW DIMNYSVSAG 1740 

LWGIFIGFQ KKAYTSPENL PALVALLLLY GWAVIEMMYP ASFLFDVPST AYVAI*SCANL 1800 

FIGINSSAIT FILELFDNNR TLLRFNAVLR KLLIVFPHFC LGRGLIDLAL SQAVTDVYAR 1860 
FGEEHSANPF HWDLIGKNLF AMWEGWYF LLTLLVQRHF FLSQWIAEPT KEPIVDEDDD 1920 
VAEERQRIIT GGNKTDILRL HELTKIYLGT SSPAVDRLCV GVRPGECFGL IjGVNGAGKTT 1980 
TFKMLTGDTT VTSGDATVAG KSILTNISEV HQNMGYCPQP DAIDELLTGR EHLYLYARLR 2040 
GVPAEEIEKV ANWSIKSLGL TVYADCLAGT YSGGNKRKLS TAIALIGCPP LVLLDEPTTG 2100 
MDPQARRMLW NVIVSIIRKG RAWLTSHSM EECEALCTRL AIMVKGAFRC MGTIQHLKSK 2160 
FGDGYIVTMK IKSPKDDLLP DLNPVEQFFQ GNFPGSVQRE RHYNMLQFQV SSSSLARIFQ 2220 
LLLSHKDSIiIi IEEYSVTOTT LDOVFVNFAK QQTESHDLPL HPRAAGASRQ AQD 2273 
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PCT/US02/36810 



Seq ID NO: C332 Protein Sequence 
Protein Accession ft: NP_006662.2 

5 1 11 21 31 41 51 

I 1 I I I I 

MVPHAILARG RDVCRRNGLL ILSVLSVIVG CLLGFFLRTR RLSPQBISYP QFPGBLLMRM 60 

LKMMILPLW SSLMSGLASL DAKTSSRLGV LTVAYYLWTT FMAVIVGIFM VSIIHPGSAA 120 

QKETTEQSGK P IMS SAD ALL DLIRNMFPAN LVEATFKQYR TKTTPWKSP KVAPEEAPPR 180 

10 RILIYGVQEB NGSHVQNFAL DLTPPPEWY KSEPGTSDGM NVLGIVFFSA TMGIMLGRMG 240 

DSGAPLVSFC QCLNESVMKI VAVAVWYFPF GIVFLIAGKI LEMDDPRAVG KKLGFYSVTV 300 

VCGLVLHGLF ILPLLYFFIT KKNPIVPIRG I LQ ALL I ALA TSSSSATLPI TFKCLLENNH 360 

IDRRIARFVL PVGATINMDG TALYEAVAAI FIAQVNNYEL DFGQIITISI TATAASIGAA 420 

GIPQAGLVTM VIVLTSVGLP TDDITLIIAV DWALDRFRTM INVLGDALAA GIMAHICRKD 480 

15 FARDTGTEKL LPCETKPVSL QEIVAAQQNG CVKSV ABASE LTLGPTCPHH VPVQVERDEE 540 

LPAASLNHCT IQISELETNV 560 



20 



Seq ID NO: C333 Protein Sequence 
Protein Accession NP 005680.1 



11 21 31 41 51 

1 I I I I I 

MVTVGNYCKA EGPVGPAWMQ DGLSPCFFFT LVPSTRMALG TLALVLALPC RRRERPAGAD 60 

SIiSWGAGPRI SPYVLQLLLA TLQAALPLAG LAGRVGTARG APLPSYLLLA SVLESLAGAC 120 

25 GLWLLWERS QARQRLAMGI HIKPRHSPGL LLLWTVAFAA ENLALVSWNS PQWWWARADL 180 

GQQVQFSLWV LRYWSGGLF VLGLWAPGLR PQSYTLQVHE EDQDVERSQV RSAAQQSTWR 240 

DFGRKLRLLS GYLWPRGSPA LQLWLICLG LMGLERALNV LVPIFYRNIV NLLTEKAPWN 300 

SLAWTVTSYV FLKFLQGGGT GSTGFVSNLR TFLWIRVQQP TSRRVELLIF SHLHELSLRW 360 

HLGRRTGEVL RIADRGTSSV TGLLSYLVFN VIPTLADIII GIIYFSMFFN AWFGLIVFLC 420 

30 MSLYLTLTIV VTEWRTKFRR AMNTQENATR ARAVDSLLNF ETVKYYNAES YEVERYREAI 480 

I KYQGLEWKS SASLVLLNQT QNLVIGLGLL AGSLLCAYFV TEQKLQVGDY VLFGTYIIQL 540 

YMPLNWFGTY YRMIQTNFID MENMFDLLKE ETEVKDLPGA GPLRFQKGRI EFENVHFSYA 600 

DGRETLQDVS FTVMPGQTLA LVGPSGAGKS TILRLLFRFY DISSGCIRID GQDISQVTQA 660 

SLRSHIGWP QDTVLFNDTI ADNIRYGRVT AGNDEVEAAA QAAGIHDAIM AFPEGYRTQV 720 

35 GERGLKLSGG EKQRVAIART ILKAPGIILL DEATSALDTS NERAIQASLA KVCANRTTIV 780 

VAHRLSTWN ADQILVIKDG CIVERGRHEA LLSRGGVYAD MWQLQQGQEE TSEDTKPQTM 840 

RP 842 



Seq ID NO: C334 Protein Sequence 
40 Protein Accession NP_000667.1 

1 11 21 31 41 51 

I I I I I I 

MLLETQDALY VALELVIAAL SVAGNVLVCA AVGTANTLQT PTNYFLVSLA AADVAVGLFA 60 

45 IPFAITISLG FCTDFYGCLF LACFVLVLTQ SSIFSLLAVA VDRYLAICVP LRYKSLVTGT 120 

RARGVIAVLW VLAFGIGLTP FLGWNSKDSA TNNCTEPWDG TTNESCCLVK CLFENWPMS 180 

YMVYFNFFGC VLPPLLIMLV IYIKIFLVAC RQLQRTELMD HSRTTLQREI HAAKSLAMIV 240 

GIFALCWLPV HAVNCVTLPQ PAQGKNKPKW AMNMAILLSH ANSWNPIVY AYRNRDFRYT 300 

FHKIISRYLL CQADVKSGNG QAGVQPALGV GL 332 



50 



60 



Seq ID NO: C335 Protein Sequence 
Protein Accession #: NP 443164 



1 11 21 31 41 51 

55 | | | | I I 

MGLGARGAWA ALLLGTLQVL ALLGAAHESA AMAETLQHVP SDHTNETSNS TVKPPTSVAS 60 

DSSNTTVTTM KPTAASNTTT PGMVSTNMTS TTLKSTPKTT SVSQNTSQIS TSTMTVTHNS 120 

SVTSAASSVT ITTTMHSEAK KGSKFDTGSF VGGIVLTLGV LSILYIGCKM YYSRRGIRYR 180 

TIDEHDAII 189 



Seq ID NO: C336 Protein Sequence 
protein Accession #: NP 004186.1 



1 11 21 31 41 51 

65 | i I I I 1 

MAQHGAMGAF RALCGLALLC ALSLGQRPTG GPGOGPGRLL LGTGTDARCC RVHTTRCCRD 60 

YPGEECCSEW DCMCVQPEFH CGDPCCTTCR HHPCPPGQGV QSQGKFSFGF QCIDCASGTF 120 

EGGHEGHCKP WTDCTQFGFL TVFPGNKTHN AVCVPGSPPA EPLGWLTWL LAVAACVLLL 180 

_ TSAQLGLHIW QLRSQCMWPR ETQLLLEVPP STEDARSCQF PEEERGERSA EEKGRLGDLW 240 

70 V 241 



Seq ID NO: C337 Protein Sequence 
Protein Accession ft: BAC03767.1 

75 1 11 21 31 41 51 

I I I I 1 1 

MGCDGRVSGL LRRNLQPTLT YWSVFFSFGL CIAFLGPTLL DLRCQTHSSL PQISWVFFSQ 60 

QLCLLLGSAL GGVFKRTLAQ SLWALFTSSL AISLVFAVIP FCRDVKVLAS VMALAGLAMG 120 

ori CIDTVANMQL VRMYQKDSAV FLQVLHFFVG FGALLSPLIA DPFLSEANCL PANSTANTTS 180 

80 RGHLFHVSRV LGQHHVDAKP WSNQTFPGLT PKDGAGTRVS YAFWIMALID LPVPMAVLML 240 

LSKERLLTCC PQRRPLLLSA DELALETQPP EKEDASSLPP KFQSHLGHED LFSCCQRKNL 300 

RGAPYSFFAI HITGALVLFM TDGLTGAYSA FVYSYAVEKP LSVGHKVAGY LPSLFWGFIT 360 

LGRLLSIPIS SRMKPATMVF INWGVWTF LVLLIFSYNV VFLFVGTASL GLFLSSTFPS 420 

MLAYTEDSLQ YKGCATTVLV TGAGVGEMVL QMLVGSIFQA QG9YSFLVCG VIFGCLAFTF 480 
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YILLLFFHRM HPGLPSVPTQ DRSIOMENSE CYQR 514 

Seq ID NO: C338 Protein Sequence 
Protein Accession ft: NP_002194.1 

1 11 21 31 41 51 

I I I I I I 

MGPERTGAAP LPLLLVLALS QGILNCCLAY NVGLPBAKIF SGPSSEQFGY AVQQFIMPKG 60 

NWLLVGSPWS GFPEMRMGDV YKCPVDLSTA TCEKUILQTS TSIPNVTEMK TNMSLGLILT 120 

RNMGTGGFLT CGPLWAQQCG NQYYTTGVCS DISPDFQLSA SFSPATQPCP SLIDWWCD 180 

ESNSIYPWDA VKNFLEKFVQ GLDIGPTKTQ VGLIQYANKP RWFNLNTYK TKEEMIVATS 240 

QT5QYGGDLT NTFGAIQYAR KYAYSAASGG RRSATKVMW VTDGESHDGS MLKAVIDQCN 300 

HDNILRFGIA VLGYLNRNAI* DTKNLIKEIK AIASIPTERY FFNVSDEAAL LEKAGTLGEQ 360 

IPSIEGTVQG GDNFQMEMSQ VGFSADYSSQ NDILMLGAVG AFGWSGTIVQ KTSHGHLIPP 420 

KQAFDQILQD RNHSSYLGYS VAAISTGEST HFVAGAPRAN YTGQIVLYSV NENGNITVIQ 480 

AHRGDQIGSY FGSVLCSVDV DKDTITDVLL VGAPMYMSDL KKEEGRVYLF TIKKGILGQH 540 

QFLEGPEGIE NTRFGSAIAA LSDINMDGFN DVIVGSPLEN QNSGAVYIYN GHQGTIRTKY 600 

SQKILGSDGA FRSHLQYFGR SLDGYGDLNG DSITDVSIGA FGQWQLWSQ SIADVAIEAS 660 

FTPEKITLVN KNAQIILKL.C FSAKFRPTKQ NNQVAIVYNI TLDADGFSSR VTSRGLFKEN 720 

NERCLQKNMV VNQAQSCPEH IIYIQEPSDV VNSLDLRVDI SLENPGTSPA LEAYSETAKV 780 

FSIPFHKDCG EDGLCISDLV LDVRQIPAAQ EQPFIVSNQN KRLTFSVTLK NKRESAYNTG 840 

IWDFSENLF FASFSLPVDG TEVTCQVAAS QKSVACDVGY PALKREQQVT FTINFDFNLQ 900 

NLQNQASLSF QALSESQEEN KADNLVNLKI PLLYDAEIHL TRSTNINFYE ISSDGNVPSI 960 

VHSFEDVGPK FIFSLKVTTG SVPVSMATVI IHIPQYTKEK NPLMYLTGVQ TDKAGDISCN 1020 

ADINPLKIGQ TSSSVSFKSE NFRHTKELNC RTASCSNVTC WLKDVHMKGE YFVNVTTRIW 1080 

NGTFASSTFQ TVQLTAAAEI NTYNPEIYVI EDNTVTIPLM IMKPDEKAEV PTGVIIGSII 1140 

AGILLLLiALV AILWKLGFFK RKYEKMTKNP DEIDETTELS S 1181 

Seq ID NO: C339 Protein Sequence 
Protein Accession # x NP_113648.1 

1 11 21 31 41 51 

I I I I I I 

MYRPRARAAP EGRVRGCAVP STVLLLLAYIi AYLALGTGVF WTLEGRAAQD SSRSFQRDKW 60 

ELLQNFTCLD RPALDSLIRD WQAYKNGAS LLSNTTSMGR WELVGSFFFS VSTITTIGYG 120 

NLSPNTMAAR LFCIFFALVG IPLNLWLNR LGHLMQQGVN HWASRLGGTW QDPDKARWLA 180 

GSGALLSGLL LFLLLPPLLF . SHMEGWSYTE GFYFAFITLS TVGFGDYVIG MNPSQRYPLW 240 

YKNMVSLWIL FGMAWLALII KLILSQLETP GRVCSCCHHS SKEDFKSQSW RQGPDREPES 300 

HSPQQGCYPE GPMGIIQHLE PSAHAAGCGK DS 332 

Seq ID NO: C340 Protein Sequence 
Protein Accession tf: NP_004145.1 

1 11 21 31 41 51 

I I I I I I 

MEWDNGTGQA LGLPPTTCVY RENFKQLLLP PVYSAVLAAG LPLNICVITQ ICTSRRALTR 60 

TAVYTLNLAL ADLLYACSLP LLIYNYAQGD HWPFGDFACR LVRFLFYANL HGSIIiFLTCT 120 

SFQRYLGICH PLAPWHKRGG RRAAWLVCVA VWLAVTTQCL PTAIFAATGI QRNRTVCYDL 180 

SPPAIATHYM PYGMALTVIG FLLPFAALLA CYCLI.ACRLC RQDGPAEPVA QERRGKAARM 240 

AVWAAAFAI SFLPFHITKT AYLAVRSTPG VPCTVLEAFA AAYKGTRPFA SANSVLDPIL 300 

FYFTQKKFRR RPHELLQKLT AKWQRQGR 328 

Seq ID NO: C341 Protein Sequence 
Protein Accession #: NPJJ09128.1 

1 11 21 31 41 51 

I I I I I I 

MQRPGPRLWL VLQVMGSCAA ISSMDMERPG DGKCQPIEIP MCKDIGYNMT RMPNLMGHEN 60 

QREAAIQLHE FAPLVEYGCH GHLRFFLCSL YAPMCTEOVS TPIPACRVMC EQARLKCSPI 120 

MEQFNFKWPD SLDCRKLPNK NDPNYLCMEA PNNGSDEPTR GSGLFPPLFR PQRPHSAQEH 180 

PLKDGGPGRG GCDNPGKFHH VEKSASCAPIj CTPGVDVYWS REDKRFAWW LAIWAVLCFF 240 

SSAFTVLTFL IDPARFRYPE RPIIFLSMCY CVYSVGYLIR LFAGAESIAC DRDSGQLYVI 300 

QEGLESTGCT LVPLVLYYFG MASSLWWWL TLTWFLAAGK KWGHEAIEAN SSYFHLAAWA 360 

IPAVKTILIL VMRRVAGDEL TGVCYVGSMD VNALTGFVLI PLACYLVIGT SFILSGFVAL 420 

FHIRRVKKTG GENTDKLEKI* MVRIGLFSVL YTVPATCVIA CYFYBRLNKD YWKILAAQHK 480 

CKMNNQTKTL DCLMAASIPA VEIFMVKIFM LLWGITSGM WIWTSKTLQS WQQVCSRRLK 540 

KKSRRKPASV ITSGGIYKKA QHPQKTHHGK YEIPAQSPTC V 581 

Seq ID NO: C342 Protein Sequence 
Protein Accession #: NP_0057S2.l 

1 11 21 31 41 51 

I I I I I I 

KEVSRRKAPP RPPRPAAPLP LLAYLLALAA PGRGADEPVW RSEQAIGAIA ASQEDGVFVA 60 

SGSCLDQIjDY SLEHSLSRLY RDQAGNCTEP VSLAPPARPR PGSSFSKLLL PYREGAAGLG 120 

GLLLTGWTFD RGACEVRPLG NLSRNSLRNG TEWSCHPQG STAGWYRAG RNNRWYLAVA 180 

ATYVLPEPET ASRCNPAASD HDTAIALKDT EGRSLATQEL GRLKLCEGAG SLHFVDAFLW 240 

NGSIYFPYYP YNYTSGAATG WPSMARIAQS TEVLFQGQAS LDCGHGHPDG RRLLLSSSLV 300 

EALDVWAGVF SAAAGEGQER RSPTTTALCL FRMSEIQARA KRVSWDFKTA ESHCKEGDQP 360 

ERVQPIASST LIHSDLTSVY GTWMNRTVL FLGTGDGQLL KVILGENLTS NCPBVIYEIK 420 

EETPVFYKLV PDPVKNIYIY LTAGKEVRRI RVANCNKHKS CSECLTATDP HCGWCHSMJR 480 

CTFQGDCVHS ENLENWLDIS SGAKKCPKIQ IIRSSKEKTT VTMVGSFSPR HSKCMVKNVD 540 

SSRELCQNKS QPNRTCTCSI PTRATYKDVS WNVMFSFGS WNLSDRFNFT NCSSLKECPA 600 

CVETGCAWCK SARRCIHPPT ACDPSDYERN QEQCPVAVEK TSGGGRPKEN KGNRTNQALQ 660 

VFYIKSIEPQ KVSTLGKSNV IVTGANFTRA SNITMILKGT STCDKDVIQV SHVLKDTHMK 720 
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FSLPSSRKEM KDVCIQFDGG NCSSVGSLSY IALPHCSHF PATTWISGGQ N1TMMGRNFD 780 

VIDNLIISHE LKGNINVSEY CVATYCGFLA PSLKSSKVRT NVTVKLRVQD TYLDCGTLQY 840 

REDPRFTGYR VBSBVDTELE VKIQKENDNF NISKKDIEIT LFHGENGQLN CSFENITRNQ 900 

DLTTILCKIK GIKTASTIAN SSKKVRVKLG NLELYVEQES VPSTWYFLIV LPVLLVIVIF 960 

AAVGVTRHKS KELSRKQSQQ LELLESELRK EIBDGFAELQ MDKLDWDSF GTVPFLDYKH X020 

FALRTFFPES GGFTHIFTED MHNRDANDKN ESLTALDALI CHKSFLVTVI HTLEKQKNFS 1080 

VKDRCLFASF LTIALQTKLV YLTSILEVLT RDLMEQCSNM QPKLMLRRTE SWEKLLTNW 1140 

MSVCLSGFLR ETVGEPFYLL, VTTLNQKINK GPVDVITCKA LYTLNEDWLL WQVPEFSTVA 1200 

LNWFEKIPE NESADVCRNI SVNVLDCDTI GQAKEKIFQA FLSKNGSPYG LQLNEIGLEL 1260 

QMGTRQKELL DIDSSSVILE DGITKLNTIG HYEISNGSTI KVFKKIANFT SDVEYSDDHC 1320 

HLILPDSEAP QDVQGKRHRG KHKFKVKEMY LTKLLSTKVA IHSVLEKLFR SIWSLPHSRA 1380 

PFAIKYFFDF LDAQAENKKI TDPDWHIWK TNSLPLRFWV NILKNPQFVF DIKKTPHIDG 1440 

CLSVIAQAFM DAFSLTEQQL GKEAPTNKLL YAKDIPTYKE EVKSYYKAIR DLPPLSSSEM 1500 

EEFLTQESKK HENEFNEEVA LTEIYKYIVK YFDEILNKLE RERGLEEAQK QLLHVKVLFD 1S60 

EKKKCKWM 1S68 



Seq ID NO: C343 Protein Sequence 
Protein Accession #: NP_002176.1 

1 11 21 31 41 51 

I t I I I I 

MTILGTTFGM VFSLLQWSG ESGYAQNGDL EDAELDDYSF SCYSQLEVNG SQHSLTCAF^E 60 

DPDVNTTNLE FEICGALVEV KCLNFRKLQB IYFIETKKFL LIGKSNICVK VGEKSLTCKK 120 

IDLTTIVKPE APFDLSVIYR EGANDFWTF NTSHLQKKYV KVLMHDVAYR QEKDENKWTH 180 

VNLSSTKLTL LQRKLQPAAM YEIKVRSIPD HYFKGFWSEW SPSYYFRTPE IKNSSGEMDP 240 

ILLTISILSF FSVALLVILA CVLWKKRIKP IVWPSLPDHK KTLEHLCKKP RKNLNVSFNP 300 

ESFLDCQIHR VDDIQARDEV EGFLQDTFPQ QLEESEKQRL GGDVQSPKCP SEDWVTPES 360 

FGRDSSLTCL AGNVSACDAP ILSSSRSLDC RESGKNGPHV YQDLLLSLGT TNSTLPPPPS 420 

LQSGILTLNP VAQGQPILTS LGSNQEEAYV TMSSFYQNQ 459 



Seq ID NO: C344 Protein Sequence 
Protein Accession ft: NP_002713.1 

l 11 21 31 41 SI 

I ! I I I I 

MAAARLCLSL LLLSTCVALL LQPLLGAQGA PLEPVYPGDN ATPEQMAQYA ADLRRYINML 60 

TRPRYGKRHK EDTLAFSEWG SPHAAVPREL SPLDIi 95 



Seq ID NO: C34S Protein Sequence 
Protein Accession #: NPJL15934.1 

1 11 21 31 41 51 

| 1 1 I I I 

MTWRHHVRLL FTVSIALQII NLGNSYQREK HNOGREEVTK VATQKHRQSP LNWTSSHFGE 60 

VTGSAEGWGP EEPLPYSRAF GEGASARPRC CRNGGTCVLG SPCVCPAHFT GRYCEHDQRR 120 

SECGALEHGA WTLRACHLCR CIFGALHCLP LQTPDRCDPK DFLASHAHGP SAGGAPSLLL 180 

LLPCALLHRIj LRPDAPAHPR SLVPSVLQRE RRPCGRPGLG HRI» 223 



Seq ID NO: C346 Protein Sequence 
Protein Accession #: NP_006524.l 

1 11 21 31 41 51 

I I ! I.I I 

MARSLVCLGV IILLSAFSGP GVRGGPMPKL ADRKLCADQE CSHPISMAVA LQDYMAPDCR 60 
FLTIHRGQW YVFSKLKGRG PXFWGGSVQG DYYGDLAARL GYFPSSIVRE DQTLKPGKVD 120 
VKTDKWDFYC Q 131 



Seq ID NO: C347 Protein Sequence 
Protein Accession #s Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MTQVTEKSTE HPEKTTSTTE KTTRTPEKPT LYSEKTICTK GKNTPVPEKP TENLGNTTLT 60 

TETIKAPVKS TENPEKTAAV TKTIKPSVKV TGDKSLTTTS SHLNKTEVTH OVPTGSFTLI 120 

TSRTKLSSIT SEATGNBSHP YLNKDGSQKG IHAGQMGEND SFPAWAIVIV VLVAVILLLV 180 

FLGLIFLVSY MMRTRRTLTQ NTQYNDAEDE GGPNSYPVYL MEQQNLGMGQ IPSPR 235 



Seq ID NO i C348 Protein Sequence 
Protein Accession #: NP_543146.1 

1 11 21 31 41 51. 

I I I I 1 I 

MTQVTEKSTE HPEKTTSTTE KTTRTPEKPT LYSEKTICTK GKNTPVPEKP TENLGNTTLT 60 

TETIKAPVKS TENPEKTAAV TKTIKPSVKV TGDKSLTTTS SHLNKTEVTH QVPTGSFTLI 120 

TSRTKLSSIT SEATGNESHP YLNKDGSQKG IHAGQMGEND SFPAWAIVIV VLVAVILLLV 180 

FLGLIFLVSY MMRTRRTLTQ NTQYNDAEDE GGPNSYPVYL MEQQNLGMGQ IPSPR 235 

Seq ID NO: C349 Protein Sequence 
Protein Accession #: FGENESH predicted 

1 11 21 31 41 51 

i I I I I I 
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MWPRLAFCCW GLALVSGWAT FQQMSPSRNF SFRLFPETAP GAPGSIPAPP APGDEAAGSR 60 

VERLGQAFRR RVRLLRELSE RLELVFLVDD SSSVGEVNFR SELMFVRKLL SDFPWPTAT 120 

RVAIVTFSSK NYWPRVDYI STRRARQHKC ALLLQEIPAI SYRGGGTYTK GAFQQAAQIL 180 

LHARENSTKV VFLITDGYSN GGDPRPIAAS LRDSGVEIFT FGIWQGNIRE LNDMASTPKE 240 

EHCYLLHSFE EFEALARRAL HEDLPSGSFI QDDMVHCSYL CDEGKDCCDR MGSCKCGTHT 300 

GHFECICEKG YYGKGLQYEC TACPSGTYKP EGSPGGISSC IPCPDENHTS PPGSTSPEDC 360 

VCREGYRASG QTCELVHCPA LKPPENGYFI QNTCNNHFNA ACGVRCHPGF DLVGSSIILC 420 

LPNGLWSGSE SYCRVRTCPH LRQPKHGHIS CSTREMLYKT TCLVACDEGY RLEGSDKLTC 480 

QGNSQWDGPE PRCVERHCST FQMPKDVIIS PHNCGKQPAK FGTICYVSCR GGFILSGVKE 540 

MLRCTTSGKW NVGVQAAVCK DVEAPQINCP KDIEAKTLEQ QDSANVTWQI PTAKDNSGEK 600 

VSVHVHPAFT PPYLFPIGDV AIVYTATDLS GNQASCIFHI KVIDAEPPVI DWCRSPPPVQ 660 

VSEKVHAASW DEPQFSDNSG AELVITRSHT QGDliFPQGET IVQYTATDPS GNNRTCDIHI 720 

VIKGSPCEIP FTPVNGDFIC TPDNTGVNCT LTCLEGYDFT EGSTDKYYCA YEDGVWKPTY 780 

TTEWPDCAKK RFANHGFKSF EMFYKAARCD DTDLMKKFSE AFETTLGKMV PSFCSDAEDI 840 

DCRLEENLTK KYCLEYNYDY ENGFAIGPGG WGAANRLDYS YDDFLDTVQE TATSIGNAKS 900 

SRIKRSAPLS DYKIKLIFNI TASVPLPDER NDTLEWENQQ RLLQTLETIT NKXiKRTLNKD 960 

PMYSFQLASE ILIADSNSLE TKKASPFCRP GSVLRGRMCV NCPLGTYYNL EHFTCESCRI 1020 

GSYQDEEGQL ECKLCPSGMY TEYIHSRNIS DCKAQCKQGT YSYSGLETCE SCPLGTYQPK 1080 

FGSRSCLSCP ENTSTVKRGA VNISACGVPC PEGKFSRSGL MPCHPCPRDY YQPNAGKAPC 1140 

LACPFYGTTP FAGSRSITEC STSVLNITIF GGFGHLELLN CPSEVFHECF FNPCHNSGTC 1200 

QQLGRGYVCL CPLGYTGLKC ETDIDECSPL PCLNNGVCKD LVGEFICECP SGYTGQRCEE 1260 

NINECSSSPC LNKGICVDGV AGYRCTCVKG FVGLHCETEV NECQSNPCLN NAVCEDQVGG 1320 

FLCKCPPGFL GTRCGKNVDE CLSQPCKNGA TCKDGANSFR CLCAAGFTGS HCBLNINECQ 1380 

SNPCRKQATC VDELNSYSCK CQPGFSGKRC ETEQSTGFNL DFEVSGIYGY VMLDGMLPSL 1440 

HALTCTFWMK SSDDMNYGTP ISYAVDNGSD NTLLLTDYNG WVLYVNGREK ITNCPSVNDG 1500 

RWHHIAITWT SANGIWKVYI DGKLSDGGAG LSVGLPIPGG GALVLGQEQD KKGEGFSPAE 1560 

SFVGSISQLN LWDYVLSPQQ VKSLATSCPE ELSKGNVLAW PDFLSGIVGK VKIDSKSIFC 1620 

SDCPRLGGSV PHLRTASEDL KPGSKVNLFC DPGFQLVGNP VQYCLNQGQW TQPLPHCERI 1680 

SCGVPPPLEN GFHSADDFYA GSTVTYQCNN GYYLLGDSRM FCTDNGSWNG VSPSCLDVDE 1740 

CAVGSDCSEH ASCIiNVDGSY ICSCVPPYTG DGKNCAEPIK CKAPGNPENG HSSGEIYTVG 1800 

AGVTFSCQEG YQLMGVTKIT CLESGEWNHL IPYCKAVSCG KPAIPENGCI EEIjAFTFGSK 1860 

VTYRCNKGYT LAGDRESSCL ANSSWSHSPP VCEPVKCSSP ENINNGKYIL SGL.TYLSTAS 1920 

YSCDTGYSLQ GPSIIECTAS GIWDRAPPAC "HLVFCGEPPA IKDAVITGNN FTFRNTVTYT 1980 

CKEGYTLAGL DTIECLADGK WSRSDQQCLA VSCDBPPIVD HASPETAHRL FGDIAFYYCS 2040 

DGYSLADNSQ LLCNAQGKWV PPEGQDMPRC IAHFCEKPPS VSYSILESVS KAKFAAGSW 2100 

SFKCMEGFVL NTSAKIECMR GGQWNPSPMS IQCIPVRCGE PPSIMNGYAS GSNYSFGAMV 2160 

AYSCNKGFYI KGEKKSTCEA TGQWSSPIPT CHPVSCGEPP KVENGFLEHT TGRIFESEVR 2220 

YQCNPGYKSV GSPVPVCQAN RHWHSESPLM CVPLDCGKPP PIQNGFMKGE NFEVGSKVQF 2280 

FCNEGYELVG DSSWTCQKSG KWNKKSNPKC MPAKCPEPPL LENQLVLKEL TTEVGWTFS 2340 

CKEGHVLQGP SVLKCLPSQQ WNDSFPVCKI VLCTPPPIiIS FGVPIPSSAL HPGSTVKYSC 2400 

VGGFFLRGNS TTLCQPDGTW SSPLPECVPV ECPQPEEIPN GIIDVQGLAY LSTALYTCKP 2460 

GFELVGNTTT LCGENGHWLG GKPTCKAIEC LKPKEILNGK F9YTDLHYGQ TVTYSCNRGF 2520 

RLEGPSALTC LETGDWDVDA PSCNAIHCDS PQPIENGFVE GADYSYGAII IYSCFPGFQV 2580 

AGHAMQTCEE SGWSSSIPTC MPIDCGLPPH IDFGDCTKLK DDQGYFEQED DMMEVPYVTP 2640 
HPPYELGAVA KTWENTKESP ATHSSNFLYG TMVSYTCNPG YELLGNPVLI CQEDGTWNGS 2700 
APSCISIECD LPTAPEMGPL RPTETSMGSA VQYSCKPGHI LAGSDLRLCL ENRKWSGASP 2760 
RCEAISCKKP NPVMNGSIKG SNYTYLSTLY YECDPGYVLN GTERRTCQDD KNWDEDEPIC 2820 
IPVDCSSPPV SANGQVRGDE YTFQKEIEYT CNEGFLLEGA RSRVCLANGS WSGATPDCVP 2880 
VRCATPPQLA NGVTEGLDYG FMKEVTFHCH EGYILHGAPK LTCQSDGNWD AEIPLCKPVN 2940 
CGPPEDLAHG FPNGFSFIHG GHIQYQCFPG YKLHGNSSRR CLSNGSWSGS SPSCLPCRCS 3000 
TPVIEYGTVN GTDFDCGKAA RIQCFKGFKL LGLSEITCEA DGQWSSGFPH CEHTSCGSLP 3060 
MIPNAFISET SSWKENVITY SCRSGYVIQG SSDLICTEKG VWSQPYPVCB PLSCGSPPSV 3120 
ANAVATGEAH TYESEVKLRC LEGYTMDTDT DTFTOQKDGR WFPERISCSP KKCPLPENIT 3180 
HILVHGDDFS VNRQVSVSCA EGYTFEGVNI SVCQLDGTWE PPFSDESCSP VSOGKPESPE 3240 
HGFWGSKYT FESTIIYQCE PGYELEGNRE RVCQEKRQWS GGVAICKETR CETPLEFLNG 3300 
KADIENRTTG PNWYSCNRG YSLEGPSEAH CTENGTWSHP VPLCKPNPCP VPFVIPENAL 3360 
LSEKEFYVDQ NVSIKCREGF LLQGHGIITC NPDETWTQTS AKCEKISCGP PAHVENAIAR 3420 
GVHYQYGDMI TYSCYSGYML EGFLRSVCLE NGTWTSPPIC RAVCRFPCQN GGICQRPNAC 3480 
SCPEGWMGRli CEEPICILPC LNGGRCVAPY QCDCPPGWTG SRCHTAVCQS PCLNGGKCVR 3540 
PNRCHCLSSW TGHNCSR 3557 

Seq ID NO: C350 Protein Sequence 
Protein Accession #: FGENBSH predicted 

1 11 21 31 41 51 

I I I I I I 

MRFSVSGMRT DYPRSVIAPA YVSVCLLLLC PREVIAPAGS EPWLCQPAPR CGDKIYNPLE 60 
QCCYNDAIVS LSETRQCGPP CTFWPCFELC CLDSFGLTND FWKLKVQGV NSQCHSSPIS 120 

129 



Seq ID NO: C351 Protein Sequence 
Protein Accession ft: AAH35671.1 

1 11 21 31 41 51 

I I I I I I 

MVPGARGGGA LARAAGRGLL ALLLAVSAPL RLQAEELGDG CGHLVTYQDS GTMTSKNYPG 60 

TYPNHTVCEK TITVPKGKRL ILR1X3DLDIE SQTCASDYLL FTSSSDQYGP YCGSMTVPKE 120 

LLLNTSEVTV RFESGSHISG RGFLLTYASS DHPDLITCLE RASHYLKTEY SKFCPAGCRD 180 

VAGDISGNMV DGYRDTSLLC KAAIHAGIIA DELGGQISVL QRKGISRYEG ILANGVLSRD 240 

GSLSDKRFLF TSNGCSRSLS FEPDGQIRAS SSWQSVNESG DQVHWSPGQA RLQDQGPSWA 300 

SGDSSNNHKP REWLEIDLGE KKKITGIRTT GSTQSNFNFY VKSFVMNFKN NNSKWKTYKG 360 

IVNNEEKVFQ GNSNFRDPVQ NNPIPPIVAR YVRWPQTWH QRIALKVELI GCQITQGNDS 420 

LVWRKTSQST SVSTKKEDET ITRPIPSEET STGINITTVA IPLVLLWLV FAGMGIFAAF 480 

RKKKKKGSPY GSAEAQKTDC WKQIKYPFAR HQSAEFTISY DNEKEMTQKL DLITSDMAG 539 
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Seq ID NO: C3S2 Protein Sequence 
Protein Accession fl: Eos sequence 

1 11 21 31 41 51 

I I I I I 1 

MGFGAGQRLR PVPAPRSSAE EAARPGQLRL GIRRGEAELA KLAPSGVMVP GARGGGALAR 60 

AAGRGLIiALL LAVSAPLRLQ AEELGDGCGH LVTYQDSGTM TSKNYPGTYP NHTVCEKTIT 120 

VPKGKRLILR LGDLDIESQT CASDYLLFTS SSDQYGPYCG SMTVPKELLL NTSEVTVRFE 180 

SGSHISGRGF LLTYASSDHP DLITCLERAS HYLKTEYSKF CPAGCRDVAG DISGNMVDGY 240 

RDTSLLCKAA IHAGIIADEL GGQISVLQRK GISRYEGILA NGVLSRDGSL SDKRFLFTSN 300 

GCSRSLSFEP DGQIRASSSW QSVNESGDQV HWSPGQARLQ DQGPSWASGD SSNKHKPREW 360 

LEIDLGBKKK ITGIRTTGST QSNFNFYVKS FVMNFKNNNS KWKTYKGIVN NEEKVFQGNS 420 

NFRDPVQNNF IPPIVARYVR WPQTWHQRI ALKVELIGOQ ITQGNDSLVW RKTSQSTSVS 480 

TKKEDETITR PIPSEETSTQ INITTVAIPL VLLWLVPAG MGIFAAFRKX KKKGSPYGSA 540 

EAQKTDCWKQ IKYPFARHQS AEFTISYDNE KEMTQKLDLI TSDMAG 586 

Seq ID NO: C353 Protein Sequence 
Protein Accession #: FGBNESH predicted 

1 11 21 31 41 51 

I I I I I I 

MFQRQERFLD LSSAEAVAAW ILHQHPDIIN KGDGCGHLVT YQDSGTMTSK NYPGTYPNHT 60 

VCEKTITVPK GKRLILRLGD LDIESQTCAS DYLLPTSSSD QYGMQKEEET EVLCLSVAGA 120 

QRVDIPVQLL PSFLEGWKGH ADARGPYCGS MTVPKELLLN TSEVTVRFES GSHISGRGFL 180 

LTYASSDHPD LITCLERASH YLKTEYSKFC PAGCRDVAGD ISGNMVDGYR DTSLLCKAAI 240 

HAGIIADBLG GQISVLQRKG ISRYEGILAN GVLSRDGSLS DKRFLFTSNG CSRSLSFEPD 300 

GQIRASSSWQ SVNESGDQVH WSPGQARLQD QGPSWASGDS SNNHKPREWL EIDLGEKKKI 360 

TGIRTTGSTQ SNFNFYVKSF VMNFKNNNSK WKTYKGIVNN EEKVFQGNSN FRDPVQNNFI 420 

PPIVARYVRV VPQTWHQRIA LKVELIGCQI TQGNDSLVWR KTSQSTSVST KKEDETITRP 480 

IPSEETSTDA MPVQIVGDHT QMISQRENLG PDEGKIPFKG TAESMVRWP AWVNDLGML 540 

FLAHTPEEDI DHYCWKQIKY PFARHQSAEF TISYDNEKEM TQKLDLITSD MADYQQPLMI 600 

GTGTVTRKGS TFRPMDTDAE EAGVSTDAGG HYDCPQRAGR HEYALPLAPP EPEYATPIVE 660 

RHVLRAHTFS AQSGYRVPGP QPGHKHSLSS GGFSPVAGVG AQDGDYQRPH SAQPADRGYD 720 

RPKAVSALAT ESGHPDSQKP PTHPGTSDSY SAPRDCLTPL NQTAMTALL 769 

Seq ID NO: C354 Protein Sequence 
Protein Accession g: NP_004607.1 

1 11 21 31 41 SI 

I I I 1 I 1 

MAGVSACIKY SMFTFNFLFW LCGILILALA IWVRVSNDSQ AIFGSEDVGS SSYVAVDILI 60 

AVGAIIMILG FLGCCGAIKE SRCMUjLFFI GLLLILLLQV ATGIL/3AVFK SKSDRIVNET 120 

LYENTKLLSA TGESBKQFQE AIIVFQEEFK CCGLVNGAAD WOJNFQHYPE LCACLDKQRP 180 

CQSYNGKQVY KETCISFIKD FLAKNLIIVI GISFGLAVIE ILGLVFSMVL YCQIGNK 237 

Seq ID NO: C355 Protein Sequence 
Protein Accession ft: NP_004608.l 

1 11 21 31 41 51 

I I I I i I 

MCTGGCARCL GGTLIPLAFF GFLANILLFF PGGKVIDDND HLSQBIWFPG GILGSGVLMI 60 

FPALVFLGLK NNDCCGCCGN EGCGKRFAMF TSTIFAWGF LGAGYSFIIS AISINKGPKC 120 

LMANSTWGYP FHDGDYLNDE ALWNKCREPL NWPWNLTLF SILLWGGIQ MVLCAIQWN 180 

GLLGTLCGDC QCCGCCGGDG PV 202 

Seq ID NO j C356 protein Sequence 
Protein Accession #: NP_002372.l 

1 11 21 31 41 51 

I I I I I I 

MPRPAPARRli PGLLLLLWPL LLLPSAAPDP VARPGFRRLE TRGPGGSPGR RPSPAAPDGA 60 

PASGTSEPGR ARGAGVCKSR PLDLVFIIDS SRSVRPLEFT KVKTFVSRII DTLDIGPADT 120 

RVAWNYAST VKIEFQLQAY TDKQSLKQAV GRITPLSTGT MSGLAIQTAM DEAFTVEAGA 180 

REPSSNIPKV AIIVTDGRPO. DQVNEVAARA QASGIELYAV GVDRADMASL KMMASEPLEE 240 

HVFYVETYGV IEKLSSRFQE TFCAU)PCVL GTHQCQHVCI SDGEGKHHCE CSQGYTLNAD 300 

KKTCSALDRC ALNTHGCEHI CVNDRSGSYH CECYEGYTLN EDRKTCSAQD KCALGTHGCQ 360 

HICVNDRTGS HHCECYEGYT LNADKKTCSV RDKCALGSHG OQHICVSDGA ASYHCDCYPG 420 

YTLNEDKKTC SATEEARRLV STEDACGCEA TLAFQDKVSS YLQRLNTKLD DILEKLKINE 480 

YGQIHR 486 

Seq ID NO: C357 Protein Sequence 
Protein Accession #: NP_057723.1 

1 11 21 31 41 51 

I I I I I I 

MARGSLRRLL RJjLVLiGLWLlA. LLRSVAGEQA PGTAPCSRGS SWSADLDKCM DCASCRARPH 60 

SDFCLGCAAA PPAPFRLLWP ILGGALSLTF VLGLLSGFLV WRRCRRREKF TTPIEBTGGB 120 

GCPAVALIQ 129 

Seq ID NO: C358 Protein Sequence 
Protein Accession ft: NP_001810.1 
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1 11 21 31 41 SI 

1 1 1 1 1 1 

MQPTLLLSLL GAVGLAAVNS MPVDNRNHNE GMVTRCIIEV LSNALSKSSA PPITPECRQV 60 

LKTSRKDVKD KETTENENTK FEVRLLRDPA OASBAHBSSS RGEAGAPGEE DIQGPTKADT 120 

EKHAEGGGHS RBRADEPQWS LYPSDSQVSE EVKTRHSEKS QREDEEEEEG ENYQKGERGB 180 

DSSEEKHLEE PGETQNAFLN ERKQASAIKK BELVARSETH AAGHSQEKTH SREKSSQESG 240 

EEAGSQENHP QESKGQPRSQ EESEEGEEDA TSEVDKRRTR PRHHHGRSRP DRSSQGGSLP 300 

SEEKGHPQEE SBESNVSMAS LGBKRDHHST HYRASEEEPE YGEEIKGYPG VQAPEDLEWE 360 

RYRGRGSEEY RAPRPQSEES WDEEDKRNYP SLELDKMAHG YGEESEEERG LEPGKGRHHR 420 

GRGGEPRAYF MSDTREEKRP LGEGHHRVQE KQMDKARRHP QGAWKELDRN YLNYGEEGAP 480 

GKWQQQGDLQ DTKENREEAR FQDKQYSSHH TABKRKRLGE LFNPYYDPIjQ WKSSHFERRD 540 

NMNDNFLEGE EENELTLNEK NFFPEYNYDW WEKKPFSEDV NWGYEKRNLA RVPKLDLKRQ 600 

YDRVAQLDQL LHYRKKSAEF PDFYDSEEPV STHQEAENEK DRADQTVLTE DEKKELENLA 660 

AMDLELQKIA EKFSQRG 677 

Seq ID NO: C359 Protein Sequence 
Protein Accession ft: XP_0 93082.1 

1 11 21 31 41 51 

I I I I I I 

MKLLCEGLKQ PNCVLQTLRW YRCLISSASC GALAAVLSTS QWLTELEFSE TKLEASALKL 60 

LYGGLKDPNC KLQKLNLQFS LSVTAAKLPV GMVGNCSGFS GSLVQSHFGY CQDSSFKCDL 120 

CKLLWPSTRV AAAKDCGSPK SFLSEGLNWA GRLEAVEEVL GLGVLVQPGD PASQGGGHCE 180 

NYGSFRDLVD LEVKAEPSLR KGGMDLQRPT LQWLLCKIF SLKLFLFIAL PNSPGQVSW 240 

QVTIPDGFVN VTVGSNVTLI CIYTTTVASR EQLSIQWSFF HKKEMEPISS PWEEGKWPDV 300 

EAVKGTLDGQ QAELQIYFSQ GGQAVAIGQF KDRITGSNDP GNASITISHM QPADSGIYIC 360 

DVNNPPDFLG QNQGILNVSV LVKPSKPLCS VQGRPETGHT ISLSCLSALG TPSPVYYWHK 420 

LEGRDIVPVK ENFNPTTGIL VIGNLTNFEQ GYYQCTAINR LGNSSCEIDL TSSHPEVGII 480 

VGALIGSLVG AAIIISWCF ARNKAKAKAK ERNSKTIAEL EPMTKINPRG ESEAMPREDA 540 

TQLEVTLPSS IHETGPDTIQ EPDYEPKPTQ EPAPEPAPGS EPMAVPDLDI EIiELEPETQS 600 

ELEPEPEPEP ESEPGWVEP LSEDEKGWK A 631 

Seq ID NO: C360 Protein Sequence 
Protein Accession ft: FGENESH predicted 

1 11 21 31 41 51 

I I I I I I 

MVFAFWKVFIi ILSCLAGQVS WOVTIPDGF VNVTVGSNVT LICIYTTTVA SREQLSIQWS 60 

FFHKKEMEPI SSPWEEGKWP DVEAVKGTLD GQQAELQIYF SQGGQAVAIG QFKDRITGSN 120 

DPGNASITIS HMQPADSGIY ICDVNNPPDF LGQNQGILNV SVLVKPSKPL CSVQGRPETG 180 

HTISLSCLSA LGTPSPVYYVJ HKLEGRD1VP VKENFNPTTG ILVIGNLTNF EQGYYQCTAI 240 

NRLGNSSCEI DLTSSHPEVG IIVGALIGSL VGAAIIISW CPARNKAKAK AKERNSKTIA 300 

ELEPMTKINP RGESEAMPRE DATQLEVTLP SSIHETGPDT IQEPDYEPKP TQEPAPEPAP 360 

GSEPMAVPDL DIELELEPET QSELEPEPEP EPESEPGVW EPLSEDEKGV VKA 413 

Seq ID NO: C361 Protein Sequence 
Protein Accession ft: NP_003011.1 

1 11 21 31 41 51 

I I I I I I 

MVSRMVSTML SGLLFWIASG WTPAFAYSPR TPDRVSEADI QRLLHGVMBQ LGIARPRVEY 60 

PAHQAMNLVG PQSIEGGAHE GLQHLGPFGN IPNIVAELTG DNIPKDFSED QGYPDPPNPC 120 

PVGKTDDGCL ENTPDTAEFS REFQLHQHIiF DPEHDYPGLG KWNKKLLYEK MKGGERRKRR 180 

SVNPYLQGQR LDNWAKKSV PHFSDEDKDP E 211 

Seq ID NO: C362 Protein Sequence 
Protein Accession ft: NP_076926.2 

1 11 21 31 41 51 

I I I I I I 

MTTKQGMEQA MPGAGPGVPQ LGNMAVIHSH LWKGLQEKFI* KGEPKVLGW QILTALMSLS 60 
MGITMMCMAS NTYGSNPISV YIGYTIWGSV MFIISGSLSI AAGIRTTKGL VRGSDGMMIT 120 
SSVLAASGIL INTFSLAFYS FHHPYCNYYG NSKNCHGTMS ILMGLDGMVL LLSVLEFCIA 180 
VSLSAFGCKV LCCTPGGWL ILPSHSHMAE TASPTPLNEV 220 

Seq ID NO: C363 Protein Sequence 
Protein Accession ft: NP_002082.1 

1 11 21 31 41 51 

I I I I I I 

MRGSELPLVL LALVLCLAPR GRAVPIiPAGG GTVLTKMYPR GNHWAVGHLM GKKSTGESSS 60 
VSERGSLKQQ LREYIRWEEA ARNLLGLIEA KENRNHQPPQ PKALGNQQPS WDSEDSSNFK 120 
DVGSKGKVGR LSAPGSQREG RNPQLNQQ 14 B 

Seq ID NO: C364 Protein Sequence 
Protein Accession #: NP_036393.1 

1 11 21 31 41 51 

I I I I i 1 

MDLQGRGVPS IDRLRVLLML FHTMAQIMAE QEVENLSGLS TNPEKDIFW RENGTTCLMA 60 

EFAAKFIVPY DVWASNYVDL ITEQADIALT RGAEVKGRCG HSQSELQVFW VDRAYALKML 120 

FVKESHNMSK GPEATWRLSK VQFVYDSSEK THFKDAVSAG KHTANSHHLS ALVTPAGKSY 180 

ECQAQQTISL ASSDPQKTVT MILSAVHIQP FDIISDFVFS EEHKCPVDER EQLEETLPLI 240 

LGLILGLVIM VTLAIYHVHH KMTANQVQIP RDRSQYKHMG 280 
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Seq ID NO: C365 Protein Sequence 
Protein Accession ft: NP_003217.1 

1 11 21 31 41 51 

I I I I 1 I 

MLGLVLALLS SSSAEEYVGL SANQCAVPAK DRVDCGYPHV TPKECNNRGC CFDSRIPGVP 
WCFKPLTRKT ECTF 

Seq ID NO i C366 Protein Sequence 
Protein Accession ft: NP_0029B4.1 

1 11 21 31 41 51 

I I I I I I 

MSLPSSRAAR VPGPSGSLCA LLALLLLLTP PGPLASAGPV SAVLTELRCT CLRVTLRVNP 
KTIGKLQVFP AGPQCSKVEV VASLKNGKQV CLDPEAPFLK KVIQKILDSG NKKN 

Seq ID NO: C367 Protein Sequence 
Protein Accession ft: NP 005233.2 



MRSPSAAWLb 
VDEFSASVLT 
DLLSVIWFPL 
MGKSRKKANI 
NYFLSLAIGV 
TPSNLLLWH 
CRSVRTVKQM 



11 
I 

GAAILLAASL 
GKLTTVFLPI 
KIAYHIHANN 
AIGISLAIWL 
FLFPAFLTAS 
YFLIKSQGQS 
QVSLTSKKHS 



21 
I 

SCSGTIQGTN 
VYTIVFWGIi 
WIYGEALCNV 
LIIiLVTIPLY 
AYVLMIRMLR 
HVYALYIVAL 
RKSSSYSSSS 



31 
I 

RSSKGRSLIG 
PSNGMALWVF 
LIGFFYGNMY 
WKQTIFIPA 
SSAMDENSBK 
CLSTLNSCID 
TTVKTSY 



41 

I 

KVDGTSHVTG 
LFRTKKKHPA 
CSILFMTCLS 
LNITTCHDVL 
KRKRAIKLIV 
PFVYYFVSHD 



Seq ID NO: C368 Protein Sequence 
Protein Accession #: NP 003460.1 



MAEAKTHWLG 
YIENLRQQAH 
AENEPQSAPK 
RTNEIVEEQY 
YEDWGGEDW 
DQLSDDVSKV 
PPEDLIEMLK 
RAGTEAIiPDG 
AAWIPHVENR 
EQIEQAIKEH 
KAEKGREHIA 



11 
I 

AALSLIPLIF 
KEESSPDYNP 
ENKPYALNSE 
TPQSLATLES 
NPVEEKIESQ 
IAYLKRLVNA 
TGEKPNGSVE 
LSVEDILNLL 
QMAYENLNDK 
LNQOSSQETD 
KRAMENM 



21 
I 

LISGAEAASF 
YQGVSVPLQQ 
KNFPMDHSDD 
VFQELGKLTG 
TQEEVRDSKE 
AGSGRLQNGQ 
PERELDLPVD 
GMESAANQKT 
DQELGEYIAR 
KLAPVSKRPP 



31 
I 

QRNQLLQKEP 
KENGDESHLP 
YETQQWPERK 
PNNQKRERMD 
NIGKNEQIND 
NGERATRLFE 
LDDISBADLD 
SYFPNPYNQE 
MLVKYPEIIN 
VGPPKNDDTP 



DLRLENVQKF 



LKHMQFPPMY 
EEQKLYTDDE 
EMKRSGQLGI 
KPLDSQSIYQ 
HPDLFQNRML 
KVLPRLPYGA 
SNQVKRVPGQ 
NRQYWDEDU, 



51 
I 

KGVTVETVFS 
VIYMANLALA 
VQRYWVIVNP 
PEQLLVGDMF 
TVLAMYLICF 
FRDHAKNALL 



51 
I 

PS P EMI RALE 
MRIILEALRQ 
EENSRDNPFK 
DDIYKANNIA 
QEEDLRKESK 
LIEISRNLQI 
SKSGYPKTPG 
GRSRSNQLPK 
GSSEDDLQEE 
MKVLEYLNQE 



60 
74 



60 
114 



60 

120 

180 

240 

300 

360 

397 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

617 



Seq ID NO: C369 Protein Sequence 
Protein Accession #: NP_112217.1 

1 11 21 31 41 51 

I I I I I I 

MPCAQRSWLA NLSWAQLLN FGALCYGRQP QPGPVRFPDR RQEHFIKGLP EYHWGPVRV 60 

DASGHFLSYG LHYPITSSRR KRDLDGSEDW VYYRISHEEK DLFFNLTVNQ GFLSNSYIME 120 

KRYGNLSHVK MMASSAPLCH LSGTVLQQGT RVGTAALSAC HGLTGFFQLP HGDFFIEPVK 180 

KHPLVEGGYH PHIVYRRQKV PETKEPTCGI* KDSVNISQKQ ELWREKWERH NLPSRSLSRR 240 

SISKERWVET LWADTKMIE YHGSENVESY ILTIMNMVTG LFHNPSIGNA IHIVWRLIL 300 

LEEEEQGLKI VHHAEKTLSS FCKWQKSINP KSDLNPVHHD VAVLLTRKDI CAGFNRPCBT 360 

LGLSHLSGMC QPHRSCNINE DSGLPLAFTI AHELGHSFGI QHDGKENDCE PVGRHPYIMS 420 

RQLQYDPTPL TWSKCSEEYI TRFLDRGWGF CLDDIPKKKG UCSKVIAPGV IYDVHHQCQL 480 

QYGPNATFCQ EVENVOQTLW CSVKGFCRSK LDAAADGTQC GEKKWCMAGK CITVGKKPES 540 

IPGGWGRWSP WSHCSRTCGA GVQSAERLCN NPEPKFGGKY CTGERKRYRL CNVHPCRSEA 600 

PTFRQMQCSE FDTVPYKNEL YHWFPIFNPA HPCELYCRPI DGQFSEKMLD AVIDGTPCFE 660 

GGNSRNVCIN GICKMVGCDY EIDSNATEDR CGVCLGDGSS CQTVRKMFKQ KEGSGYVDIG 720 

LIPKGARDIR VMEIEGAGNF LAIRSEDPEK YYLNGGFIIQ WNGNYKLAGT VFQYDRKGDL 780 

- EKLMATGPTN ESVWIQLLFQ VTNPGIKYEY TIQKDGLDND VEQMYFWQYG HWTECSVTCG 840 

TGIRRQTAHC IKKGRGMVKA TFCDPETQPN GRQKKCHEKA CPPRWWAGEW EACSATCGPH 900 

GEKKRTVLCI QTMVSDEQAL PPTDCQHLLK PKTIiLSCNRD ILCPSDWTVG NWSECSVSCG 960 

GGVRIRSVTC AKNHDEPCDV TRKPNSRALC GLQQCPSSRR VLKPNKGTIS NGKNPPTLKP 1020 

VPPPTSRPRM LTTPTGPESM STSTPAISSP SPTTASKEGD LGGKQWQDSS TQPELSSRYlt 1080 

ISTGSTSQPI LTSQSLSIQP SEENVSSSDT GPTSEGGLVA TTTSGSGLSS SRNPITWPVT 1140 

PPYNTLTKGP EMEIHSGSGE EREQPEDKDE SNPVIWTKIR VPGNDAPVES TEMPLAPPLT 1200 

PDLSRESWWP PPSTVMEGLL PSQRPTTSET GTPRVEGMVT EKPANTLLPL GGDHQPEPSG 1260 

KTANRNHLKI* PNNMNQTKSS EPVLTEEDAT SLITEGFLLN ASNYKQLTNG HGSAHWIVGN 1320 

WSECSTTCGL GAYWKRVECT TQMDSDCAAI QRPDPAKRCH LRPCAGWKVG NWSKCSRNCS 1380 

GGFKIREIQC VDSRDHRNLR PFHCQFLAGI PPPLSMSCNP EPCEAWQVEP WSQCSRSCGG 1440 

GVQERGVFCP GGLCDWTKRP TSTMSCNEHL CCHWATGNWD LCSTSCGGGF QKRIVQCVPS 1500 

EGNKTEDQDQ CLCDHKPRPP EFKKCNQQAC KKSADLLCTK DKLSASFCQT LKAMKKCSVP 1560 

TVRAECCFSC PQTHITHTQR QRRQRLLQKS KEL 1593 



Seq ID NO: C370 Protein Sequence 
Protein Accession ft: NP 001053.1 
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I 

MRQSHQLPLV 
LKLVGIQIQT 
KLENKFQAEI 
SQFSVDTGAM 
TFSTGEAMQA 
DINKDSSCVS 
VMEKAQKMND 
RNGENLEVRW 



1 

GLLLFSFIPS 
LMQKMIQQIK 
ENMEAHNGTP 
AVLALTCVKK 
LPVSSDYYNE 
ASGNFNI SAD 
TIFGFTMEER 
SKY 



I 

QLCEICEVSB 
YNVKSRLSDV 
LTNYYQLSLD 
SLINGQIKAD 
NDWNCQQTLN 
EPITVTPPDS 
SWGPYITCIQ 



I 

ENYIRLKPLL 
SSGELALI IL 
VLALCLFNGN 
EGSLKNISIY 
TVLTEISQGA 
QSYISVNYSV 
GLCANNNDRT 



NTMIQSNYNR 
ALGVCRNAEE 
YSTAEWNHF 
TKSLVEKILS 
FSNPNAAAQV 
RINETYFTNV 
YWELliSGGEP 



GTSAVNWLS 
NLIYDYHLTD 
TPENKNYYFG 
EKKENGL1GN 
LPALMGKTPL 
TVUIGSVFLS 
LSQGAGSYW 



Seq ID NO: C371 Protein Sequence 
Protein Accession ft: NP 004582.1 



1 11 21 31 41 51 

| I I I I I 

MCCTKSLLLA ALMSVLLLHL CGESEAASNF DCCLGYTDRI LHPKFIVGFT RQLANEGCDI 
NAIIFHTKKK LSVCANPKQT WVKYIVRLLS KKVKNM 

Seq ID NO: C372 Protein Sequence 
Protein Accession Jh NP_037403.1 

1 11 21 31 41 51 

I I I I I I 

MAGSPLLWGP RAGGVGLLVL LLLGLFRPPP ALCARPVKEP RGLSAASPPL AETGAPRRFR 
RSVPRGEAAG AVQELARALA HLLEAERQER ARAEAQEAED QQARVLAQLL RVWGAPRNSD 
PALGLDDDPD APAAQLARAL LRARLDPAAL AAQLVPAPVP AAALRPRPPV YDDGPAGPDA 
BEAGDETPDV DPELLRYLLG RIIiAGSADSE GVAAPRRLRR AADHDVGSEL PPEGVLGALL 
RVKRLETPAP QVPARRLLPP 



Seq ID NO: C373 Protein Sequence 
Protein Accession #: NP_O0223G.l 



MLQSLAGSSC 
RRFLEEHECL 
GHTVPLSDGG 
IVHAVLLGFV 
FRELYKIGIT 
FSSITDQAAG 



11 
I 

VRLVERHRSA 
SEQQLEQFLG 
KAFCIIYSVI 
TVSCFFFIPA 
CYLIiLGLIAM 
MKEDQKQNEP 



21 
I 

WCFGFIiVLGY 
RVLEASNYGV 
GIPFTLLFIiT 
AVFSVLEDDW 
LWLETFCEL 
PVATQSSACV 



31 41 

I I 
LLYLVFGAW FSSVELPYED 
SVLSNASGNW NWDFTSALFF 
AWQRITVHV TRRPVLYFHI 
NFLBSFYFCF ISLSTIGLGD 
HELKKFRKMP YVKKDKDEDQ 
DGPANH 



51 
I 

LLiRQELRKLK 
ASTVLSTTGY 
RWGFSKQWA 
YVPGEGYNQK 
VHIIEHDQLS 



60 

120 

180 

240 

300 

360 

420 

433 



60 
96 



60 

120 

180 

240 

260 



60 

120 

180 

240 

300 

336 



Seq ID NO: C374 Protein Sequence 
Protein Accession #: NP_005463.1 

1 11 21 31 41 51 

I I I I I 1 

METTNGTETW YESLHAVLKA LNATLHSNliL CRPGPGLGPD NQTEERRASL PGRDDNSYMY 
ILFVMFLFAV TVGSLILGYT RSRKVDKRSD PYHVYIKNRV SMI 

Seq ID NO: C375 Protein Sequence 
Protein Accession #t NP 005236.1 



1 
I 

KGRHLALLLL 
VYITHPAWEV 
KALEKNTNVE 
NGEFYYSFKD 
AKLTVHIEQA 
QQFRTVRSFP 
KAGPVKFEKD 
SILEPVKRQQ 
TIMSLSAVDP 
SDWGLPYRRE 
LVQYQIEAGN 
ITVAASHKIiV 
PTGIQVKENQ 
DRETTDKYTL 
IIQVEATDKD 
AREEPQLFST 
QVRYSLLDHG 
EWDVNENLH 
VFKIGEETGV 
EPVYYPEIME 
RKLDREQQDB 
DRERNARREP 
DILSIKAVDN 
SVEPPGIPLW 
QVFIKVIDTN 
S LKK.FR LD PA 
WFTASSYKGR 
SIKTAKELDR 
ETVSIGSFVG 
QGTNMAGLST 



11 
i 

LLLLFQHFGD 
RYKIVSGDSB 
ARTKVRVQVL 
RTDMFAIHPT 
NECAPVITAV 
GSKEYKVKAI 
VYRAEISEFA 
AAHFELEVTT 
DEGENGYVTY 
VEVLATITLN 
ELDLFSLNPN 
NLQCEETGVA 
PVGSSVIFMN 
NITVYDLGIP 
LGPNGHVTYS 
VWKVSLEDV 
EGNFDVDKLS 
PPVFSSFVEK 
IETSDRLDRE 
NSPKDVSWQ 
RILBVTVTDN 
LYRVIATDKD 
GRPQKSSTTR 
FDITGGNYDS 
DHRPQFSTSK 
TGSIjYTSEKL 
VYESAAVGSV 
SNQAEYDLMV 
MVTAHSQSSV 
NTTVLVHLQD 



21 
I 



NLFKAEEYIL 
DTNDLRPLFS 
SGVIVLTGRL 
TLSPSELDRD 



PPNTPWMVK 
SDRKASTKVL 
SIANIiNHVPF 
NLNDNTPLFB 
SGVLSLKRSL 
KMLAEKLLQA 
STDLDTGFNG 
QKAAWRliLHV 
ILTDTDTFSI 
NDNPPTFIPP 
GAVRIVQQLD 
GTVKEDAPVG 
STSHYWLTVF 
IEAFDPDSSS 
GSPPKSTIAR 
EGPNAEISYS 
LHIEWISKPK 
HFDVDKGTGT 
YEWIPEDTA 
DHEAVSPAHXj 
VLQVTALDKD 
KATDKGSPPM 
VYEIKDGNTG 
ENDNAPVFMQ 



31 
I 

PLQFTHLEYN 
GDFCFLRIRT 
PTSYSVSLPE 
DYLETKLYEM 
PAYATVTVDD 
GYNLTLQAKD 
AIPAYSHLRY 
VKVLGANSNP 
AIDHFTGAVS 
KINCEGTIPR 
MDGLGAKVSF 
NKLHNQGEVE 
KLVYAVSGGN 
VWDANDNPP 
DSVTGWNIA 
NYRVKVREDL 
FEKKQVYNLT 
SLVMTVSAHD 
ATDQGWPLS 
NDKLMYKITS 
VIVKILDEND 
IEDGNEHGKF 
QSLEPISFEE 
IIVAKPLDAE 
PETEILQISA 
TVMVRDQDVP 
KGKNAEVLYS 
SEITSVRIFV 
DAFDINPHSG 
AEYTGLISES 



41 
I 

VTVQENSAAK 
KGGNTAILNR 
NTAIRTSIAR 
EILAADRGMK 
CDQGANGDXA 
KGTPPQFSSV 
VFKRTPGKAK 
PEFTQTAYKA 
TSENLDYBLM 
DLGVGEQITT 
HSLRITATDG 
DIFFDSHSVN 
EDSCFMIDME 
EFLQESYFVE 
RPLDRELQHE 
PEGTVIMWLE 
VRAKDKGKPV 
BDAGRDGBIR 
SFIBIYIEVE 
GNPQGFFSIH 
NKPQFLQKFY 
FIEPKTGWS 
SFFTFTVMES 
QKSNYNLTVE 
VDQDEKNKLI 
VKRNFARIW 
IESGNIGNIG 
TIADNASPKF 
TIITQKALDF 
ASINSWLTD 



51 
I 

TYVGHPVKMG 
EVKDHYTLIV 
VSATDADIGT 
LYGSSGISSM 
SLSIVAGDLL. 
KVIHVTSPQF 
FSLNYNTGLI 
AFDENVPIGT 
PRVYTLRIRA 
VSAIDADELQ 
ENFATPLYIN 
AHIPQFRSTL 
TGMLKILSPZj 
VSEDKEVHSE 
HSLKIEARDQ 
AHDPDLGQSG 
SLSSTCYVEV 
YSIRDGSGVG 
DVNDNAPQTS 
PKTGIiITTTS 
KIRLPBREKP 
SKRFSAAGEY 
DPVAHMIGVI 
ATDGTTTILT 
YTLQSSRDPL 
NVSDTNDHAP 
NSFMIDPVU3 
TSKEYSVELS 
ETLPIYTLII 
RNVPLVIRAA 



60 
103 



60 

120 

100 

240 

300 

360 

420 

480 

540 

600 

660 

720 

7B0 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 
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DADKDSNALL VYHIVBPSVH TYFAIDSSTG AIHTVLSLDY EETSIFHPTV QVHDMGTPRL 1860 

FAEYAANVTV HVIDINDCPP VFAKPLYBAS LLLPTYKGVK VITVNATDAD SSAFSQLIYS 1920 

ITEGNIGEKF SMDYKTGALT VQNTTQLRSR YELTVRASDG RFAGLTSVKI NVKESKESHL 1980 

KFTQDVYSAV VKENSTEAET LAVITAIGSP INEPLFYHIL NPDRRFKISR TSGVLSTTGT 2040 

5 PFDREQQEAF DVWEVIEEH KPSAVAHVW KVIVEDQNDN APVPVNLPYY AWKVDTEVG 2100 

HVIRYVTAVD RDSGRNGEVH YYliKEHHEHF QIGPLGEISI# KKQPELDTLN KEYLVTWAK 21G0 

DGGNPAFSAE VIVPITVMNK AMPVFEKPFY SAEIAESIQV HSPWHVQAN SPEGLKVFYS 2220 

ITDGDPFSQF TINFNTGVIN VIAPLDFEAH PAYKLSIRAT DSL.TGAHAEV FVDUVDDIN 2280 

DNPPVFAQQS YAVTLSEASV IGTSWQVRA TDSDSEPNRG ISYQMFGNHS KSHDHFHVDS 2340 

10 STGLISLLRT LDYEQSRQHT IFVRAVDGGM PTLSSDVIVT VDVTDLNGNP PLFEQQIYEA 2400 

RISEHAPHGH FVTCVKAYDA DSSDIDKLQY SILSGNDHKH PVIDSATGII TLSNLHRHAL 2460 

KPFYSLNLSV SDGVFRSSTQ VHVTVIGGNL HSPAFLQNEY EVELAENAPL HTLVMEVKTT 2S20 

DGDSGIYGHV TYHIVNDFAK DRFYINERGQ IFTLEKLDRE TPAEKVISVR LMAKDAGGKV 2S80 

AFCTVNVILT DDNDNAPQFR ATKYEVNIGS SAAKGTSWK SASDADEGSN ADIT YAI BAD 2640 

15 SESVKENLEI NKLSGVITTK ESLIGLENEF FTFFVRAVDN GSPSKESWL VYVKTLPPBM 2700 

QLPKFSEPFY TFTVSEDVPV GTEIDLIRAE HSQTVLYSLV KGNTPESNRD ESFVIDRQSG 2760 

RLKLEKSLDH ETTKWYQFS I LARCTQDDHE MVASVDVS1Q VKDANDNSPV FESSPYEAFI 2820 

VENLPGGSRV IQIRASDADS GTNGQVMYSL DQSQSVBVIB SFAINMETGW ITTLKELDHE 2880 

KRDNYQIKW ASDHGEKIQL SSTAIVDVTV TDVNDSPPRF TAEIYKGTVS EDDPQGGVIA 2940 

20 ILSTTDADSE EINRQVTYFI TGGDPLGQFA VETIQNEWKV YVKKPLDREK RDNYLLTITA 3000 

TDGTFSSKAI VEVKVLDAND NSPVCEKTLY SDTIPEDVLP GKLIMQISAT DADIRSNAEI 3060 

TYTLLGSGAE KFKLNPDTGE LKTSTPLDRE EQAVYHLLVR ATDGGGRFCQ ASIWTLEDV 3120 

NDNAPEFSAD PYAITVFENT EPGTLLTRVQ ATDADAGLNR KILYSLIDSA DGQFSINELS 3180 

GIIQLEKPLD RELQAVYTLS LKAVDQGLPR RLTATGTVIV SVLDINDNPP VFEYREYGAT 3240 

25 VSEDIiiVGTE VLQVYAASRD IEANAEITYS IISGNEHGKF SIDSKTGAVF IIENIiDYESS 3300 

HEYYLTVEAT DGGTPSLSDV ATVNVNVTDI NDNTPVFSQD TYTTVISEDA VLEQSVITVM 3360 

ADDADGPSNS HIHYSIIDGN QGSSFTIDPV RGEVKVTKLL DRETISGYTL TVQASDNGSP 3420 

PRVNTTTVNI DVSDVNDNAP VFSRGNYSVI IQENKPVGFS VLQLWTDED SSHNGPPFFF 3480 

TIVTGNDEKA FEVNPQGVLL TSSAIKRKEK DHYLLQVKVA DNGKPQLSSL TYIDIRVIEE 3540 

30 SIYPPAILPL EIFITSSGEE YSGGVIGKIH ATDQDVYDTb TYSLDPQMDN LFSVSSTGGK 3600 

LIAHKKLDIG QYLLNVSVTD GKFTTVADIT VHIRQVTQEM LNHTIAIRFA NLTPEEFVGD 3660 

YWRNFQRALR NILGVRRNDI QIVSLQSSEP HPHLDVLLFV EKPGSAQIST KQLLHKINSS 3720 

VTDIEEIIGV RILNVFQKLC AGLDCPWKFC DEKVSVDESV MSTHSTARLS FVTPRHHRAA 3780 

VCLCKEGRCP PVHHGCEDDP CPEGSECVSD PWEEKHTCVC PSGRFGQCPG SSSMTLTGNS 3840 

35 YVKYRLTENE NKLEMKLTMR LRTYSTHAW MYARGTDYSI LEIHHGRLQY KFDCGSGPGI 3900 

VSVQSIQVND GQWHAVALEV NGNYARLVLD QVHTASGTAP GTLKTLNLDN YVFFGGHIRQ 3960 

QGTRHGRSPQ VGNGFRGCMD SIYLNGQELP LNSKPRSYAH IEESVDVSPG CFLTATEDCA 4020 

SNPCQNGGVC NPSPAGGYYC KCSALYIGTH CEISVNPCSS NPCLYGGTCV VDNGGFVCQC 4080 

RGLYTGQRCQ LSPYCKDEPC KNGGTCFDSL DGAVCQCDSG FRGERCQSDI DECSGNPCLH 4140 

40 GALCENTHGS YHCNCSHEYR GRHCEDAAPN QYVSTPWNIG LAEGIGIWF VAGIFLLVW 4200 

FVLCRKMISR KKKKQAEPKD KHLGPATAFL QRPYFDSKLN KNIYSDIPPQ VPVRPISYTP 4260 

SIPSDSRNNL DRNSFEGSAI PEHPEFSTFN PESVHGHRKA VAVCSVAPNIi PPPPPSNSPS 4320 

DSDSIQKPSW DFDYDTKWD LDPCLSKKPL EEKPSQPYSA RESLSEVQSL SSFQSBSCDD 4380 

NGYHWDTSDW MPSVPLPDIQ EFPNYEVIDE QTPLYSADPN AIDTDYYPGG YDIESDFPPP 4440 

45 PEDFPAADEL PPLPPEFSNQ FESIHPPRDM PAAGSLGSSS RNRQRFNLNQ YLPNFYPLDM 4500 

SEPQTKGTGE NSTCREPHAP YPPGYQRHFE APAVESMPMS VYASTASCSD VSACCEVESE 4560 

VMMSDYESGD DGHFEEVTIP PLDSQQHTEV 4590 

Seq ID NO: C376 Protein Sequence 
50 Protein Accession #: NP_05S035.1 

1 11 21 31 41 51 

I I I I I I 

MCYGKCARCI GHSLVGLALL CIAANILLYF PNGETKYASE NHLSRFVWFF SGIVGGGLLM 60 

55 LLPAFVFIGL EQDDCCGCCG HENCGKRCAM LSSVLAALIG IAGSGYCVIV AALGLAEGPL 120 

CLDSLGQWNY TFASTEGQYL LDTSTWSECT EPKHTVEWNV SLFSILLALG GIEFILCLIQ 180 

VINGVI/3GIC GFCCSHQQQY DC 202 

Seq ID NO: C377 Protein Sequence 
60 Protein Accession Jh NP_003750.l 

1 11 21 31 41 51 

I I 1 I I I 

MSTENVEGKP SNLGERGRAR SSTFIiRWQP MFNHSIFTSA VSPAAERIRF ILGEEDDSPA 60 

65 PPQLFTELDE LLAVDGQEME WKETARWIKF EEKVEQGGER WSKPHVATLS LHSLFELRTC 120 

MEKGSIMLDR EASSLPQLVE MIVDHQIETG LLKPBLKDKV TYTLLRKHRH QTKKSNLRSL 180 

ADIGKTVSSA SRMFTNPDNG SPAMTHRNLT SSSLNDISDK PEKDQLKNKF MKKLPRDAEA 240 

SNVLVGEVDF LDTPPIAFVR LQQAVMLGAL TBVPVPTRFL FILIX3PKGKA KSYHEIGRAI 300 

ATLMSDEVFH DIAYKAKDRH DLIAGIDEFL DEVIVLPPGE WDPAIRIEPP KSLPSSDKRK 360 

70 NMYSGGENVQ MNGDTPHDGG HGGGGHGDCE ELQRTGRFCG GLIKDIKRKA PFFASDFYDA 420 

LNIQALSAIL FIYLATVTNA ITFGGLLGDA TDNMQGVLES FLGTAVSGAI FCLFAGQPLT 480 

ILSSTGPVLV FERLLFNFSK DNNFDYLEFR LWIGLMSAFL CLILVATDAS FLVQYFTRFT 540 

EEGFSSLISF IFIYDAFKKM IKXADYYPIN SNFKVGYNTL FSCTCVPPDP ANISISNDTT 600 

LAPEYLPTMS STDMYHNTTF DWAFLSKKEC SKYGGNLVGN NCNFVPDITL MSFILFLGTY 660 

75 TSSMALKKFK TSPYFPTTAR KLISDFAIIXj SILIFCVIDA LVGVDTPKLI VPSEFKPTSP 720 

NRGWPVPPFG ENPWWVCIAA AIPALLVTIL IFMDQQITAV IVNRXEHKLK KGAGYHLDLF 780 

WVAILMVICS LMALPWYVAA TVISIAHIDS LKMBTBTSAP GEQPKFLGVR EQRVTGTLVF 840 

ILTGLSVFMA PILKFIPMPV LYGVFLYMGV ASLNGVQFMD RLKLLLMPLK HQPDFIYLRH 900 

VPLRRVHLFT FIjQVLCLALL WILKSTVAAI IFPVMIIALV AVRKGMDYLF SQHDLSFLDD 960 

80 VIPEKDKKKK EDEKKKKKKK GSLDSDNDDS DCPYSEKVPS IKIPMDIMEQ QPFLSDSKPS 1020 

DRERSPTFLE RHTSC 1035 

Seq ID NO: C37B Protein Sequence 
Protein Accession ih NP_000949.1 
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1 11 21 31 41 51 

I I I I I I 

MSTPGVNSSA SLSPDRLNSP VTIPAVMFIF GWGNLVAIV VLCKSRKEQK ETTFYTLVCG 60 

LAVTDLLGTL LVSPVTIATY MKGQWPGGQP LCEYSTFILL FFSLSGLSII CAMSVERYLA 120 

INHAYFYSHY VDKRLAGLTL FAVYASNVLF CALPNMGLGS SRLQYPDTWC FIDWTTNVTA 180 

HAAYSYMYAG FSSFLILATV LCNVLVCGAL LRMHRQFMRR TSLGTEQHHA AAAASVASRG 240 

HPAASPALPR LSDFRRRRSF RRIAGAEIQM VILLIATSLV VLICSIPLW RVFVNQLYQP 300 

SLEREVSKNP DLQAIRIASV NPILDPWIYI LLRKTVLSKA IEKIKCLFCR IGGSRRERSG 360 

QHCSDSQRTS SAMSGHSRSF ISRELKBISS TSQTLLPDLS LPDLSENGLG GRNLLPGVPG 420 

MGLAQEDTTS LRTLRISETS DSSQGQDSES VLLVDEAGGS GRAGPAPKGS SLQVTFPSET 480 
LNLSEKCI 



488 



Seq ID NO: C379 Protein Sequence 
Protein Accession ft: NP_002650.1 

1 11 21 31 41 51 

I I I I I I 

MGHPPLLPLL LLLHTCVPAS WGLRCMQCKT NGDCRVEECA LGQDLCRTTI VRLWEEGEEL 60 

ELVEKSCTHS EKTNRTLSYR TGLKITSLTE WCGLDLCNQ GNSGRAVTYS RSRYLECISC 120 

GSSDMSCERG RHQSLQCRSP EEQCLDWTH WIQEGEEGRP KDDRHLRGCG YLPGCPGSNG 180 

FHNNDTFHFL KCCNTTKCNE GPILELENLP QNGRQCYSCK GNSTHGCSSE ETFLIDCRGP 240 

MNQCLVATGT HEPKNQSYMV RGCATASMCQ HAHLGDAFSM NHIDVSCCTK SGCNHPDLDV 300 

QYRSGAAPQP GPAHLSLTIT LLMTARLWGG TLLWT 335 

Seq ID NO: C380 Protein Sequence 
Protein Accession ft: BAB55406.1 

1 11 21 31 41 51 

I 1 I I I I 

MDFESGQVDP LASVILPPNL LENLSPEDSV LVRRAQFTFF NKTGLFQDVG PQRKTLVSYV 60 

MACSIGNITI QNLKDPVQZK IKHTRTQEVH HPICAFWDLN KNKSFGGWNT SGCVAHRDSD 120 

ASETVCLCNH FTHFGVLMDL PRSASQLDAR NTKVLTFISY IGCGISAIFS AATLLTYVAF 180 

EKLRRDYPSK ILMNLSTALL FLNLLFLLDG WITSFNVDGL CIAVAVLLHF FLLATFTWMG 240 

LEAIHMYIAL VKVFNTYIRR YILKFCHGW GLPALWSW LASRNNNEVY GKESYGKEKG 300 

DEFCWIQDPV IFYVTCAGYF GVMFPLNIAM FIWMVQICG RNGKRSNRTL REEVLRNLRS 360 

WSLTFLLGM TWGFAFFAWG PLNIPFMYLF SIFNSLQGLF IFIFHCAMKE NVQKQWRRHL 420 

CCGRFRLADN SDWSKTATNI IKKSSDNLGK SLSSSSIGSN STYLTSKSKS SSTTYFKRNS 480 

HTDNVSYEHS FNKSGSLRQC PHGQVLVKTG PC 512 

Seq ID NO: C381 Protein Sequence 
Protein Accession ft: NP_O00S65.1 

1 11 21 31 1 41 51 

t I I I I I 

MTVARPSVPA ALPIiLGELPR LLLLVLLCLP AVWGDCGLPP DVPNAQPALE GRTSFPEDTV 60 

ITYKCEESFV KIPGEKDSVI CLKGSQWSDI EEFCNRSCEV PTRLNSASLK QPYITQNYFP 120 

VGTWEYECR PGYRREPSLS PKLTCLQNLK W STAVE FCKK KSCPNPGEIR NGQIDVPGGI 180 

LFGATISFSC NTGYKLFGST SSFCLISGSS VQWSDPLPEC REIYCPAPPQ IDNGI IQGER 240 

DHYGYRQSVT YACNKGFTMI GEHSIYCTVN NDEGEWSGPP PECRGKSLTS KVPPTVQKPT 300 

TVNVPTTEVS PTSQKTTTKT TTPNAQATRS TPVSRTTKHF HETTPNKGSG TTSGTTRLLS 360 

GHTCFTLTGL LGTLVTMGLL T 381 

seq ID NO: C382 Protein Sequence.. 
Protein Accession ft: Eos sequence 

1 11 21 31 41 SI 

I I I I I I 

MDTSRLGVLL SLPVLLQLAT GGSSPRSGVL LRGCPTHCHC EPDGRMLLRV DCSDLGLSEL 60 

PSNLSVFTSY LDLSMNNISQ LLPNPLPSLR PLEELRLAGM ALTYIPKGAF TGLYSLKVLM 120 

LQNNQLRHVP TEALQNLRSL QSLRLDANHI SYVPPSCFSG LHSLRHLWLD DNALTBIPVQ 180 

AFRSLSALQA MTLALNKIHH IPDYAFGNLS SLWLHLHNN RIHSLGKKCF DGLHSLETLD 240 

LNYNNLDEFP TAIRTLSNLK ELHFYDNPIQ FVGRSAFQHL PELRTLTLNG ASQITEFPDIi 300 

TGTANLESLT LTGAQISSLP QTVCNQLPNL QVLDLSYNLL EDLPSFSVCQ KLQKIDLRHN 360 

EIYEIKVDTF QQLLSLRSLN LAWNKI AI IH PNAFSTLPSL IKLDLSSNLL SSFPITGLHG 420 

. LTHLKLTGNH ALQSLISSEN FPELKVIEMP YAYQCCAPGV CENAYKISNQ WNKGDNSSMD 480 

DLHKKDAGMF QAQDERDLED FLLDFBEDLK ALHSVQCSPS PGPFKPCEHL LDGWLIRIGV 540 

WTIAVLALTC NALVTSTVFR SPLYISPIKL LIGVIAAVNM LTGVSSAVLA GVDAFTFGSF 600 

ARHGAWWENG VGCHVIGFLS IFASESSVFL LTLAALERGF SVKYSAKFET KAPFSSLKVI 660 

ILLCALLALT MAAVPLLGGS KYGASPLCLP LPFGEPSTMG YMVALILLNS LCFLMMTIAY 720 

TKLYCNLDKG DLENIWDCSM VKHIALLLFT NCILNCPVAF LSFSSLINLT FISPEVIKFI 780 

LLVWPLPAC LNPLLYILFN PHFKEDLVSL RKQTYVWTRS KHPSLMSINS DDVEKQSCDS 840 

TQALVTFTSS SITYDLPPSS VPSPAYPVTE SCRLSSVAFV PCL 883 

Seq ID NO: C383 Protein Sequence 
Protein Accession ft: NP_003658.1 

1 11 21 31 41 51 

I I I I I I 

MDTSRLGVLL SLPVLLQLAT GGSSPRSGVL LRGCPTHCHC EPDGRMLLRV DCSDLGLSEL 60 

PSNLSVFTSY LDLSMNNISQ LLPNPLPSLR FLEELRLAGN ALTYIPKGAF TGLYSLKVLM 120 

LQNNQLRHVP TEALQNLRSL QSLRLDANHI SYVPPSCFSG LHSLRHLWLD DNALTEIPVQ 180 

AFRSLSALQA MTLALNKIHH IPDYAFGNLS SLWLHLHNN RIHSLGKKCF DGLHSLETLD 240 

LNYNNLDEFP TAIRTLSNLK ELGFHSNNIR SIPEKAFVGN PSLITIHFYD NPIQFVGRSA 300 
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FQHLPELRTL TLNGASQITE 
YNL.LEDL.PSF SVCQKLQKID 
LPSLIKLDLS SNLLSSFPIT 
AFGVCENAYK ISNQWNKGDN 
CSPSPGPFKP CEHLLDGWLI 
AVNMLTGVSS AVLAGVDAFT 
ERGFSVKYSA KFETKAPFSS 
STMGYMVALI LLNSLCFLMM 
PVAFLSFSSL INLTFISPEV 
WTRSKHPSLM SINSDDVEKQ 
VAFVPCL 

Seq ID NO i C3B4 Protein Sequence 
Protein Accession #: NP_003497.1 

1 11 21 31 41 Si 

| | 1 1 I I 

MEMFTFLLTC IFLPLLRGHS LPTCEPITVP RCMKMAYNMT FFPNLMGHYD QSIAAVEMEH 60 
FLPLANLECS PNIETFLCKA FVPTCIEQIH WPPCRKLCE KVYSDCKKLI DTFGIRWPEE 120 
LECDRLQYCD ETVPVTFDPH TEFLGPQKKT EQVQRDIGFW CPRHLKTSGG OGYKPLGIDQ 180 
CAPPCPNMYP KSDELEFAKS FIGTVSIFCL CATLFTPLTF LIDVRRFRYP EHPIIYYSVC 240 
YSIVSLMYFI GFLLGDSTAC NKADEKLELG DTWLGSQNK ACTVLFMLLY FFTMAGTVWW 300 
VILTITWFLA AGRKWSCEAI EQKAVWFHAV AWGTPGFLTV MLLALNKVEG DNISGVCFVG 360 
LYDLDASRYF VLLPLCLCVF VGLSLLLAGI ISLNHVRQVI QHDGRNQEKL KKFMIRIGVF 420 
SGLYLVPLVT LLGCYVYEQV NRITWEITWV SDHCRQYHIP CPYQAKAKAR PELALFMIKY 480 
LMTLIVGISA VFWVGSKKTC TEWAGFFKRN RKRDPISESR RVLQESCEFF LKHNSKVKHK 540 
KKHYKPSSHR LKVISKSMGT STGATANHGT SAVAITSHDY LGQBTLTEIQ TSPETSMREV 600 
KADGASTPRL REQDCGEPAS PAASISRLSG BQVDGKGQAG SVSESARSEG RISPKSDITD 660 
TGLAQSNNLQ VPSSSEPSSL KGSTSLLVHP VSGVRKEQGG GCHSDT 706 

Seq ID NO: C385 Protein Sequence 
Protein Accession #: NP_000573 

1 11 21 31 41 51 

I | I I I I 

MRIAVICFCL LGITCAIPVK QADSGSSEEK QLYNKYPDAV ATWLNPDPSQ KQNLLAPOTL 60 
PSKSNESHDH MDDMDDEDDD DHVDSQDSID SNDSDDVDDT DDSHQSDESH HSDESDELVT 120 
DFPTDLPATE VFTPWPTVD TYDGRGDSW YGLRSKSKKF RRPDIQYPDA TDEDITSHME 180 
SEELNGAYKA IPVAQDLNAP SDWDSRGKDS YETSQLDDQS AETHSHKQSR LYKRKANDES 240 
NEHSDVIDSQ ELSKVSREFH SHEPHSHEDM LWDPKSKEE DKHLKFRISH ELDSASSEVN 300 

Seq ID NO: C386 Protein Sequence 
Protein Accession #: NP_002812 

1 11 21 31 41 51 

I I I I I I 

MGAARGSPAR PRRLPLLSVL LLPLLGGTOT AIVFIKQPSS QDALQGRRAL LRCEVEAPGP 60 

VHVYWLLDGA PVQDTERRFA QGSSLSFAAV DRLQDSGTFQ CVARDDVTGE EARSANASFN 120 

IKWIEAGPW LKHPASEAEI QPQTQVTLRC HIDGHPRPTY QWFRDGTPLS DGQSNHTVSS 180 

KERNLTLRPA GPEHSGLYSC CAHSAFGQAC SSQNFTLSIA DBSFARVVLA PQDVWARYE 240 

EAMFHCQPSA QPPPSLQWLF EDETPITNRS RPPHLRRATV FANGSLLLTQ VRPRNAGIYR 300 

CIGQGQRGPP IILEATLHLA EIEDMPLFEP RVFTAGSEER VTCLPPKGLP EPSVWWEHAG 360 

VRLPTHGRVY QKGHELVLAN IAESDAGVYT CHAANLAGQR RQDVNITVAT VPSWLKKPQD 420 

SQLEEGKPGY LDCLTQATPK PTWWYRNQM LISEDSRFEV FKNGTLRINS VEVYDGTWYR 480 

CMSSTPAGSI EAQARVQVLE KLKFTPPPQP QQCMEFDKEA TVPCSATGRE KPTIKWERAD 540 

GSSLPEWVTD NAGTLHFARV TRDDAGNYTC IASNGPQGQI RAHVQLTVAV FITFKVEPER 600 

TTVYQGHTAL LQCEAQGDPK PLIQWKGKDR ILDPTKLGPR MHIFQNGSLV IHDVAPEDSG 660 

RYTCIAGNSC NIKHTBAPLY WDKPVPEES EGPGSPPPYK MIQTIGLSVG AAVAYIIAVL 720 

GLMFYCKKRC KAKRLQKQPE GEEPEMECLN GGPLQNGQPS AEIQEEVALT SIX3SGPAATN 780 

KRHSTSDKMH FPRSSLQPIT TLGKSEFGEV FLAKAOGLEB GVAETLVLVK SIjQTKDEQQQ 840 

LDFRRELEMF GKLNHANWR LLGLCREAEP HYMVLEYVDL GDLKQFLRIS KSKDEKLKSQ 900 

PLSTKQKVAL CTQVALGMEH LSNNRFVHKD LAARNCLVSA QRQVKVSALG LSKDVYNSEY 960 

YHFRQAWVPI* RWMSPEAILE GDFSTKSDVW AFGVLMWEVF THGEMPHGGQ ADDEVIADLQ 1020 

AGKARLPQPB GCPSKLYRLM QRCWALSPKD RPSFSEIASA LGDSTVDSKP 1070 

. Seq ID NO: C387 Protein Sequence 
Protein Accession #■ NP_002300.1 

1 11 21 31 41 51 

| 1 I I I I 

MKVLAAGWP LLLVLHWKHG AGSPLPITPV NATCAIRHPC HNNLMNQIRS QIAQLNGSAN 60 

ALFILYYTAQ GEPFPNNLDK LCGPNVTDFP PFHANGTEKA KLVELYRIW YLGTSLGNIT 120 

RDQKILNPSA LSLHSKLNAT ADILRGLLSN VLCRLCSKYH VGHVDVTYGP DTSGKDVFQK 180 

KKLGCQLLGK YKQIIAVLAQ AF 202 

Seq ID NO: C3B8 Protein Sequence 
Protein Accession #: XP_097508 

1 11 21 31 41 51 

I I I I.I I 

MGRPRLTLVC HVSIIISARD LSMNNLTELQ PGLFHHLRFL EELRLSGNHL SHIPGQAFSG 60 

LYSIiKILMLQ NNQLGGIPAE ALWELPSLQS LRLDANLISL VPERSFEGLS SLRHLWLDDN 120 

ALTEIPVRAL NNLPALQAMT LALNRISHIP DYAFQNLTSL WLHLHNNRI QHLGTHSFEG 180 

LHNLETLDLN YNKLQEFPVA IRTLGRLQEL GFHNNNIKAI PEKAFMGNPL IiOTIHFYDNP 240 



FPDLTGTANL ESLTLTGAQI SSLPQTVCNQ LPNLQVLDLS 360 

LRHNEIYEIK VDTFQQLLSL RSLNLAWNKI AIIHPNAFST 420 

GLHGLTHLKL TGNHALQSLI SSENFPEUCV IEMPYAYQCC 480 

SSMDDLHKKD AGMFQAQDER DLEDFLLDFE BDLKALHSVQ 540 

RIGVWTIAVL ALTCNALVTS TVFRSPLYIS PIKLLIGVIA 600 

FGSFARHGAW HENGVGCHVI GFLSIFASES SVFLLTLAAIi 660 

LKVIILLCAL LALTMAAVPL LGGSKYGASP LCLPLPFGEP 720 

TIAYTKLYCN LDKGDLENIW DCSMVKHIAL LLFTNCILNC 780 

IKFILLWVP LPACLNPLLY .ILFNPHFKED LVSLRKQTYV 840 

SCDSTQALVT FTSSSITYDL PPSSVPSPAY PVTESCHLSS 900 

907 



1371 
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IQFVGRSAFQ YLPKLHTLSL NGAMDIQEFP DLKGTTSLEI LTLTRAGIRL LPSGMCQQLP 300 

RLRVLELSHN QIEELPSLHR CQKLEEIGLQ HNRIHEIGAD TFSQLSSLQA LDLSWNAIRS 360 

IHPEAFSTLH SLVKLDLTDN QLTTLPLAGL GGLMHLKLKG NLALSQAFSK DSFPKLRILE 420 

VPYAYQCCPY GMCASFFKAS GQWEAEDIiHL DDEESSKRPL GLLARQAENH YDQDLDELQL 480 

5 EMEDSKPHPS VQCSPTPGPF KPCEYLFESW GIRLAVWAIV- LLSVLCNGLV LLTVFAGGPV 540 

PLPPVKFWG AIAGANTI/TG ISCGLLASVD ALTFGQFSEY GARWETGLGC RATGFLAVLG 600 

SEASVLLLTL AAVQCSVSVS CVRAYGKSPS LGSVRAGVLG CLALAGLAAA LPLASVGEYG 660 

ASPLCLPYAP PEGQPAALGF TVALVMMNSF C PL WAG AY I KLYCDLPRGD FEAVWDCAMV 720 

RHVAWLIFAD GLLYCPVAFL SFASMLGLFP VTPEAVKSVL LWLPLPACL NPLLYLLFNP 780 

10 HFRDDLRRLR PRAGDSGPLA YAAAGELEKS SCDSTQALVA FSDVDLILEA SEAGRPPGLE 840 

TYGFPSVTLI SCQQPGAPRL EGSHCVEPEG NHFGNPQPSM DGELLLRAEG STPAGGGLSG 900 

GGGFQPSGLA FASHV 915 

Seq ID NO: C389 Protein Sequence 
15 Protein Accession #: NP_S7090l 

1 11 21 31 41 Si 

I I I I I I 

MASLVSLELG LLLAVLVVTA TASPPAGLLS LLTSGQGALD QEALGGbLNT LADRVHCTNG 60 

20 PCGKCLSVED ALGLGEPEGS GLPPGPVLEA RYVARLSAAA VLYLSNPEGT CEDTRAGLWA 120 

SHADHLLALL ESPKALTPGL SWLLQRMQAR AAGQTPKTAC VDIPQLLEEA VGAGAPGSAG 180 

GVLAALLDHV RSGSCFHALP SPQYFVDFVF QQHSSEVPMT LAELSALMQR LGVGREAHSD 240 

HSHRHRGASS RDPVPLISSS NSSSVWDTVC LSARDVMAAY GLSEQAGVTP EAWAQLSPAL 300 

LQQQLSGACT SQSRPPVQDQ LSQSERYLYG SLATLLICLC AVFGLLLLTC TGCRGVAHYI 360 

25 LQTFLSLAVG ALTGDAVLHL TPKVLGIiHTH SEEGLSPQPT WRLLAMLiAGL YAFFLFENLF 420 

NLLLPRDPED LEDGPOGHSS HSHGGHSHGV SLQIAPSELR QPKPPHEGSR ADLVAEESPE 480 

LLNPEPRRLS PELRLLPYMI TLGDAVHNFA DGLAVGAAFA SSWKTGLATS LAVFCHELPH 540 

ELGDFAALLH AGLSVRQALL LNLASALTAF AGLYVALAVG VSEESEAWIL AVATGLFLYV 600 

ALCDMLPAML KVRDPRPWLL FLLHNVGLLG GWTVLLLLSL YEDDITF 648 



30 



65 



Seq ID NO: C390 Protein Sequence 
Protein Accession #: NP_061844 



1 11 21 31 41 51 

35 | | | I I I 

MANASEPGGS GGGEAAALGL KLATLSLLLC VSUVGNVLFA LLIVRERSLH RAPYYLLLDL 60 

CLADGLRALA CLPAVKLAAR RAAAAAGAPP GALGCKLLAF LAALFCFHAA FLLLGVGVTR 120 

YLAIAHHRFY AERLAGWPCA AMLVCAAWAL ALAAAFPPVL DGGGDDEDAP CALEQRPDGA 180 

PGALGFLLLL AWVGATHLV YLRLLFFIHD RRKMRPARLV PAVSHDWTFH GPGATGQAAA 240 

40 NWTAGFGRGP TPPALVGIRP AGPGRGARRL LVLEEFKTEK RLCKMFYAVT LLFLLLWGPY 300 

WASYLRVLV RPGAVPQAYL TASVWLTFAQ AGINPWCFL FNRELRDCFR AQFPCCQSPR 360 

TTQATHPCDL KGIGL 375 

Seq ID NO: C391 Protein Sequence 
45 Protein Accession #♦. NP_005622 

1 11 21 31 41 51 

! I I 111 

MAAARPARGP ELPLLGLLLL LLLGDPGRGA ASSGNATGPG PRSAGGSARR SAAVTGPPPP 60 

50 LSHCGRAAPC EPLRYNVCLG SVLPYGATST LLAGDSDSQE EAHGKLVLWS GLRNAPRCWA 120 

VIQPLLCAVY MPKCENDRVE LPSRTLCQAT RGPCAIVERE RGWPDFLRCT PDRFPEGCTN 180 

EVQNIKFNSS GQCEVPLVRT DNPKSWYEDV EGCGIQCQNP LFTEAEHQDM HSYIAAFGAV 240 

TGLCTLFTLA TFVADWRNSN RYPAVILFYV NACFFVGSIG WLAQFMDGAR REIVCRADGT 300 

MRLGEPTSNE TLSCVIIFVI VYYALMAGW WFWLTYAWH TSFKALGTTY QPLSGKTSYF 360 

55 HLLTWSLPPV LTVAILAVAQ VDGDSVSGIC FVGYKNYRYR AGFVLAPIGI* VLIVGGYFLI 420 

RGVMTLFSIK SNHPGLLSEK AASKINETML RLGIFGFLAF GFVLITFSCH FYDFFNQAEW 480 

ERSFRDYVLC QANVTIGLPT KQPIPDCEIK NRPSLLVEKI NLFAMFGTGI AMSTWVWTKA 540 

TLLIWRRTWC RLTGQSDDEP KRIKKSKMIA KAFSKRHELL QNPGQELSFS MHTVSHDGPV 600 

AGLAFDLNEP SADVSSAWAQ HVTKMVARRG AILPQDISVT PVATPVPPEE QANLWLVEAE 660 

60 ISPELQKRLG RKKKRRKRKK EVCPLAPPPE LHPPAPAPST IPRLPQLPRQ KCLVAAGAWG 720 

AGDSCRQGAW TLVSNPFCPE PSPPQDPFLP SAPAPVAWAH GRRQGLGPIH SRTNLMDTEL 780 

MDADSDF 788 



Seq ID NO: C392 Protein Sequence 
Protein Accession #: BAC04382 



1 11 21 31 41 51 

I I I I I I 

MGARSGARGA LLLALLLCWD PRLSQAGRKR SGEVLPDSFP SAPAEPLPYF LQEPQDAYIV 60 

70 KNKPVELRCR AFPATQIYFK CNGEWVSQND HVTQEGLDEA TLGARGGLRV REVQIEVSRQ 120 

QVEELFGLED YWCQCVAWSS AGTTKSRRAY VRIAYLRKNF DQEPLGKEVP LDHEVLLQCR 180 

PPEGVPVAEV EWLKNEDVID PTQDTNFLLT IDHNLIIRQA RLSDTANYTC VAKNIVAKRR 240 

STTATVIVYV NGGWSSWAEW SPCSNRCGRG WQKRTRTCTN PAPLNGGAFC EGQAFQKTAC 300 

TTICPVDGAW TEWSKWSACS TECAHWRSRE CMAPPPQNGG RDCSGTLLDS KNCTDGLCMQ 360 

75 NKKTLSDPNS HLLEASGDAA LYAGLWAIF WVAILMAVG VWYRRNCRD FDTDITDSSA 420 

ALTGGFHPVN FKTARPSNPQ LLHPSVPPDI» TASAGIYRGP VYALQDSTDK IPMTNSPLLD 480 

PLPSLKVEVY SSSTTGSGPG LADGADLLGV LPPGTYPSDF ARDTHFLHLR SASLGSQQLL 540 

GLPRDPGSSV SGTFGCLGGR LSIPGTGVSL LVPNGAIPQG KFYEMYLLIN KAESTLPLSE 600 

0rt GTQTVLSPSV TCGPTGLLLC RPVILTMPHC AEVSARDWIF QLKTQAHQGH WEEWTLDBE 660 

80 TLNTPCYCQL EPRACHILLD QLGTYVFTGE SYSRSAVKRL QLAVFAPALC TSLEYSLRVY 720 

CLEDTPVALK EVLELERTLG GYLVEEPKPL MFKDSYHNLR LSLHDLPHAH WRSKliLAKYQ 780 

EIPFYHIWSG SQKALHCTFT LERHSLASTE LTCKICVRQV EGEGQIFQLH TTLABTPAGS 840 

LDTIiCSAPGS TVTTQLGPYA FKIPLSIRQK ICNSLDAPNS RGNDWRMLAQ KLSMDRYLNY 900 

FATKASPTGV ILDLWEALQQ DDGDLNSLAS ALEEMGKSEM LVAVATDGDC 950 

1372 
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Seq ID NO: C393 Protein Sequence 
Protein Accession fl: NP__004616 

1 11 21 31 41 51 

| | I 1 I I 

MNRKARRCLG HLFLSLGMVY LRIGGFSSW ALQASIICNK IPGLAPRQRA ICQSRPDAII 60 
VIGEGSQMGL DECQFQFRNG RWNCS ALGER TVFGKELKVG SREAAFTYAI IAAGVAHAIT 120 
AACTQGNLSD CGCDKEKQGQ YHRDEGWKWG GCSADIRYGI GFAKVFVDAR EIKQNARTLM 180 
NLHNNEAGRK ILEENMKLEC KCHGVSGSCT TKTCWTTLPQ FRELGYVLKD KYNEAVHVEP 240 
VRASRNKRPT FLKIKKPLSY RKPMDTDLVY IEKSPNYCEE DPVTGSVGTQ GRACNKTAPQ 300 
ASGCDLMCCG RGYNTHQYAR VWQCNCKFHW CCYVKCNTCS ERTEMYTCK 349 



Seq ID NO: C394 Protein Sequence 
Protein Accession #: NP_003777 

1 11 * 21 31 41 51 

1 | I I I I 

MDALCGSGEL GSKFWDSNLS VHTENPDLTP CFQNSLIAWV PCIYLWVALP CYLLYLRHHC 60 

RGYIILSHLS KLKMVLGVLL WCVSWADLFY SFHGLVHGRA PAPVFFVTPL WGVTMLLAT 120 

LLIQYERLQG VQSSGVLIIF WFLCWCAIV PFRSKILLAK AEGEISDPFR FTTFYIHFAL 180 

VLSALILACF REKPPFFSAK NVDPNPYPET SAGFLSRLFF WWFTKMAIYG YRHPLEEKDL 240 

WSLKEEDRSQ MWQQLLEAW RKQEKQTARH KASAAPGKNA SGEDEVLLGA RPRPRKPSFL 300 

KALLATFGSS FLISACFKLI QDLLSFINPQ LLSILIRFIS NPMAPSWWGF LVAGLMFLCS 360 

MMQSLILQHY YHYIFVTGVK FRTGIMGVIY RKALVITNSV KRASTVGEIV NLMSVDAQRF 420 

MDLAPFLNUj WSAPLQIIIA lYFLWQNLGP SVLAGVAFMV'LLIPLNGAVA VK MRAF QVKQ 480 

MKLKDSRIKL MSEILNGIKV LKLYAWEPSF LKQVEGIRQG ELQLLRTAAY LHTTTTFTWM 540 

CSPFLVTLIT LWVYVYVDPN NVLDAEKAFV SVSLFNILRL PLNMLPQLIS NLTQASVSLK 600 

RIQQFLSQEE LDPQSVERKT ISPGYAITIH SGTFTWAQDL PPTLHSLDIQ VPKGALVAW 660 

GPVGCGKSSL VSALLGEMEK LEGKVHMKGS VAYVPQQAWI QNCTLQENVL FGKALNPKRY 720 

QQTLEACALL ADLEMLPGGD QTEIGEKGIN LSGGQRQRVS LARAVYSDAD IFLLDDPLSA 780 

VDSHVAKHIF DHVIGPEGVL AGKTRVLVTH GISFLPQTDF I IVLADGQVS EMGPYPALLQ 840 

RNGSFANFLC NYAPDEDQGH LEDSWTALEG AEDKEALLIE DTLSNHTDLT DNDPVTYWQ 900 

KQFMRQLSAL SSDGEGQGRP VPRRHLGPSE KVQVTEAKAD GALTQEEKAA IGTVELSVFW 960 

DYAKAVGLCT TLAICLLYVG QSAAAIGANV WLSAWTNDAM ADSRQNNTSL RLGVYAALGI 1020 

LQGFLVMLAA MAMAAGGIQA ARVLHQALLH NKIRSPQSFF DTTPSGRILN CFSKDIYWD 1080 

EVLAPVILML LNSFFNAIST LWIMASTPL FTWILPLAV LYTLVQRFYA ATSRQLKRLE 1140 

SVSRSPIYSH FSETVTGASV IRAYNRSRDF EIISDTKVDA NQRSCYPYII SNRWLSIGVE 1200 

FVGNCWLFA ALFAVIGRSS LNPGLVGLSV SYSLQVTFAL NWMIRMMSDL ESNIVAVERV 1260 

KEYSKTETEA PWWEGSRPP EGWPPRGEVE FRNYSVRYRP GLDLVLRDLS LHVHGGEKVG 1320 

IVGRTGAGKS SMTLCLFRIL EAAKGEIRID GLNVADIGLH DLRSQLTIIP QDPILFSGTL 1380 

RMNLDPFGSY SEBDIWWALE LSHLHTFVSS QPAGLDFQCS EGGENLSVGQ RQLVCLARAI* 1440 

LRKSRILVLD EATAAIDLET DNLIQATIRT QFDTCTVLTI AHRLNTIMDY TRVLVLDKGV 1500 

VAEFDSPANL IAARGIFYGM ARDAGLA 1527 



Seq ID NO: C395 Protein Sequence 
Protein Accession ft» NP_004617 

1 11 21 31 41 51 

I 1 I I i I 

MRARPQVCEA LLFALALQTG VCYGIKWLAL SKTPSALALN QTQHCKQLEG LVSAQVQLCR 60 

SNLBLMHTW HAAREVMKAC RRAFADMRWN CSSIELAPNY LLDLERGTRE SAFVYALSAA 120 

AISHAIARAC TSGDLPGCSC GPVPGEPPGP GNRWGGCADN LSYGLLMGAK FSDAPMKVKK 180 

TGSQANKLMR LHNSEVGRQA LPASLEMKCK CHGVSGSCSI RTCWKGLQEL QDVAADLKTR 240 

YLSATKWHR PMGTRKHLVP KDLDIRPVKD SELVYLQSSP DFCMKNEKVG SHGTQDRQCN 300 

KTSNGSDSCD LMCCGRGYNP YTDRWERCH CKYHWCCYVT CRRCERTVER YVCK 354 

Seq ID NO: C396 Protein Sequence 
Protein Accession #: NP_114072 

1 11 21 31 41 51 

| | I I I I 

MEWGYLLEVT SLLAALALLQ RSSGAAAASA KELACQEITV PLCKGIGYNY TYMPNQFNHD 60 

TQDEAGIiEVH QFWPLVEIQC SPDLKFFLCS MYTPICLEDY KKPLPPCRSV CERAKAGCAP 120 

LMRQYGFAWP DRMRCDRLPE QGNPDTLCMD YNRTDLTTAA PSPPRRLPPP PPGEQPPSGS 180 

GHGRPPGARP PHRGGGRGGG GGDAAAPPAR GGGGGGKARP PGGGAAPCEP GCQCRAPMVS 240 

VSSERHPLYN RVKTGQIANC ALPCHNPFFS QDERAFTVFW IGLWSVLCFV STFATVSTFL 300 

IDMERFKYPE RPIIPLSACY LFVSVGYLVR LVAGHEKVAC SGGAPGAGGA GGAGGAAAGA 360 

GAAGAGAGGP GGRGEYEELG AVEQHVRYET TGPALCTWF LLVYFFGMAS SIWWVILSLT 420 

WFIiAAGMKWG NEAIAGYSQY FHLAAWLVPS VKSIAVLALS SVDGDPVAGI CYVGNQSLDN 480 

LRGFVLAPLV IYLFIGTMFL LAGFVSLFRI RSVIKQQDGP TKTHKLEKLM IRLGbFTVLY S40 

TVPAAWVAC LFYEQHNRPR WEATHNCPCL RDLQPDQARR PDYAVFMLKY FMCLWGITS 600 

GVWVWSGKTL ESWRSLCTRC CWASKGAAVG GGAGATAAGG GGGPGGGGGG GPGGGGGPGG 660 

GGGSLYSDVS TGLTWRSGTA SSVSYPKQMP LSQV 694 



Seq ID NO: C397 Protein Sequence 
Protein Accession #: XP_050625 

1 11 21 31 41 51 

| | 1 I 1 I 

MLQGPGSLLL LFLASHCCLG SARGLFIiFGQ PDFSYKRSNC KPI PANLQLC HGIEYQNMRL 60 

PNLLGHETMK EVLEQAGAWI PLVMKQCHPD TKKFLCSLFA PVCLDDLDET IQPCHSLCVQ 120 

VKDRCAPVMS AFGFPWPDML ECDRFPQDND LCIPLASSDH LLPATEEAPK VCEACKNKND 180 

DDNDIMETLC KNDFALKIKV KEITYINRDT KIILETKSKT IYKLNGVSER DLKKSVLWLK 240 

1373 
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DSLQCTCEEM NDINAPYLVM GQKQGGELVI TSVKRWQKGQ RKFKRISRSI RKLQC 

Seq ID NO: C3 98 Protein Sequence 
Protein Accession tt: NP 001297.1 



1 11 21 31 41 51 

I I I I I I 

MSMGLEITGT ALAVLGWLGT IVCCALPMWR VSAPIGSNII TSQNIWEGLW MNCWQSTGQ 60 

MQCKVYDSLL ALPQDLQAAR ALIWAILLA AFGLLVALVG AQCTNCVQDD TAKAKITIVA 120 

10 GVLPLLAALL TLVPVSWSAN TIIRDFYNPV VPEAQKREMG AGLYVGWAAA ALQLLGGALL 1B0 
CCSCPPREKK YTATKWYSA PRSTGPGASL GTGYDRKDYV 220 



Seq ID NO: C399 Protein Sequence 
Protein Accession «: NP 036581.1 



1 11 21 31 41 51 

I I I I I I 

MESRKDITNQ EELWKMKPRR NLEEDDYLHK DTGETSMLKR PVLLHLHQTA HADEFDCPSE 60 

OA LQHTQELFPQ WHLPIKIAAI IASLTFLYTL LREVIHPLAT SHQQYFYKIP ILVINKVLPM 120 

20 VSITLLALVY LPGVIAAIVQ LHNGTKYKKF PHWLDKWMLT RKQFGLLSPF FAVLHAIYSL 180 

SYPMRRSYRY KLLNWAYQQV QQNKEDAWIE HDVWRMEIYV SLGIVGLAIL AUAVTSIPS 240 

VSDSLTWREF HYIQSKLGIV SLLLGTIHAL IFAWNKWIDI KQFVWYTPPT FMIAVFLPIV 300 

VLIFKSILFL PCLRKKILKI RHGWEDVTKI NKTEICSQL . 339 

25 Seq ID NO: C400 Protein Sequence 
Protein Accession 8: NP_001766.i 

1 11 21 31 .41 51 

nf\ 1 I I I I I 

J\J MANCEFSPVS GDKPCCRLSR RAQLCLGVSI LVLILVWLA VWPRWRQTW SGPGTTKRFP 60 

ETVLARCVKY TEIHPEMRHV DCQSVWDAFK GAFISKHPCN ITEEDYQPLM KLGTQTVPCN 120 

KILLWSRIKD IiAHQFTQVQR DMFTLEDTLL GYLADDLTWC GEFNTSKINY QSCPDWRKDC 180 

SNNPVSVFWK TVSRRFAEAA CDWHVMLNG SRSKIFDKNS TFGSVEVHNL QPEKVQTLEA 240 

WVIHGGREDS RDLCQDPTIK ELESIISKRN IQFSCKNIYR PDKFLQCVKN PEDSSCTSEI 300 



Seq ID NO: C401 Protein Sequence 
Protein Accession 8: XP 120513.2 



1 11 21 31 41 51 

40 | i | | | | 

KVSCTFSGPL RETNENVKKF YALRAFMFRM SSEAAMLGES RTPKPRKHRA TTRAKIFKRF 60 

FSEGSESNSR LVEELAVIHT YSDDPAPTTS PSSVQPREFG VMOGAPRARF GSRTPPAAAE 120 

ASSPHLGIGE AACQSGARAA APRAGARROQ PQRQAAAAAA TAQTHTLPHA RTRADPAGRR 180 

RRHPRSPAPG GEGTCSEGPA PRRRMEEEMQ PAEEGPSVPK IYKQRSPYSV LKTFPSKRPA 240 

45 LAKRYERPTL VELPHGHLRT PAQPPPASPA ASSSSSFAAV VRLGAPPRPP RRGFRARGTI 300 

PPLLPAPGVA GTLLPPPTSS SPPSPRPRPW HAAAPRGGTS HTHMMRSQST LPGSDTMVSV 360 

FGLMAQRRWQ HRSLKQFEWG ILGSHGTWPC GQDWLEKEGQ VAVLLPRSEG NTAPKKSRMI 420 

LDAFAQQCSR VLSLLNCGGK LLDSNHSQSM ISCVKQEGSS YNBRQEHCHI GKGVHSQTSD 480 

NVDIEMQYMQ RKQQTSAFLR VFTDSLQNYL LSGSFPTPNP SSASEYGHLA DVDPLSTSPV 540 

HTLENISLDS TASLCKSRHL SREPPVKSDF PNPLQQALAG GASRPFSGAQ QSIAYRVNSE 600 

LEDGIRSPVP LSCEALEMDL TSLGSKQLLN NYPVYITSKQ WDEAVNSSKK DGRRLLRYLI 660 

RFVFTTDELK YSCGLGKRKR SVQSGETGPE RRPLDPVKVT CLRGTASFRS VSPSVISFHR 720 

IGCGSPRTSV QPSVF 735 

55 Seq ID NO: C402 Protein Sequence 
Protein Accession §: BAA92562.1 



1 11 21 31 41 51 

i I I I 1 I 

METTVLSGIN FEYKGMTGWE VAGDHIYTAA GASDNDFMIL TLWPGFRPP QSVMADTENK 60 

EVARITFVFE TLCSVNCELY FMVGVNSRTN TPVETWKGSK GKQSYTYIIE ENTTTSFTWA 120 

FQRTTFHEAS RKYTNDVAKI YSINVTNVMN GVASYCRPCA LEASDVGSSC TSCPAGYYID 180 

RDSGTCHSCP PNTILKAHQP YGVQACVPCG PGTKNNKIHS LCYNDCTFSR NTPTRTFNYN 240 

FSALANTVTIi AGGPSFTSKG LKYFHHFTIiS LCGNQGRKMS VCTDNVTDLR IPEGESGFSK 300 

SITAYVCQAV IIPPEVTGYK AGVSSQPVSL ADRLIGVTTD MTLDGITSPA ELFHLESLGI 360 

PDVIFFYRSN DVTQSCSSGR STTIRVRCSP QKTVPGSLLL PGTCSDGTCD GCNFHPLWES 420 

AAACPLCSVA DYHAIVSSCV AGIQKTTYVW REPKLCSGGI SLPEQRVTIC KTIDFWLKVG 480 

ISAGTCTAIL LTVLTCYFWK KNQKLEYKYS KLVMNATLKD CDLPAADSCA IMEGEDVBDD 540 

LIFTSRKSLP GKIKSFTSKQ PAPVTISLSE DS 572 

Seq ID NO: C403 Protein Sequence 
Protein Accession #: NP 055139.1 



1 11 21 31 41 51 

75 | | | | | | 

MALQGISWE LSGLAPGRXC AMVLADFGAR WRVDRPGSR YDVSRLGRGK RSLVLDLKQP 60 

REPRAAASVQ AVGCAAGALP PRCHGETPAG PRDSAAGKSK AYLCQAEWIW PVQESFCRLA 120 

GHDINYLALS GVLSKIGRSG ENPYAPLNLV ADFAGGGLMC ALGIIMALFD RTRTDKGQVI 180 

DANMVEGTAY LSSFLWKTQK SSLWEAPRGQ NMLDGGAPFY TTYRTADGBF MAVGAIEPQF 240 

OO YELLIKGLGL KSDELPNQMS TDDWPEMKKK FADVFAKKTK AEWCQIFDGT DACVTPVLTF 300 

EEWHHDHNK ERGSFITSEE QDVSPRLAPL LUTTPAIPSS KGDPFIGEHT EEILEEFGFS 360 

REEIYQLNSD KIIESNKVKA Sh 382 

Seq ID NO: C404 Protein Sequence 
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Protein Accession ft: XP_091332.1 

1 11 21 31 41 SI 

c I I I t I I 

J MQRHTLWAAA FLTLHSAQAP PQTDISISPA LPELPLPSLC PLFWMEFKGH CYRFFPLNKT 60 

WAEADLYCSE FSVGRKSAKL ASIHSWBENV FVYDLVNSCV PGIPADVWTG LHDHRQEGQF 120 

EWTDGSSYDY SYWDGSQPDD GVHADPBEED CVQIWYRPTS EQLQAPEPQL PLSISKATDV 180 

YIiPEDFPAEP KLMDQSWVSR KSLKPSKSHL MEPPTPVAKH QKAKTRHRSL RQVWWPSGKA 240 

A GSWKERMNAD YGRRKRSAPR QEGRLRCRER RLRAASGQGR PEGQRKQRQQ ERQERGWEEL 300 

10 GGVSPMRGAQ AWQHGLGAGS QRGAAPECGE NHQAPELGST WRGQRLQPQT AALCHFALRK 360 

LPGNAHGLAA AFVQPALQVQ EEKNNRTRFS GAYFTMSDPT CDQDSKEQSL RRHGREAEKD 420 

GPYRLVKKKR GPVACPSSFE LQSGGEVCLD FPVBLRAGTW IAREPP 466 



15 



35 



Seq ID NO: C40S Protein Sequence 
Protein Accession ft: XP_054869.2 



; f * j 1 11 21 31 * 41 SI 

I I I I I ! 

MHTCCPPVTL EQDLHRKMHS WMLQTLAFAV TSLVLSCAET IDYYGEICDN ACPCEEKDGI 60 

20 LTVSCENRGI ISLSEISPPR FPIYHLLLSG NLLNRLYPNE FVNYTGASIL HLGSNVIQDI 120 

ETGAFHGLRG LRRLHLNNNK LELLRDDTFL GLENLEYLQV DYNYISVIEP NAFGKIiHLLQ 180 

VLILNDNLLS SLPNNLFRFV PLTHLDLRGN RLKUiPYVGL LQHMDKWEL QLEENPWNCS 240 

CELISLKDWL DSISYSALVG DWCETPFRL HGRDLDEVSK QELCPRRLIS DYEMRPQTPL 300 

STTGYLHTTP ASVNSVATSS SAVYKPPLKP PKGTRQPNKP RVRPTSRQPS KDLGYSNYGP 360 

25 SIAYQTKSPV PLECPTACSC NLQISDLGLN VNCQERKIES IAELQPKPYN PKKMYLTENY 420 

IAWRRTDFL EATGLDLLHL GNNRISMIQD RAFGDLTNLR RLYLNGNRIE RLSPELFYGL 480 

QSLQYLFLQY NLIREIQSGT FDPVPNLQLL FLNMNLLQAM PSGVFSGLTL LRLNLRSNHF 540 

TSLPVSGVLD QLRSLIQIDL HDNPWDCTCD IVGMKLWVEQ LKVGVLVDEV ICKAPKKFAE 600 

TDMRSIKSEL LCPDYSDVW STPTPSSIQV PARTSAVTPA VRLNSTGAPA SLGAGGGASS 660 

30 VPLSVLILSL LLVPIMSVFV AAGLFVLVMK RRKKNQSDHT STNNSDVSSF NMQYSVYGGG 720 

GGTGGHPHAH VHHRGPALPK VKTPAGHVYE YIPHPLGHMC KNPIYRSREG NSVEDYKDLH 780 

ELKVTYSSNH HLQQQQQPPP PPQQPQQQPP PQLQLQPGEE ERRESHHLRS PAYSVSTIEP 840 

REDLLSPVQD ADRFYRGILE PDKHCSTTPA GNSLPEYPKP PCSPAAYTFS PNYDLRRPHQ 900 

YLHPGAGDSR LREPVLYSPP SAVFVEPNRN EYLELKAKLN VEPDYLEVLE KQTTFSQF 958 



50 
55 
60 
65 



Seq ID NO: C406 Protein Sequence 
Protein Accession ft: NP_000784.2 



1 11 21 31 41 SI 

40 | | | | | | 

MGILSVDLLI TLQILPVFFS NCLFLALYDS VILLKHWLL LSRSKSTRGB WRRMLTSEGL . 60 

RCVWKSFLLD AYKQVKLGED APNSSWHVS STEGGDNSGN GTQEKIAEGA TCHLLDFASP 120 

ERPLWNFGS ATUPPFTSQI* PAFRKLVEEF SSVADFLLVY IDBAHPSDGW AIPGDSSLSF 180 

EVKKHQNQED RCAAAQQLLB RFSLPPQCRV VADRMDNNAN IAYGVAFERV CIVQRQKIAY 240 

45 LGGKGPFSYN LQEVRHWLEK NFSKRUKKTR LAG 273 



Seq ID NO: C407 Protein Sequence 
Protein Accession ft: NP_006540.2 

1 11 21 31 41 51 

I I I I I I 

MSSCVSSQPS SNRAAPQDEL GGRGSSSSES QKPCEALRGL SSLSIHLGME SFIWTECEP 60 

GCAVDLGLAR DRPLEADGQE VPLDSSGSQA RPHLSGRKLS LQERSQGGLA AGGSLDMNGR 120 

CICPSLPYSP VSSPQSSPRL PRRPTVESHH VSITGMQDCV QLNQYTLKDE IGKGSYGWK 180 

LAYNENDNTY YAMKVLSKKK LIRQAGFPRR PPPRGTRPAP GGCIQPRGPI EQVYQEIAIL 240 

KKLDHPNWK LVEVLDDFNE DHLYMVFELV NQGPVMEVPT LKPLSEDQAR FYFQDLIKGI 300 

EYLHYQKIIH RDIKPSNLLV GEDGHIKIAD FGVSNEFKGS DALLSNTVGT PAFMAPESLS 360 

ETRKIFSGKA LDVHAMGVTL YCFVFGQCPF MDERIMCLHS XIKSQALEFP DQPDIAEDLK 420 

DLITRMLDKN PESRIWPEI KLHPWVTRHG AEPLPSEDEN CTLVEVTEEE VENSVKHIPS 480 

LATVILVKTM IRKRSFGNPF EGSRREERSL SAPGNLLTKK PTRECESLSE LKEARQRRQP 540 

PGHRPAPRGG GGSALVRGSP CVESCWAPAP GSPARMHPLR PEEAMEPB 588 

Seq ID NO: C408 Protein Sequence 
Protein Accession ft: NP_061116.2 



.1 11 21 31 41 51 

I I I I I I 

MGLSLPREKG LILCLWSKFC RWFQRRESWA QSRDEQNLLQ QKRIWESPLL LAAKDNDVQA 60 

LNKLLKYEDC KVHQRGAMGE TALHIAALYD NLEAAMVLMB AAPELVFEPM TSEtrYEGQTA 120 

70 LHIAWNONM NLVRALLARR ASVSARATGT AFRRSPCNLI YFGEHPLSFA ACVNSEEIVR 180 

LLIEHGADIR AQDSLGNTVL HILILQPNKT FACQMYNLLL SYDRHGDHLQ PLDLVPNHQG 240 

LTPFKLAGVE GNTVMPQHLM QKRKHTQWTY GPLTSTLYDI* TEIDSSGDEQ SLLELIITTK 300 

KREARQILDQ TPVKELVSLK WKRYGRPYFC MLGAIYLLYI ICFTMCCIYR PLKPRTNNRT 360 

SPRDNTLLQQ KLLQEAYMTP KDDIRLVGEL VTVIGAIIIIi LVEVPDIFRM GVTRFFGQTI 420 

75 LGGPFHVLII TYAFMVLVTM VMRLISASGB WPMSFALVL GWCNVMYFAR GFQMLGPFTI 480 

MIQKMIFGDIi MRFCWLMAW ILGFASAFYI IPQTEDPEEL GHFYDYPMAL FSTFELFLTI 540 

IDGPANYNVD LPFHYSITYA AFAIIATLLM LNLLIAMMGD THWRVAHERD KLWRAQIVAT 600 

TVMLERKLPR CLWPRSGICG REYGLGDRWF LRVKDRQDLN RQRIQRYAQA FHTRGSEDLD 660 

KDSVEKLELG CPFSPHLSLP MPSVSRSTSR SSANWERLRQ GTLRRDLRGI I NRG LEDGES 720 

80 WEYQI 725 

Seq ID NO: C409 Protein Sequence 
Protein Accession ft: NP_068710.1 
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1 11 21 31 41 51 

I I 1 I I i 

MQKVTLGLLV FLAGFPVLDA NDLEDKNSPF YYDWHSLQVG GLICAGVLCA MGIIIVMSEW 60 

RSSGEQAGRG WGSPPLTTQL SPTGAKCKCK FGQKSGHHPG ETPPLITPGS AQS 113 

Seq ID NO: C410 Protein Sequence 
Protein Accession if: NP_005962.1 

1 11 21 31 41 51 

I I I I 1 I 

MQKVTLGLLV FLAGFPVLDA NDLEDKNSPF YYDWHSLQVG GLICAGVLCA MGIIIVMSAK 60 
CKCKFGQKSG HHPGETPPLI TPGSAQS 87 

Seq ID NO: C411 Protein Sequence 
Protein Accession #: NP_004952.1 

1 11 21 31 41 51 

I I I I I I 

MLSKVLPVLL GILLILQSRV EGPQTESKNE ASSRDWYGP QPQPLENQLL SEETKSTETE 60 

TGSRVGKLPE ASRILNTILS NYDHKLRPGI GEKPTWTVE IAVNSLGPLS ILDMEYTIDI 120 

IFSQTWYDER LCYNDTFESL VLNGNWSQL WIPDTFFRNS KRTHEHEITM PNQMVRIYKD 180 

GKVLYTIRMT IDAGCSLHML RFPMDSHSCP LSFSSFSYPE NEMIYKWENF KLEINEKNSW 240 

KLFQFDPTGV SNKTEIITTP VGDFMVMTIF FNVSRRFGYV AFQNYVPSSV TTMLSWVSFW 300 

IKTESAPART SLGITSVLTM TTLGTFSRKN FPRVSYITAL DFYIAICFVF CFCALLEFAV 360 

LNFLIYNQTK AHASPKLRHP RINSRAHART RARSRACARQ HQEAFVCQIV TTEGSDGEER 420 

PSCSAQQPPS PGSPEGPRSL CSKLACCEWC KRFKKYFCMV PDCEGSTWQQ GRLCIHVYRL 480 

DNYSRWFPV TFFFFNVLYW LVCLNL 506 

Seq ID NO: C412 Protein Sequence 
protein Accession Jh NP_068819.1 

1 11 21 31 41 51 

I I I I I I 

MEYTIDIIFS QTWYDERLCY NDTFESLVLN GNWSQLWIP DTFFRNSKRT HEHEITMPNQ 60 

MVRIYKDGKV LYTIRMTIDA GCSLHMLRFP MDSHSCPLSF SSFSYPENEM IYKWENFKLE 120 

INEKNSWKLF QLDFTGVSNK TEIITTPVGD FMVMTIFFNV SRRFGYVAFQ NYVPSSVTTM 180 

LSWVSFWIKT ESAPARTSLG ITSVLTMTTL GTFSRKNFPR VSYITALDFY IAICFVFCFC 240 

ALLEFAVLNF LIYNQTKAHA SPKLRHPRIN SRAHARTRAR SRACARQHQE AFVCQIVTTE 300 

GSDGEERPSC SAQQPPSPGS PEGPRSLCSK LACCEWCKRF KKYFCMVPDC EGSTWQQARL 360 



CIHVYRLDNY SRWFPVTFF FFNVLYWLVC LNL 393 

Seq ID NO: C413 Protein Sequence 
Protein Accession #: NP_068822.1 

1 11 21 31 41 51 

I I I I I I 

MEYTIDIIFS QTWNSKRTHE HEITMPNQMV RIYKDGKVLY TIRMTIDAGC SLHMLRFPMD 60 

SHSCPLSFSS FSYPENEMIY KWENFKLEIN EKNSWKLFQF DFTGVSNKTE IITTPVGDFM 120 

VMTIFFNVSR RFGYVAFQNY VPSSVTTMLS WVSFWIKTES APARTSLGIT SVLTMTTLGT 180 

FSRKNFPRVS YITALDFYIA ICFVFCFCAL LEFAVLNFLI YNQTKAHASP KLRHPRINSR 240 

AHARTRARSR ACARQHQEAF VCQIVTTEGS DGEERPSCSA QQPPSPGSPE GPRSLCSKLA 300 

CCEWCKRFKK YFCMVPDCEG STWQQGRLCI HVYRLDNYSR WFPVTFFFF NVLYWLVCLN 360 

L 361 



Seq ID NO: C414 Protein Sequence 
Protein Accession <h NP__068830.1 

1 11 21 31 41 51 

11(111 

MEYTIDIIFS QTWYDERLCY NDTFESLVLN GNWSQLWIP DTFFRNSKRT HEHEITMPNQ 60 
MVRIYKDGKV LYTIRMTIDA GCSLHMLRFP MDSHSCPLSF SSFSYPENEM IYKWENFKLE 120 
INEKNSWKLF QFDFTGVSNK TEIITTPVGD FMVMTIFFNV SRRFGYVAFQ NYVPSSVTTM 180 
LSWVSFWIKT ESAPARTSLG ITSVLTMTTL GTFSRKNFPR VSYITALDFY IAICFV FCFC 240 
ALLEFAVLNF LIYNQTKAHA SPKLRHPRIN SRAHARTRAR SRACARQHQE AFVCQIVTTE 300 
GSDGEERPSC SAQQPPSPGS PEGPRSLCSK LACCEWCKRF KKYFCMVPDC EGSTWQQGRL 360 
CIHVYRLDNY SRWFPVTFF FFNVLYWLVC LNL 393 

Seq ID NO: C415 Protein Sequence 
Protein Accession #: NPJ)68S91.1 

1 11 21 31 41 51 

I I I I I I 

MPAVSGPGPL FCLLLLLLDP HSPETGCPPL RRFEYKLSFK GPRLALPGAG IPFWSHHGDA 60 

ILGLEEVRLT PSMRNRSGAV WSRASVPFSA WBVEVQMRVT GLGRRGAHGM AVWYTRGRGH 120 

VGSVLGGLAS WDGIGIFFDS PAEDTQDSPA IRVLASDGHI PSEQPGDGAS QGLGSCHWDF 180 

RNRPHPFRAR ITYWGQRLRM SLNSGLTPSD PGEFCVDVGP LLLVPGGFFG VSAATGTLAD 240 



DHDVLSFLTF SLSEPSPEVP PQPFLEMQQL RLARQLEGLW ARLGLGTRED VTPKSDSEAQ 300 

GEGBRLFDLE ETLGRHRRIL QALRGLSKQL AQAERQWKKQ LGPPGQARPD GGWALDASCQ 360 

IPSTPGRGGH LSMSLNKDSA KVGALLHGQW TLLQALQEMR DAAVRMAAEA QVSYLPVGIE 420 

HHFLELDHIL GLLQEELRGP AKAAAKAPRP PGQPPRASSC LQPGIFLFYL LIQTVGFFGY 480 

VHFRQELNKS LQECLSTGSL PLGPAPHTPR ALGILRRQPL PASMPA 526 



Seq ID NO: C416 Protein Sequence 
Protein Accession ft: XP_117036.1 
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1 11 21 31 41 51 

I I I I I I 

MERRTRGALG SRRPPPPLPA LRHLCTGLQA AGMAWPGTLW RHTCQGRAXA AEGPWGLFRP 60 

HRCPREAGQA PVGPSPETQG VAHVCSRARV SVDEREPGGG AYAMHVTPRW KGCHRHSGRT 120 

VRGSVSWKRP EQAAPETGRG PAVARGSGDG NECGWG 156 

Seq ID NO: C417 Protein Sequence 
Protein Accession #: XP_167B03.2 

1 11 21 31 41 51 

I I I 1 I I 

MPGKGQRKTA TNKPGGLPGA PGVGIGGHCL YVLECKCFIK NKTKTHHHKK KNFAAKRNEE 60 
KLKKKKKQEK KNHTKFFHHT YPLSQQDFLF AKSYFCGNGP CFLWQGLF 108 

Seq ID NO: C41B Protein Sequence 
Protein Accession #: NP_079056.1 

1 11 21 31 41 51 

I I I I I t 

MFRLVERYEM PRHEVYVLLI RNIFLKISII GILCYYWLNT VALSGEECWB TLIGQDIYRL 60 

LLMDFVFSLV NSFLGEFLRR IIGMQLITSL GLQEFDIARN VLELIYAQTL VWIGIFFCPL 120 

LPFIQMIMLF IMFYSKNISL MMNFQPPSKA WRASQMMTFF , IFLLFFPSFT GVLCTLAITI 180 

WRLKPSADCG PFRGLPLFIH SIYSWIDTLS TRPGYLWWW IYRNLIGSVH FFFILTLIVL 240 

IITYLYWQIT EGRKIMIRLL HEQIINEGKD KMFLIEKLIK LQDMEKKANP SSLVLERREV 300 

EQQGFLHLGE HDGSLDLRSR RSVQEGNPRA 330 

Seq ID NO: C419 Protein Sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

III I I I 

MLSDDHVNEI IIQVENVSSG VQSHPSSNQI FQEKVLLDSS INMVLSISDI DVIDSQTVSK 60 

RNDQKGNQVL RESTS LNESM SQTLHSLECM GIDTPGSSHE TVQGQKLIAS LIPMTSRDRI 120 

KAIRNQPRTM EEKRNLRKIV DKEKSKQTHR ILQLNCCIQC LNSISRAYRR SKNSLSEILN 180 

SISLWQKTLK I IGGKFGTSV LSYFNFLRWL LKFNIFSFI L • NFSFI I IPQF TVAKKNTLQF 240 

TGLEFFTGVG YFRDTVMYYG FYTNSTIQHG NSGASYNMQL AYIFTIGACL TTCFFSLLFS 300 

MAKYFRNNFI NPHIYSGGIT KLIFCWDFTV THEKAVKLKQ KNLSTEIREN LSELRQENSK 360 

LTFNQLLTRF SAYMVAWWS TGVAIACCAA VYYLAEYNLE FIiKTHSNPGA VLLLPFWSC 420 

INLAVPCIYS MFRLVERYEM PRHEVYVLLI RNIFLKISII GILCYYWLNT VALSGEECWE 480 

TLIGQDIYRL LLMDFVFSLV NSFLGEFLRR IIGMQLITSL GLQEFDIARN VLELIYAQTL 540 

VWIGIFFCPL LPFIQMIMLF IMFYSKNISL MMNFQPPSKA WRASQMMTFF IFLLFFPSFT 600 

GVLCTLAITI WRLKPSADCG PFRGLPLFIH SIYSWIDTLS TRPGYLWWW IYRNLIGSVH 660 

FFFILTLIVL IITYLYWQIT EGRKIMIRLL HEQIINEGKD KMFLIEKLIK LQDMEKKANP 720 

SSLVLERREV EQQGFLHLGE HDGSLDLRSR RSVQEGNPRA 760 

Seq ID NO: C420 Protein Sequence 
Protein Accession ft: NP_002241.1 

1 11 21 31 41 51 

I I I I I I 

MGGDLVLGLG ALRRRKRLLE QEKSLAGWAL VLAGTGIGLM VLHAEMLWFG GCSWALYLFL 60 

VKCTISISTF LLLCLIVAFH AKEVQLFMTD NGLRDWRVAL TGRQAAQTVL ELWCGLHPA 120 

PVRGPPCVQD LGAPLTSPQP WPGFLGQGEA LLSLAMLLRL YLVPRAVLLR SGVLLNASYR 180 

SIGALNQVRF RHWFVAKLYM NTHPGRLLLG LTLGLWLTTA WVLSVAERQA VNATGHLSDT 240 

LWLIPITFLT IGYGDWPGT MWGKIVCLCT GVMGVCCTAL LVAWARKLE FNKAEKHVHN 300 

FMMDIQYTKE MKESAARVLQ EAWMFYKHTR RKESHAARRH QRKLLAAINA FRQVRLKHRK 360 

LREQVNSMVD ISKMHMILYD LQQNLSSSHR ALEKQIDTLA GKLDALTELL STALGPRQLP 420 

EPSQQSK 427 

Seq ID NO: C421 Protein Sequence 
Protein Accession #: NP_079533.1 

1 11 21 31 41 51 

I I I I I I 

. MGGKQRDEDD EAYGKPVKYD PSFRGPIKNR SCTDVICCVL FLLFILGYIV VGIVAWLYGD 60 

PRQVLYPRNS TGAYCGMGEN KDKPYLLYFN IFSCILSSNI ISVAENGLQC PTPQVCVSSC 120 

PEDPWTVGKN EFSQTVGEVF YTKNRNFCLP GVPWNMTVIT SLQQELCPSF LLPSAPALGR 180 

CFPWTNITPP ALPGITNDTT IQQGI SOLID SLNARDISVK IFEDFAQSWY WILVALGVAL 240 

VLSLLFILLL RLVAGPLVLV LILGVLGVLA YGIYYCWEEY RVLRDKGASI SQLGFTTNLS 300 

AYQSVQETWL AALIVLAVLE AILLLVLIFL RQRIRIAIAL LKEASKAVGQ MMSTMFYPLV 360 

TFVLLLICIA YWAMTALYPL PTQPATLGYV LWASNISSPG CEKVPINTSC NPTAHLVNSS 420 

CPGLMCVFQG YSSKGLIQRS VFNLQIYGVL GLFWTLNWVL ALGQCVLAGA FASFYWAFHK 480 

PQDIPTFPLI SAFIRTLRYH TGSLAFGALI LTLVQIARVI LEYIDHKLRG VQNPVARCIM 540 

CCFKCCLWCL EKFIKFLNRN AYIMIAIYGK NFCVSAKNAF MLLMRNIVRV WLDKVTDLL 600 

LFFGKLLWG GVGVLSFFFF SGRIPGLGKD FKSPHLNYYW LPIMTSILGA YVIASGFFSV 660 

FGMCVDTLFL CFLEDLERNN GSLDRPYYMS KSLLKILGKK NEAPPDNKKR KK 712 

Seq ID NO: C422 Protein Sequence ' 
Protein Accession #: NP_057264.1 

1 11 21 31 41 51 

I I I I I I 

MGSNSGQAGR HIYKSLADDG PFDSVEPPKR PTSRLIMHSM AMFGREFCYA VEAAYVTPVL 60 
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LSVGLPSSLY SIVWFLSPIL GFLLQPWGS ASDHCRSRWG RRRPYILTLG VMMLVGMALY 120 

LNGATWAAL IANPRRKLVW AISVTMIGW LFDFAADFID GPIKAYLFDV CSHQDKEKGL 180 

HYHALFTGFG GALGYI*LGAI DWAHLELGRL LGTEFQVMFF FSALVLTLCF TVHLCSISEA 240 

PLTEVAKGIP PQQTPQDPPL SSDGMYEYGS IBKVKNGYVN PELAMQGAKN KNHAEQTRRA 300 

MTLKSLLRAL VNMPPHYRYL CISHLIGWTA FLSNMLFFTD FMGQIVYRGD PYSAHNSTEF 360 

LIYERGVEVG CWGFCINSVF SSLYSYFQKV LVSYIGLKGL YFTGYLLFGL GTGFIGLFPN 420 

VYSTLVLCSL FGVMSSTLYT VPFNLITEYH REEEKBRQQA PGGDPDNSVR GKGMDCATLT 480 

CMVQLAQILV GGGLGFLVNT AGTWWVIT ASAVALIGCC FVALFVRYVD 530 

Seq ID NO: C423 Protein Sequence 
Protein Accession #: NP_003264.1 

1 11 21 31 41 51 

11)111 

MEGFGGVGGR GTRGFAAKGV WRGRAEEGPV LGAAERGFMV STGSRRRVFE GPGGGGLRWT 60 

PGKGTGRQRG AWGPRAEDGV RRRTLGMPRG SRRDVRAPCG PAGSWGARGG RRRDGPSRRR 120 

RGSATAAARH HVPPAPGGPF GPRAPAGSTR VPARAGGAVE PTGAAAVARL ARPAGGALPT 180 

AGAQGAGPAR GRSGEGSEWA RRGKGRPGPY QSPLGPAVAE GQELKDKSRL RYPINGFQAL 240 

VLTALLVGLG MSAGLPLGAL PEMLLPLAFV ATLTAFIFSL FLYMKAQVAP VSALAPGGNS 300 

GNPIYDFFLG RBLNPRICFF DFKYFCELRP GLIGWVLINL ALLMKEAEbR GSPSLAMWLV 360 

NGFQLLYVGD ALWHEEAVLT TMDITHDGFG FMLAFGDMAW VPFTYSLQAQ FLLHHPQPLG 420 

LPMASVICLI NATGYYIFRG ANSQKNTFRK NPSDPRVAGti ETISTATGRK LLVSGWWGMV 480 

RHPNYLGDLI MALAWSLPCG VSHLLPYFYL LYFTALLVHR EARDERSACR STAWPGRSTA 540 

GVCLTASCPT STEAAPPPQV GHVPTHPPAH PGPGASTHLG LKGLHPTQP 589 

Seq id NO> C424 Protein Sequence 
Protein Accession #: NP_056535.l 

1 11 21 31 41 51 

I I I I I I 

MGRLLRAARL PPLLSPLLLL LVGGAFLGAC VAGSDEPGPE GLTSTSLLDL LLPTGLEPLD 60 

SEEPSETMGL GAGLGAPGSG FPSEENEESR ILQPPQYFWE EEEELNDSSL DLGPTADYVF 120 

PDLTEKAGSI EDTSQAQELP NLPSPLPKMN LVEPPWHMPP REEEEEEEBE EEREKEEVEK 180 

QEEEEEEELL FVNGSQEEAK PQVRDFSLTS SSQTPGATKS RHEDSGDQAS SGVEVESSMG 240 

PSLLLPSVTP TTVTPGDQDS TSQEAEATVL PAAGLGVEFE APQEASEEAT AGAAGLSGQH 300 

EEVPALPSFP QTTAPSGAEH PDEDPLGSRT SASSPIAPGD MELTPSSATL GQEDLNQQLL 360 

EGQAAEAQSR IPWDSTQVIC KDWSNLAGKN YIILNMTENI DCEVFRQHRG PQLLALVEEV 420 

LPRHGSGHHG AWHISLSKPS EKEQHLLMTL VGEOGWPTQ DVLSMW3DIR RSLEEIGIQN 480 

YSTTSSCQAR ASQVRSDYGT LFWLWIGA ICIIIIALGL LYNCWQRRLP KLKHVSHGEE 540 

LRFVENGCHD NPTLDVASDS QSEMQEKKPS LNGGGALNGP GSWGALMGGK RDPEDSDVFE 600 

EDTHL 605 

Seq ID NO; C425 Protein Sequence 
Protein Accession #: NP_001188.1 

1 11 21 31 41 51 

I I I I I I 

MSEVRPLSRD IU4ETLLYBQ LLEPPTMEVL GMTDSEEDLD PMEDFDSLEC MEGSDALALR 60 

LACIGDEMDV SLRAPRLAQL SEVAMHSLGL AFIYDQTEDI RDVLRSFMDG FTTIjKENIMR 120 

FWRSPNPGSW VSCEQVLIAL LLLLALLLPL LSGGLHLLLK 160 

Seq ID NO i C426 Protein Sequence 
Protein Accession #: AAF76225.1 

1 11 21 31 41 51 

I 1 I I'll 

MATPLPPPSP RHLRLLRLLL SGLVLGAALR GAAAGHPDVA ACPGSLDCAL KRRARCPPGA 60 

HACGPCLQPF QEDQQGLCVP RMRRPPGGGR PQPRLEDEID FIAQELARKE SGQSTPPLPK 120 

DRQRLPEPAT LGFSARGQGIi ELGLPSTPGT PTPTPHTSLG SPVSSDPVHM SPLEPRGGQG 180 

DGLALVLILA FCVAGAAALS VASLCWCRLQ REIRLTQKAD YATAKAPGSP AAPRISPGDQ 240 

RLAQSAEMYH YQHQRQQMLC LERHKEPPKB LDTASSDEEN BDGDFTVYEC PGLAPTGEME 300 

VRNPLFDHAA LSAPLPAPSS PPALP 325 

Seq ID NO: C427 Protein Sequence 
Protein Accession #; NP_004436.1 

1 11 21 31 41 51 

I I I 1 I 1 

MVCSLWVLLL VSSVLALEEV LLDTTGETSE IGWLTYPPGG WDEVSVLDDQ RRLTRTFEAC 60 

HVAGAPPGTG QDNWLQTHFV ERRGAQRAHI RLHFSVRACS SLGVSGGTCR ETFTLYYRQA 120 

EEPDSPDSVS SWHliKRWTKV DTIAADESFP SSSSSSSSSS SAAWAVGPHG AGQRAGLQLN 180 

VKERSFGPLT QRGFYVAFQD TGACLALVAV RLFSYTCPAV LRSFASFPET QASGAGGASL 240 

VAAVGTCVAH AEPEEDGVGG QAGGSPPRLH CNGEGKWMVA VGGCRCQPGY QPARGDKACQ 300 

ACPRGLYKSS AGNAPCSPCP ARSHAPNPAA PVCPCLEGFY RASSDPPRAP CTGPPSAPQE 360 

LWFEVQGSAL MLHWRLPREL GGRGDLLFNV VCKECEGRQE PASGGGGTCH RCRDEVHFDP 420 

RQRGLTESRV LVGGLRAHVP YILEVQAVNG VSELSPDPPQ AAAINVSTSH EVPSAVPWH 480 

QVSRASNSIT VSWPQPDQTN GNILDYQLRY YDQAEDESHS FTLTSETNTA TVTQLSPGHT 540 

YGFQVRARTA AGHGPYGGKV YFQTLPQGEL SSQLPERLSL VIGSIIjGALA FLLIAAITVL 600 

AWFQRKRRG TGYTEQLQQY SSPGLGVKYY IDPSTYEDPC QAIRELAREV DPAYIKIEEV 660 

IGTGSFGEVR QGRLQPRGRR EQTVAIQALW AGGAESLQMT FLGRAAVLGQ FQHPNILRLE 720 

GWTKSRPLM VLTEFMELGP LDSFLRQREG QFSSLQLVAM QRGVAAAMQY LSSFAFVHRS 780 

LSAHSVLVNS HLVCKVARLG HSPQGPSCLL RWAAPEVIAH GKHTTSSDVW SFGILMWEVM 840 

SYGERPYWDM SEQEVLNAIE QEFRLPPPPG CPPGLHIiLML DTWQKDRARR PHFDQLVAAF 900 

DKMIRKPDTL QAGGDPGERP SQALLTPVAL DFPCLDSPQA WLSAIGLECY QDNFSKFGLC 960 
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TFSDVAQLSL EDLPALGITL AGHQKKLLHH IQLLQQHLRQ QGSVEV 1006 

Seq ID NO: C428 Protein Sequence 
Protein Accession 8: XP_043340.2 

1 11 21 31 41 51 

1 I I I 1 I 

MPFDFRRFDI YRKVPKDLTQ PTYTGAIISI CCCLFILFLF LSELTGFITT EWNELYVDD 
PDKDSGGKID VSLNISLPNL HCELVGLDIQ DEMGRHEVGH IDNSMKIPLN NGAGCRFEGQ 
FSINKVPGNF HVSTKSATAQ PONPDMTHVI HKLSFGDTI/Q VQNIHGAFNA LGGADRLTSN 
PLASHDYILK IVPTVYEDKS GKQRYSYQYT VANKEYVAYS HTGRIIPAIW FRYDLSPITV 
KYTERRQPLY RPITTICAII GGTFTVAGIL DSCIFTASEA WKKIQLGKMH 

Seq ID NO: C429 Protein Sequence 
Protein Accession #: NPJJ02142.1 

1 11 21 31 41 51 

I I 1 I I I 

MAQKEGGRTV PCCSRPKVAA LTAGTLLLLT AIGAASWAIV AVLLRSDQEP LYPVQVSSAD 60 

ARLMVFDKTE GTWRLLCSSR SNARVAGLSC EEMGPLRALT HSELDVRTAG ANGTSGFFCV 120 

DEGRLPHTQR LLEVISVCDC PRGRPLAAIC QDCGRRKLPV DRIVGGRDTS LGRWPWQVSL 180 

RYDGAHLCGG SLLSGDWVLT AAHCFPERNR VLSRWRVFAG AVAQASPHGL QLGVQAWYH 240 

GGYLPFRDPN SEENSNDIAL VHLSSPLPLT EYIQPVCLPA AGQALVDGKI CTVTGWGNTQ 300 

YYGQQAGVLQ EARVPIISND VCNGADFYGN QIKPKMFCAG YPEGGIDACQ GDSGGPFVCE 360 

DSISRTPRWR LCGIVSWGTG CAIiAQKPGVY TKVSDFREWI FQAIKTHSEA SGMVTQL 417 

Seq ID NO: C430 Protein Sequence 
Protein Accession #: BAA92562.1 

1 11 21 31 41 51 

I I I I I I 

METTVLSGIN FEYKGMTGWE VAGDHIYTAA GASDNDFMIL TLWPGFRPP QSVMADTENK 60 

EVARITFVFE TLCSVNCELY FMVGVNSRTN TPVETWKGSK GKQSYTYIIE ENTTTSFTWA 120 

FQRTTFHEAS RKYTNDVAKI YSINVTNVMN GVASYCRPCA LEASDVGSSC TSCPAGYYID 180 

RDSGTCHSCP PNTILKAHQP YGVQACVPCG PGTKNNKIHS LCYNDCTFSR NTPTRTFNYN 240 

FSALANTVTL AGGPSFTSKG LKYFHHFTLS LCGNQGRKMS VCTDNVTDLR IPEGESGFSK 300 

SITAYVCQAV IIPPEVTGYK AGVSSQPVSL ADRIiIGVTTD MTJuDGITSPA ELFHLESLGI 360 

PDVIFFYRSN DVTQSCSSGR STTIRVRCSP QKTVPGSLLL PGTCSDGTCD GCNFHFLWES 420 

AAACPLCSVA DYHAIVSSCV AGIQKTTYVW REPKLCSGGI SLPEQRVTIC KTIDFWLKVG 480 

ISAGTCTAIL LTVLTCYFWK KNQKLEYKYS KLVMNATLKD CDLPAADSCA IMEGEDVEDD 540 

LIFTSKKSLF GKIKSFTSKQ PAPVTISLSE DS 572 

Seq ID NO: C431 Protein Sequence 
Protein Accession #: NP_004855.1 

1 11 21 31 41 51 

I I I I I 1 

MPGQELRTVN GSQMUjVLLV LSWLPHGGAL SIAEASRASP PGPSELHSED SRFRELRKRY 60 

EDLLTRLRAN QSWEDSNTDL VPAPAVRILT PBVRLGSGGH LHLRISRAAL PEGLPEASRL 120 

HRALFRLSPT ASRSWDVTRP LRRQLSLARP QAPALHLRLS PPPSQSDQLL AESSSARPQL 180 

ELHLiRPQAAR GRRRARARNG DDCPLGPGRC CRLHTVRASL EDLGWADWVL SPREVQVTMC 240 

IGACPSQFRA ANMHAQIKTS LHRLKPDTBP APCCVPASYN PMVLIQKTDT GVSLQTYDDL 300 

LAKDCHCI 30 fl 

Seq ID NOi C433 Protein Sequence 
Protein Accession ttx NP_443090.1 

1 11 21 31 41 51 

I I ! I I I 

MED P SCARE P RARPRERDPG RRPHPDQGRT HDRPRDRPGD PRRKRSSDGN RRRDGDRDPK 60 
RDQERDGNRD RNRDRERERB RERDPDRGPR RDTHRDAGPR AGEHGVWEKP RQSRTRDGAR 120 
GLTWDAAAPP GPAPWEAPEP PQPQRKGDPG RRRPESEPPS ERYLPSTPRP GREEVEYYQS 180 
EAEGLLECHK CKYLCTGRAC CQMLEVLLNL LILACSSVSY SSTGGYTGIT SLGGIYYYQF 240 
GGAYSGFDGA DGEKAQQIxDV QFYQLKLPMV TVAMACSGAL TALCCLFVAM GVLRVPWHCP 300 
LLLVTEGLLD MLIAGGYIPA LYFYFHYLSA AYGSPVCKER QALYQSKGYS GFGCSFHGAD 360 
IGAGIFAALG IWFALGAVL AIKGYRKVRK LKEKPAEMFE P 401 

Seq ID NO: C435 Protein Sequence 
Protein Accession #t Eos sequence 

1 n 21 31 41 51 

I I I I I I 

MGAAGRQDFL FKAMLTISWL TLTCFPGATS TVAAGCPDQS PELQPWNPGH DQDHHVHIGQ 60 

GKTLLLTSSA TVYSIHISEG GKLVIKDHDE PIVLRTREIL IDNGGELHAG SALCPFQGNF 120 

TIILYGRADE GIQPDPYYGL KYIGVGKGGA LELHGQKKLS WTFLNKTLHP GGMAEGGYFF 180 

ERSWGHRGVI VHVIDPKSGT VIHSDRFDTY RSKKESERLV QYLNAVPDGR ILSVAVNDEG 240 

SRNLDDMARK AMTKLGSKHF LHLGFRHPWS FLTVKGNPSS SVEDHIEYHG HRGSAAARVF 300 

KLFQTEHGBY FNVSLSSEWV QDVEWTEWFD HDKVSQTKGG EKISDLWKAH PGKICNRPID 360 

IQATTMDGVN LSTEWYKKG QDYRFACYDR GRACRSYRVR FLCGKPVRPK LTVTIDTNVN 420 

STILNLEDNV QSMKPGDTLV IASTDYSMYQ AEBFQVLPCR SCAPNQVKVA GKPMYLHIGE 480 

EIDGVDMRAE VGLLSRNIIV MGEMEDKCYP YRNHICNFFD FDTFGGHIKF ALGFKAAHLE 540 

GTELKHMGQQ LVGQYPIHFH LAGDVDERGG YDPPTYIRDL SIHHTFSRCV TVHGSNGLLI 600 

KDWGYNSLG HCFPTEDGPE ERNTFDHCU3 LLVKSGTLLP SDRDSKMCKM ITEDSYPGYI 660 

PKPRQDCNAV STFWMANPNN NLINCAAAGS EETGFWFIFH HVPTGPSVGM YSPGYSEHIP 720 
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LGKFYNNRAH SNYRAGMI ID NGVKTTEASA KDKRPFLSII SARYSPHQDA DPLKPREPAI 780 
IRHFIAYKNQ DHGAWLRGGD VWLDSCHFRG EAQEGFLLTG MKAGGILLGG DEAASGMAQG 840 
FSPPCRCLLK LVTTGSPFAH VSLAHS fi ^ 6 
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It is understood that the examples described above in no way serve to limit the true 
scope of this invention, but rather are presented for illustrative purposes. All publications, 
5 sequences of accession numbers, and patent applications cited in this specification are herein 
incorporated by reference as if each individual publication, accession number, or patent 
application were specifically and individually indicated to be incorporated by reference. 
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WHAT IS CLAIMED IS: 



1 1 . A method for determining the presence or absence of a pathological cell in a 

2 patient, said method comprising detecting a nucleic acid comprising a sequence at least 80% 

3 identical to a sequence as described in Tables 2A-80 in a biological sample from said patient, 

4 thereby determining the presence or absence of said pathological cell. 

1 2. The method of Claim 1 , wherein: 

2 a) said pathology is described in Table 1, including a cancer; and/or 

3 b) said biological sample comprises isolated nucleic acids. 

1 3 . The method of Claim 1 , wherein said biological sample is tissue from an organ ' 

2 which is affected by said pathology of Table 1 , including a cancer. 

1 4. The method of Claim 2, wherein said nucleic acids are mRNA 

1 5 . The method of Claim 2: 

2 a) further comprising a step of amplifying nucleic acids before said step of detecting ; 

3 said nucleic acid; or 

4 b) where said detecting is of a protein encoded by said nucleic acid. 

1 6. The method of Claim 1 , wherein said nucleic acid comprises a sequence as 

2 described in Tables 2A-80. 

1 7. The method of Claim 2, wherein: 

2 a) said detecting step is carried out by: 

3 i) using a labeled nucleic acid probe; 

4 ii) utilizing a biochip comprising a sequence at least 80% identical to a sequence 

5 as described in Tables 2A-80; or 

6 iii) detecting a polypeptide encoded by said nucleic acid; or 

7 b) said patient is: 

8 i) undergoing a therapeutic regimen to treat said pathology of Table 1 ; or 

9 ii) is suspected of having said pathology or cancer. 

1 8. An isolated nucleic acid molecule comprising a sequence as described in 

2 Tables 2A-80. 
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1 9. The nucleic acid molecule of Claim 8, which is labeled. 

1 1 0. An expression vector comprising the nucleic acid of Claim 8. 

1 1 1 . A host cell comprising the expression vector of Claim 10. 

1 12. An isolated polypeptide which is encoded by a nucleic acid molecule 

2 comprising a sequence as described in Tables 2A-80. 

1 13. An antibody that specifically binds a polypeptide of Claim 12. 

1 14. The antibody of Claim 13: 

2 a) conjugated to an effector component; 

3 b) conjugated to a detectable label, including a fluorescent label, a radioisotope, or a 

4 cytotoxic chemical; 

5 c) which is an antibody fragment; or 

6 d) which is a humanized antibody. 

1 15. A method for specifically targeting a compound to a pathological cell in a 

2 patient, said method comprising administering to said patient an antibody of Claim 13, 

3 thereby providing said targetting. 

1 1 6. A method for determining the presence or absence of a pathological cell in a 

2 patient, said method comprising contacting a biological sample with an antibody of Claim 13. 

1 17. The method of Claim 16, wherein: 

2 a) said antibody is conjugated to: 

3 i) an effector component; or 

4 ii) a fluorescent label; or , 

5 b) said biological sample is a blood, serum, urine, or stool sample. 

1 1 8. A method for identifying a compound that modulates a pathology-associated 

2 polypeptide, said method comprising the steps of: 
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3 a) contacting said compound with a pathology-associated polypeptide, said 

4 polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence 

5 at least 80% identical to a sequence as described in Tables 2A-80; and 

6 b) determining the functional effect of said compound upon said polypeptide. 

1 1 9. A drug screening assay comprising the steps of: 

2 a) administering a test compound to a mammal having a pathology of Table 1 or a 

3 cell isolated therefrom; and 

4 b) comparing the level of gene expression of a polynucleotide that selectively 

5 hybridizes to a sequence at least 80% identical to a sequence as described in 

6 Tables 2A-80 in a treated cell or mammal with the level of gene expression of said 

7 polynucleotide in a control cell or mammal, wherein a test compound that 

8 modulates said level of expression of the polynucleotide is a candidate for the 

9 treatment of said pathology. 
10 
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